Display #
Title Hits
EMC Data Science Question 1 : A data scientist is asked to implement an article recommendation feature for an on-line magazine. The magazine does not want to use client tracking technologies such as cookies or reading history. Therefore, only the style an 1988
EMC Data Science Question : 2 Which method is used to solve for coefficients b0, b1, .., bn in your linear regression model : 1742
EMC Data Science Question : 3 What describes a true limitation of Logistic Regression method? 1632
EMC Data Science Question 4 : What is a core deliverable at the end of the analytic project? 1876
EMC Data Science Question 5 : You have been assigned to run a logistic regression model for each of 100 countries, and all the data is currently stored in a PostgreSQL database. Which tool/library would you use to produce these models with the least effor 1411
EMC Data Science Question 6 : Your organization has a website where visitors randomly receive one of two coupons. It is also possible that visitors to the website will not receive a coupon. You have been asked to determine if offering a coupon to visitor 1577
EMC Data Science Question 7 : Imagine you are trying to hire a Data Scientist for your team. In addition to technical ability and quantitative background, which additional essential trait would you look for in people applying for this position? 1384
EMC Data Science Question 8 : You have run the association rules algorithm on your data set, and the two rules {banana, apple} => {grape} and {apple, orange}=> {grape} have been found to be relevant. What else must be true? 1617
EMC Data Science Question 9 : When would you use a Wilcoxson Rank Sum test? 2065
Question 10: Consider a database with 4 transactions: 1440
EMC Data Science Question 11: You are using the Apriori algorithm to determine the likelihood that a person who owns a home has a good credit score. You have determined that the confidence for the rules used in the algorithm is > 75%. You calculate lift = 1777
EMC Data Science Question 12: Consider a database with 4 transactions: Transaction 1: {cheese, bread, milk} Transaction 2: {soda, bread, milk} 1519
EMC Data Science Question 13: Under which circumstance do you need to implement N-fold cross-validation after creating a regression model? 1699
EMC Data Science Question 14: What is an appropriate data visualization to use in a presentation for an analyst audience? 1591
EMC Data Science Question 15: When would you use GROUP BY ROLLUP clause in your OLAP query? 1500
EMC Data Science Question 16: Which type of numeric value does a logistic regression model estimate? 1622
EMC Data Science Question 17: Your colleague, who is new to Hadoop, approaches you with a question. They want to know how best to access their data. This colleague has a strong background in data flow languages and programming. Which query interface woul 1559
EMC Data Science Question 18: The web analytics team uses Hadoop to process access logs. They now want to correlate this data with structured user data residing in a production single-instance JDBC database. They collaborate with the production team to im 1624
EMC Data Science Question 19 : In R, functions like plot() and hist() are known as what? 1841