www.HadoopExam.com

HadoopExam Learning Resources

Display # 
Title Hits
EMC Data Science Question 1 : A data scientist is asked to implement an article recommendation feature for an on-line magazine. The magazine does not want to use client tracking technologies such as cookies or reading history. Therefore, only the style an 1221
EMC Data Science Question : 2 Which method is used to solve for coefficients b0, b1, .., bn in your linear regression model : 1098
EMC Data Science Question : 3 What describes a true limitation of Logistic Regression method? 989
EMC Data Science Question 4 : What is a core deliverable at the end of the analytic project? 1167
EMC Data Science Question 5 : You have been assigned to run a logistic regression model for each of 100 countries, and all the data is currently stored in a PostgreSQL database. Which tool/library would you use to produce these models with the least effor 856
EMC Data Science Question 6 : Your organization has a website where visitors randomly receive one of two coupons. It is also possible that visitors to the website will not receive a coupon. You have been asked to determine if offering a coupon to visitor 959
EMC Data Science Question 7 : Imagine you are trying to hire a Data Scientist for your team. In addition to technical ability and quantitative background, which additional essential trait would you look for in people applying for this position? 851
EMC Data Science Question 8 : You have run the association rules algorithm on your data set, and the two rules {banana, apple} => {grape} and {apple, orange}=> {grape} have been found to be relevant. What else must be true? 1015
EMC Data Science Question 9 : When would you use a Wilcoxson Rank Sum test? 1286
Question 10: Consider a database with 4 transactions: 853
EMC Data Science Question 11: You are using the Apriori algorithm to determine the likelihood that a person who owns a home has a good credit score. You have determined that the confidence for the rules used in the algorithm is > 75%. You calculate lift = 1094
EMC Data Science Question 12: Consider a database with 4 transactions: Transaction 1: {cheese, bread, milk} Transaction 2: {soda, bread, milk} 908
EMC Data Science Question 13: Under which circumstance do you need to implement N-fold cross-validation after creating a regression model? 1061
EMC Data Science Question 14: What is an appropriate data visualization to use in a presentation for an analyst audience? 941
EMC Data Science Question 15: When would you use GROUP BY ROLLUP clause in your OLAP query? 899
EMC Data Science Question 16: Which type of numeric value does a logistic regression model estimate? 1031
EMC Data Science Question 17: Your colleague, who is new to Hadoop, approaches you with a question. They want to know how best to access their data. This colleague has a strong background in data flow languages and programming. Which query interface woul 1054
EMC Data Science Question 18: The web analytics team uses Hadoop to process access logs. They now want to correlate this data with structured user data residing in a production single-instance JDBC database. They collaborate with the production team to im 1036
EMC Data Science Question 19 : In R, functions like plot() and hist() are known as what? 1181
You are here: Home EMC Certification EMC Data Science