www.HadoopExam.com

HadoopExam Learning Resources

Display # 
Title Hits
EMC Data Science Question 1 : A data scientist is asked to implement an article recommendation feature for an on-line magazine. The magazine does not want to use client tracking technologies such as cookies or reading history. Therefore, only the style an 889
EMC Data Science Question : 2 Which method is used to solve for coefficients b0, b1, .., bn in your linear regression model : 824
EMC Data Science Question : 3 What describes a true limitation of Logistic Regression method? 716
EMC Data Science Question 4 : What is a core deliverable at the end of the analytic project? 906
EMC Data Science Question 5 : You have been assigned to run a logistic regression model for each of 100 countries, and all the data is currently stored in a PostgreSQL database. Which tool/library would you use to produce these models with the least effor 651
EMC Data Science Question 6 : Your organization has a website where visitors randomly receive one of two coupons. It is also possible that visitors to the website will not receive a coupon. You have been asked to determine if offering a coupon to visitor 709
EMC Data Science Question 7 : Imagine you are trying to hire a Data Scientist for your team. In addition to technical ability and quantitative background, which additional essential trait would you look for in people applying for this position? 625
EMC Data Science Question 8 : You have run the association rules algorithm on your data set, and the two rules {banana, apple} => {grape} and {apple, orange}=> {grape} have been found to be relevant. What else must be true? 792
EMC Data Science Question 9 : When would you use a Wilcoxson Rank Sum test? 974
Question 10: Consider a database with 4 transactions: 617
EMC Data Science Question 11: You are using the Apriori algorithm to determine the likelihood that a person who owns a home has a good credit score. You have determined that the confidence for the rules used in the algorithm is > 75%. You calculate lift = 829
EMC Data Science Question 12: Consider a database with 4 transactions: Transaction 1: {cheese, bread, milk} Transaction 2: {soda, bread, milk} 661
EMC Data Science Question 13: Under which circumstance do you need to implement N-fold cross-validation after creating a regression model? 807
EMC Data Science Question 14: What is an appropriate data visualization to use in a presentation for an analyst audience? 680
EMC Data Science Question 15: When would you use GROUP BY ROLLUP clause in your OLAP query? 661
EMC Data Science Question 16: Which type of numeric value does a logistic regression model estimate? 804
EMC Data Science Question 17: Your colleague, who is new to Hadoop, approaches you with a question. They want to know how best to access their data. This colleague has a strong background in data flow languages and programming. Which query interface woul 848
EMC Data Science Question 18: The web analytics team uses Hadoop to process access logs. They now want to correlate this data with structured user data residing in a production single-instance JDBC database. They collaborate with the production team to im 799
EMC Data Science Question 19 : In R, functions like plot() and hist() are known as what? 859
You are here: Home EMC Certification EMC Data Science