What is one modeling or descriptive statistical function in MADlib that is typically not provided in a standard relational database?
You are analyzing data in order to build a classifier model. You discover non-linear data and discontinuities that will affect the model. Which analytical method would you recommend?
In unsupervised learning which statements correctly applies
You are doing advanced analytics for the one of the medical application using the regression and you have two variables which are weight and height and they are very important input variables, which cannot be ignored and they are also highly co-related. What is the best solution for that?
Find out the classifier which assumes independence among all its features?
In which phase of the analytic lifecycle would you expect to spend most of the project time?
Suppose there are three events then which formula must always be equal to P(E1|E2,E3)?
You are working with the Clustering solution of the customer datasets. There are almost 40 variables are available for each customer and almost 1.00,0000 customer's data is available. You want to reduce the number of variables for clustering, what would you do?
You are asked to create a model to predict the total number of monthly subscribers for a specific magazine. You are provided with 1 year's worth of subscription and payment data, user demographic data, and 10 years worth of content of the magazine (articles and pictures). Which algorithm is the most appropriate for building a predictive model for subscribers?
Spam filtering of the emails is an example of
Which of the following are advantages of the Support Vector machines?
Question-13. Which of the following is not the Classification algorithm?
Under which circumstance do you need to implement N-fold cross-validation after creating a regression model?
In which of the scenario you can use the linear regression model?
Select the choice where Regression algorithms are not best fit
What are the advantages of the Hashing Features?
Question-3: In machine learning, feature hashing, also known as the hashing trick (by analogy to the kernel trick), is a fast and space-efficient way of vectorizing features (such as the words in a language), i.e., turning arbitrary features into indices in a vector or matrix. It works by applying a hash function to the features and using their hash values modulo the number of features as indices directly, rather than looking the indices up in an associative array. So what is the primary reason of the hashing trick for building classifiers?
Which of the following true with regards to the K-Means clustering algorithm?
You are using one approach for the classification where to teach the agent not by giving explicit categorizations, but by using some sort of reward system to indicate success, where agents might be rewarded for doing certain actions and punished for doing others. Which kind of this learning
You have collected the 100's of parameters about the 1000's of websites e.g. daily hits, average time on the websites, number of unique visitors, number of returning visitors etc. Now you have find the most important parameters which can best describe a website, so which of the following technique you will use