Which method is used to solve for coefficients bO, b1, ... bn in your linear regression model:
A. Apriori Algorithm
B. Ridge and Lasso
C. Ordinary Least squares
D. Integer programming
Which technique you would be using to solve the below problem statement? "What is the probability that individual customer will not repay the loan amount?"
A. Classification
B. Clustering
C. Linear Regression
D. Logistic Regression
E. Hypothesis testing
Which of the following problem you can solve using binomial distribution
A. A manufacturer of metal pistons finds that on the average: 12% of his pistons are rejected because they are either oversize or undersize. What is the probability that a batch of 10 pistons will contain no more than 2 rejects?
B. A life insurance salesman sells on the average 3 life insurance policies per week. Use Poisson's law to calculate the probability that in a given week he will sell Some policies
C. Vehicles pass through a junction on a busy road at an average rate of 300 per hour Find the probability that none passes in a given minute.
D. It was found that the mean length of 100 parts produced by a lathe was 20.05 mm with a standard deviation of 0.02 mm. Find the probability that a part selected at random would have a length between 20.03 mm and 20.08 mm
Under which circumstance do you need to implement N-fold cross-validation after creating a regression model?
A. The data is unformatted.
B. There is not enough data to create a test set.
C. There are missing values in the data.
D. There are categorical variables in the model.
Marie is getting married tomorrow, at an outdoor ceremony in the desert. In recent years, it has rained only 5 days each year. Unfortunately, the weatherman has predicted rain for tomorrow. When it actually rains, the weatherman correctly forecasts rain 90% of the time. When it doesn't rain, he incorrectly forecasts rain 10% of the time. Which of the following will you use to calculate the probability whether it will rain on the day of Marie's wedding?
A. Naive Bayes
B. Logistic Regression
C. Random Decision Forests
D. All of the above
If E1 and E2 are two events, how do you represent the conditional probability given that E2 occurs given that E1 has occurred?
A. P(E1)/P(E2)
B. P(E1+E2)/P(E1)
C. P(E2)/P(E1)
D. P(E2)/(P(E1+E2)
Select the correct algorithm of unsupervised algorithm
A. K-Nearest Neighbors
B. K-Means
C. Support Vector Machines
D. Naive Bayes
RMSE measures error of a predicted:
A. Numerical Value
B. Categorical values
C. For booth Numerical and categorical values
What type of output generated in case of linear regression?
A. Continuous variable
B. Discrete Variable
C. Any of the Continuous and Discrete variable
D. Values between 0 and 1
A data scientist is asked to implement an article recommendation feature for an on-line magazine.
The magazine does not want to use client tracking technologies such as cookies or reading history. Therefore, only the style and subject matter of the current article is available for making recommendations. All of the magazine's articles are stored in a database in a format suitable for analytics.
Which method should the data scientist try first?
A. K Means Clustering
B. Naive Bayesian
C. Logistic Regression
D. Association Rules