FILL BLANK
Refer to the REG procedure output:
How many observations are used in the analysis? Enter your numeric answer in the space below.
FILL BLANK
Refer to the confusion matrix:
An analyst determines that loan defaults occur at the rate of 3% in the overall population. The above confusion matrix is from an oversampled test set (1 = default).
What is the sensitivity adjusted for the population event probability?
Enter your answer in the space below. Round to three decimals (example: n.nnn).
Refer to the ROC curve:
As you move along the curve, what changes?
A. The priors in the population
B. The true negative rate in the population
C. The proportion of events in the training data
D. The probability cutoff for scoring
Refer to the lift chart:
What does the reference line at lift = 1 corresponds to?
A. The predicted lift for the best 50% of validation data cases
B. The predicted lift if the entire population is scored as event cases
C. The predicted lift if none of the population are scored as event cases
D. The predicted lift if 50% of the population are randomly scored as event cases
An analyst knows that the categorical predictor, storeId, is an important predictor of the target.
However, store_Id has too many levels to be a feasible predictor in the model. The analyst wants to combine stores and treat them as members of the same class level.
What are the two most effective ways to address the problem? (Choose two.)
A. Eliminate store_id as a predictor in the model because it has too many levels to be feasible.
B. Cluster by using Greenacre's method to combine stores that are similar.
C. Use subject matter expertise to combine stores that are similar.
D. Randomly combine the stores into five groups to keep the stochastic variation among the observations intact.
Refer to the following exhibit:
What is a correct interpretation of this graph?
A. The association between the continuous predictor and the binary response is quadratic.
B. The association between the continuous predictor and the log-odds is quadratic.
C. The association between the continuous predictor and the continuous response is quadratic.
D. The association between the binary predictor and the log-odds is quadratic.
A marketing manager attempts to determine those customers most likely to purchase additional products as the result of a nation-wide marketing campaign.
The manager possesses a historical dataset (CAMPAIGN) of a similar campaign from last year.
It has the following characteristics:
1.
Target variable Respond (0, 1)
2.
Continuous predictor Income
3.
Categorical predictor Homeowner(Y, N) Which SAS program performs this analysis?
A. Option A
B. Option B
C. Option C
D. Option D
A non-contributing predictor variable (Pr > |t| = 0.658) is removed from an existing multiple linear regression model. What will be the result?
A. An increase in R-Square
B. A decrease in R-Square
C. A decrease in Mean Square Error
D. No change in R-Square
When working with smaller data sets (N<200), which method is preferred to perform honest assessment?
A. Training: 40% Validation: 30% Testing: 30%
B. K-fold cross validation
C. Cross validation using 4th quartile observations
D. Use the AIC goodness of fit statistic
Refer to the confusion matrix:
Calculate the accuracy and error rate (0 - negative outcome, 1 - positive outcome)
A. Accuracy = 58/102, Error Rate = 23/48
B. Accuracy = 83/102, Error Rate = 67/102
C. Accuracy = 25/150, Error Rate = 44/150
D. Accuracy = 83/150, Error Rate = 67/150