Klausur


Kartei Details

Karten 64
Sprache English
Kategorie Finanzen
Stufe Universität
Erstellt / Aktualisiert 08.02.2025 / 08.02.2025
Weblink
https://card2brain.ch/box/20250208_data_analytics_wissen
Einbinden
<iframe src="https://card2brain.ch/box/20250208_data_analytics_wissen/embed" width="780" height="150" scrolling="no" frameborder="0"></iframe>

What is the 10-fold cross validation?

It involves comparing the mean performances of the tuning parameters on test data blocks.

What do you do to find the optimal tuning parameter?

Try different tuning parameters, evaluate their performance, and compare. Performance is commonly checked using 10-fold cross-validation.

When should you use Ridge and when Lasso?

Ridge: If most variables in the dataset are useful. Lasso: If most variables in the dataset are useless. If uncertain: Use Elastic Net Regression.

Why are some coefficients in LASSO equal to zero and in RIDGE not?

Because the constraint of the RIDGE forms a circle, so the ellipse never exactly touches zero.

What is the ELASTIC NET?

A combination of RIDGE and LASSO.

What is the LASSO?

We minimize the sum of squared residuals + a shrinkage penalty with an absolute value term (power of one) to find ßs.

What is the RIDGE?

We minimize the sum of squared residuals + a shrinkage penalty with a squared term (power of two) to find ßs.

What is the OLS?

We minimize the sum of squared residuals to find ßs.

What is the trade-off the ridge has to deal with?

Ridge has a larger bias than the OLS, but a lower variance. This reflects the Bias-Variance Trade-Off.

What are Lasso, Ridge and Elastic Net regression good for?

They can be used for prediction or classification when we have large data sets.

What is the Type 2 Error?

False Negative

What is the Type 1 Error?

False Positive

For what do you need logistic regression?

When the target variable y is categorical (e.g., color). We only deal with binary outcomes (yes (1), no (0)).

What is the difference between a simple and multiple regression model regarding the RMSE?

The RMSE of the training data is lower in the multiple regression compared to the simple one.

What is a log-log regression and how do you interpret it?

log y = ß log x + E. If x increases by 1 percent, then y changes by ß percent.

What is a level-log regression and how do you interpret it?

y = ß log x + E. If x increases by 100 percent, then y changes by ß units.

What is a log-level regression and how do you interpret it?

log y = ßx + E. If x increases by one unit, then y changes by ß * 100 percent.

What is a level-level regression and how do you interpret it?

y = ßx + E. If x increases by one unit, then y changes by ß units.

How do you measure the accuracy of a prediction?

Compare the root mean squared error and mean absolute error of training and validation data.

What is the Mean Error?

Gives an idea of systematic over- or underprediction.

What is the Mean Absolute Error?

Gives an idea of the magnitude of errors.

How do you interpret a lift chart?

Taking the 10% of observations most likely classified as 1s by the model yields almost eight times as many 1s as a random selection of 10% of cases.

What are the six procedure steps of a Lift Chart?

1. Usually in deciles. 2. First: Observations are ordered along predicted probabilities. 3. Calculate the proportion of 1s in each decile. 4. Divide by the average proportion of 1s in the data set. 5. This ratio gives the lift value.

How do you measure the ROC curve?

Higher area under the curve: - 0.5 indicates no better than random assignment - 1.0 indicates perfect separation of classes

What is the ROC curve?

Receiver Operating Characteristic. It illustrates sensitivity and specificity when the cutoff value decreases from 1 to 0. Better performance is shown by an ROC curve closer to the upper left corner.

What is the Specificity (true negative rate)?

True Negative / (True Negative + False Positive)

What is Sensitivity (true positive rate)?

True Positive / (True Positive + False Negative)

What is the Accuracy?

1 - Error Rate

How do you decide who gets into which class of interest?

For example: Class 1 (acceptance of credit) vs. Class 0 (rejection of credit). Calculate the probability of belonging to Class 1. If it is lower than 0.5 (threshold), classify as Class 0; otherwise, classify as Class 1.

How does the separation of observations affect the error?

High separation: Predictor variables lead to a low error. Low separation: Predictor variables do not significantly improve the naive rule.

What is the Naive rule?

Classify all observations as belonging to the most frequent class (benchmark).

What is an Error Rate?

Proportion of misclassified observations out of all observations of the datasets in the validation data.

What is an Error?

Classification of an observation as belonging to one class, although it belongs to another.

What does a Bar Chart show?

In the simplest version, a bar chart shows only the frequency in each category.

What is the histogram useful for?

Visualization of the distribution of a continuous variable.

What are grouped boxplots?

Grouped boxplots allow comparison between categories of a potential predictor.

What is the boxplot useful for?

The boxplot is very useful to get an overview of the overall distribution of a continuous variable.

What is a Scatter Plot Matrix?

The Scatter Plot Matrix offers a combination of bivariate scatter plots and distribution plots.

What shows the visualization the scatterplot?

Displays the relationship between two numerical variables.

What are the major visualizations for classification?

Study the relationship of the outcome variable to categorical predictors using bar charts with the outcome variable on the y-axis. Study the relationship of the outcome variable to pairs of numerical predictors via color-coded scatter plots. Study the relationship between the outcome variable and numerical predictors via side-by-side boxplots.