Karten 118 Karten
Lernende 1 Lernende
Sprache English
Stufe Universität
Erstellt / Aktualisiert 15.06.2020 / 30.06.2020
Lizenzierung Keine Angabe
0 Exakte Antworten 118 Text Antworten 0 Multiple Choice Antworten
Fenster schliessen

Which trends are driving the data science "revolution"?

Mainly Big Data and Machine Learning, 

Fenster schliessen

Give a definition of Data science:

Data science is about the extraction of useful
information and knowledge from large
volumes of data, on order to improve
business decision-making.

Data science is an interdisciplinary subject with 3 key areas:
- Statistics
- Computer Science
- Domain expertise

Fenster schliessen

Why is Data Science important?

In the past, data analysis was typically slow: Needed teams of statisticians, analysts etc. to explore data manually.

Today colume, velocity and variety make manual analysis impossible but fast computers and good algorithms allow much deeper analyses than before.

--> data-driven decision making
--> base decisions on alysis of data, not intuition

Fenster schliessen

Draw the Data Science performing process:

Lizenzierung: Keine Angabe

- Iterative process
- Non-sequential
- Early termination
- Established processes, e.g. CRISP-DM

Fenster schliessen

Name the approximately year of invention of Machine Learning, Deep learning and Artificial Intelligence:

  • AI 1950's
    Creation of first "intelligent" algorithms and programs
  • ML 1980's
    Statistical models and algorithms that can learn from data
  • DL 2010's
    Statistical models and algorithms inspired by neurones that can learn from data
Fenster schliessen

Name the 3 main branches of ML and some of its applications:

  • Supervised Learning
    • Classification
      • Diagnostics
      • Customer Retention (Kundenbindung)
      • Image Classification
    • Regression
      • Estimating life expextancy
      • Population Growht Prediction
      • Market Forecasting
  • Unsupervised Learning
    • Clustering
      • Recommender System
      • Customer Segmentation
      • Targetted marketing
    • Dimensionality Reduction
      • Big data Visualisation
      • Structure Discovery
  • Reinforcement Learning
    • Game AI
    • Robot Navigation
    • Real-time decisions
Fenster schliessen

Explain supervised learning:

In supervised learning the training data consicts of input / output pairs and we train a function to map the inputs to the outputs. The predicted variable consists is therby either a continuous variable like Price / Cost / Weight (Regression Problems) or categorical variable like A, B or C / Dogs or Cats.

Fenster schliessen

Explain unsupervised learning:

In unsupervised learning there are no labels available, insights are gained without prior knowledge.

For Anomaly / Outlier detection is the task, finding samples in a dataset tat raise suspicion.
The problem therby is, that you usally do not know, what you are looking for.
The solution is to use statistics and characteristics of the dataset to find outliers.