Which trends are driving the data science "revolution"?
Mainly Big Data and Machine Learning,
Give a definition of Data science:
1.
Data science is about the extraction of useful
information and knowledge from large
volumes of data, on order to improve
business decision-making.
2.
Data science is an interdisciplinary subject with 3 key areas:
- Statistics
- Computer Science
- Domain expertise
Why is Data Science important?
In the past, data analysis was typically slow: Needed teams of statisticians, analysts etc. to explore data manually.
Today colume, velocity and variety make manual analysis impossible but fast computers and good algorithms allow much deeper analyses than before.
--> data-driven decision making
--> base decisions on alysis of data, not intuition
Draw the Data Science performing process:
Name the approximately year of invention of Machine Learning, Deep learning and Artificial Intelligence:
Name the 3 main branches of ML and some of its applications:
Explain supervised learning:
In supervised learning the training data consicts of input / output pairs and we train a function to map the inputs to the outputs. The predicted variable consists is therby either a continuous variable like Price / Cost / Weight (Regression Problems) or categorical variable like A, B or C / Dogs or Cats.
Explain unsupervised learning:
In unsupervised learning there are no labels available, insights are gained without prior knowledge.
For Anomaly / Outlier detection is the task, finding samples in a dataset tat raise suspicion.
The problem therby is, that you usally do not know, what you are looking for.
The solution is to use statistics and characteristics of the dataset to find outliers.