Cartes mémoires DOE

Cartes-fiches	60
Langue	English
Catégorie	Informatique
Niveau	Université
Crée / Actualisé	28.10.2022 / 28.10.2022
Lien de web	https://card2brain.ch/box/20221028_doemsc
Intégrer	<iframe src="https://card2brain.ch/box/20221028_doemsc/embed" width="780" height="150" scrolling="no" frameborder="0"></iframe>

8.1.1 Example classical A/B Testing

A/B Testing

Description

We want to check how different versions of our newsletter (which is send by e-mail) is changing the open rate. We have two versions of the newsletter.

We assign the newsletter recipients randomly into two groups. Each group receives a different version of the newsletter.

In the end we check which newsletter was opened more, thus this version wins and will be further used.

Advantages

Planning and execution effort is low/simple.
Easy to analyze the results of the test

Disadvantages

Depending on the prerequisites (number of variants, evaluation procedures, etc.) many observations are needed according to power analysis -> potential profit gets lost

8.1.1 A/B Testing with Bandit algorithm

Dynamic-algorithmic procedure

Description

We want to check how different versions of our newsletter (which is send by e-mail) is changing the open rate. We have two versions of the newsletter.

The experiment takes place over 4 days. We send out the newsletter every 6 hours. For the first batch, an equal count of recipients is assigned.

Depending on the success (the open rate), we change the allocation of recipients to the two different versions. The version which has the higher open rate receives more recipients. After 4 days we check which version has received more recipients. The version with the higher amount of recipients wins (= higher open rate)

Advantages

Realtime data shows (potentially) quickly an outcome of the test
The test can be adjusted

Disadvantages

The dynamic process produces ‘only’ one best result.

- No further models can be estimated

- No dependence of the conversion rate on other factors can be investigated

8.2 Error sources and pitfalls

Research design

A/B Testing cannot be used in all research questions, Ex. For a complete redesign of a website! (Quality aspects, survey, UX testing)

Sampling

Sampling bias / population is unknown, Example: Conducting an experiment on a webpage layout that includes only heavy users.

Sample size

Large samples Hypothesis test is significant, but information on effect sizes is missing.

Metrics

The measurement uses an unsuitable metric.

8.3 Manipulation / Dark patterns

Dark patterns are patterns of persuasion and influence, examples:

Misdirection: The design purposefully focuses your attention on one thing to distract your attention from something else.
Price Comparison Prevention: The retailer makes it hard for you to compare the price of an item with another item, so you cannot make an informed decision.
Privacy Zuckering: You are tricked into publicly sharing more information about yourself than you really intended to. Named after Facebook CEO Mark Zuckerberg

9 Factorial Designs

How to arrange Factors and to get information out of them. One or more independent variables (= factors) with several levels are investigated simultaneously.

Factors = independent variables

The research team has several factors with several levels in which they have an interest, and levels of all factors are varied simultaneously. As a result, all possible combination of factor levels can be studied. The factor levels are determined before the experiment.

-> Interaction can be tested!

9.1 Full factorial designs - Single factorial

---

9.1 Full factorial designs - Multifactorial

---

9.2 Full factorial design

---

9.2 Fractional factorial design

---

9.3 Latin square

The more factors and the more characteristics, the larger the number of groups. Provided that there are NO interactions, the experiment can be conducted with a reduced number of groups By using Latin squares or related design, the number of groups required can be reduced compared to a full factorial design

= Relationship between a complete design and a Latin square (standard form)

Level ^ factors -1

3^4 = 84 -> 3^3 = 27

a. Each level of C is combined only once with each level of A and each level of B

-> Latin square is completely balanced in terms of the main effects

b. Each level of C occurs partially with other combinations of A and B

-> Latin square is only partially balance with respect to first-order interaction

10 Large data quantities

Volume (Datenmenge)

Describes the extreme amount of data. Increasing amounts of data must be stored and processed.

Important aspects for DoE

- Relation of the number of variables vs. sample size

- Relation of sample / administrative dataset vs. population

Velocity

(Geschwindigkeit)

Describes the increased speed with which the data must be produced, but also processed at the same time

Variety

(Vielfalt)

Describes the variety of different file structures: structured, semi-structured and unstructured.

Veracity

(Wahrhaftigkeit)

The uncertainty (veracity) of the data and data quality. In some cases, data from various sources does not arrive in the desired quality and can therefore not be used as intended or must be preprocessed at great expense

10.1 Made and found data

---

10.1 Made and found data (compared)

Made data

Are taken as part of a study, answers research question, data is generated by researcher

Found data

Collected for various reasons, serves documentary and administrative purposes, data is obtained administratively and technically

Grey area

Lies between: data from full surveys (=census), social media, representative surveys

10.2 Bias

Bias is the deviation (= Abweichung) between the population mean µ0 and the sample mean x. There are three elements determining the bias:

10.2.1Statistical paradise and paradox

Calculating the deviation between the sample mean and the mean in the population produces:

statistical paradises: Have large data quantities → Big data (absolute)
paradoxes in big data: It is not about the absolute size, but the relation!
- Often not the paradise we expected!

If the size n of the administrative dataset goes towards N, the bias becomes arbitrarily small. The absolute size n is meaningless without specifying N! Example: n = 2’000’000 “is big”, but the proportion is still small with f = n/N = 2’000’000 / 200’000’000 = 1%

1.1 Observational studies

Commandes clavier:

= tourner,

= avant/arrière,

= faire défiler

Researchers observe the effect of a risk factor (diagnostic test, treatment, or other intervention) without trying to change it. That means they just record what they see. There are two types of observational studies:

a) Cohort Study: For research purposes, a cohort is any group of people who are linked in some way. For instance, a birth cohort includes all people born within a given time frame. Researchers compare what happens to members of the cohort that have been exposed to a particular variable to what happens to the other members who have not been exposed.

b) Case control study: Here researchers identify people with an existing health problem (“cases”) and a similar group without the problem (“controls”) and then compare them with respect to an exposure(s).

-> Problem: They run the risk of containing confounding biases!

1.1.1Confounder

Commandes clavier:

= tourner,

= avant/arrière,

= faire défiler

Confounding (“Verwirrung”) means that a factor (confounder) that is not directly investigated is associated with both the independent variable and the dependent variable and accordingly causes the relationship between the two variables (Spurious Correlation = “falsche Korrelation”)

Example: The fewer storks nest in an area, the lower human birth rate in the area

direct relationship is most unlikely
explainable trough urbanization (confounder)
Urbanization leads to a decline in the stork population because more land is being developed
Urbanization also leads to a lower birth rate because women tend to be more educated

Solution: Randomization

1.2 Experimental studies

Commandes clavier:

= tourner,

= avant/arrière,

= faire défiler

These studies are the solution to the disadvantages of observational studies. Experimental studies are ones where researchers introduce an intervention and study the effects. The Randomized double blind placebo control study/trial (= RCT) is considered the “gold standard” for producing reliable evidence because little is left to chance.

Aim: Treatment/exposures is planned, and its effects are investigated

1.2.1Randomization

Commandes clavier:

= tourner,

= avant/arrière,

= faire défiler

Eligible people are randomly assigned to one of two or more groups. This ensures the comparability of the groups at study begin. Aim of randomization:

Elimination of selection bias: is an experimental error that occurs when the participant pool is not representative of the target population. (parents with higher incomes >> more informed >> get their child vaccinated >> proportion would be larger in the treatment group)
Elimination of confounding: Confounding means that a factor (=confounder), that it not directly investigated, relates to both the independent and dependent variable. This confounder then causes the relationship between the two variables (= Spurious Correlation) and not the treatment itself.
Ensuring the comparability of the groups at study initiation (baseline data)

1.2.2 Blinding

Commandes clavier:

= tourner,

= avant/arrière,

= faire défiler

Blinding is a suitable technique to avoid distortion (“Verzerrung”) due to knowledge. Blinding eliminates conscious and unconscious influences on the treatment results. Also it ensures the comparability of the groups during / in the course of the study trough the similar treatment of the two groups and the independent evaluation of the effect of the exposure. There are several stages of blinding (cumulative):

Open: no blinding
Single-blind: Tested person has no knowledge of group membership.
Double-blind: The person carrying out the experiment has no knowledge of the group membership.
Triple-blind: Person doing the analysis has no knowledge about the group membership.

In Short:

Effect and aim of blindness.

Elimination of conscious and unconscious influences of the treatment result (Placebo)
Ensuring comparability regarding treatment of groups during the study
Ensuring comparability on the final evaluation of both groups

2. Principles

Commandes clavier:

= tourner,

= avant/arrière,

= faire défiler

2.1 Observational study - Descriptive study

Commandes clavier:

= tourner,

= avant/arrière,

= faire défiler

Descriptive study

Descriptive character, suitable for forming hypotheses but insufficient for testing them.

Cross sectional study

Sample with variables of interest are collected and described at a specific time “Quality of Life of master’s students.”

-> Survey is typical

Correlational study

Same as cross sectional, but variables are correlated:

“Relationship between the time that master’s students spend studying on their own and their general quality of life”

2.1 Observational study - Analytical study

I

Commandes clavier:

= tourner,

= avant/arrière,

= faire défiler

dentification and quantification of effects of relationship, not fully suitable for hypothesis testing.

Case control study

Examines how cases and controls differ from having been exposed previously Looking back at the exposition (retrospective)

Example with Pädos (Vergangenheit wird angeschaut, wieso wurden sie so?)

Cohort study

Cohort = group of people with comparable initial conditions

One group is exposed to an influencing factor afterwards the groups are compared

2.2 Experimental study

Commandes clavier:

= tourner,

= avant/arrière,

= faire défiler

Randomized controlled (RCT)

Treatment and control are determined by a random process (randomization) before the intervention.

uitable for hypothesis testing.

Non-randomized controlled

Treatment and control are not determined by a random process (randomization) before the intervention. For example, control groups involve data obtained from previous studies («historical control») or from randomly allocated groups.

2.3 Empirical research

Commandes clavier:

= tourner,

= avant/arrière,

= faire défiler

3 Introduction to the design of experiments

An experiment is characterized by two conditions:

The experimenter systematically varies at least one variable and registers the effect from this active change.
At the same time, the experimenter disables the effect of other variables by using one of the techniques for controlling nuisance variables

Input

Trial objects, tests objects, test persons, materials

Process

Process in which controllable and non-controllable factors influence the input

Controllable factors

Influencing factors whose strength can be adjusted within defined limits.

Independent variable (IV)

Non-controllable factors

Influencing factors whose strength cannot be determined,

but measured:
- body weight, stress level, etc.
or not measured:
- random fluctuation in output, random fluctuation in measurement instruments

Nuisance variables (= Störvariablen)

Output

Input changed by the process, result of the test/experiment.

Dependent variable (DV)

3.1 Variance - Primary variance

Impact of factors on the variation of the output (focus of the study)

Ex. Body weight changes systematically when height changes:

3.1 Variance - Secondary variance

Variation of the output, caused by nuisance factors (not in the focus of the study, for example gender)

Ex. Boys and girls differ in the level of body weight, that’s why we can divide the sample into two groups

3.1 Variance - Error variance

Variation caused by measurement errors or random processes.

3.1 Variance ->

Goals:

The Variance of the dependent variable (primary variance) should be attributed to the systematic variation of the independent variable (IV). The secondary variance should be controlled, and the error variance minimized.

DOE_MSc

Créer ou copier des fichiers d'apprentissage

Créer ou copier des fichiers d'apprentissage

Connecte-toi pour voir toutes les cartes.

SWITCHaai

Office 365

Edulog

Apple ID

Google