Basic Statistics

Chapter 1: Data Collection

Chapter 1: Data Collection


Fichier Détails

Cartes-fiches 44
Langue English
Catégorie Mathématiques
Niveau Université
Crée / Actualisé 07.09.2015 / 01.11.2022
Lien de web
https://card2brain.ch/box/basic_statistics
Intégrer
<iframe src="https://card2brain.ch/box/basic_statistics/embed" width="780" height="150" scrolling="no" frameborder="0"></iframe>

Statistics

is science of collecting, organizing, summarizing, and analyzing information to draw conclusions or answer questions.

Data 

numeric values that are gathered through measurements and/or observations.

Data Set

a collection of data values

Data Value (datum)

an individual value  in a data set

Population

Entire group to be studied

Ex: people, animals, foods, 

Individual

person or object that is a member of the population being studied

Sample

is a subset of the population that is being studied

Statistic

numerical summary of a sample

Descriptive statistics

consist of organizing and summarizing data. Described data through numerical summaries, tables, and graphes.

Examples include:  the U.S. Census, Special Interest, Group Studies, Marketing Studies, Consumer

                                             Habits, etc

Inferential Statisitics

uses methods that take a result from a sample, extend it to the population, and measure the reliability of the result. 

Ex: Games of chance,  insurance industry, simulations, consumer taste, 

Parameter

numerical summary of population

Process of Statistics

1. Identify the research objective: A researcher must determine the questions he or she wants answered. The questions must be detailed so that it identifies the population that is to be studied. 

2. Collect the data needed to answer the questions posed in #1

3. Describe the data

4. Perform Inference: Apply appropriate techniques to extend the results obtained from the sample to the population and report a level of reliability of the results

Variables

are the characteristics of the individuals within the population

Qualitative, or categorical, variables

allow for classification of individuals based on some attribute or characteristic

Examples are: gender,  geographic area, religious affiliation, membership in a group, etc

Quantitative variables

provide numerical measures of individuals. The values of a quantitative variable can be added or subtracted and provide meaningful results. 

Examples : age,  weight,  height,  GPA, IQ,  distance, payments,  tax  owed, etc.

Approach

way to look at and organize a problem so that it can be solved 

Discrete Variable 

quantitative variable that has either a finite number of possible values or a countable number of possible values. The term countable means that the values result from counting, 1,2,3. A discrete variable cannot take on every possible value between any teo possible values. 

Examples:  # of Baseball Games won  by the Cardinals ,  # of members of a family,  # of  students in a class, # of  leaves on a tree,  # of  digits in a zip code,  # of  stars in the galaxy,  # of people who wear glasses, etc.

Continuous Variable

quantitative variable that has an infinite number of possible values that are not countable. A continuous variable may take on every possible value between any two values. 

Examples:   a person's temperature, distance between two places,  the speed of an object,  time,  amount of force applied,  etc

Nominal level of measurement

if the values of the variable name, label, or categorize. In addition, the naming sceme does not allow for the values of the variable to be arranged in a ranked or specific order. 

EX: Grouping people by their nationality, gender, their religion, zip code of residence, marital  status, etc.

Ordinal level of measurement

if it has the properties of the nominal level of measurement, however the naming scheme allows for the values of the variable to be arranged in a ranked or specific order 

Ex: Letter Grades  ( A, B, C, D, & F); Rating Scales ( Outstanding, Excellent,  Good, Average,  Poor, Unsatisfactory); Rankings ( General, Colonel, Captain, Sergeant , Corporal, Private)

Interval level of measurement

if it has the properties of the ordinal level of measurement and the differences in the values of the variable have meaning. A value of zero foes not mean the absence of the quanitity. Arthmetic operations such as addition and subtraction can be performed on values of the variable. 

EX: IQ  may be  100 or  110 ( An  IQ of zero does not exist), Temperature may be  90  or  91 degrees ( 0  temperature does not mean  no heat)

Ratio level of measurement

if it has the properties of the interval level of measurement and the ratios of the values of the variable have meaning. A value of zero means the absence of the quantity. Arithmetic operations such as multiplication and division can be performed on the values of the varibale. 

Ex: (Examples include height, weight,  area,  # of phone calls, etc.)

 

Random variable

is one whose values are determined by chance

Independent variable

is one whose value is chosen free of any influence by the value of other variables or any given situation

Dependent variable

is one whose value is determined by the value held by another variable at a given time. 

Grams of carbohydrates in a doughnut

Quantitative

Number of unpopped kernals in a bag of ACT microwave popcorn

Quantitative 

Phone number

Qualitative 

Goals scored in a season by a soccer player

Discrete

Length (in minutes) of a country song

Continuous

Temperature on a randomly selected day in Memphis, TN

Continuous

Temperature on a randomly selected day in Memphis, TN

Continuous

Points scored in an NCAA basketball game

Discrete

How varying the amount of an explanatory variable affect the value of response variable

Explanation

Observational study

Measures the value of the response variable without attempting to influence the value of either the response or explanatory variables. That is, in an observational study, the researcher observes the behavior of the individuals without trying to influence the outcome of the study. 

Designed Experiment

if researcher assigns the individuals in a study to a certain group, intentionally changes the value of an explanatory variable, and then records the value of the response variable for each group

Confounding

occurs in a study when the effects of two or more explanatory variable are not separated. Any relation that may exist between an explanatary variable and the response variable may be due to some othervariable or variables not accounted for in the study.

Ex: Desire for a type of food and how hungry the  subject is at the time of the study

Lurking variable

is an explanatory variable that was not considered in a study, but that affects the value of the response variable in the study. Lurking variables are typically related to explanatory variables considered in the study. 

3 major categories of observational studies:

1) Cross-sectional studies

2) Case-control studies

3) Cohort studies

1) collect information about individuals at a specific point in time or over a very short period of time

2) retrospective they require individuals to look back in time or require the researcher to look at existing records. In case control studies individuals who have a certain characteristics may be matched with those who do not. 

3)  identifies a group of individuals to participate in the study(cohort). Observed over a long period of time. Characteristics about individuals are recorded and come individuals will be exposed to certain factors and others will not. Downside since longer study  people drop out leading to scewed results. 

Census

list of individuals in a population along with certain characteristics of each individual.