DUD Chapter 1: Data and Statistical Thinking

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/52

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

53 Terms

1
New cards

statistics

the science of data; involves collecting, classifying, summarizing, organizing, analyzing and interpreting numerical info

2
New cards

descriptive statistics

utilizes numerical and graphical methods to explore data, summarize info, and present it in a convenient form

3
New cards

inferential statistics

utilizes sample data to make estimates, decisions, predictions, or other generalizations about a larger set of data

4
New cards

experimental (or observational) unit

object upon which we collect data

5
New cards

population

set of units we are interested in studying

6
New cards

variable

property of an individual experimental unit

7
New cards

sample

subset of the units of a population

8
New cards

statistical inference

estimate, prediction, or generalization about a population based on info contained in a sample

9
New cards

parameter

a numerical value that describes a characteristic of an entire population (ex. the true average height of all adult women in Canada)

10
New cards

statistic

a numerical value that describes a characteristic of a sample (a subset of the population) - ex. the average height of a random sample of 1,000 adult women from Canada; this statistic can estimate the population parameter in situations where measuring the parameter is impractical

11
New cards

sampling error and applications

a degree of uncertainty because statistics are only based on samples;

12
New cards
13
New cards

designing studies by choosing the right sample size and ensuring the sample represents the population

14
New cards
15
New cards

interpreting results by avoiding overgeneralization and communicating findings through understanding limitations

16
New cards

measure of reliability

a statement about the degree of uncertainty associated with a statistical inference

17
New cards

data generating process (DGP)

the underlying mechanism or process that produces the data we observe; economists can identify patterns, test theories, and propose policy interventions by studying the DGP

18
New cards

why understanding the DGP matters

we may be interested in describing the outcome of a process (distribution of wages), but understanding the DGP helps economists and policymakers make informed decisions by revealing the relationships between variables - this can lead to more effective policies: if the DGP shows that education significantly incs wages, this could justify prioritizing policies that enhance educational opps

19
New cards

quantitative data

measurements that are recorded on a naturally occurring numerical scale

20
New cards

qualitative data

measurements that cannot be measured on a natural numerical scale - can only be classified into one of a group of categories

21
New cards

obtaining data (ways)

published source, designed experiment, observation study (like a survey)

22
New cards

observational study

a data-collection method where the experimental units sampled are observed in their natural setting. no attempt is made to control the characteristics of the experimental units samples (ex. opinion polls, surveys, etc.)

23
New cards

designed experiment

a data-collection method where the researcher exerts full control over the characteristics of the experimental units sampled. these experiments typically involve a group of experimental units that are assigned the treatment and an untreated (control) group

24
New cards

randomized control trials (RCTs)

A research design in which the investigator assigns participants randomly to two or more groups who then receive systematic treatment (or, in some cases, no treatment), and the outcomes for each group are compared; help remedy the lack of data in poor countries and cheaper to conduct than in rich countries

25
New cards

importance of selection (a sample)

how a sample is selected from a population is of vital importance in statistical inference because the probability of an observed sample will be used to infer the characteristics of the population

26
New cards

representative sample

exhibits characteristics typical of those possessed by the population of interest

27
New cards

simple random sample

a sample selected from the population in such a way that every different sample of size n has an equal chance of selection

28
New cards

random number generator

an algorithm that generates a sequence of numbers that seem to occur in random order

29
New cards

stratified random sampling (purpose)

ensures that specific subgroups (strata) within a population are adequately represented

30
New cards

the process of stratified random sampling

  1. divide the pop into strata based on important characteristics

31
New cards
  1. randomly sample from each stratum. the number of units sampled from each stratum can be proportional to the stratum's size relative to the population or the same across strata

32
New cards
  1. combine the samples from all strata to form the final sample

33
New cards

cluster sampling (purpose)

makes sampling more practical and cost-effective for large, naturally divided populations

34
New cards

cluster sampling (process)

  1. divide the population into naturally occurring groups (clusters) (ex. schools, neighborhoods)

35
New cards
  1. randomly select a sample of clusters

36
New cards
  1. collect data from all units within the selected clusters

37
New cards

ex. studying student performance by randomly selecting a few schools and surveying all students within those schools

38
New cards

systematic sampling

a method where you select every kth item from a list of all items in the population

39
New cards

systematic sampling (process)

  1. list the population

40
New cards
  1. choose a random starting point

41
New cards
  1. select every kth item until the end of the list

42
New cards

ex. select every 10th student from a list of 1,000 students

43
New cards

*problem: if the list has a periodic pattern, this method might introduce bias

44
New cards

randomized response sampling

a technique to obtain truthful responses to sensitive questions by reducing the likelihood of false answers

45
New cards

randomized response sampling (process)

  1. participants answer one of two questions based on a randomizing device (ex. coin flip)

46
New cards
  1. the pollster doesn't know which question was answered, ensuring anonymity

47
New cards
  1. only the overall proportion of "yes"/"no" answers in the sample will be known

48
New cards

ex. participants flip a coin to decide whether to answer a sensitive or a neutral question

49
New cards

selection bias

occurs when a subset of the experimental units in the population is excluded, giving those units no chance of being selected for the sample; the sample may not accurately represent the population, leading to biased results

50
New cards

ex. conducting a phone survey that only includes landlines may exclude a significant portion of the population that only uses mobile phones

51
New cards

nonresponse bias

occurs when the researchers conducting a survey or study are unable to obtain data from all the experimental units selected for the sample; if non-respondents differ significantly from respondents, the results may not reflect the true characteristics of the population

52
New cards

measurement error

refers to inaccuracies in the values of the data recorded, often due to faulty data collection methods or instruments; can lead to incorrect conclusions about the population, reducing the reliability of the results

53
New cards

ex. a survey question worded ambiguously causes respondents to interpret it differently