Chapter 6 - Statistical Inference & Generalizations

studied byStudied by 1 person
0.0(0)
learn
LearnA personalized and smart learning plan
exam
Practice TestTake a test on your terms and definitions
spaced repetition
Spaced RepetitionScientifically backed study method
heart puzzle
Matching GameHow quick can you match all your cards?
flashcards
FlashcardsStudy terms and definitions

1 / 30

encourage image

There's no tags or description

Looks like no one added any tags here yet for you.

31 Terms

1

Statistical inference (Inductive)

The process of using specific examples or observations to make general conclusions.

  • Basically a way of using information from a smaller group (called a sample) to make a guess or conclusion about a larger group (called a population).

  • Imagine you’re looking at the test scores of 10 students in a class. You could use those 10 scores to make conclusions about how the whole class might perform (that's statistical inference).

New cards
2

Statistical Generalization (Inductive)

Sample - population.

Features found in a sample (or individuals) as a premise and draws a conclusion about the features of a population.

Making a guess about a big group of things based on what you see in a smaller group.

They are a sample of the bigger population.

  • An inference that moves outward from facts about a sample to draw a conclusion about the group at large.

<p>Sample - population.</p><p><span>Features found in a sample (or individuals) as a premise and draws a conclusion about the features of a population.</span></p><p>Making a guess about a big group of things based on what you see in a smaller group.</p><p>They are a sample of the bigger population.</p><ul><li><p>An inference that moves outward from facts about a sample to draw a conclusion about the group at large.</p></li></ul><p></p>
New cards
3

Statistical Instantiation (Inductive)

Population - sample.

Uses features about a population as a premise and draws a conclusion about a sample (or individual) within that population.

Moves inward from a fact about the larger group to draw a conclusion about a sample.

  • If you know something about a big group, you can use that information to make an educated guess about a smaller part of that group.

<p>Population - sample. </p><p><span>Uses features about a population as a premise and draws a conclusion about a sample (or individual) within that population.</span></p><p>Moves inward from a fact about the larger group to draw a conclusion about a sample.</p><ul><li><p>If you know something about a big group, you can use that information to make an educated guess about a smaller part of that group.</p></li></ul><p></p>
New cards
4

Strength Test (2)

  1. The probability that the observation would be made if the hypothesis were true

  2. The probability that the observation would be made if the hypothesis were false.

New cards
5

Sampling Bias (Selection Effect)

Occurs when the individuals or units selected for a sample are not representative of the larger population you're trying to study. This leads to inaccurate or skewed results because the sample doesn't reflect the true diversity or characteristics of the entire population.

  • A selection effect in a sample created by the way in which we are sampling the population. 

  • Gathering evidence that is relevant to inductive inference.

  • Want samples that are evidence for the beliefs we are testing.

New cards
6

The Law of Large Numbers

The larger our sample, the more likely it is that its proportions closely resemble those of the population as a whole.

  • This is why we want a specifically large sample size. As we increase our sample size our confidence in our evidence increases.

New cards
7

Two Major Problems to Avoid with Statistical Generalization

  1. Selection effects

  2. Small sample size

These two issues share the same issues related to making it too likely that we would have made this observation even if our hypothesis was false.

New cards
8

Margin of Error

How much we can expect the results of a survey or poll to vary from the true result. Reflects the uncertainty or potential error in the results due to sampling.

  • For example, if you ask 100 people about their favorite ice cream flavor and 60% say chocolate, your margin of error tells you how much the real percentage of chocolate lovers could vary if you asked everyone.

  • Accounts for the inherent uncertainty in any survey or poll due to the fact that only a sample is being surveyed, not the entire population

New cards
9

Stratifying Random Sampling

A method where you divide the population into smaller groups based on important characteristics (like age or gender). Then, you randomly select people from each group in proportion to how big those groups are in the overall population. This helps make sure your sample accurately reflects the whole population.

  • Ensure that every group that matters to your study is proportionally represented.

New cards
10

Randomness (Randomize)

Means to make something random or to introduce randomness into a process. In simple terms, it means to mix things up so that there is no predictable pattern.

Every individual in the population has an equal chance of being selected for the sample, without any bias or predetermined pattern.

  • Stratification supplements but does not substitute for randomness.

  • But random sampling is a fair approach, it might under-represent or over-represent certain groups, especially if there are large differences between them (e.g., certain age groups, income groups, or geographic areas).

New cards
11

Participation biases

A selection effect arising from differences in the target population with regard to willingness to participate in a survey. Those who choose to respond might be importantly different from those who choose not to respond.

  • For example, those with strong opinions and who are less busy are more likely to take part in a survey than those who lack strong opinions or who are busier. 

  • A sample may fail to be representative even if the initial selection of potential participants was perfectly random.

New cards
12

Response Bias

Happens when people answer survey questions in a way that doesn't reflect their true thoughts or feelings. This can happen because they think there’s a certain answer expected, or they’re worried about what others will think of their response.

New cards
13

Summary statistics

The practice of summarizing and reporting statistical data. This involves making decisions about what the most important facts are, and how best to present them. Ask questions such as:

  • what features of the data are most important to us?

  • what's the clearest way to present those features?

New cards
14

Central Tendency (& Different Ways to Measure)

A way to find a single value that represents the "typical" or "average" point in a set of data. It helps summarize a bunch of numbers by showing us where most of the values are centered.

There are different ways to measure central tendency, like:

  • Mean (average): Add up all the numbers and divide by how many numbers there are.

  • Median: The middle number when the data is sorted from lowest to highest.

  • Mode: The number that appears the most.

  • Geometric mean: A type of average that multiplies all the numbers together and then takes the root.

  • Truncated mean: Similar to the mean but ignores some extreme values at the ends.

The choice of which one to use depends on the data and what you want to measure. To provide a sense of what value the trait has for a typical individual.

New cards
15

Outlier

An observation that is very distant from a dataset’s central tendency, conventionally three standard deviations.

  • When we say that an outlier is "three standard deviations away," it means that this number is so far from the average (mean) that it’s very unlikely to happen in a typical set of data. In other words, it's far enough away from the rest of the data that it stands out as unusual or extreme

New cards
16
<p><span>Geometric mean</span></p>

Geometric mean

A way to find the average of a set of numbers, but it’s different from the regular average (mean). Multipy all the numbers together and then take the root (like the square root, cube root, etc.) based on how many numbers you have.

  • It’s often used when you want to find the average of things like growth rates or percent changes over time. For example, if something grows by 10% in the first year and 20% in the second year, the geometric mean gives a better idea of the overall growth than a simple average would.

New cards
17

Standard Deviation

Measure of how spread out or different the numbers in a set of data are from the average (mean).

Here’s a simple way to think about it:

  • If the standard deviation is low, it means the numbers are close to the average (they’re similar).

  • If the standard deviation is high, it means the numbers are spread out more widely (they’re different from the average).

New cards
18

Truncated

Refers to when you remove or ignore the extreme values (either the highest or lowest) from a set of data.

  • For example, if you're looking at people's incomes, you might truncate the data by removing the very highest and lowest incomes because they might be extreme outliers.

  • You can do this when it is deemed unnecessary for the analysis or data.

New cards
19

Cherry Picking Data

When someone selects only the pieces of data that support their argument or point of view, while ignoring data that might contradict it.

New cards
20

Loose Generalization

When we associate one kind of thing or person with an attribute but we are unclear what proportions we take to be involved.

  • For example, we might believe that Canadians are polite without having much sense of what this means, statistically speaking.

  • Loose generalizations can be expressed using bare plurals or with "many" as in "Many Canadians are polite". 

New cards
21

Representativeness Heuristic

A mental shortcut we use when we're trying to figure out how likely something is based on how well it seems to match our mental picture or stereotype of that thing.

Instead of looking at actual statistics or evidence, we rely on our gut feeling or mental association between two things.

  • For example, we might be wondering how common a feature F is among individuals that are G and, instead of answering that question, we determine how closely we associate being F with being G (stereotype).

New cards
22

Base Rate

Refers to how common something is in a larger group or population. The general prevalence or frequency of an event or characteristic within a particular population or context.

For example, if we say that 10% of people in a city are left-handed, that 10% is the base rate for left-handedness in that city.

  • The overall proportion or probability of a feature in general or in the population at large. 

New cards
23

Confidence interval

Helps us understand the uncertainty or precision of an estimate. It provides a range of values within which we expect the true value of a population parameter (like a mean, proportion, or difference) to lie, with a certain level of confidence.

  • For example, if you wanted to know the average height of all people in a country, but you only measured a sample, a 95% confidence interval means you can be 95% sure that the true average height for everyone in the country falls within that range.

  • The size of this interval in either direction from the given value is called the margin of error. 

New cards
24

Convenience Sample

A group of observations or data that is chosen in a quick and easy way, without much care or planning.

  • For example, you might survey the people who are closest to you, or the first 10 people who walk by, because it's the easiest way to get data.

New cards
25

Representative Sample

A small group that accurately reflects the larger group you're studying. It means that the people or things you select for the sample should have the same characteristics or variety as the larger population in a way that might affect the result you’re looking for.

  • For example, if you want to know how people in a country feel about a political issue, you need your sample to have the same balance of Democrats, Republicans, and other groups that exist in the country. This way, the sample is a mini version of the whole population.

New cards
26

Steps to Find Geometric Mean

Multiply all the numbers together.

  • For example, if your numbers are 2, 4, and 8, you would multiply them: 2 Ă— 4 Ă— 8 = 64.

Take the root of the result based on how many numbers you have.

  • If you have 3 numbers, take the cube root (because 3 is the number of values).

  • If you have 4 numbers, take the 4th root, and so on.

For 3 numbers, the cube root of 64 is about 4.

So the geometric mean of 2, 4, and 8 is 4.

In short:

  • Multiply all the numbers.

  • Take the root based on how many numbers you started with.

<p><strong>Multiply all the numbers</strong> together.</p><ul><li><p>For example, if your numbers are 2, 4, and 8, you would multiply them: 2 Ă— 4 Ă— 8 = 64.</p></li></ul><p><strong>Take the root</strong> of the result based on how many numbers you have.</p><ul><li><p>If you have 3 numbers, take the <strong>cube root</strong> (because 3 is the number of values).</p></li><li><p>If you have 4 numbers, take the <strong>4th root</strong>, and so on.</p></li></ul><p>For 3 numbers, the cube root of 64 is about <strong>4</strong>.</p><p>So the <strong>geometric mean</strong> of 2, 4, and 8 is <strong>4</strong>.</p><p>In short:</p><ul><li><p>Multiply all the numbers.</p></li><li><p>Take the root based on how many numbers you started with.</p></li></ul><p></p>
New cards
27

Median

The middle number in a set of data when the numbers are arranged in order (from smallest to largest).

Here’s how you find the median:

  1. Arrange the numbers in order.

  2. If there’s an odd number of numbers, the median is the middle one.

  3. If there’s an even number of numbers, the median is the average of the two middle numbers.

For example:

  • If you have 3, 5, 7, the numbers are already in order, and the middle number is 5, so the median is 5.

  • If you have 2, 4, 6, 8, the middle numbers are 4 and 6, so the median is the average of these two: (4 + 6) Ă· 2 = 5.

New cards
28

Mode

The number that appears the most in a set of data.

For example:

  • If you have the numbers 2, 4, 4, 5, 7, the mode is 4 because it appears twice, more than any other number.

  • If all the numbers appear only once (like 2, 3, 5, 7), there is no mode.

New cards
29

Mean

What we usually call the average. It’s a way to find the typical value in a set of numbers.

Here’s how you find it:

  1. Add up all the numbers in the set.

  2. Divide the total by how many numbers there are.

For example:

  • If you have the numbers 3, 5, and 7:

    1. Add them together: 3 + 5 + 7 = 15.

    2. Divide by how many numbers there are (3 numbers): 15 Ă· 3 = 5.

So, the mean (average) of 3, 5, and 7 is 5.

New cards
30

Induction

Modes of reasoning that link facts / beliefs about observations with facts/beliefs about unobserved phenomena.

  • Lend probable cause to the conclusion.

New cards
31
New cards

Explore top notes

note Note
studied byStudied by 14 people
1005 days ago
4.0(1)
note Note
studied byStudied by 162 people
624 days ago
5.0(1)
note Note
studied byStudied by 16 people
122 days ago
5.0(1)
note Note
studied byStudied by 22 people
743 days ago
5.0(1)
note Note
studied byStudied by 61 people
882 days ago
4.0(1)
note Note
studied byStudied by 8 people
176 days ago
5.0(1)
note Note
studied byStudied by 10 people
898 days ago
5.0(1)
note Note
studied byStudied by 255 people
686 days ago
4.8(9)

Explore top flashcards

flashcards Flashcard (127)
studied byStudied by 31 people
911 days ago
5.0(1)
flashcards Flashcard (20)
studied byStudied by 19 people
266 days ago
5.0(1)
flashcards Flashcard (20)
studied byStudied by 8 people
784 days ago
5.0(1)
flashcards Flashcard (28)
studied byStudied by 29 people
737 days ago
5.0(2)
flashcards Flashcard (67)
studied byStudied by 9 people
837 days ago
5.0(1)
flashcards Flashcard (315)
studied byStudied by 51 people
763 days ago
5.0(4)
flashcards Flashcard (29)
studied byStudied by 15 people
379 days ago
5.0(1)
flashcards Flashcard (26)
studied byStudied by 84 people
17 days ago
5.0(1)
robot