Chapter 2: Frequency distributions

0.0(0)
studied byStudied by 0 people
0.0(0)
full-widthCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/21

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

22 Terms

1
New cards

what is a Raw score?

is a data point that has not yet been transformed or analyzed

2
New cards

what is the frequency distribution?

-In stats we organize raw scores into frequency distributions: which describe a set of numbers by displaying a count or proportion for each possible value of a variable

<p>-In stats we organize raw scores into frequency distributions: which describe a set of numbers by displaying a count or proportion for each possible value of a variable</p>
3
New cards

what are the 3 different ways to organize information about a single variable or raw scores???

-Frequency tables: Is a visual depiction of data that shows how often each value occurred, that is how many scores were at each value (see lecture notes on how to create one lecture 2)

A frequency table is the best way to create an easy-to-understand distribution of data

-Grouped frequency tables: A visual depiction of data that reports frequencies within a given interval rather than frequencies for a specific value, which makes sense when data covers a huge range

-Histograms +

4
New cards

what is an outlier?

A value in a data set that is much less or much greater than the rest of the data. An outlier is 1.5 times the interquartile range less than the lower quartile or greater than the upper quartile.

5
New cards

How to create a frequency table

ex: 10 students wrote a test: their scores were: 80,83,90,82,80,83,85,83,88

1)determine the highest and lowest score

highest 90, lowest 80

2)create two columns: label the first with the variable name and the other with frequency

3)list the full range of values that encompass all the scores in the data from highest to lowest

4)count the number of scores at each values, and write those numbers in the frequency column

5)add a column for each relative frequency or percentage frequency (relativef (x)=f(x)/N, and you can multiply the 100 for percentage

<p>ex: 10 students wrote a test: their scores were: 80,83,90,82,80,83,85,83,88</p><p>1)determine the highest and lowest score</p><p>highest 90, lowest 80</p><p>2)create two columns: label the first with the variable name and the other with frequency</p><p>3)list the full range of values that encompass all the scores in the data from highest to lowest</p><p>4)count the number of scores at each values, and write those numbers in the frequency column</p><p>5)add a column for each relative frequency or percentage frequency (relativef (x)=f(x)/N, and you can multiply the 100 for percentage</p>
6
New cards

what is a grouped frequency table?

a visual depiction of data that reports frequencies within a given interval rather than the frequencies for a specific value

-makes sense when covering a huge range

7
New cards

Creating a Grouped Frequency Table

ex: 10 students wrote a test: their scores were: 80,83,90,82,80,83,85,83,88

1) find the highest and lowest scores of the frequency distribution highest 90, lowest 80

2) get the full range of data: Range = Xhighest - X lowest +1

(90-80)+1 = 11

3) determine the # of intervals and the best interval size

interval size v=5

number of intervals = (11/5)= 2.2 or 3 intervals

4a)figure out the number that will be the bottom of the interval

important must be a multiple of 5 i choose 80

4b)figure out the lower limit of the next interval: l(i)=l(i-1)+v

4c) determine the upper limits of each interval: each interval contains 5 numbers ex:80,81,82,83,84

5)finish the table by listing the intervals from highest to lowest and then counting the numbers of each

<p>ex: 10 students wrote a test: their scores were: 80,83,90,82,80,83,85,83,88</p><p>1) find the highest and lowest scores of the frequency distribution highest 90, lowest 80</p><p>2) get the full range of data: Range = Xhighest - X lowest +1</p><p>(90-80)+1 = 11</p><p>3) determine the # of intervals and the best interval size</p><p>interval size v=5</p><p>number of intervals = (11/5)= 2.2 or 3 intervals</p><p>4a)figure out the number that will be the bottom of the interval</p><p>important must be a multiple of 5 i choose 80</p><p>4b)figure out the lower limit of the next interval: l(i)=l(i-1)+v</p><p>4c) determine the upper limits of each interval: each interval contains 5 numbers ex:80,81,82,83,84</p><p>5)finish the table by listing the intervals from highest to lowest and then counting the numbers of each</p>
8
New cards

what are histograms?

-A graph that depicts frequency data for one variable, usually based on scale data, with the values of the variable on the x axis and the frequencies on the y axis

Values of the variable on the x axis (horizontal line)

Frequencies on the y axis (vertical line)

<p>-A graph that depicts frequency data for one variable, usually based on scale data, with the values of the variable on the x axis and the frequencies on the y axis</p><p>Values of the variable on the x axis (horizontal line)</p><p>Frequencies on the y axis (vertical line)</p>
9
New cards

Midpoint of an interval formula

knowt flashcard image
10
New cards

what is the midpoint of the interval 80-84?

knowt flashcard image
11
New cards

normal distribution (bell curve)

A specific frequency distribution that is a bell shaped symmetric, with a unimodal curve

<p>A specific frequency distribution that is a bell shaped symmetric, with a unimodal curve</p>
12
New cards

when data is not symmetrical, one of the tails of the distribution is pulled away from the centre, a positive skew is towards the?

a negative skew is towards the?

-tail to the right

-tails to the left

<p>-tail to the right</p><p>-tails to the left</p>
13
New cards

what is the difference between frequencies and grouped frequencies?

-a frequency is a count of how many times a score appears, a grouped frequency is a count for a defined interval or group of scores

14
New cards

when the bulk of data cluster together but the data tails off to the left the skew is?

when data tails off to the right the skew is?

-is a negative skew

-positive skew

15
New cards

Alzheimer's disease is typically diagnosed in adults older than the age of 70, cases diagnosed sooner are called early onset:

a)assuming that these early-onset cases represent unique tailing off of data on one side, would the skew be positive or negative?

b)does this data represent a ceiling effect or a floor effect?

a)early onset of Alzheimer's disease would create a negative skew in the distribution for age onset

b) represents a ceiling effect, because eventually all humans pass away

16
New cards

what is the difference between a histogram and a bar graph?

-bar graphs typically score for nominal data, and are not arranged in order where

-histograms provide frequencies for scale data and are arranged in order and are lowest to highest

17
New cards

what are the typical labels for the x -axis and the y-axis in a histogram?

a histogram looks like a bar graph but is usually to depict scale data, with the values(independent variable) or intervals of the variables on the x -axis (horizontal line)wheres the frequency (dependant variable) goes on the y-axis (vertical line)

<p>a histogram looks like a bar graph but is usually to depict scale data, with the values(independent variable) or intervals of the variables on the x -axis (horizontal line)wheres the frequency (dependant variable) goes on the y-axis (vertical line)</p>
18
New cards

what is floor effect ?and how does it affect a distribution?

-is a situation where a constraint prevents a variable from taking on values below a certain point,

-and when a distribution is positively skewed, the tail of the distribution extends to the right is usually a floor effect

19
New cards

What is the ceiling effect? and how does it affect a distribution?

-occurs when there are no scores above a certain point,

-the ceiling effect leads to a negatively skewed distribution because the upper part of the distribution is constrained

20
New cards

convert the following to percentages: 7 out of 39 and 122 out of 300?

a) 7/39=0.1794 x 100= 17.94%

b) 122/300=0.4066 x 100 = 40.66%

21
New cards

if you have data that range from 2 to 68 and you wanted seven intervals in a grouped frequency table, what would the intervals be?

1) find the range: x highest - x lowest + 1

68-2 + 1 = 67

2) divide the 67 by the 7 intervals desired

67/7= 9.57, round that up to 10

3)the seven intervals are: 0-9,10-19, 20-29,30,39,40-49,50-59,60-69

22
New cards

if the average person convicted of murder killed only one person, serial killers would create what kind of skew?

-serial killers would create a positive skew, adding high numbers of murders to the data that are clustered around 1

<p>-serial killers would create a positive skew, adding high numbers of murders to the data that are clustered around 1</p>