Chapter 2: Frequency distributions

0.0(0)

Studied by 0 people

0.0(0)

Call Kai

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Knowt Play

Card Sorting

1/21

There's no tags or description

Looks like no tags are added yet.

Study Analytics

Name	Mastery	Learn	Test	Matching	Spaced

No study sessions yet.

22 Terms

New cards

what is a Raw score?

is a data point that has not yet been transformed or analyzed

New cards

what is the frequency distribution?

-In stats we organize raw scores into frequency distributions: which describe a set of numbers by displaying a count or proportion for each possible value of a variable

<p>-In stats we organize raw scores into frequency distributions: which describe a set of numbers by displaying a count or proportion for each possible value of a variable</p>

New cards

what are the 3 different ways to organize information about a single variable or raw scores???

-Frequency tables: Is a visual depiction of data that shows how often each value occurred, that is how many scores were at each value (see lecture notes on how to create one lecture 2)

A frequency table is the best way to create an easy-to-understand distribution of data

-Grouped frequency tables: A visual depiction of data that reports frequencies within a given interval rather than frequencies for a specific value, which makes sense when data covers a huge range

-Histograms +

New cards

what is an outlier?

A value in a data set that is much less or much greater than the rest of the data. An outlier is 1.5 times the interquartile range less than the lower quartile or greater than the upper quartile.

New cards

How to create a frequency table

ex: 10 students wrote a test: their scores were: 80,83,90,82,80,83,85,83,88

1)determine the highest and lowest score

highest 90, lowest 80

2)create two columns: label the first with the variable name and the other with frequency

3)list the full range of values that encompass all the scores in the data from highest to lowest

4)count the number of scores at each values, and write those numbers in the frequency column

5)add a column for each relative frequency or percentage frequency (relativef (x)=f(x)/N, and you can multiply the 100 for percentage

<p>ex: 10 students wrote a test: their scores were: 80,83,90,82,80,83,85,83,88</p><p>1)determine the highest and lowest score</p><p>highest 90, lowest 80</p><p>2)create two columns: label the first with the variable name and the other with frequency</p><p>3)list the full range of values that encompass all the scores in the data from highest to lowest</p><p>4)count the number of scores at each values, and write those numbers in the frequency column</p><p>5)add a column for each relative frequency or percentage frequency (relativef (x)=f(x)/N, and you can multiply the 100 for percentage</p>

New cards

what is a grouped frequency table?

a visual depiction of data that reports frequencies within a given interval rather than the frequencies for a specific value

-makes sense when covering a huge range

New cards

Creating a Grouped Frequency Table

ex: 10 students wrote a test: their scores were: 80,83,90,82,80,83,85,83,88

1) find the highest and lowest scores of the frequency distribution highest 90, lowest 80

2) get the full range of data: Range = Xhighest - X lowest +1

(90-80)+1 = 11

3) determine the # of intervals and the best interval size

interval size v=5

number of intervals = (11/5)= 2.2 or 3 intervals

4a)figure out the number that will be the bottom of the interval

important must be a multiple of 5 i choose 80

4b)figure out the lower limit of the next interval: l(i)=l(i-1)+v

4c) determine the upper limits of each interval: each interval contains 5 numbers ex:80,81,82,83,84

5)finish the table by listing the intervals from highest to lowest and then counting the numbers of each

<p>ex: 10 students wrote a test: their scores were: 80,83,90,82,80,83,85,83,88</p><p>1) find the highest and lowest scores of the frequency distribution highest 90, lowest 80</p><p>2) get the full range of data: Range = Xhighest - X lowest +1</p><p>(90-80)+1 = 11</p><p>3) determine the # of intervals and the best interval size</p><p>interval size v=5</p><p>number of intervals = (11/5)= 2.2 or 3 intervals</p><p>4a)figure out the number that will be the bottom of the interval</p><p>important must be a multiple of 5 i choose 80</p><p>4b)figure out the lower limit of the next interval: l(i)=l(i-1)+v</p><p>4c) determine the upper limits of each interval: each interval contains 5 numbers ex:80,81,82,83,84</p><p>5)finish the table by listing the intervals from highest to lowest and then counting the numbers of each</p>

New cards

what are histograms?

-A graph that depicts frequency data for one variable, usually based on scale data, with the values of the variable on the x axis and the frequencies on the y axis

Values of the variable on the x axis (horizontal line)

Frequencies on the y axis (vertical line)

<p>-A graph that depicts frequency data for one variable, usually based on scale data, with the values of the variable on the x axis and the frequencies on the y axis</p><p>Values of the variable on the x axis (horizontal line)</p><p>Frequencies on the y axis (vertical line)</p>

New cards

Midpoint of an interval formula

New cards

what is the midpoint of the interval 80-84?

New cards

normal distribution (bell curve)

A specific frequency distribution that is a bell shaped symmetric, with a unimodal curve

New cards

when data is not symmetrical, one of the tails of the distribution is pulled away from the centre, a positive skew is towards the?

a negative skew is towards the?

-tail to the right

-tails to the left

New cards

what is the difference between frequencies and grouped frequencies?

-a frequency is a count of how many times a score appears, a grouped frequency is a count for a defined interval or group of scores

New cards

when the bulk of data cluster together but the data tails off to the left the skew is?

when data tails off to the right the skew is?

-is a negative skew

-positive skew

New cards

Alzheimer's disease is typically diagnosed in adults older than the age of 70, cases diagnosed sooner are called early onset:

a)assuming that these early-onset cases represent unique tailing off of data on one side, would the skew be positive or negative?

b)does this data represent a ceiling effect or a floor effect?

a)early onset of Alzheimer's disease would create a negative skew in the distribution for age onset

b) represents a ceiling effect, because eventually all humans pass away

New cards

what is the difference between a histogram and a bar graph?

-bar graphs typically score for nominal data, and are not arranged in order where

-histograms provide frequencies for scale data and are arranged in order and are lowest to highest

New cards

what are the typical labels for the x -axis and the y-axis in a histogram?

a histogram looks like a bar graph but is usually to depict scale data, with the values(independent variable) or intervals of the variables on the x -axis (horizontal line)wheres the frequency (dependant variable) goes on the y-axis (vertical line)