POS Final

0.0(0)

Studied by 0 people

0.0(0)

Call Kai

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Knowt Play

Card Sorting

1/59

There's no tags or description

Looks like no tags are added yet.

Study Analytics

Name	Mastery	Learn	Test	Matching	Spaced

No study sessions yet.

60 Terms

New cards

Univariate Distribution

Describing and analyzing one variable.

New cards

Statistic

An estimate of a parameter based on a sample.

New cards

Descriptive Statistics

Measurements used to summarize and organize the observed values of one variable.

New cards

Inferential statistics

Measurements used to make decisions about variable(s) by interpreting one variable.

New cards

Frequency Distributions

Number of cases for each category of a variable. To construct one, list the categories of variables and then list the number of observations is each.

New cards

Proportion

Number of observations in each category divided by the total number of observations

New cards

Percentage

Proportion multiplied by 100.

New cards

Central Tendency

The value around which most of the data are clustered.

New cards

Mode

The value that appears most frequently

New cards

Median

The midpoint in an ordered series of data. (N+1)/2. For an even #, is the average of the two middle values.

New cards

Bimodal

when two categories occur just as frequently

New cards

Mean

The sum of the observed values divided by the number of cases

New cards

Dispersion

The extent to which the data are spread out from their central tendency

New cards

Range

Indicates the difference between the lowest and highest values of the distribution

New cards

Quantiles

points taken at regular intervals of an ordered data set that divide the set into equal groups from lowest to highest.

New cards

Quartiles

4 equal parts

New cards

Quintiles

5 equal parts

New cards

Deciles

Ten equal parts

New cards

Percentiles

100 equal parts

New cards

Variance

A measure of dispersion for interval or ratio level data based on finding the variation around the mean value of the distribution.

Is the average of the total square of the deviation from the mean of the data.

How far the data is spread from the mean.

New cards

Standard Deviation

The square root of the variance - since the variance has transformed the data into squared units and we want to report them in original units.

New cards

Standard Deviation Advantages (over other measures of dispersion)

it is more stable from sample to sample since it is based on all observations

New cards

Standard Deviation is most useful when…

It is most useful when they are interpreted by comparing the dispersion of several clusters of data.

New cards

Six examples of frequency curve

Bell shaped
U-shaped
Positively Skewed j-curve
Negatively skewed j-curve
Bimodal
Rectangular

New cards

Bell Shaped

A symmetrical distribution (mean and median are identical and frequencies going toward the right and left tails are identical) - where most of the data (the mode) is centered near the median and mean.

New cards

U-shaped

A symmetrical distribution where most of the data is spread evenly away from the mean and median (Very few cases are average and most are at either end of the extreme).

New cards

Positively skewed j-curve

A non-symmetrical distribution with a large number of low scores and a few extremely high scores.

New cards

Negatively skewed j-curve

A non-symmetrical distribution with a large number of high scores and a few extremely low scores

New cards

Bimodal

A distribution where the data is clustered at two different points away from the mean. Takes an “m” shape.

New cards

Rectangular

A symmetrical distribution where the data is spread equally in every category.

New cards

Normal Curve

A special type of bell curve where the distribution of values nears a direct and known relationship to the size of the standard deviation. A given percentage of cases are within 1, 2, or 3 standard deviations from the mean.

New cards

Four properties of Normal Curves

Symmetrical and bell-shaped
Mode, median, and mean coincide at the center of the distribution
Curve is based on an infinite number of observations
A fixed proportion of observations lies between the mean and fixed units of standard deviations

New cards

Normal Distribution

When data is distributed normally the mean divides the data in half. The following holds true:

68.26% of the data lies within ±1 sd from the mean
95.46% of the observations fall within ±2 sd from the mean
99.73% of the observations fall within ±3 sd from the mean

New cards

Outlier

An observation in a normally distributed data set that lies beyond ±3 standard deviations from the mean (only .27% of observations fall in this category).

Are sometimes dropped from the data when it is analyzed since they do not represent the average cases and tend to skew the results.

New cards

Z-scores

Also known as standardized score; is the number of standard deviations an observation is above or below the mean. A positive score is above the mean, and a negative is below the mean.

Are used if the data is normally distributed.

New cards

To compute z-scores

Subtract the mean of the data from the score of a specific observation. Then divide the results by the standard deviation of the data.

New cards

Bivariate Relationships

Relationships between two variables.

New cards

Contingency Tables

aka Crosstabulation. Compares or cross-tabulates two nominal and/or ordinal variables in a table to see if the values of one are contingent on the other.

This helps determine if there is a relationship between the variables.

New cards

Cross Tabulations

a statistical method used to analyze the relationship between two or more categorical variables by displaying the frequency of their combinations in a table

New cards

“Percentage” the Table

Percent the Table and Subtract Across to Compute the Percentage Difference

New cards

Difference of Means

Comparing the appropriate central tendency in two groups to look for patterns

New cards

Tests of Statistical Significance

Determines whether a relationship between variables in a probability sample can be generalized to the population from which the sample was selected.

New cards

Significance Level

states the probability that a relationship in a probability sample occurred by chance and doesn’t really exist in the population. Frequently symbolized with the letter “p” for probability - “p-value”

New cards

Probability or “p” value

the significance level. The probability that a relationship in a probability sample occurred by chance.

New cards

.05 Cutoff Level

95% confidence. The most commonly used level to use as a cutoff point. If reported significance level is greater than .05, the relationship is assumed not significant. if it is less than .05, the relationship is assumed to be significant.

New cards

Chi - squared statistic for statistical significance

Used for contingency tables of nominal or ordinal data.

Computed by figuring out the difference between the observed and expected values of the variables in contingency table.

If it is significant at .05 or less then you can state that the sample relationship in the contingency table is significant. ..01 or .001 is Yes. .20 or .30 No.

New cards

Measures of Association

Allow us to summarize the strength of a relationship in more accurate ways than relying on percentage differences.

New cards

Strength of Relationships

Extent to which changes in one variable are accompanied by changes in another variable.

New cards

Yule’s Q

A summary measure for use in a bivariate table with nominal (or ordinal) data that indicates the strength of the relationship.

Is based on the number of cases that show a positive relationship minus the number of cases that show a negative relationship.

Q= ad-bc / ad+bc

New cards

Proportional Reduction of Error (PRE)

The amount by which errors in predicting the dependent variable can be reduced by knowing the relationship between the DV and the IV.

The extent to which we can reduce possible errors in estimating the value of a case if we know its value on a second variable.

errors w/o knowledge of iv - errors w/ knowledge of iv [divided by] errors w/out knowledge of iv

New cards

Lambda

Used with two nominal variables or with one nominal and one ordinal variable. Can be used for tables that are larger than two by two. It compares the modal value for each value of the IV.

New cards

Gamma

an ordinal measure of relationships. Measures the number of similarly ordered pairs as a proportion of all relevant pairs. Does not include tied pairs.

(identical to Yule’s Q for a 2X2 table). For Ordinal level data. Is frequently higher than the other measures and can over estimate relationships when there are many tied pairs since it does not include them.

New cards

Kendall’s Tau

an ordinal measure of relationships. is based on pairs of cases.

For Ordinal level data; can be used for nominal data when lambda is inappropriate (1 DV category very high). Tau b for square tables (2x2 or 3x3 for instance) and tau c for rectangular ones (2x3 or 2x4 or 3x4 for instance). Includes tied pairs in denominator in calculations (those not on diagonal) and thus gives more accurate (lowest, most conservative) measure. (tau c harder to interpret and can only say which of two tables of similar proportions is stronger.

New cards

Somer’s D

an ordinal measure of relationship.

Generally for ordinal level data. Only counts pairs tied on the DV in the denominator. This has the effect of focusing on pairs of cases where the IV actually changes. Thus it is better for causal analysis. Usually gives a moderate measure between those arrived at by Gamma and Tau.

New cards

List the Variables:

Dependent (Y), Independent (X), Intervening (Z)`

New cards

Dependent Variable

(Y). The variable we wish to explain or predict; its value is influenced by the Independent Variable

New cards

Independent Variable

(X) The variable we think causes a change in (has an effect on) the Dependent variable

New cards

Intervening Variable

(Z) A third variable that can affect the relationship between the independent variable and dependent variable. It is also another independent variable.

New cards

Multivariate Analysis

Examining a relationship between more than two variables; usually looks at the effect of several Independent variables on one Dependent variable.

New cards

Control

Examining a relationship between an Independent Variable and a Dependent variable by holding another Independent variable constant