Chapter 1: Introduction and Descriptive Statistics
The Use of Statistics in Science
Statistics is defined as the analysis and evaluation of scientific observations. It provides a structured mathematical framework for quantifying data and drawing meaningful conclusions. The field is broadly categorized into two main branches:
Descriptive Statistics: Used to summarize and describe the characteristics of a specific data set. It is the first step in statistical analysis, helping to make large volumes of data more manageable.
Inferential Statistics: Used to make generalizations or inferences about a larger population based on the data collected from a sample.
Descriptive and Inferential Statistics
Terminology: Data vs. Datum
Datum: A single measurement or observation, often referred to as a score or raw score.
Data: Multiple measurements or observations collected together.
Presentation of Data
Descriptive statistics involve the use of tables, graphs, and summary measures (like the mean or standard deviation). For example, making scientific observations requires a systematic way to present findings so patterns become apparent.
Populations and Samples
Population: The entire set of individuals, items, or data points of interest in a study.
Population Parameter: A characteristic (usually numeric) that describes the entire population.
Sample: A subset of individuals selected from the population, intended to represent the larger group.
Sample Statistic: A characteristic that describes the sample data. Statistics are used to estimate parameters.
Research Methods and Statistics
Data collection must follow strict rules to ensure validity. The Scientific Method, or Research Method, is the systematic process of gathering information.
1. Experimental Method
This method is designed to demonstrate cause-and-effect relationships by controlling conditions. Key requirements include:
Manipulation: The researcher actively changes the level of the Independent Variable (IV). The various levels of the IV are the conditions being compared.
Randomization (Random Assignment): Participants have an equal chance of being assigned to any group, which helps control for preexisting differences.
Comparison/Control: Including a control group (no treatment) to compare against treatment groups.
Dependent Variable (DV): The variable that is measured; it is hypothesized to be affected by the IV.
Operational Definition: A clear description of how a variable is measured or defined within a study (e.g., defining "fear" as a specific heart rate threshold).
2. Quasi-Experimental Method
Used when researchers test a hypothesis but lack full control over conditions.
Quasi-independent variable (Factor): A variable that differentiates groups (like gender or age) that cannot be randomly assigned.
This method lacks the random assignment found in true experiments, and there may not be a true comparison or control group.
3. Correlational Method
This method measures the extent to which two variables change together.
It involves no control over experimental conditions.
Researchers observe variables in their natural state to determine if a relationship (correlation) exists between them.
Scales of Measurement
Measurements are characterized by four properties: identity, order, difference, and ratio.
Nominal Scale: Represents an identity or category. It provides no information about rank or quantity. Coding is often used here (e.g., assigning 1 for "Male" and 2 for "Female").
Ordinal Scale: Conveys rank or order. It shows that one value is greater or less than another, but the distance between ranks is not equal.
Interval Scale: Features an equidistant scale where the difference between values is consistent. It lacks a True Zero (e.g., Temperature in Fahrenheit).
Ratio Scale: The most informative scale. It has equidistant points and a true zero point, representing the total absence of the variable (e.g., height, weight, or time).
Types of Variables
Continuous Variable: Can take on an infinite number of values between any two points (e.g., distance).
Discrete Variable: Measured in whole units or categories; there are no values between adjacent units (e.g., number of children).
Quantitative Variable: Data that represents amounts or counts.
Qualitative Variable: Data that describes labels, categories, or qualities.
SPSS in Focus: Entering and Defining Variables
SPSS (Statistical Package for the Social Sciences) is a common tool for data analysis.
Data Entry: Scores (such as GPA) are typically entered into a spreadsheet.
Columns: Usually represent variables.
Rows: Usually represent individual cases or subjects.
A final check of the data entry is essential for accuracy.
Chapter 2: Frequency Distributions in Tables and Graphs
Why Summarize Data?
Summarizing data allows for comparing scores and identifying the Frequency (the number of times a score occurs). This is especially critical for large data sets to make them meaningful and interpretable.
Simple Frequency Distributions for Grouped Data
A Simple Frequency Distribution summarizes the frequency of scores in categories. When there are many scores, Grouped Data is used to organize scores into Intervals.
Steps to Construct a Distribution:
Find the Real Range: The difference between the highest and lowest scores in the data set.
Find the Interval Width (i): IntervalWidth=NumberofintervalsRealrange
Construct the Frequency Distribution: List intervals and count the frequency of scores within each.
Rules for Grouped Data:
Intervals must be mutually exclusive (no overlapping boundaries).
All intervals must be the same width.
Open Interval: An interval with no specific upper or lower boundary (e.g., "90 and above").
Outlier: An extreme score that falls significantly outside the range of most of the data.
Other Ways of Summarizing Grouped Data
Cumulative Frequency (cf): The sum of frequencies at and below a particular interval (bottom-up) or at and above (top-down).
Relative Frequency (rf): The proportion of total scores in an interval. RelativeFrequency=TotalfrequencycountObservedfrequency
Relative Percent: The percentage of the total data set represented by an interval. RelativePercent=TotalfrequencycountObservedfrequency×100
Cumulative Relative Frequency / Cumulative Percent: The sum of relative frequencies or percents up to a certain point.
Percentile Points and Percentile Ranks
Percentile Point: The raw score value at or below which a specific percentage of scores fall.
Percentile Rank: The percentage of scores in a distribution that are at or below a given raw score.
Graphs for Continuous Data
Graphs provide an alternative visual way to display frequency distributions.
Histogram: A bar-style graph where bars touch, used for continuous data. The height represents frequency.
Frequency Polygon: A dot-and-line graph where dots are placed at the midpoint of each interval and connected by lines.
Ogive: A line graph representing cumulative frequencies.
Stem-and-Leaf Display: A display that lists individual scores. The Stem (left of the vertical line) represents the leading digits, and the Leaf (right of the line) represents the trailing digits. It retains the identity of every single score, unlike a histogram.
Graphs for Discrete and Categorical Data
Bar Chart: Similar to a histogram, but the bars do not touch, emphasizing the discrete nature of the categories.
Pie Chart: A circular graph where the "slices" or Sectors represent the proportion of the total for each category.
Chapter 3: Central Tendency
Introduction to Central Tendency
Central tendency describes the center of a distribution. Measures of central tendency for samples are Sample Statistics, while those for populations are Population Parameters (N vs. n).
Measures of Central Tendency
Mean (M or μ): The arithmetic average. - Weighted Mean: Used when combining groups with unequal sample sizes. Weightedmean=SumofweightsSumofweightedproducts
Median: The middle score in a distribution when ordered from lowest to highest. It is at the (n+1)/2 position. It is unaffected by outliers.
Mode: The value that occurs most frequently. A distribution can be unimodal, bimodal, multimodal, or nonmodal (rectangular).
Characteristics of the Mean
Every score in the distribution affects the mean.
Adding/removing a score usually changes the mean (unless the score equals the mean).
Adding, subtracting, multiplying, or dividing every score by a Constant changes the mean by that same constant.
The sum of the differences of each score from the mean is always zero: ∑(x−M)=0. This makes the mean the Balancing Point of the distribution.
The sum of the squared differences of scores from the mean ∑(x−M)2 is minimal compared to any other value.
Choosing a Measure
Mean: Best for Normal Distributions and Interval/Ratio data.
Median: Best for Skewed Distributions (where outliers pull the mean) and Ordinal data.
Mode: Best for Nominal data and describing Modal Distributions.
Chapter 4: Summarizing Data: Variability
Introduction to Variability
Variability measures the spread or dispersion of scores in a distribution. Knowing the mean is not enough; variability explains how far scores deviate from that center.
Basic Measures
Range: The difference between the largest and smallest values. It is sensitive to outliers and is often avoided by researchers.
Quartiles: Fractiles that divide the data into four equal parts. - Interquartile Range (IQR): Q3−Q1. - Semi-interquartile Range (SIQR): (Q3−Q1)/2, which describes the spread of the middle 50% of scores.
Variance
Variance measures the average squared distance from the mean.
Population Variance (σ2): σ2=N∑(X−μ)2
Sample Variance (s2): Uses n−1 in the denominator to be an Unbiased Estimator of the population variance. s2=n−1∑(X−M)2=dfSS
Sums of Squares (SS): The numerator of the variance formula (∑(X−M)2).
Standard Deviation
The standard deviation is the square root of the variance, bringing the unit of measurement back to the original scale.
Population (σ): σ=σ2
Sample (SD): SD=s2
Informativeness of Standard Deviation
Empirical Rule: In a normal distribution, approximately 68% of scores fall within 1 SD, 95% within 2 SD, and 99.7% within 3 SD of the mean.
Chebyshev’s Theorem: For any distribution, the proportion of scores within k standard deviations is at least 1−k2′1 . (Applies to non-normal distributions).
Standard Deviation Characteristics: Always positive, affected by every score, reported as M±SD
Chapter 5: Probability and the Foundations of Inferential Statistics
Introduction to Probability
Probability predicts the likelihood of random events. It ranges from 0 to 1 and can never be negative. P(A)=SamplespaceCount(A)
Relationships Between Outcomes
Mutually Exclusive: Outcomes that cannot occur at the same time. Use the Additive Rule: p(A∪B)=p(A)+p(B).
Independent: The occurrence of one outcome does not affect the other. Use the Multiplicative Law: p(A∩B)=p(A)×p(B).
Complementary: The sum of probabilities equals 1. p(A)=1−p(B).
Conditional: One outcome depends on another. p(U∣P)=p(P)p(P∩U).
Bayes’s Theorem: p(U∣P)=p(P)p(P∣U)p(U).
Probability Distributions
Random Variable: A variable whose values are determined by chance.
Expected Value: The mean of a probability distribution. { \mu = \sum [x \cdot p(x)] }$.
Chapter 6: Probability, Normal Distributions, and z Scores
Characteristics of the Normal Distribution
Mathematically defined by de Moivre, it is theoretical, symmetrical, and unimodal.
Mean, median, and mode are equal.
Area under the curve = 1.</p></li><li><p>Tailsare<strong>Asymptotic</strong>(nevertouchtheX−axis).</p></li><li><p>Standarddeviationisalwayspositive.</p></li></ul><h4id="60f092ba−37a8−4f14−a999−10dbb5dd0263"data−toc−id="60f092ba−37a8−4f14−a999−10dbb5dd0263"collapsed="false"seolevelmigrated="true">Thez−Transformation</h4><p>Convertsarawscore(X)intostandardunits(numberofstandarddeviationsfromthemean).</p><ul><li><p><strong>Population</strong>:{ z = \frac{X - \mu}{\sigma} }</p></li><li><p><strong>Sample</strong>:{ z = \frac{X - M}{SD} }</p></li></ul><h4id="f7f0f7cb−ef79−4769−a22a−10b4bc7de5ba"data−toc−id="f7f0f7cb−ef79−4769−a22a−10b4bc7de5ba"collapsed="false"seolevelmigrated="true">TheUnitNormalTable</h4><ul><li><p><strong>ColumnA</strong>:Listsz−scores.</p></li><li><p><strong>ColumnB</strong>:Areabetweenthemeanandthez−score.</p></li><li><p><strong>ColumnC</strong>:Areafromthez−scoretowardthetail.</p></li></ul><h4id="3c93ef8e−7a51−4f6f−8de9−a3268258159e"data−toc−id="3c93ef8e−7a51−4f6f−8de9−a3268258159e"collapsed="false"seolevelmigrated="true">NormalApproximationtotheBinomial</h4><p>Assamplesize(n)increases,thediscretebinomialdistributionapproximatesthecontinuousnormaldistribution.Conditionsmustbechecked(usually{np \geq 10}and{nq \geq 10}).</p><ul><li><p>Requiresfinding<strong>RealLimits</strong>fordiscretevaluesbeforecalculatingz.</p></li></ul><h3id="c3733220−7e33−488e−a9d3−c8228bd9e1dd"data−toc−id="c3733220−7e33−488e−a9d3−c8228bd9e1dd"collapsed="false"seolevelmigrated="true">Chapter7:ProbabilityandSamplingDistributions</h3><h4id="7191ad63−a76c−4033−9e98−9eaf31517068"data−toc−id="7191ad63−a76c−4033−9e98−9eaf31517068"collapsed="false"seolevelmigrated="true">SelectingSamples</h4><ul><li><p><strong>SamplingwithReplacement</strong>:Memberisreturnedtothepopulationbeforethenextdraw(keepsprobabilitiesconstant).</p></li><li><p><strong>SamplingwithoutReplacement</strong>:Memberisnotreturned(changesprobabilities).</p></li><li><p><strong>TheoreticalSampling</strong>:UseseverypossiblesampleofsizenfromapopulationN.Totalsamples=N^{n}</p></li><li><p><strong>ExperimentalSampling</strong>:Usedinbehavioralresearch.Totalsamples=\frac{N!}{n!\left(N-n\right)!}</p></li></ul><h4id="ff250a5e−bb7f−46a0−a810−907f0e196ab9"data−toc−id="ff250a5e−bb7f−46a0−a810−907f0e196ab9"collapsed="false"seolevelmigrated="true">CentralLimitTheorem</h4><p>Regardlessoftheshapeofthepopulationdistribution,thesamplingdistributionofthemeanwillbecomenormalassamplesize(n)increases.</p><ul><li><p><strong>UnbiasedEstimator</strong>:Themeanofthesamplingdistributionofthemean(M_M)isequaltothepopulationmean(\mu).</p></li></ul><h4id="3bfc6d51−43cb−4d8c−b37d−e6e74c71ec0c"data−toc−id="3bfc6d51−43cb−4d8c−b37d−e6e74c71ec0c"collapsed="false"seolevelmigrated="true">StandardErroroftheMean(s_M)</h4><p>Thestandarddeviationofthesamplingdistribution.Itmeasuresthe<strong>SamplingError</strong>(discrepancybetweensampleandpopulation).{ \sigma_M = \frac{\sigma}{\sqrt{n}} }</p><ul><li><p><span><span>DecreasedbyasmallerpopulationSD()oralargersamplesize()(</span><strong><span>LawofLargeNumbers</span></strong><span>).</span></span></p></li></ul><h3id="88598f24−cb5f−4a16−ac9f−a7c370dd0b71"data−toc−id="88598f24−cb5f−4a16−ac9f−a7c370dd0b71"collapsed="false"seolevelmigrated="true">Chapter8:HypothesisTesting:Significance,EffectSize,Estimation,andPower</h3><h4id="5dcda5c8−af9f−494d−a7c3−8d231d699c79"data−toc−id="5dcda5c8−af9f−494d−a7c3−8d231d699c79"collapsed="false"seolevelmigrated="true">TheInferentialStep</h4><p>Weobservesamplestolearnaboutpopulationeffects.<strong>NullHypothesisSignificanceTesting(NHST)</strong>followsfoursteps:</p><ol><li><p><strong>StatetheHypotheses</strong>:Null({H_0})usuallypredictsnoeffect.Alternative({H_1}) predicts an effect.
Set the Criteria: Select Alpha ($\alpha$), the probability of a Type I error (usually 0.05).</p></li><li><p><strong>ComputeTestStatistic</strong>:(e.g.,z−test,t−test).</p></li><li><p><strong>MakeaDecision</strong>:If{p < \alpha},reject{H_0}. The result is Statistically Significant.
Types of Errors
Type I Error ($\alpha$): Rejecting {H_0} when it is actually true (False Positive).
Type II Error ($\beta$): Failing to reject {H_0}whenitisactuallyfalse(FalseNegative).</p></li><li><p><strong>TypeIIIError</strong>:Makingadirectionalprediction,buttheeffectisintheoppositedirection.</p></li><li><p><strong>Power(1 - \beta)</strong>:Theprobabilityofcorrectlyrejectingafalsenullhypothesis.</p></li></ul><h4id="fbd0e842−116e−4d54−abbd−3f6312b5275b"data−toc−id="fbd0e842−116e−4d54−abbd−3f6312b5275b"collapsed="false"seolevelmigrated="true">TestingSignificance:Thez−Test</h4><p>Usedwhenthepopulationmeansandvarianceareknown.{ z_{obt} = \frac{M - \mu}{\sigma_M} }</p><ul><li><p><strong>Directional(One−tailed)</strong>:{H_1: > \mu}or{H_1: < \mu}.</p></li><li><p><strong>Nondirectional(Two−tailed)</strong>:{H_1: \neq \mu}.</p></li></ul><h4id="2fb24b56−a899−4c91−9061−0f943055a3fd"data−toc−id="2fb24b56−a899−4c91−9061−0f943055a3fd"collapsed="false"seolevelmigrated="true">EffectSize:Cohen’sd</h4><p>Hypothesistestsonlytellifaneffectexists;effectsizetellshowlargeitis.{ d = \frac{M - \mu}{\sigma} }</p><ul><li><p>Cohen′sConventions:Small(0.2),Medium(0.5),Large(0.8).</p></li></ul><h4id="f0d1b734−a07c−4067−83d6−a1471d29f5b6"data−toc−id="f0d1b734−a07c−4067−83d6−a1471d29f5b6"collapsed="false"seolevelmigrated="true">ConfidenceIntervals(CI)</h4><p>Arangeofvaluesthatlikelycontainsthepopulationmeans.{ CI = M \pm z_{crit}(\sigma_M) }</p><h3id="480b4434−f2e1−4924−90c5−c3f63d94fdb8"data−toc−id="480b4434−f2e1−4924−90c5−c3f63d94fdb8"collapsed="false"seolevelmigrated="true">Chapter9:TestingMeans:One−SampletTest</h3><h4id="b6405171−95ce−48c5−8627−549bd6313e30"data−toc−id="b6405171−95ce−48c5−8627−549bd6313e30"collapsed="false"seolevelmigrated="true">Goingfromztot</h4><p>Ifthepopulationvarianceisunknown,weusesamplevariancetocomputethe<strong>EstimatedStandardError</strong>(s_M).{ s_M = \frac{SD}{\sqrt{n}} }{ t_{obt} = \frac{M - \mu}{s_M} }</p><h4id="f49920eb−d344−4bbb−8e91−b94cf56f3d34"data−toc−id="f49920eb−d344−4bbb−8e91−b94cf56f3d34"collapsed="false"seolevelmigrated="true">DegreesofFreedom(df)</h4><p>Foraone−samplet−test,{ df = n - 1 }.Thet−distributionchangesshapebasedon{df},becomingmorenormalas{df}increases.</p><h4id="22444512−2b98−4883−b3f3−dc9df3792ff7"data−toc−id="22444512−2b98−4883−b3f3−dc9df3792ff7"collapsed="false"seolevelmigrated="true">EffectSizefort−Test</h4><ul><li><p><strong>EstimatedCohen’sd</strong>:<spanstyle="font−family:Arial;"><span>ameasureofeffectsizeintermsofthenumberofstandarddeviationsthatmeanscoresshiftaboveorbelowthepopulationmeanstatedbythenullhypothesis.d=\frac{M-\mu}{SD}
Eta-Squared: Proportion of variance explained by the treatment. { \eta^2 = \frac{t^2}{t^2 + df} }
Omega-Squared: A more conservative estimate than eta-squared. { \omega^2 = \frac{t^2 - 1}{t^2 + df} }</p></li></ul><h3id="f61cda2c−42b2−4b8b−abcc−e866cbde3bbd"data−toc−id="f61cda2c−42b2−4b8b−abcc−e866cbde3bbd"collapsed="false"seolevelmigrated="true">Chapter10:Two−Independent−SampletTest</h3><h4id="3f715b45−ca0f−40bf−8a2a−132e9a5add17"data−toc−id="3f715b45−ca0f−40bf−8a2a−132e9a5add17"collapsed="false"seolevelmigrated="true">Between−SubjectsDesign</h4><p>Comparestwoindependentsamples(differentpeopleineachgroup).</p><ul><li><p><strong>PooledVariance(s_p^2): Weighted average of the variances from both samples.
Main Effect: The effect of one factor regardless of the other.
Interaction: Occurs when the effect of one factor depends on the level of the other factor (seen as non-parallel lines on a graph).
Analysis
A two-way ANOVA provides three hypothesis tests: Main Effect A, Main Effect B, and the A×B Interaction.
Simple Main Effect Test: Conducted if an interaction is significant, testing the effect of one factor at a single level of the other factor.
Chapter 15: Correlation
Measuring Relationships
Correlation Coefficient (r): Measures the direction (+/-) and strength (0 to 1) of a relationship.
Pearson Correlation: Used for interval/ratio data.
Spearman Correlation (rs): Used for ordinal/ranked data. rs=1−n(n2−1)6ΣD2
Point-Biserial (rpb): One continuous and one dichotomous variable.
Phi Coefficient ($\phi$): Two dichotomous variables.
Limitations
Causality: Correlation does not imply causation.
Outliers: Can significantly obscure or exaggerate relationships.
Restriction of Range: Narrow data sets can hide true relationships.
Chapter 16: Linear Regression and Multiple Regression
Linear Regression
Uses the relationship from correlation to predict values.
Regression Line: The "best-fitting" line (Y=bX+a).
Slope (b) and y-intercept (a).
Method of Least Squares: Minimizes the sum of squared residuals (SSresidual).
Standard Error of Estimate (se): Measures average prediction error.
Multiple Regression
Predicts a criterion variable (Y) using multiple predictor variables (X1,X2,…).
Multicollinearity: A problem where predictors are too highly correlated with each other (checked via VIF).
Beta Coefficient ($\beta$): Standardized coefficients that show the unique relative contribution of each predictor.
Chapter 17: Nonparametric Tests: Chi-Square Tests
Introduction to Chi-Square (\chi^2)
Chi-Square tests are essential statistical tools used primarily to analyze categorical data, particularly for evaluating relationships between variables or testing hypotheses about observed frequencies across categories. Unlike parametric tests, which assume the data follow a specific distribution, Chi-Square tests do not require such assumptions, making them particularly useful in research contexts where the data are nominal or ordinal.
Goodness-of-Fit Test
This test assesses how well the observed frequency distribution of a categorical variable aligns with an expected distribution derived from a theoretical hypothesis.
Purpose: To ascertain whether a sample distribution fits a specified distribution.
Formula: The test statistic is calculated using the formula:
χobt2=∑fe(fo−fe)2
where:
- f_o: Observed frequency for each category. - f_e: Expected frequency under the null hypothesis.
Interpretation: A significant Chi-Square value indicates that the observed frequencies differ markedly from the expected frequencies, leading to rejection of the null hypothesis that there is no difference between the observed and expected frequencies.
Test for Independence
This test evaluates whether two categorical variables are independent, allowing researchers to determine whether a relationship exists.
Purpose: To explore the association between two categorical variables, often assessed in contingency tables.
Procedure: The test computes a Chi-Square statistic based on the frequencies observed in a cross-tabulation format, where f_o is the observed frequency from the table, and f_e is calculated based on the products of the row and column totals divided by the grand total.
Interpretation: A significant result indicates that the two variables are likely related, while a non-significant result suggests independence.
Assumptions of Chi-Square Tests
To properly utilize Chi-Square tests, certain assumptions must be met:
Independence: Each observation should be independent of others. This ensures that the data points are not influenced by one another.
Minimum Frequency: Each expected frequency in the contingency table should be at least 5. If this condition is not met, the test results may be unreliable.
Categorical Data: The data should be nominal or ordinal, as Chi-Square tests cannot be used for continuous data without first converting it to categories.
Cramer’s V
Cramer’s V is a measure of effect size associated with the Chi-Square test for independence. It quantifies the strength of association between two categorical variables, providing insight into the practical significance of the observed relationship.
Calculation: It is calculated as follows:
V=n⋅min(k−1,r−1)χ2
where:
- n = total number of observations
- k = number of columns
- r = number of rows
Interpretation: Values range from 0 to 1. A value closer to 0 suggests a weak association, while a value closer to 1 indicates a strong association.
Applications of Chi-Square Tests
Chi-Square tests are broadly applicable across various disciplines such as:
Social Sciences: Examining the relationship between responses to survey items.
Healthcare: Investigating the association between lifestyle factors and health outcomes.
Market Research: Exploring consumer preferences across different demographic groups.
Psychology: Analyzing frequencies of categorical responses to psychological assessments.
Limitations of Chi-Square Tests
Despite their utility, Chi-Square tests have limitations that researchers must consider:
Sensitivity to Sample Size: Large sample sizes can inflate Chi-Square statistics, leading to significant results that may not be practically significant.
Data Type Restrictions: They can only be used with categorical data and are not suited for continuous variables without proper categorization.
Loss of Information: Grouping continuous data into categories can lead to information loss, potentially affecting the results.
Conclusion
Chi-Square tests remain fundamental tools in statistical analysis for categorical data. Their flexibility and nonparametric nature make them accessible for researchers examining relationships in various fields, though careful attention to assumptions and limitations is crucial for valid and reliable results.