Lecture 2 Notes: Data Coding & Entry, Levels of Measurement, and Visual Plots

0.0(0)

Studied by 0 people

Call with Kai

Knowt Play

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Card Sorting

1/40

Earn XP

Description and Tags

Vocabulary flashcards covering data coding, levels of measurement, SPSS data entry, survey types, data plots, and related concepts from Lecture 2.

Study Analytics

Name	Mastery	Learn	Test	Matching	Spaced

No study sessions yet.

41 Terms

New cards

Quantitative data

Numerical data produced when researchers measure something and assign a number value.

New cards

Discrete data

Quantitative data with finite values or categories; the values can only be counted and cannot be mathematically manipulated.
also called categorical data - this will be nominal

New cards

Continuous data

Quantitative data that can take an infinite number of values; values are interval or ratio and can be mathematically manipulated.
eg blood pressure, height, weight, body temp

New cards

sources of quantitative data

surveys
observations
secondary databases - data collected by somebody other than you

New cards

types of surveys

questionnaires - paper or electronic instruments w questions to collect data from individuals - self-administered surveys
interview-type surveys — face-to-face or over the phone/online between researcher and individual — interviewer-administered

New cards

types of survey questions

close ended — predetermined answers, easiest to analyze
open ended — answer in their own words, harder to analyze but can be done
closed and open ended — still restrictions, but then later ask for open ended

New cards

Nominal level of measurement

Assign numerical values to categories with no inherent order; values cannot be ranked; categories are mutually exclusive and exhaustive. these cannot be manipulated.
dichotomous is only two options, but falls under dichotomous

New cards

Ordinal level of measurement

Categories can be ordered or ranked;
intervals between categories are not equal;
categories are mutually exclusive and exhaustive.
subjective ratins will have ordinal measurement properties

New cards

Scale (Normal) level of measurement

Also called interval/ratio or normal scale; infinite possible values, equal intervals, true zero; suitable for parametric statistics.
normal distribution, continuous
this is where we can do mathematical manipulation
allows for parametric tests, which have more power or ability to detect relationships and/or differences. non normal have to use nonparametric tests, and have less power to determine relationships and/or differences compared to parametric

New cards

Dichotomous variable

A nominal variable with two levels (dummy variable); examples: 1=Male, 2=Female.

New cards

Data coding

Assign numerical codes to categories or attributes to prepare data for SPSS.

New cards

Levels of measurement

Concept used to determine data type and appropriate statistical tests; main levels: nominal, ordinal, scale.
assignment of numerical values to attributes of variables according to some rules

New cards

Data dictionary / data codebook

Documentation describing variables, coding, and measurement to guide data entry and analysis.

New cards

SPSS Variable View

SPSS window where you define variable properties: names, labels, values, and measure.

New cards

SPSS Data View

SPSS window showing the actual data; rows are cases, columns are variables.

New cards

Variable name

Meaningful, concise identifier for a variable in SPSS.

New cards

Variable label

Descriptive text describing a variable to aid interpretation.

New cards

Variable values

Numeric codes used to represent categories within a variable.

New cards

Value labels

Text labels attached to numeric codes in SPSS to describe categories.

New cards

Missing values

Codes or blanks indicating missing data; SPSS can use codes to distinguish reasons for missingness.

New cards

detecting data entry problems

eyeball it when entering the data - double and triple check
generate and inspect visual plots
generate and inspect descriptive statistics
- like mean, minimum, maximum, range
- mean will be really useful because it will be altered by outliers. if the mean is too big or small, it might suggest that there is incorrect data for variables

New cards

Frequency distribution

A table showing counts (or percentages) of each value for a variable. — how many times each score or value occurs for that variable. this will use a table
frequency plot is a graphical representation of values on a variable and can be used to represent all types of data

New cards

normal distribution

most scores are for middle values, with a small number of scores in the low or high values

New cards

negatively skewed distribution

extreme scores, or the tail of the curve are on the low end, or the left sidea

New cards

Bar chart

Best for nominal, dichotomous, and ordinal data; not ideal for normal/scale data.

New cards

Histogram

Best for scale/normal data; not appropriate for nominal or dichotomous data.

New cards

Frequency polygon

Line-connecting plot for distributions; best for scale/normal data, not nominal.

New cards

Box and whisker plot

Plot useful for ordinal and normal/scale data; shows quartiles and spread.

New cards

mean

average - takes into account all available info in computing central tendency of a frequency distribution
add all raw scores and divide by number of scores

New cards

median

good measure of central tendency for ordinal level raw data
the middle value
especially good when you have outliers

New cards

mode

most common category or score
least precise info on central tendency
this would be the tallest bar in bar graph or histogram

New cards

measures of variability

if all scores in distribution are the same, there is no variability
if scores are all different and far apart, variability will be high

New cards

range

a measure of variability
highest minus lowest score

New cards

standard deviation

measure of variability
based on deviation of each score from the mean of all scores

New cards

interquartile range

in box plot, distance between top and bottom of box
the whiskers are the expected range

New cards

Pie chart

Alternative to bar chart for nominal data; bar charts are usually easier to read.

New cards

Parametric statistics

Statistical tests with higher power used with normal/scale data.

New cards

Nonparametric statistics

Statistical tests used for non-normal data; generally less powerful.

New cards

SPSS Data Editor concepts

Interface for entering and managing data, including Data View and Variable View.

New cards

Two views in SPSS

Data View shows data; Variable View shows variable properties and measurement.

New cards

Mutually exclusive and exhaustive

Categories do not overlap (mutually exclusive) and cover all possibilities (exhaustive).