Statistics 2201 - Exam 1 Flashcards

0.0(0)

Studied by 0 people

Knowt Play

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Card Sorting

1/53

There's no tags or description

Looks like no tags are added yet.

Study Analytics

Name	Mastery	Learn	Test	Matching	Spaced

No study sessions yet.

54 Terms

New cards

Data

A column of numbers.

New cards

Population

The totality of elements in a well-defined group that is to be studied.

New cards

Sample

Any part of the population that is:

Small enough to measure, and in a good sample is
Representative of the population.

New cards

Individual

One element of a population.

New cards

Simple Random Sampling

A method of sampling that gives every individual in the population the same chance of being chosen.

New cards

With Replacement

To select and measure an individual, then return it back into the population, so there is some small chance that it could be selected again.

New cards

Without Replacement

To select and measure an individual, then do not return it to the population, so there is no chance that it could be selected again.

New cards

Data Value

A measurement from an individual, such as height, volts, beak length.

New cards

Qualitative Data

Consists of qualities or categories.

New cards

Discrete Data

Consists of numerical information that has only a few possible values in the population (less than 15 to 20 values).

New cards

Continuous Data

Consists of numerical information that has many, many possible values in the population.

New cards

Constant

When the data value is fixed to only one possible value by there being only one value in the population.

New cards

Variable

When the data values can vary by there being more than one value in the population.

New cards

Random variable

When the data values can vary, and the value varies randomly.

New cards

Data Set

A specific way to organize data by putting the information from:

Variables into columns, and
Individuals into rows.

New cards

Experimental Study

Where the individuals are in a highly controlled environment before measuring, so that physical controls can be used to allow only the variable of interest to have an effect.

New cards

Observational Study

Where the individuals are in an uncontrolled environment before measuring, so that statistical controls are needed to cancel out the effects of the non-interesting variables.

New cards

Probabilistic Data

Where the value of the next data value is not known (this is the short run), but the value of many, many data values is very well known (this is the long run)

New cards

Purpose of Statistics

To extract information from columns of sample data, to help make better decisions.

New cards

Descriptive Statistics

Statistical methods used to summarize and describe columns of data for the purpose of extracting information about the values of the data.

New cards

Summary Numbers

Numerical values used to summarize one characteristic from a column of data in order to communicate the largest amount of information as simply as possible.

New cards

Inferential Statistics

Statistical methods that combine sample with probability to get information about a population.

New cards

Distribution of Data

The shape, location, and spread of a column of data values.

New cards

Mathematical Distance

The number of mathematical units between two numbers. Found by taking the difference between the two numbers.

New cards

Statistical Distance

The number of spread units between two numbers. Found by dividing the mathematical distance by the spread of the data.

New cards

Concept of Close and Far

Uses the probability distribution of the population, expressed in terms of statistical distance, to determine which values are close to the population mean and which values are far from the population mean.

New cards

Relationship

Used to determine if, and how, the values of one variable relate to the values of another variable (i.e. the relationship).

New cards

Lurking Variable-

A known, or unknown, variable whose values affect the values of the variables being studied.

New cards

Analytical Thinking

To break a big problem down into smaller parts, solve each part individually, then put the parts back together to get the answer to the big problem.

New cards

Synthetic Thinking

To look at the whole problem at once, see what aspect of the problem is the most important, then use this aspect to solve the problem.

New cards

Process of Abstraction

To look at a problem, extracting only the information relevant to solving the problem, and ignoring all other, unneccessary, information.

New cards

Distribution of Data

To describe a column of data by giving the shape, location, and spread of all the data values in the column.

New cards

Shape

Refers to the pattern the data values make when graphed, usually over a real number line. Shape can be expressed with a bar chart for any data, or with an equation for pretty data.

New cards

Location

Gives the middle of the data values, again usually on a real number line. Location is often considered the most representative data value of the column.

New cards

Spread

Gives the width of the data values over a real number line in how far away the minimum data value is from the maximum data value, or by measuring how far the data values are from the middle on average.

New cards

Summary Number

A single number summarizing information about one characteristic from a column of data.

New cards

Parameter

A number summarizing information for a characteristic from a column containing population data.

Its value does not change when repeating the statistical process because the column of data values does not change.
Usually denoted with Greek letters (μ, σ, or ρ).

New cards

Statistic

A number summarizing information for a characteristic from a column containing sample data.

Its value changes when repeating the statistical process because the column of data values changes every time a new random sample is chosen.
Usually denoted with Roman letters (¯x, s, or r).

New cards

Size

The number of data values in a column of data.

Denoted: Denoted: N for population, and n for sample.

New cards

Degrees of Freedom

The number of units of information contained in a sample statistic.

New cards

Efficient Statistics

Summary numbers that extract the most information about a characteristic out of a column of data.

New cards

Resistant Statistics

Summary numbers that extract less, but more robust, information about a characteristic out of a column of data.

Used for discrete or continous data.
Strength is that they are weakly affected by extreme values (resistant).
Weakness is that they contain less information than efficient statistics.

New cards

Frequency Table

A table summarizing the shape information in a column of data by listing all possible data values, and recording how often each value occurs in the column of data.

New cards

Bar Chart

A graphical summary of a frequency table for qualitative data giving shape information by showing a non-touching bar for each category, with the height of the bar representing how many data values are in the category.

New cards

Histogram

A graphical summary of a frequency table for discrete data giving shape information by showing a touching bar for each category, with the height of the bar representing how many data values are in the category.

New cards

Binning

To separate continuous data into bins (or groups) to reduce the number of values (or categories) for use in a frequency table.

New cards

Stem-and-Leaf Plot

A graphical summary of continuous data giving shape information by displaying each data value as a stem for the category, and a leaf for the bar.

New cards

Boxplot

A graphical summary of continuous data giving shape information by displaying the middle 50% of the data values as a box, the median as a line inside the box, and the upper 25% and lower 25% of the data values as tails on either side of the box.

New cards

Overall Shape of Data

Overall shape of data will be simply defined with two characteristics:

Modality
Symmetry

New cards

Exceptions from the Overall Shape

There are three major exceptions to the overall shape of the data:

Extreme values
Gaps or peaks
Patterns or grouping.

New cards

Efficient statistics

Statistics that are designed to extract the most information about a characteristic from a column of data values.

New cards

Mean

New cards