U1

studied byStudied by 2 people
0.0(0)
get a hint
hint

Categorical data

1 / 41

encourage image

There's no tags or description

Looks like no one added any tags here yet for you.

42 Terms

1

Categorical data

refers to data that can be divided into categories or groups based on qualitative characteristics.

New cards
2

categorical variable

is one that represents characteristics or qualities rather than numerical values. It consists of categories or groups into which data can be classified.

New cards
3

center

refers to the middle or average value of a data set. It represents the typical or central value around which the data tends to cluster.

New cards
4

Descriptive statistics

involves organizing, summarizing, and presenting data in a meaningful way to describe its main features.

New cards
5

Inferential statistics

involves using sample data to make inferences or draw conclusions about a population.

New cards
6

Outliers

are extreme values that significantly differ from other values in a dataset. They can greatly affect statistical analyses and should be carefully examined.

New cards
7

Predictive modeling

involves using historical data and statistical algorithms to make predictions about future outcomes.

New cards
8

proportion

is a fraction or percentage that represents the relationship between a part and a whole in a population or sample.

New cards
9
  • Categorical

    • groups

    • proportions to measure

    • eye color, statuses

      • visualized via __________

  • bar graph, pie chart, table, mosaic plot

New cards
10
  • Quantitative

    • measured, counted

    • means to measure

    • height, age

      • visualized via _____

  • histogram, stem leaf, box whisker

New cards
11
  • Five number summary

    • left to right on box plot: ______

  • minimum, q1, median, q3, maximum

New cards
12
New cards
13

Quantitative data

refers to numerical information that can be measured or counted. It involves quantities and can be analyzed using mathematical methods.

New cards
14

SOCS to describe DISTRIBUTION

shape, outlier, center, spread

New cards
15

shape

symmetry skewness modality

New cards
16

outlier

strong variation from other values that affect stat measures; determine with 1.5IQR, 2SD rule

New cards
17

center

mean median mode

New cards
18

spread described by

range (IQR), variance, stand dev

New cards
19

IQR equals

q3-q1

New cards
20

standard deviation does

measures how far values are from mean

New cards
21

variance

measures variability

New cards
22

uniform

height approx same everywhere, no significant mode

New cards
23
  • _ is easily affected by outliers, best measure of central tendency unless skewed / outliers

  • __ is OUTLIER RESISTANT; better measure or central tendency when data skewed, outliersmean is easily affected by outliers, best measure of central tendency unless skewed / outlier

mean, median

New cards
24

z score is equal to

data point minus mean value all divided by standard deviation

New cards
25
  • categorical: ____

  • quantitative: _____

  • bar graphs, NOT _____

  • watch the scale!

  • frequency = ____

  • relative frequency =____

  • qualitative, numerical, histograms, count, percentage

New cards
26
  • describing distribution: SOCS (____________)

  • ____ is affected by skew, ____ is not

  • histograms: if comparing distributions w/ diff. sample sizes, use ______

shape outlier center spread, mean, median, relative frequency

New cards
27
  • IQR = Q_-Q_

    • middle 50% of observations

  • finding Q1,Q3:

    • odd: exclude _____ (Q2)

    • even: split ____

  • five-number summary (1-Var Stats): ____

    • turn into box plot (use TRACE to find outliers)

  • standard deviation: typical distance from ___

1,3, median, median, min, Q1, Q2, Q3, max, mean

New cards
28

describe how standard deviation is calculated

find the sum of initial x minus x raised to the negative second power. divide by

<p>find the sum of initial x minus x raised to the negative second power. divide by </p>
New cards
29

continuous data

numerical data that can take on any value within a given range. infinite possible values

New cards
30

interval level of measurement

is a type of measurement scale that not only categorizes data but also allows for meaningful comparisons between the values. It has equal intervals between the numbers, but there is no true zero point.

New cards
31

nominal level of measurement

is the lowest level of measurement where variables are categorized into distinct groups or categories based on their characteristics or attributes.

New cards
32

ordinal level of measurement

a type of measurement scale where variables are ranked or ordered based on their attributes. The order matters, but the differences between values may not be equal or meaningful.

New cards
33

ordinal variable

is a type of categorical variable that has a natural order or ranking. The categories can be ranked or ordered based on some characteristic or attribute.

New cards
34

contingency table is a type of table that is used to organize and (later on) analyze ____ data. It shows how the observations in a dataset are distributed among different _____ of two or more variables.

categorical, categories

New cards
35

ratio level of measurement

similar to interval level, as it allows for meaningful comparisons and equal intervals. However, ratio level also has a true zero point which represents an absence or complete lack of the measured attribute.

New cards
36
New cards
37
New cards
38
New cards
39
New cards
40
New cards
41
New cards
42
New cards

Explore top notes

note Note
studied byStudied by 17 people
Updated ... ago
5.0 Stars(1)
note Note
studied byStudied by 31 people
Updated ... ago
5.0 Stars(1)
note Note
studied byStudied by 14 people
Updated ... ago
5.0 Stars(1)
note Note
studied byStudied by 55 people
Updated ... ago
5.0 Stars(3)
note Note
studied byStudied by 15 people
Updated ... ago
4.0 Stars(1)
note Note
studied byStudied by 21 people
Updated ... ago
5.0 Stars(1)
note Note
studied byStudied by 4 people
Updated ... ago
5.0 Stars(1)
note Note
studied byStudied by 5023 people
Updated ... ago
4.8 Stars(21)

Explore top flashcards

flashcards Flashcard30 terms
studied byStudied by 8 people
Updated ... ago
5.0 Stars(1)
flashcards Flashcard174 terms
studied byStudied by 10 people
Updated ... ago
5.0 Stars(1)
flashcards Flashcard36 terms
studied byStudied by 31 people
Updated ... ago
5.0 Stars(1)
flashcards Flashcard48 terms
studied byStudied by 7 people
Updated ... ago
5.0 Stars(1)
flashcards Flashcard30 terms
studied byStudied by 28 people
Updated ... ago
5.0 Stars(1)
flashcards Flashcard41 terms
studied byStudied by 2 people
Updated ... ago
5.0 Stars(1)
flashcards Flashcard63 terms
studied byStudied by 4 people
Updated ... ago
5.0 Stars(1)
flashcards Flashcard34 terms
studied byStudied by 32 people
Updated ... ago
5.0 Stars(2)