The Structure of Data: Cases, Variables, and Data Types (Pages 1-5)

0.0(0)
studied byStudied by 0 people
GameKnowt Play
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/25

flashcard set

Earn XP

Description and Tags

Vocabulary flashcards covering key concepts from the notes: cases, variables, dataset structure, quantitative vs categorical variables, explanatory vs response variables, and the specific example fields from Hollywood data, yogurt study, and college data.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

26 Terms

1
New cards

Case (in a dataset)

A single unit of observation; a row describing one entity in the data (e.g., one movie).

2
New cards

Variable (in a dataset)

A characteristic measured for each case; a column in the dataset.

3
New cards

Dataset

A collection of cases (rows) and variables (columns) used for analysis.

4
New cards

Quantitative variable

A numeric value representing measurements or counts (e.g., budget, opening, rating).

5
New cards

Categorical variable

A variable that assigns cases to named categories (e.g., genre, LeadStudio, Title).

6
New cards

Explanatory variable

The variable used to explain or predict the outcome; also called the independent variable.

7
New cards

Response variable

The outcome variable measured to assess effect; also called the dependent variable.

8
New cards

Opening

Opening weekend gross, measured in millions of dollars.

9
New cards

Budget

Film budget, measured in millions of dollars.

10
New cards

Rating

Audience rating on a 100-point scale.

11
New cards

Genre

Category/type of movie; a categorical variable.

12
New cards

LeadStudio

Movie production studio; a categorical variable (e.g., Sony, Warner Bros).

13
New cards

Title

Movie title; a variable describing each case; in the Hollywood dataset it is the film’s name.

14
New cards

Cases in Hollywood dataset

136 movies (observations) in the full dataset for 2011.

15
New cards

Two-variable relationship question

A question about how two variables relate or affect each other (e.g., budget vs. opening weekend).

16
New cards

Single-variable question

A question focusing on one variable (e.g., average rating across movies).

17
New cards

Explanatory variable (Yogurt study)

Yogurt consumption (whether yogurt is eaten) used to explain potential weight change.

18
New cards

Response variable (Yogurt study)

Weight change (amount of weight lost or gained) observed as the outcome.

19
New cards

Cases (Yogurt study)

Individuals observed in the weight-loss study (participants).

20
New cards

Type of college

Private or Public; a categorical variable.

21
New cards

Enrollment

Total number of students enrolled; a quantitative variable.

22
New cards

Student-Faculty Ratio

Ratio of students to faculty members; a quantitative variable.

23
New cards

Graduation Rate

Percentage of students who graduate; a quantitative variable.

24
New cards

Tuition

Cost of tuition; a quantitative variable.

25
New cards

State

U.S. state where the college is located; a categorical variable.

26
New cards

Dataset fields (Hollywood example)

The variables listed for the dataset: Title, LeadStudio, Rating, Genre, Budget, Opening.