AP CSP Big Idea 2 (Data): Analysis, Metadata, and Preparing Data for Use

0.0(0)
Studied by 0 people
0%Big Idea 2 Mastery
0%Exam Mastery
Build your Mastery score
multiple choiceMultiple Choice
call kaiCall Kai
Supplemental Materials
Card Sorting

1/24

Last updated 3:08 PM on 3/12/26
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai

No analytics yet

Send a link to your students to track their progress

25 Terms

1
New cards

Data (AP CSP sense)

Any information stored in a form a computer can process (e.g., numbers, text, images, sounds, locations, clicks, sensor readings).

2
New cards

Program (in data analysis)

A set of precise steps that takes data as input and can transform, summarize, and extract patterns from it.

3
New cards

Transform (data)

Change data’s form or representation (e.g., convert units, create new fields, recode categories).

4
New cards

Summarize (data)

Compute compact descriptions of a dataset such as counts, totals, averages, minimum/maximum, or distributions.

5
New cards

Pattern extraction

Using computation to find relationships, trends, clusters, or unusual values (anomalies) in data.

6
New cards

Iteration (through a dataset)

Looping through many records/values to compute results (e.g., checking every row in a table).

7
New cards

Filtering

Keeping only records that match a condition (e.g., only rows where grade = 12 or value > threshold).

8
New cards

Aggregation

Grouping and combining data to produce summaries (e.g., totals per category).

9
New cards

Visualization

Presenting results in charts/graphs/maps so humans can interpret patterns and summaries.

10
New cards

Data-processing pipeline

Common sequence of steps: input → parse → clean/validate → transform → analyze → output.

11
New cards

Parse

Interpret a data format into usable parts (e.g., split a CSV row into columns).

12
New cards

Clean/Validate

Detect and handle missing/invalid values, formatting issues, duplicates, and inconsistencies so analysis is reliable.

13
New cards

Selection (rows)

A process that keeps specific records based on a rule (e.g., appending only rows where grade = 12).

14
New cards

Counter (variable)

A variable that increases by 1 for each item that matches a condition (used for counting).

15
New cards

Sum (accumulator)

A running total that adds the data values themselves (used before computing totals/averages).

16
New cards

Average (mean)

A summary statistic computed as total sum divided by number of items; requires both sum and count (or LENGTH).

17
New cards

Missing values

Data entries not recorded or absent (e.g., blank, NA, null, ?), which can skew or break computations if not handled.

18
New cards

Duplicate records

The same person/event recorded multiple times; can distort counts and totals unless duplicates are appropriately handled.

19
New cards

Inconsistent categories

Same category represented in different text forms (e.g., "NY", "New York", "newyork"), preventing correct grouping/counting without standardization.

20
New cards

Outlier / impossible value

An unusually extreme or invalid entry (e.g., negative age, temperature of 999) that may indicate error or a rare real event.

21
New cards

Biased sample

Data that does not represent the target population; programs cannot fix this, so results may be misleading.

22
New cards

Correlation vs. causation

A correlation means two values vary together; it does not prove one causes the other.

23
New cards

Metadata

“Data about data”: context that explains meaning, units, quality, and constraints so data can be interpreted correctly.

24
New cards

Data dictionary

A metadata document describing each field/column (name, meaning, data type, allowed values, units, missing-value rules).

25
New cards

Imputation

Filling in missing data with an estimated/default value (e.g., group average), which can hide uncertainty and distort results if unjustified.

Explore top notes

note
Biology: Nervous System
Updated 1233d ago
0.0(0)
note
Microbiology Quiz 3 (BIO 210)
Updated 140d ago
0.0(0)
note
AP1
Updated 472d ago
0.0(0)
note
Chains Vocab Pt.2
Updated 1149d ago
0.0(0)
note
Chapter 6: Learning
Updated 1076d ago
0.0(0)
note
COMPOSITION
Updated 1016d ago
0.0(0)
note
Biology: Nervous System
Updated 1233d ago
0.0(0)
note
Microbiology Quiz 3 (BIO 210)
Updated 140d ago
0.0(0)
note
AP1
Updated 472d ago
0.0(0)
note
Chains Vocab Pt.2
Updated 1149d ago
0.0(0)
note
Chapter 6: Learning
Updated 1076d ago
0.0(0)
note
COMPOSITION
Updated 1016d ago
0.0(0)

Explore top flashcards

flashcards
RT 460 test 2
48
Updated 367d ago
0.0(0)
flashcards
88-110 biology prefixes
23
Updated 910d ago
0.0(0)
flashcards
Unit 2: Age of Discovery
20
Updated 540d ago
0.0(0)
flashcards
History Final Exam
72
Updated 1020d ago
0.0(0)
flashcards
Civics Unit 5 Test
54
Updated 1059d ago
0.0(0)
flashcards
Scramble for Africa
39
Updated 1080d ago
0.0(0)
flashcards
History Exam
134
Updated 1077d ago
0.0(0)
flashcards
RT 460 test 2
48
Updated 367d ago
0.0(0)
flashcards
88-110 biology prefixes
23
Updated 910d ago
0.0(0)
flashcards
Unit 2: Age of Discovery
20
Updated 540d ago
0.0(0)
flashcards
History Final Exam
72
Updated 1020d ago
0.0(0)
flashcards
Civics Unit 5 Test
54
Updated 1059d ago
0.0(0)
flashcards
Scramble for Africa
39
Updated 1080d ago
0.0(0)
flashcards
History Exam
134
Updated 1077d ago
0.0(0)