Data Analysis and Visualization: Key Concepts and Tools

0.0(0)
studied byStudied by 0 people
0.0(0)
full-widthCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/24

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

25 Terms

1
New cards

What is data?

Tabular, computer-readable files composed of rows (records) and columns (fields).

2
New cards

What does the DIKW hierarchy stand for?

Data, Information, Knowledge, Wisdom.

3
New cards

Name three common data file formats.

CSV, XLSX, TXT.

4
New cards

What is raw data?

Unaggregated individual records.

5
New cards

What is summarized data?

Aggregated totals or averages.

6
New cards

What is the difference between primary and secondary data?

Primary data is collected by you; secondary data is collected by others.

7
New cards

What is metadata?

Information describing variables, units, and sources.

8
New cards

What is a 'data state of mind'?

A mindset to spot where data exists.

9
New cards

What are three reasons agencies collect data?

Legal mandates, mission needs, open-government policies.

10
New cards

What is FOIA?

Freedom of Information Act, a path to obtaining data.

11
New cards

What should you specify when requesting offline data?

Dataset name, fields, date range, preferred electronic format.

12
New cards

What is 'data dirt'?

Errors in data, such as duplicates, misspellings, or invalid entries.

13
New cards

What are two common sources of data dirt?

Duplicates and missing values.

14
New cards

What is the purpose of data integrity checks?

To validate datasets for completeness and consistency.

15
New cards

What Excel tools can be used for data integrity checks?

`COUNTIF`, `SUMIF`, `VLOOKUP`, pivot tables.

16
New cards

What is the importance of preserving the original file before cleaning?

To maintain a backup of the unaltered data.

17
New cards

What is the difference between mean and median?

Mean is the average; median is the middle value in a dataset.

18
New cards

Why normalize counts into rates before comparing across regions?

To account for differences in population size.

19
New cards

What are pivot tables used for?

To aggregate and summarize data in Excel.

20
New cards

What are key components of a clear chart?

Labeled axes, units, sources, and context.

21
New cards

What types of charts are best for comparing categories?

Bar charts.

22
New cards

Why should you avoid too many slices in a pie chart?

It can make the chart difficult to read and interpret.

23
New cards

What feature in Excel allows you to add regression lines to scatterplots?

Chart formatting options.

24
New cards

What factors should you consider when choosing a web visualization tool?

Usability, interactivity, and ethical publishing.

25
New cards

What is the first step in using a web visualization platform?

Clean data and export it as CSV.

Explore top flashcards

Peripheral Nerve
Updated 905d ago
flashcards Flashcards (62)
-4 Poverty, Part 1
Updated 1088d ago
flashcards Flashcards (61)
BIO-205 Chapter 12
Updated 263d ago
flashcards Flashcards (51)
Anime
Updated 51d ago
flashcards Flashcards (70)
Optics and Vision
Updated 45d ago
flashcards Flashcards (50)
Peripheral Nerve
Updated 905d ago
flashcards Flashcards (62)
-4 Poverty, Part 1
Updated 1088d ago
flashcards Flashcards (61)
BIO-205 Chapter 12
Updated 263d ago
flashcards Flashcards (51)
Anime
Updated 51d ago
flashcards Flashcards (70)
Optics and Vision
Updated 45d ago
flashcards Flashcards (50)