data science

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/44

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

45 Terms

1
New cards

Lookup Questions

could simply find the answer by searching for a specific value in the data

2
New cards

Compute Questions

requires you to use math to find the answer, it uses more than one value found in the data

3
New cards

Relate Questions

requires you to find a possible relationship between more than one value.

4
New cards

Question (could not be answered)

question requires additional information, data

5
New cards

Lookup Question Example

What was the gender of the shopper who purchased an item on January 5, 2019

6
New cards

Compute Question Example

What was the purchase total of the shopper on January 5, 2019

7
New cards

Relate Question Example

Do females or males spend more at the store?

8
New cards

Intersection of fields in data science

statistics, coding, and business knowledge

9
New cards

Data Science

the process of learning about the world using data and computation.

10
New cards

Statistical Questions

could have a variety of different answers, involves looking at more than one piece of data.

11
New cards

Use of data science

make predictions, draw reliable conclusions about the world.

12
New cards

Data Science Life Cycle

a sequence of steps taken to process and use data.

13
New cards

Ask Questions

formulate statistical questions that can be answered with data

14
New cards

Considered Data

collect or record data, or finding an existing data

15
New cards

Analyze Data

run calculations and or create data displays to identify patterns and relationships

16
New cards

Interpret Data

answer questions and determine results

17
New cards

Qualitative Data

data the can be divided into different categories, descriptive data

18
New cards

Quantitative Data

numeric data that can be counted or measured

19
New cards

Data Table

used to organize data in data science, each row represents a case and each column represents a variable

20
New cards

Column

A vertical stack of cells in a table.

21
New cards

Row

The horizontal placement of cells in a table.

22
New cards

Structured Data

Data that (1) are typically numeric or categorical; (2) can be organized in a way that is easy for computers to read, organize, and understand; and (3) can be inserted into a database.

23
New cards

Interpret Data

an observation that lies outside the overall pattern of a distribution

24
New cards

Data Cleaning

The process of fixing or removing incorrect, corrupted, incorrectly formatted, or duplicated data.

25
New cards

Sorting

Arranging data in a specified order.

26
New cards

Filtering

To create displays for relevant information only.

27
New cards

Line Chart

Chart used to illustrate changes in data over time

28
New cards

Pie Chart

used to display distribution, shows the relationship of a part to a whole

29
New cards

Bar Chart

effective for comparing data across different categories and display relationships

30
New cards

Data Visualization

the presentation of data in a pictorial or graphical format

31
New cards

Average

returns the average (arithmetic means) of its arguments, mean

32
New cards

Min

returns the smallest number in a set of values

33
New cards

Max

returns the highest value in a set of data

34
New cards

Revenue

multiplying the quantity of goods by price

35
New cards

If formula

performs logical comparisons and return different results depending on the outcome

36
New cards

Auto Sum

A function that automatically identifies and adds ranges of cells in your worksheet.

37
New cards

Absolute Reference

A cell reference that does not change when a formula is copied to a new location.

38
New cards

=

symbol was used at the beginning of each formula in Excel Spreadsheet

39
New cards

Data labels

text used to identify data points or categories, used to identify each value in the data series

40
New cards

Raw data

The original data as it was collected, not yet processed, formatted, or analyzed.

41
New cards

Data encryption

The process of encoding or translating data into another form so that only the intended recipient can decrypt and read the data

42
New cards

Data minimization

Limiting the collection of personal information to that which is directly relevant and necessary to accomplish a specific task.

43
New cards

Data anonymization

The process of protecting people's private or sensitive data by eliminating identifying information

44
New cards

Data aggregation

The process of collecting and organizing large amounts of information

45
New cards

Statistical questions

Questions that account for variability in the responses, or many different answers