Data Analytics Exam

5.0(1)
studied byStudied by 35 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/51

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

52 Terms

1
New cards

Crosstab/Contingency table

slices data by 2 categorical variables, bivariate

2
New cards

Bar chart

utilizes frequency table and categorical data, univariate(one variable), uses height/length

3
New cards

Histogram

usually used for continuous (numerical) data, univariate(one variable), uses height/length

4
New cards

Positively/Right skewed

heavy to left side, light on right side (TAIL ON RIGHT)

5
New cards

Negatively/Left skewed

heavy to right side, light on left side (TAIL ON LEFT)

6
New cards

Stacked bar/column chart

utilizes contingency chart, can have both data types, usually categorical though, multivariate

7
New cards

Line chart

measures two 2 things over time

8
New cards

Scatterplot

relationship between 2 numeric variables/ the third variable can be categorical with a legend, uses position, can be bivariate or multivariate

9
New cards

Business analytics

data analyses for business applications

10
New cards

Data science

develop applications for end users

11
New cards

Sequence/Types of Analytics

Descriptive → Diagnostic → Predictive → Prescriptive

12
New cards

Statistical inference

is the process of using data from a sample to gain
information about the population

13
New cards

Sampling bias


occurs when the method of selecting a sample causes the sample to differ from the population in some relevant way

14
New cards

Time series data

data values observed over time

15
New cards

Cross sectional data

values observed at the same point in time

16
New cards

Structured Data

Reside in a pre-defined, row-column format; Spreadsheet or database applications; Enter, store, query, and analyze

17
New cards

Unstructured Data

Do not conform to a pre-defined, row-column format; Textual; Multimedia content

18
New cards

Discrete data

numerical, can have decimals, more strigid, would be a more jagged graph

19
New cards

Continuous data

numerical, yes to decimals, what is the number above 1

20
New cards

How to determine if it’s numerical or categorical

if you can perform a relevant calculation then it’s numerical ex:  avg/mean (you don’t need the avg of zip codes, so it’s nominal)

21
New cards

Nominal data

categorical, no order, can be numeric but usually words ex: 1=yes 0=no, uniform numbers, zip code

22
New cards

Ordinal data

 ranked, not necessarily a preference, ORDER

23
New cards

Lollipop chart

variation of a bar chart, uses height/length

24
New cards

Bullet graph

Encodes data using length/height, position and color to show actual compared to target and performance bands

25
New cards

Dot plot

is a Univariate plot for Continuous data, uses position

26
New cards

Box and whisker plot

univariate, for continuous data, uses position and height/length

27
New cards

Pie chart

uses angle, area and arc to show a part-to-whole comparison, univariate, can be categorical or continuous

28
New cards

Line chart

uses position and often shows trend over time, usually bivariate, time usually on x-axis and y-axis is usually numerical

29
New cards

Sparkline / Sparkbar

using position (line) or height/length (bar) in a small, word-sized graphic

30
New cards

Bubble plot

Allows to add more variables to scatter plot, can use color and size to visualize other (likely numerical) data, multivariate

31
New cards

Heat Map

uses color, uses numerical data but does not use numbers in the visualization, bivariate

32
New cards

Visual perception

the brain's ability to receive, interpret, and act upon visual stimuli

33
New cards

Preattentive attributes

visual properties that we notice without using conscious effort to do so

34
New cards

Important preattentive attributes used in graphs

Length, width, orientation (is it a different way than the others), size, shape, color hue, color intensity, position, texture

35
New cards

Marks to encode quantitative values

Points, lines, bars, boxes, shapes with 2-d areas, shapes with color intensity

36
New cards

Encoding categorical items

Hue, point shape, 2d position

37
New cards

Pie charts are

bad! we don’t like to use them

38
New cards

Business intelligence

Data + tools + brains

39
New cards

As data analytics changes from descriptive to diagnostic to predictive to prescriptive, more human input is required for making decisions and enacting them.

FALSE

40
New cards

The use of historical information to predict what could happen in the future describes prescriptive analytics.

FALSE - predictive analytics

41
New cards

Social media data, such as Facebook, Instagram, and TicTok are examples of structured data.

FALSE

42
New cards

Supervised learning

Input & output data, classification, regression, predictive and prescriptive models

43
New cards

Unsupervised learning

Input data, clustering, association, PATTERN/structure discovery

44
New cards

Four Vs of Big Data

velocity, variety, volume, veracity(accuracy of data)

45
New cards

Descriptive Data Analytics

What is happening in my business?

46
New cards

Diagnostic Data Analytics

Why is it happening?

47
New cards

Prescriptive Data Analytics

What should be done?

48
New cards

Predictive Data Analytics

What will happen in the future?

49
New cards

Data analytics

 the science of examining raw data to conclude that information; the process of inspecting, cleansing, transforming, and modeling data to discover useful information for decision-making.

50
New cards

Big Data

massive complex structured and unstructured data sets that are rapidly generated and transmitted from a wide variety of sources

51
New cards

Data Mining

a set of statistical and machine learning methods that inform decision-making. (Dipping through vast stores of data in search of something interesting)

52
New cards

Information

a set of data that are organized and processed in a meaningful and purposeful way.