1/9
Looks like no tags are added yet.
Name | Mastery | Learn | Test | Matching | Spaced |
---|
No study sessions yet.
what is the definition of data analytics?
the process of evaluating data With the purpose of getting an answer to a business question or helping make a business decision
what are the 4 steps in the data analytics process?
ask the right question
extract, transform, and load relevant data (ETL process)
apply appropriate data analytics technique
interpret and share the results with stakeholders
what is the difference between unstructured and structured data?
most data is unstructured (messy) 80ish%
(text files, social media posts)
structured is organized data
define statistics
what does the data look like?
what is the shape?
what are the values
EMPIRICAL FACTS
what is an outlier?
a data point that significantly deviates from the rest.
how can you find outliers?
use graphs, standard deviations, or percentiles
what are 2 things you can do with outliers?
truncate (delete them)
winzerize (tuck values in)
truncated or winsorization??
change the number of statistics we have, but don’t change the data
truncate
truncate of winsorization?
actually change the data, but the number of statistics stays the same
winsorization