1/21
Name | Mastery | Learn | Test | Matching | Spaced | Call with Kai |
|---|
No analytics yet
Send a link to your students to track their progress
What are the steps of data analysis?
Problem formulation -> Getting the data -> Exploring the data -> Analysing the data -> Communicating the results
What is the first step in data analysis?
Problem formulation, where questions and patterns to investigate are identified.
What common file format is used for data sets in Python?
.csv (comma separated values)
What library is commonly used in Python to handle data frames?
Panda data frames
What issues might you find in a data set?
Missing data values or outliers.
What is the purpose of pre-processing data?
To find and fix issues such as incorrect or missing data.
How can you explore data in Python?
Using the describe() function, printing values or calculations from data, and graphing.
What is the benefit of changing data types in a data set?
It helps visualization libraries like Seaborn interpret fields correctly.
What are the two main plot families in Seaborn for 1-dimensional charts?
Catplot (comparing categories) and Displot (comparing distributions). Both can be grouped by additional categories.
What does catplot create?
Charts of a single variable, such as box plots, strip plots, and violin plots.
What does displot create?
Charts to show the distribution of a single variable, such as histograms (with equal interval widths and frequency on y axis).
How can comparisons in datasets be facilitated?
Superimposing datasets over each other.
What does the term 'association' mean in data analysis?
It means the values of one variable are linked to the values of another.
Does association imply causation?
No, association does not imply causation.
How can you discover answers to questions about data?
By combining describe() with scatter diagrams.
What function can be used to compare different features in a data set?
groupby
What function can be used to create box plots and compare charts for different features?
catplot
What function can be used to create and compare scatterplots based on two features to identify associations?
relplot
What can facilitate data visualization?
Using colours to enhance visual clarity and comparing smaller slices of data
What is essential after investigating data?
Communicating the results clearly to the target audience.
What is the role of charts and calculated values in data analysis?
They help answer the initial problems or questions posed.
What can proper use of the data cycle allow?
The yield of patterns or insights