EDA - Exploratory data analysis: Becoming more familiar with the data

studied byStudied by 1 person
0.0(0)
Get a hint
Hint

What is EDA about?

1 / 3

flashcard set

Earn XP

4 Terms

1

What is EDA about?

Understanding the data by

  • Understanding the data dictionary

  • How to deal with missing data dataframe.info()

  • Finding outliers

    dataframe.describe()

  • Which features to keep and discard

New cards
2

Positive vs. Negative correlation

  • 1.00 is a perfect correlation (age vs. age) is 100% correlated to itself. 0.00 means there is no correlation.

  • Comparing cp to target has a +0.43. So, it has a positive potential correlation to the target

    • As cp goes up. The target value also increases. As cp incraeses, the target (has heart disease) goes up

  • exang to target has a -0.44. So, it has a negative potential correlation to the target

    • As exange goes down. The target value will go up. As exang goes up the target (has heart disease) goes down

New cards
3

Explain ways to use EDA

  • dataframe.describe() …count, mean, stdDev, min, max

  • dataframe['target'].value_counts() will show you if, the data is balanced for dep var

  • Finding questions to ask the SME's

  • Using a correlation matrix dataframe.corr() to see how each variable is correlated to every other variable

  • Visualizing correlation matrix using a heat map to visually see how variables relate to each other

New cards
4

Explain what a crosstab is used for

  • To compare different a feature variable against the target variable

  • A crosstab compares two variables and puts them in a matrix

  • pd.crosstab(dataframe['target'], dataframe.sex)

New cards

Explore top notes

note Note
studied byStudied by 7 people
... ago
5.0(1)
note Note
studied byStudied by 12 people
... ago
5.0(1)
note Note
studied byStudied by 21 people
... ago
4.0(1)
note Note
studied byStudied by 32 people
... ago
5.0(1)
note Note
studied byStudied by 8 people
... ago
5.0(1)
note Note
studied byStudied by 9 people
... ago
5.0(1)
note Note
studied byStudied by 31 people
... ago
5.0(1)
note Note
studied byStudied by 357 people
... ago
5.0(5)

Explore top flashcards

flashcards Flashcard (24)
studied byStudied by 21 people
... ago
5.0(1)
flashcards Flashcard (51)
studied byStudied by 28 people
... ago
4.0(1)
flashcards Flashcard (198)
studied byStudied by 7 people
... ago
5.0(1)
flashcards Flashcard (34)
studied byStudied by 2 people
... ago
5.0(1)
flashcards Flashcard (39)
studied byStudied by 4 people
... ago
5.0(1)
flashcards Flashcard (61)
studied byStudied by 379 people
... ago
4.6(28)
flashcards Flashcard (116)
studied byStudied by 13 people
... ago
5.0(1)
flashcards Flashcard (65)
studied byStudied by 2352 people
... ago
4.6(14)
robot