EDA - Exploratory data analysis: Becoming more familiar with the data

studied byStudied by 1 person
0.0(0)
Get a hint
Hint

What is EDA about?

1 / 3

flashcard set

Earn XP

4 Terms

1

What is EDA about?

Understanding the data by

  • Understanding the data dictionary

  • How to deal with missing data dataframe.info()

  • Finding outliers

    dataframe.describe()

  • Which features to keep and discard

New cards
2

Positive vs. Negative correlation

  • 1.00 is a perfect correlation (age vs. age) is 100% correlated to itself. 0.00 means there is no correlation.

  • Comparing cp to target has a +0.43. So, it has a positive potential correlation to the target

    • As cp goes up. The target value also increases. As cp incraeses, the target (has heart disease) goes up

  • exang to target has a -0.44. So, it has a negative potential correlation to the target

    • As exange goes down. The target value will go up. As exang goes up the target (has heart disease) goes down

New cards
3

Explain ways to use EDA

  • dataframe.describe() …count, mean, stdDev, min, max

  • dataframe['target'].value_counts() will show you if, the data is balanced for dep var

  • Finding questions to ask the SME's

  • Using a correlation matrix dataframe.corr() to see how each variable is correlated to every other variable

  • Visualizing correlation matrix using a heat map to visually see how variables relate to each other

New cards
4

Explain what a crosstab is used for

  • To compare different a feature variable against the target variable

  • A crosstab compares two variables and puts them in a matrix

  • pd.crosstab(dataframe['target'], dataframe.sex)

New cards

Explore top notes

note Note
studied byStudied by 37 people
Updated ... ago
5.0 Stars(1)
note Note
studied byStudied by 50 people
Updated ... ago
5.0 Stars(3)
note Note
studied byStudied by 59666 people
Updated ... ago
4.9 Stars(331)
note Note
studied byStudied by 10 people
Updated ... ago
5.0 Stars(1)
note Note
studied byStudied by 16 people
Updated ... ago
5.0 Stars(1)
note Note
studied byStudied by 5 people
Updated ... ago
5.0 Stars(1)
note Note
studied byStudied by 79 people
Updated ... ago
5.0 Stars(1)
note Note
studied byStudied by 26 people
Updated ... ago
5.0 Stars(2)

Explore top flashcards

flashcards Flashcard282 terms
studied byStudied by 42 people
Updated ... ago
5.0 Stars(1)
flashcards Flashcard100 terms
studied byStudied by 15 people
Updated ... ago
5.0 Stars(1)
flashcards Flashcard44 terms
studied byStudied by 1 person
Updated ... ago
5.0 Stars(1)
flashcards Flashcard243 terms
studied byStudied by 88 people
Updated ... ago
5.0 Stars(3)
flashcards Flashcard23 terms
studied byStudied by 6 people
Updated ... ago
5.0 Stars(1)
flashcards Flashcard66 terms
studied byStudied by 26 people
Updated ... ago
5.0 Stars(1)
flashcards Flashcard22 terms
studied byStudied by 7 people
Updated ... ago
5.0 Stars(2)
flashcards Flashcard79 terms
studied byStudied by 14 people
Updated ... ago
5.0 Stars(1)