Data Science

0.0(0)
studied byStudied by 0 people
0.0(0)
full-widthCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/38

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

39 Terms

1
New cards

Qualititative Data

Can be divided into different categories

Ex. What’s your favorite type of coffee?

2
New cards

Quantitative Data

Is a numerical data that can be counted or measured

Ex. How many cups of coffee did you drink today?

3
New cards

Variable

A name that stores a piece of data

Ex. greeting = “Hello, World!”

print(greeting)

Output: Hello, World!

var name = greeting

data type = string

value = “Hello, World!”

4
New cards

Naming Rules 

Basic Guidelines 

  • Use descriptive names

  • Underscores take the place of spaces

  • Only lower case letters

  • Cannot start with a number

5
New cards

Integer 

A number can be positive, negative or zero w/o a decimal component 

Ex. -5;  500; ;0

6
New cards

Float

A number can be positive, negative or zero w/ a decimal component

Ex. 3.2; 0.0; 4.5623 

7
New cards

String

Contains sequence of letters, numbers, punctuations, spaces, etc.

Ex.”abcdehj”; “Hello, World!”

8
New cards

Character

Contains single character or punctuation

Ex. a; z; 1; !; #;

9
New cards

Boolean

Binary and evaluate to either true or false

Ex. True or false

10
New cards

==

equal to → 6 == 6

11
New cards

!= / ≠

not equal to → 6 != 7

12
New cards

>

greater than → 6 > 7

13
New cards

<

less than → 6 < 7

14
New cards

>=

greater than or equal to → 6 >= 7

15
New cards

<=

less than or equal to → 6 <= 7

16
New cards

and

is only true if both conditions are true

Ex. 6 > 7 and 6 > 3

* Print false because 6 is not greater than 7, only one statement is true not both

17
New cards

or

is only true if either condition is true

Ex. 6 > 7 or 6 > 3

* Print true because 6 is greater than 3. One statement is true, doesn’t require both statements to be true 

18
New cards

not

negates the truth value of the condition

Ex. not (6 > 7)

* print true because 6 isn’t greater than seven, the statement is false

19
New cards

List

a collection of ordered items

<p>a collection of ordered items </p><p></p>
20
New cards

Built in functions

knowt flashcard image
21
New cards

Python modules

  • Can be imported into your code

  • Contains predefined functions, variables and more

  • Helps build programs faster and w/ less difficulty

  • A package is a collection of related modules

  • A library is a collection of modules and packages

22
New cards

Series

One-dimensional labeled array or list that is formatted for like a single column of a data table. 

<p>One-dimensional labeled array or list that is formatted for like a single column of a data table.&nbsp;</p>
23
New cards

Mean

Use when points aren’t too spread out and there aren’t any outliers.

  • Use to avoid outliers from negatively affecting data

24
New cards

Median

Use when there are extreme outliers or the data isn’t balanced well.

25
New cards

Mode

Use when there is only a few different data values 

26
New cards

Documentation

A written set of instructions for using the python module or library.

  • Lists functions that are included

  • Directs how to use each function

  • Offers examples 

27
New cards

Measures of Spread

knowt flashcard image
28
New cards

Indices

Assigns names to elements in the list 

<p>Assigns names to elements in the list&nbsp;</p>
29
New cards

Dataframe

A data structure that stores and aligns data in a table using rows and columns

  • Essentially a collection of more than one series. 

<p>A data structure that stores and aligns data in a table using rows and columns</p><ul><li><p>Essentially a collection of more than one series.&nbsp;</p></li></ul><p></p>
30
New cards

table.dtypes

Lists the datatypes used in each column in the Data Frame. 

This one for examples print integers as a datatype because positive whole numbers are being used in the column.

<p> Lists the datatypes used in each column in the Data Frame.&nbsp;</p><p>This one for examples print integers as a datatype because positive whole numbers are being used in the column.</p><p></p>
31
New cards

table.shape

  • Displays the # of rows and columns within the table

<ul><li><p>Displays the # of rows and columns within the table</p></li></ul><p></p>
32
New cards

table.describe

  • Lists and describes all the specific statistics for each column

the round command rounds the decimal to whatever place the user inputs in this case, the decimal is rounded to 1 place. 

<ul><li><p>Lists and describes all the specific statistics for each column</p></li></ul><p>the round command rounds the decimal to whatever place the user inputs in this case, the decimal is rounded to 1 place.&nbsp;</p><p></p>
33
New cards

Rows Commands

Determine which rows of the table are displayed, instead of displaying the entire table, useful for using big data tables. 

<p>Determine which rows of the table are displayed, instead of displaying the entire table, useful for using big data tables.&nbsp;</p>
34
New cards

table.head / table.tail

displays the first or last few rows depending on which number is placed in the parentheses

* if there isn’t any number in the parentheses, the function will display the first / last five rows as a default.

35
New cards

table[a:b]

Choose a specific section of the table to display using brackets.

will list the rows using the indices from the first # to the last # EXCLUSIVELY

* Will not include the last #

36
New cards

Module

A python module:

  • Can be imported into your code

  • Contains predefined functions, variables, and more.

  • helps to build programs faster with less difficulty.

37
New cards

Package

  • A collection of related modules

38
New cards

Library

Is a collection of modules and packages that can be imported into a program

39
New cards