Business Analytics - Python Flashcards (Numpy and Pandas)

0.0(0)

Studied by 1 person

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Card Sorting

1/135

There's no tags or description

Looks like no tags are added yet.

Study Analytics

Name	Mastery	Learn	Test	Matching	Spaced

No study sessions yet.

136 Terms

New cards

import numpy as np

Import convention for Numpy

New cards

np.zeroes(())

Create an array of zeroes

New cards

np.ones

Create an array of ones

New cards

np.arrange(10,25,5)

Create an array of evenly spaced values (step value_

New cards

np.linspace(0,2,9)

Create an array of evenly spaced values

New cards

np.full((2,2),7)

Create a constant array

New cards

np.eye(2)

Create a 2×2 identity matrix

New cards

np.random.random((2,2))

Create an array with random values

New cards

np.empty((3,2))

Create an empty array

New cards

np.int64

Signed 64-bit integer types

New cards

np.float32

Standard double-precision floating point

New cards

np.complex

Complex numbers represented by 128 floats

New cards

np.bool

Boolean type storing TRUE and FALSE values

New cards

np.object

Python object type

New cards

np.string_

Fixed-length string type

New cards

np.unicode_

Fixed-length unicode type

New cards

a.shape

Array dimensions

New cards

len(a)

Length of array

New cards

b.ndim

Number of array dimensions

New cards

e.size

Number of array elements

New cards

b.dtype

Data type of array elements

New cards

b.dtype.name

Name of data type

New cards

b.astype(int)

Convert an array to a different type

New cards

np.subtract(a,b)

Subtraction

New cards

np.add(b,a)

Addition

New cards

np.divide(a,b)

Division

New cards

np.multiply(a,b)

Multiplication

New cards

np.exp(b)

Exponentiation

New cards

np.sqrt(b)

Square root

New cards

np.sin(a)

Print sines of an array

New cards

np.cos(b)

Element-wise cosine

New cards

np.log(a)

Element-wise natural logarithm

New cards

e.dot(f)

Dot product

New cards

a == b

Element-wise comparison

New cards

np.array_equal(a, b)

Array-wise comparison

New cards

a.sum()

Array-wise sum

New cards

a.min()

Array-wise minimum value

New cards

b.max(axis=0)

Maximum value of an array row

New cards

b.cumsum(axis=1)

Cumulative sum of the elements

New cards

a.mean()

Mean

New cards

b.median()

Median

New cards

a.corrcoef()

Correlation coefficient

New cards

np.std(b)

Standard deviation

New cards

h = a.view()

Create a view of the array with the same data

New cards

np.copy(a)

Create a copy of the array

New cards

h = a.copy()

Create a deep copy of the array

New cards

a.sort()

Sort an array

New cards

c.sort(axis=0)

Sort the elements of an array's axis

New cards

a[2]

Select the element at the 2nd index

New cards

b[1,2]

Select the element at row 1 column 2

New cards

a[0:2]

Select items at index 0 and 1

New cards

b[0:2,1]

Select items at rows 0 and 1 in column 1

New cards

b[:1]

Select all items at row 0

New cards

a[ : :-1]

Reversed array a

New cards

a[a<2]

Select elements from a less than 2

New cards

b[[1, 0, 1, 0],[0, 1, 2, 0]]

Select elements (1,0),(0,1),(1,2) and (0,0)

New cards

b[[1, 0, 1, 0]][:,[0,1,2,0]]

Select a subset of the matrix’s rows and columns

New cards

i = np.transpose(b) or i.T

Permute array dimensions

New cards

b.ravel()

Flatten the array

New cards

g.reshape(3,-2)

Reshape, but don’t change data

New cards

h.resize((2,6))

Return a new array with shape (2,6)

New cards

np.append(h,g)

Append items to an array

New cards

np.insert(a, 1, 5)

Insert items in an array

New cards

np.delete(a,[1])

Delete items from an array

New cards

np.concatenate((a,d),axis=0)

Concatenate arrays

New cards

np.vstack((a,b))

Stack arrays vertically (row-wise)

New cards

np.r_[e,f]

Stack arrays vertically (row-wise)

New cards

np.hstack((e,f))

Stack arrays horizontally (column-wise)

New cards

np.column_stack((a,d)) or np.c_[a,d]

Create stacked column-wise arrays

New cards

np.hsplit(a,3)

Split the array horizontally at the 3rd index

New cards

np.vsplit(c,2)

Split the array vertically at the 2nd index

New cards

df = pd.DataFrame(

{"a" : [4 ,5, 6],

"b" : [7, 8, 9],

"c" : [10, 11, 12]},

index = [1, 2, 3])

Specify values for each column

New cards

df = pd.DataFrame(

[[4, 7, 10],

[5, 8, 11],

[6, 9, 12]],

index=[1, 2, 3],

columns=['a', 'b', 'c'])

Specify values for each row

New cards

df = pd.DataFrame(

{"a" : [4 ,5, 6],

"b" : [7, 8, 9],

"c" : [10, 11, 12]},

index = pd.MultiIndex.from_tuples(

[('d',1),('d',2),('e',2)],

names=['n','v'])))

Create DataFrame with a MultiIndex

New cards

pd.melt(df)

Gather columns into rows

New cards

pd.concat([df1,df2])

Append rows of DataFrames

New cards

df.pivot(columns='var', values='val')

Spread rows into columns

New cards

pd.concat([df1,df2], axis=1)

Append columns of DataFrames

New cards

df[df.Length > 7]

Extract rows that meet logical criteria.

New cards

df.drop_duplicates()

Remove duplicate rows (only considers columns)

New cards

df.head(n)

Select first n rows.

New cards

df.tail(n)

Select last n rows.

New cards

df.sample(frac=0.5)

Randomly select fraction of rows.

New cards

df.sample(n=10)

Randomly select n rows.

New cards

df.iloc[10:20]

Select rows by position

New cards

df.nlargest(n, 'value')

Select and order top n entries.

New cards

df.nsmallest(n, 'value')

Select and order bottom n entries.

New cards

df[['width','length','species']]

Select multiple columns with specific names.

New cards

df['width'] or df.width

Select single column with specific name.

New cards

df.filter(regex='regex')

Select columns whose name matches regular expression regex

New cards

df.loc[:,'x2':'x4']

Select all columns between x2 and x4 (inclusive).

New cards

df.iloc[:,[1,2,5]]

Select columns in positions 1, 2 and 5 (first column is 0).

New cards

df.loc[df['a'] > 10, ['a','c']]

Select rows meeting logical condition, and only the specific columns .

New cards

df['w'].value_counts()

Count number of rows with each unique value of variable

New cards

len(df)

# of rows in DataFrame

New cards

df['w'].nunique()

df.describe()

100

New cards

sum()

Sum values of each object.