1/34
These flashcards cover key terms and concepts related to pandas, a powerful data manipulation and analysis library in Python.
Name | Mastery | Learn | Test | Matching | Spaced | Call with Kai |
|---|
No analytics yet
Send a link to your students to track their progress
Pandas
A Python library built on top of NumPy, used for data analysis, data science, and machine learning.
Series
A one-dimensional labeled array in pandas, similar to a column in a spreadsheet.
DataFrame
A two-dimensional labeled data structure in pandas, similar to an Excel spreadsheet.
Panel Data
The term from which the name 'pandas' is derived, referring to multidimensional structured data.
Dunder Version
A method in pandas to check the current version of the library, accessed via pd.__version__.
Index
A label that identifies rows in a pandas Series or DataFrame.
Key Value Pairs
The structure used in Python dictionaries, consisting of keys and their corresponding values.
Custom Labels
User-defined identifying labels for data points in a Series or DataFrame.
Loc Property
A pandas property used to access data by label in a Series or DataFrame.
ILoc
A pandas property used to access data by integer location.
Filtering by Value
The process of creating a new DataFrame containing only rows that meet a specified condition.
Dictionary
A collection of key-value pairs in Python used for constructing Series or DataFrames.
Concatenation
The process of combining multiple DataFrames into one.
CSV File
A file format that stores tabular data in plain text, where each line represents a data record.
JSON File
A file format that represents data in a structured, human-readable way using JavaScript object notation.
Read CSV Function
A pandas function used to import data from a CSV file into a DataFrame.
To String Function
A pandas function to display the entire content of a DataFrame without truncation.
Aggregate Functions
Functions used to summarize and analyze data by reducing it to a single summary value.
Drop Function
A method in pandas to remove specified columns or rows from a DataFrame.
NaN
Represents a missing value in pandas, standing for 'Not a Number'.
Fill NA Function
A pandas method to replace missing values with specified replacements.
Standardization
The process of ensuring consistency in data representation, such as changing text to lowercase.
Duplicate Values
Redundant entries in a DataFrame that can be removed for data integrity.
Group By Function
A pandas method used to group DataFrame rows based on certain criteria.
Selection Techniques
Methods to access and display specific data from a DataFrame.
Condition
A logical expression used to filter data in a DataFrame.
Numeric Columns
Columns in a DataFrame that contain numeric data types suitable for mathematical operations.
Subscript Operator
The square brackets [] used in pandas to access elements in Series or DataFrames.
Column Selection
Accessing one or more columns from a DataFrame for analysis.
Row Selection
Accessing one or more rows from a DataFrame based on labels or indexes.
Intelligent Indexing
Accessing specific data points using logical conditions or integer positions.
Value Error
An error that occurs when operations involve inconsistent dimensions or incompatible data.
Software Libraries
Pre-written code utilized to simplify and enhance programming tasks.
Meta Data
Additional information about the data, such as its type or structure.
Unique Identifier
A specific label or value used to distinguish one entry from another in a dataset.