1/47
Looks like no tags are added yet.
Name | Mastery | Learn | Test | Matching | Spaced |
---|
No study sessions yet.
data
information in raw or structured forms that can be processed
database
organized collection of structured data
SQL
a programming language used to manage and manipulate relational databases.
big data
extremely large datasets that can be analyzed computationally to reveal patterns
data warehouse
a centralized repository for storing large volumes of structured data from different sources
data lake
a storage system that holds vast amounts of raw data in its native format until it is needed for analysis.
ETL
a process that extracts data from various sources
data modeling
the process of creating a visual representation (model) of a data structure or system
data cleansing
the process of detecting
data mining
the practice of analyzing large datasets to discover patterns
data integration
the process of combining data from different sources into a unified view for analysis or reporting.
data governance
the management and oversight of data assets
data quality
the measure of data's accuracy
NoSQL
a class of non-relational databases designed for handling unstructured and semi-structured data with flexible schemas.
hadoop
an open-source framework that enables the storage and processing of large datasets across distributed computing systems.
apache spark
a fast
MapReduce
a programming model for processing large datasets in parallel across distributed clusters of computers.
data pipeline
a series of data processing steps that collect
data architecture
the design and structure of an organization's data assets
data migration
the process of transferring data from one system
data transformation
the process of converting data from one format or structure to another for integration or analysis purposes.
data ingestion
the process of collecting and importing data from various sources into a database or data storage system.
data visualization
the graphical representation of data to help users understand and interpret complex data patterns and insights.
data analytics
the practice of analyzing datasets to derive actionable insights
data engineer
a professional who designs
data scientist
a professional who analyzes and interprets complex data to solve problems using techniques from statistics
data analyst
a professional who examines and interprets data to provide actionable insights
data stream
continuous flow of data generated in real-time
data schema
a blueprint that defines the structure
data security
measures taken to protect data from unauthorized access
data privacy
the practice of ensuring that personal or sensitive data is collected
machine learning
a branch of artificial intelligence focused on building systems that can learn and improve from experience without being explicitly programmed.
artificial intelligence
the simulation of human intelligence in machines designed to perform tasks that typically require human intelligence
predictive analytics
the use of historical data
batch processing
the execution of data-processing tasks in large volumes at once
real-time processing
data processing that occurs immediately as new data is generated
data recovery
the process of restoring lost
data compression
the technique of reducing the size of a data file or dataset to save storage space or bandwidth.
data serialization
the process of converting data into a format that can be easily transmitted
data center
a facility used to house computer systems and associated components
data retention
the policies and practices related to storing data for a set period of time
data lifecycle
the stages data goes through
data dictionary
a repository that contains definitions and descriptions of the data
data API
an application programming interface that allows data to be accessed
data query
a request for information or data from a database
data access
the ability to retrieve
data backup
the act of copying and archiving data to protect against data loss.
data replication
the process of copying and maintaining data in multiple locations to ensure high availability and redundancy.