ISC vocabulary 2: data engineering concepts (EN-defnitions)

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/47

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

48 Terms

1
New cards

data

information in raw or structured forms that can be processed

2
New cards

database

organized collection of structured data

3
New cards

SQL

a programming language used to manage and manipulate relational databases.

4
New cards

big data

extremely large datasets that can be analyzed computationally to reveal patterns

5
New cards

data warehouse

a centralized repository for storing large volumes of structured data from different sources

6
New cards

data lake

a storage system that holds vast amounts of raw data in its native format until it is needed for analysis.

7
New cards

ETL

a process that extracts data from various sources

8
New cards

data modeling

the process of creating a visual representation (model) of a data structure or system

9
New cards

data cleansing

the process of detecting

10
New cards

data mining

the practice of analyzing large datasets to discover patterns

11
New cards

data integration

the process of combining data from different sources into a unified view for analysis or reporting.

12
New cards

data governance

the management and oversight of data assets

13
New cards

data quality

the measure of data's accuracy

14
New cards

NoSQL

a class of non-relational databases designed for handling unstructured and semi-structured data with flexible schemas.

15
New cards

hadoop

an open-source framework that enables the storage and processing of large datasets across distributed computing systems.

16
New cards

apache spark

a fast

17
New cards

MapReduce

a programming model for processing large datasets in parallel across distributed clusters of computers.

18
New cards

data pipeline

a series of data processing steps that collect

19
New cards

data architecture

the design and structure of an organization's data assets

20
New cards

data migration

the process of transferring data from one system

21
New cards

data transformation

the process of converting data from one format or structure to another for integration or analysis purposes.

22
New cards

data ingestion

the process of collecting and importing data from various sources into a database or data storage system.

23
New cards

data visualization

the graphical representation of data to help users understand and interpret complex data patterns and insights.

24
New cards

data analytics

the practice of analyzing datasets to derive actionable insights

25
New cards

data engineer

a professional who designs

26
New cards

data scientist

a professional who analyzes and interprets complex data to solve problems using techniques from statistics

27
New cards

data analyst

a professional who examines and interprets data to provide actionable insights

28
New cards

data stream

continuous flow of data generated in real-time

29
New cards

data schema

a blueprint that defines the structure

30
New cards

data security

measures taken to protect data from unauthorized access

31
New cards

data privacy

the practice of ensuring that personal or sensitive data is collected

32
New cards

machine learning

a branch of artificial intelligence focused on building systems that can learn and improve from experience without being explicitly programmed.

33
New cards

artificial intelligence

the simulation of human intelligence in machines designed to perform tasks that typically require human intelligence

34
New cards

predictive analytics

the use of historical data

35
New cards

batch processing

the execution of data-processing tasks in large volumes at once

36
New cards

real-time processing

data processing that occurs immediately as new data is generated

37
New cards

data recovery

the process of restoring lost

38
New cards

data compression

the technique of reducing the size of a data file or dataset to save storage space or bandwidth.

39
New cards

data serialization

the process of converting data into a format that can be easily transmitted

40
New cards

data center

a facility used to house computer systems and associated components

41
New cards

data retention

the policies and practices related to storing data for a set period of time

42
New cards

data lifecycle

the stages data goes through

43
New cards

data dictionary

a repository that contains definitions and descriptions of the data

44
New cards

data API

an application programming interface that allows data to be accessed

45
New cards

data query

a request for information or data from a database

46
New cards

data access

the ability to retrieve

47
New cards

data backup

the act of copying and archiving data to protect against data loss.

48
New cards

data replication

the process of copying and maintaining data in multiple locations to ensure high availability and redundancy.