Dremio Study

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/61

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

62 Terms

1
New cards

MS SQL

Relational DB

2
New cards

MySQL

Relational DB

3
New cards

Oracle

Relational DB

4
New cards

Greenplum

Relational DB

5
New cards

Postgres

Relational DB

6
New cards

IBM DB2

Relational DB

7
New cards

Cassandra

NoSQL (We don’t integrate)

8
New cards

MongoDB

NoSQL

9
New cards

ElasticSearch

NoSQL

10
New cards

Data Lake

A centralized repository that stores, processes and secures large amounts of data in its original form. Can store structured, semi-structured, and unstructured data from any source, without sacrificing fidelity.

11
New cards

Hadoop

Data Lake (on-prem/open-source)

12
New cards

Amazon S3

Data Lake (cloud)

13
New cards

Azure Data Lake Service

Data Lake (cloud)

14
New cards

Google Cloud Storage

Data Lake (cloud)

15
New cards

MinIO

On-prem object storage

16
New cards

Dell Isilon

On-prem object storage

17
New cards

Dell ECS

On-prem object storage

18
New cards

Pure Storage

On-prem object storage

19
New cards

Cumulo

On-prem object storage

20
New cards

NetApp

On-prem object storage

21
New cards

Data Warehouse

Proprietary way of storing and managing data, built for analytics, massive costs, getting data in takes time

22
New cards

Terradata

Data warehouse (on-prem)

23
New cards

Vertica

Data warehouse (on prem)

24
New cards

Oracle

Data warehouse

25
New cards

Netezza

Data warehouse (on-prem)

26
New cards

Snowflake

Data warehouse (cloud)

27
New cards

Amazon Redshift

Data warehouse (cloud)

28
New cards

Microsoft Synapse

Data warehouse (cloud)

29
New cards

Firebolt

Data Warehouse

30
New cards

SQL Engine

Engine that queries data

31
New cards

Impala

Cloudera SQL Engine (assume Hadoop)

32
New cards

Drill

SQL Engine (MapR)(Hadoop)

33
New cards

Hive

SQL Engine (Horton Works)

34
New cards

Presto

SQL Engine (built by FB. Open-source, Free)(connects to all data lakes like we do)

35
New cards

Amazon Athena

SQL Engine (only on s3)

36
New cards

Starburst

SQL Engine (most like Dremio) connects to all data lakes, SQL and noSQL sources.

37
New cards

Trino

SQL Engine

38
New cards

Databricks SQL

SQL Engine

39
New cards

ETL (Extract, Transform, Load)

Moving data from one repository to another

40
New cards

Informatica

ETL

41
New cards

Microsoft SSIS

ETL

42
New cards

IBM Datastage

ETL

43
New cards

Data Prep

Collect, clean, transform and organize raw and incomplete data into a suitable and consistent format for further data processing.

44
New cards

Alteryx

Data prep

45
New cards

Paxata

Data prep

46
New cards
47
New cards

Trifacta

Data prep

48
New cards

BI/BA

Software applications that collect, process and analyze data to help businesses make sense of it.

49
New cards

Tableau

BI

50
New cards

Power BI

BI

51
New cards

Cognos

BI

52
New cards

Microstrategy

BI

53
New cards

Looker

BI

54
New cards

Qilk

BI

55
New cards

Machine Learning

Used to analyze data and identify patterns, which are then used to create a data model that can make predictions.

56
New cards

Jupyter

ML

57
New cards

Spark

ML (databricks)

58
New cards

SAS

ML

59
New cards

Anaconda

ML

60
New cards

Virtualization

Data virtualization is an approach to data management that allows an application to retrieve and manipulate data without requiring technical details about the data, such as how it is formatted at source, or where it is physically located, and can provide a single customer view of the overall data.

61
New cards

Denodo

Virtualization

62
New cards

Tibco

Virtualization