1/24
Looks like no tags are added yet.
Name | Mastery | Learn | Test | Matching | Spaced |
---|
No study sessions yet.
Batch processing
This can output data to a file store, relational DB, and a NoSQL DB
View
A saved SQL query that can include JOINS and calculated columns
Relationships between employees…
….belongs to Azure Cosmos DB Gremlin API
Image files
Are stored in Azure Blob Storage
Key/value pairs
Are stored in Azure Table Storage
Azure Data Lake Storage Gen2 hierarchy
Azure Resource Group —> Azure Storage Account —> Container —> Folder —> Files
Pipelines
A logical grouping of activities that performs a unit of work and can be scheduled.
ETL process
Extract: from source systems
Load: into data warehouses
Transform: into other target systems
ELT process
Extract: from source systems
Transform: inside data warehouses
Load: into data warehouses
ALTER SQL statement
SQL statement that modifies a view in a relational DB by adding a new column
Azure Data Lake Storage Gen2
This supports POSIX-compliant access control lists (ACLs)
Normalizing a database…
…does not increase the throughput of writing transactions, it will slow them down.
Transactional systems…
…are more normalized than analytics systems
Normalizing a database…
…does result in queries that require more JOINs.
Graph DBs…
…natively support the analysis of relationships(—> NODES and EDGES) between entities.
Azure Data Studio
This supports SQL notebooks
OLTP
This type of DB is designed to serve as a persistent data for business or front-end applications. It handles current, day to day transactional data. This has a highly normalized schema.
OLAP (OnLine Analytical Processing)
This is focused on analyzing LOTS of historical and aggregated data to gain insights. It is read-intensive, optimized for complex queries, and usually uses a denormalized structure to speed up analysis.
sqlcmd
This is a command-line utility that lets you query Azure SQL DBs
You can use…
…existing Microsoft SQL Server licenses to reduce the cost of Azure SQL DBs.
Azure SQL Database
This is used for OLTP
PaaS means that…
You are not responsible for OS updates
Backups are automatically performed
You are not responsible for installation of the DB engine
Azure SQL Managed Instance
This has native support for cross-database queries and transactions.
Relational Database
This must be used when strong consistency (ACID) guarantees are required, not when a dynamic schema is required.
Clustered index
An object associated with a table that sorts and stores the data rows in a table based on their key values.