1/51
Looks like no tags are added yet.
Name | Mastery | Learn | Test | Matching | Spaced |
---|
No study sessions yet.
Consists of facts and statistics about a person or object that are collected for reference or analysis.
Data
What can data include?
Numbers, words, measurements, observations, or even just the description of the object it references.
What are the six data elements that together comprise an IT system
1. Bit
2. Byte
3. Field
4. Record
5. File
6. Database
The smallest element in a computer system
Bits
Are composed of Bits
Bytes
Are composed of bytes
Fields
Are composed of fields
Records
Are composed of records
Files
Are composed of files
Databases
What are the two types of ways data is categorized as based on how that data is stored on a computer
1. Structured
2. Unstructured
Data that is organized and fits nicely into tables
Structured
Data that doesn't fit into a traditional table
Unstructured
What are some characteristics of structured data?
1. Defined data types like numerical, text, or date
2. Can easily be displayed as a table
3. Require less storage space and are easier to manage
What are some characteristics of unstructured data?
1. Not stored in a fixed field of a record or file
2. Images, audio, video, and more
3. Require more storage space and are harder to manage
4. Cannot easily be displayed as a table and are harder to manage
Refers to user-generated content and user participation on the internet with interactive applications. Read and write environment
Web 2.0
True or False: The internet was once a "read-only" environment where users mainly consumed published content
True
True or False: Over time, unstructured data has grown much more quickly than structured data
True
Data that doesn't change once it's created
Static Data
Data that may change after it is recorded and must be updated
Dynamic Data
A set of logically related files (tables) that contains an organized collection of data that is accessible for fast searching and retrieval.
Database
The most common type of database you'll encounter, organize structured data in interrelated tables which are connected by similarities between tables.
Relational Database
The systems that manage a database and retrieve data for a user. For example, Oracle, MySQL, SQL Server, and Microsoft Access
Database Management System (DBMS)
Requests for data that is stored within a database
Queries
Queries are written in a ______ that is unique to the database management system
Querying Languages
A logical diagram of a database's structure and organization. It depicts relationships between tables and acts as both a blueprint for database administrators when setting up a new database and a guide for users who access an established database
Schema
The capability of a database to manage increasing demands
Database Scalability
What are the 2 types of scaling?
1. Vertical Scaling
2. Horizontal Scaling
This type of scaling increases the size of the machine
Vertical Scaling
This type of scaling increases the number of machines
Horizontal Scaling
A database that is connected to an information system that is used for transactions in a business process. This database is "live"— the data is created and stored in real time as a result of a transaction being processed
Production Database
What are the 2 types of data storage?
1. Data Lake
2. Data Warehouse
Is designed specifically for reporting and data analysis and contains relevant data that has already been transformed for reporting use.
Data Warehouse
A vast pool of data as it is designed to contain all a company's data and acts as a central repository for data. Stores data that has not yet been cleaned, aggregated, or filtered.
Data Lake
A subset of a data warehouse that is designed for a specific business function. A department uses this for its individual reporting needs.
Data Mart
What is the type of data do data lakes use?
Unstructured and structured data from across the company (raw data)
What is the type of data do data warehouses use?
Historical data in a structured format designed for a relational database (processed data)
What is the purpose of data lakes?
Cost-effective storage of big data
What is the purpose of data warehouses?
Aggregated big data for analytics and business decisions
Who are the users of data lakes?
Data Scientists
Who are the users of data warehouses?
Data Analysts
What are the activities of a data lake?
1. Storing big data
2. Big data analytics (data science)
What are the activities of a data warehouse?
1. Supporting business analysis
2. Read-only queries for aggregating or extracting data
What is the scope of data of a data lake?
All data in a company
What is the scope of data of a data warehouse?
Only data relevant to analysis
True or False: Data flows through an enterprise data lake to an enterprise data warehouse
True
Refers to extremely large and complex data sets that can be analyzed using recent technological innovations to reveal patterns and associations. So large, generated so fast, and so unstructured that it surpasses the limitations of traditional systems and databases.
Big Data
What are the 5 characteristics of big data?
1. Volume
2. Velocity
3. Variety
4. Veracity
5. Value
The speed at which data is generated.
Velocity
The accuracy and truthfulness of the data.
Veracity
The diversity of data created or collected.
Variety
The quantity and scale of data generated every second.
Volume
Arguably the most important of the 5Vs because data isn't useful to a business unless it can be converted into valuable information. Extracting useful data.
Value