1/27
Looks like no tags are added yet.
Name | Mastery | Learn | Test | Matching | Spaced |
---|
No study sessions yet.
data
streams of raw facts
information
data shaped into meaningful form
database
groups of related tables (files)
table(file)
group of related rows (records)
row(record)
group of related columns(fields)
column (field)
Group of characters, words, numbers, etc.
entity
person, place, thing on which we store information
attribute (metadata)
each characteristic, or quality, describing the entity
components of a database
tables or files + relationships among rows in tables + metadata
key field
field used to uniquely identify each record (row)
primary key
field in table used for key fields
foreign key
primary key used in second table as look-up field to identify records from original tables
normalization
streamlining complex groupings of data to minimize redundant data elements and awkward many-to-many relationships
referential integrity
rules used by RDBMS to ensure relationships between tables remain consistent
cardinality
measure of the number of links in a particular relationship between database one or more data entities
main operations within DBMS
read, insert, modify, delete data
SQL script commands
select, from, where, order by
SQL - select
what fields do u want to see?
SQL - from
source tables for fields
SQL - where
operations and conditions on fields
SQL - order by
sorting
big data
massive sets of unstructured/semi-structured data from web traffic, social media, sensors, and so on
data warehouse
stores current and historical data from many core operational transaction systems
Consolidates and standardizes information for use across enterprise, but data cannot be altered
Provides analysis and reporting tools
data martz
Subset of data warehouse
Typically focus on single subject or line of business
hadoop
Enables distributed parallel processing of big data across inexpensive computers
data mining
find hidden patterns, relationships in datasets
text mining
extracts key elements from unstructured data
web mining
discovery and analysis from web (content, structure, usage)