1/80
Looks like no tags are added yet.
Name | Mastery | Learn | Test | Matching | Spaced |
---|
No study sessions yet.
What is data?
A collection of facts
Does data have meaning?
No meaning just raw facts
What is information?
Processed data
Is information meaningful?
Yes it is.
What does information require for it to be meaningful?
Context
What is knowledge?
Consume information, understand it, and apply it
What is data analysis?
Collection, transformation, and organization of data in order to draw conclusions, make predicitions and drive informed decision-making.
What does data ecosystem involve?
Involves hardware, software, and people
What is hardware?
Physical storage, personal computers
What is software?
Database management systems
What is the cloud?
A place to keep data online rather than a computer hard drive.
What is data driven decision making?
using facts to guide business strategy
What are the steps in data driven decision making?
Define the problem
Find data
Analyzes it
Uses it to uncover trends, patterns, and relationships
What does data do in a business setting?
Improve processes, identify opportunities and trends, launch new products, serve customers, and make thoughtful decisions
What is qualitative data?
Involves subjective and explainatory measures of qualities and characteristics
What is an example of qualitative data?
Favorite movie
What is quantitative data?
specific and objective measures of numerical facts
What is an example of quantitative data?
population of elephants in Africa
What is continuous data?
is measured and can have almost any numeric value.
What is an example of continuous data?
Temperature
What is discrete data?
is counted and has limited number of values
What is an example of discrete data?
Tickets sold in the current month
What is nominal data?
type of qualitative data that is not categorized with a set order
What is an example of nominal data?
First-time customer, returning customer, regular customer
What is ordinal data?
type of qualitative data with a set order or scale
What is an example of ordinal data?
income level (low, medium, high)
What is internal data?
lives inside a company’s own system
What is an example of internal data?
sales data by store location
What is external data?
lives outside of a company or organization
What is an example of external data?
credit reports for customers of an auto dealership
What is primary data?
Collected by a researcher from first-hand sources
What is an example of primary data?
data from an interview you conducted
What is secondary data?
gathered by other people or from other research
What is an example of secondary data?
demographic data collected by a university
What is structured data?
organized by a certain format, line rows and columns
What is an example of structured data?
store inventory
What is unstructured data?
not organized in any easily identifiable manner
What is an example of unstructured data?
email, social media comments
What is a population?
all possible data values in a particular data set
What is an example of a population?
all the cars in a given area
What is a sample?
Part of a population
What are the five things that represent good data?
Reliable, original, comprehensive, current, cited
What is reliable (good data)?
you trust the data. You are sure that the data is accurate and complete and it avoids unbiased data
What is original (good data)?
validate data with its original source
What is comprehensive (good data)?
contains all critical data needed to answer the question or find the solution
What is current (good data)?
Up to date
What is cited (good data)?
Cite datasets/data to prove more credible
What is metadata?
Data about data
What is an example of metadata?
creator of a file
What is data bias?
an error that systematically skews results in a specific direction
What is an example of a data bias?
bias in a question from a survey
What is sampling bias?
when a sample is not representative of the population
What is observer bias?
Tendency for different people to observe things differently
What is interpretation bias?
Tendency to always interpret ambiguous situations positively or negatively
What is confirmation bias?
Tendency to search for or interpret information in a way that confirms preexisting beliefs
What is data ethics?
Refers to well-founded standards of right and wrong that dictate how data is collected, shared, and used
What are the 6 aspects of data ethics?
Ownership, transaction transparency, consent, currency, privacy, openness
Data Ethics
Ownership
who owns data?
Data Ethics
Transaction transparency
Idea that all data processing activities and algorithms should be completely explainable and understood by individuals who provides their data
Data Ethics
Consent
Individual’s right to know explicit details about how and why their data will be used before agreeing to provide it
Data Ethics
Currency
Individuals should be aware of financial transactions resulting from using their personal data and the scale of these transactions
Data Ethics
Privacy
Preserving data subjects’ information and activity any time a data transaction occurs. Sometimes called data protection
Data Ethics
Openness
Free access, usage, and data sharing. Everyone must be able to use, reuse, and redistribute the data.
What is data modeling?
Creating a specific data model for a determined problem domain
What is the first step in designing a database?
Data modeling
What is a data model?
graphical, of complex real-world data structures
What are the 6 data model building blocks?
Entity/Table, attribute/field, field/attribute value. relationship, constraint, primary key/foreign key
Data Model Building Blocks
Entity/Table
a person,place, thing or event about which data will be collected and stored
Data Model Building Blocks
What is stored in a table?
Related fields/attributes
Data Model Building Blocks
Attribute/Field
a single characteristic/attribute of a person, place, object, event or idea.
Data Model Building Blocks
Attribute/Field Example
animal ID, animal name, animal type, breed
Data Model Building Blocks
Field/Attribute Value
Specific content
Data Model Building Blocks
What is the set of field values called?
Record
Data Model Building Blocks
Relationship
Association among entities/tables
Data Model Building Blocks
Types of Relationships
One-to-many
Many-to-many
One-to-one
Data Model Building Blocks
One-to-many relationship
Exists between two tables when one record in the first table matches zero, one, or many records in the second table, and when one record in the second table matches at most one record in the first table
Data Model Building Blocks
Constraint
Restriction placed on data that ensures data integrity
Data Model Building Blocks
Constraint- Example
GPA has to be between 0.00 and 4.00
Data Model Building Blocks
Primary key/Foreign key
Relationships between tables are created using a primary key and a foreign key/common field.
Data Model Building Blocks
Primary Key
collection of fields, that uniquely identify each record in a table
Data Model Building Blocks
Foreign Key
Key from one table as a field in a second table to form a relationship between two tables