1/39
Looks like no tags are added yet.
Name | Mastery | Learn | Test | Matching | Spaced | Call with Kai |
|---|
No analytics yet
Send a link to your students to track their progress
Data at Scale (Big Data)
Refers to very large volumes of data collected from many different sources and analyzed to discover insights and patterns
Examples of Data at Scale
Social media posts
Online shopping transactions
Mobile app usage
GPS location data
Health records
Data in Daily Routine
People interact with data daily through digital assistants and apps. For example asking devices like Alexa or Siri about the weather, news, or meetings.
Better Healthcare Predictions
Large data sets help doctors and researchers predict diseases and improve treatments.
Smarter Transportation Systems
Data helps improve traffic systems, navigation, and transportation efficiency
Personalized Recommendations
Systems recommend products, shows, or content based on user behavior and preferences.
Improved Disaster Response
Large datasets help governments respond quickly to natural disasters.
Privacy Invasion
When personal data is collected or used without a person’s permission.
Data Misuse
When companies or organizations use data in harmful or unethical ways.
Algorithm Bias
When algorithms produce unfair results because of biased data
Security Risks
Large data storage systems may be vulnerable to hacking or cyberattacks.
Privacy Violations
Concerns about whether someone’s privacy is violated by collecting their personal data.
Fairness and Transparency
Ensuring that decisions made using data (such as loans or insurance) are fair and understandable.
New Discoveries
Combining large datasets allows analysts to discover insights that cannot be found using a single data source
Online Platforms
Websites and apps that collect user data through online interactions
Sensors and IoT Devices
Devices connected to the internet that collect real-time data from the physical world.
Surveys and Forms
Tools used to collect responses from people through questionnaires or forms
Transaction Records
Data created whenever people make purchases or financial transactions
Raw Data
Unprocessed data that has not yet been analyzed or interpreted.
Data Cleaning
Removing errors, duplicates, or incorrect data
Data Organizing
Sorting and categorizing data so it becomes easier to analyze.
Data Analysis
Examining data to identify patterns, trends, or relationships
Data Interpretation
Explaining what the analyzed data means and drawing conclusions
Example of Data Analysis
Raw Data:
1000 students visited a website
After Analysis:
Most visits happen at 8 PM
Most users are 18–22 years old
Meaning: The website is popular among college students in the evening.
Data Visualization
The graphical representation of data (charts, graphs, etc.) to help people understand patterns and trends.
Visual Literacy
The ability to understand and interpret data visualizations.
Why Visualization Matters
Large datasets are difficult to understand in text form, so visuals help people quickly see patterns and trends
Data Exploration
The process of examining data before making conclusions.
Key Questions in Data Analysis
What kind of data is it?
What is the data about?
Why was it collected?
Why was it analyzed and represented that way?
Ethical Design
Designing systems that use data responsibly and protect users
Importance of Ethics
When working with data, we deal with real people, and misuse of data can cause harm.
Privacy
Collect only necessary data and respect user privacy.
Consent
Users must clearly agree to the collection and use of their data.
Bias and Fairness
Systems must avoid discrimination and treat all users fairly
Transparency
Users should know how their data is collected and used
Protecting Human Rights
Respect human rights and cultural diversity when handling data.
Fairness and Honesty
Systems must be fair, trustworthy, and respectful of privacy
Privacy by Design
Avoid collecting unnecessary or sensitive data that is not needed.
On-Device Processing
Analyzing data directly on a user’s device instead of sending it to the cloud.
Data at Scale Key Concept
Data at scale involves collecting large volumes of data from multiple sources and analyzing it to gain insights that cannot be discovered from a single dataset