Detailed Study Notes on Big Data & Business Intelligence
Overview of Big Data & Business Intelligence
Presentation Outline
Big Data
Business Intelligence
Why - Impacts
What is Big Data?
Definition: Big Data is the current buzzword in the business world, reshaping daily operations. It encompasses not only massive amounts of data but also the structure and processing of that data to provide tangible benefits to organizations.
Data Characteristics
Big Data Everywhere:
Data that is too large and complex for standard data tools to manage.
Examples:
Shares traded on US Stock Markets each day: 7 Billion transactions.
Data generated in one flight from New York to London: 10 Terabytes.
Tweets per day on Twitter: 400 Million.
Likes per day on Facebook: 3 Billion.
The 3 V's of Big Data
Volume
Refers to the amount of data generated.
Data stored in kilobytes or terabytes via records, tables, or files.
Variety
Denotes both structured and unstructured data types.
Examples involve online images, videos, human-generated texts, and machine-generated readings.
Velocity
Indicates the speed of data generation and modification.
Data can come in streams, batches, or bits.
Statistics: 90% of the world’s data has been generated in the last two years.
Additional Characteristics
5 V's of Big Data
Value
Access to big data is beneficial only if it can be converted into value for the organization.
Veracity
Refers to the reliability and trustworthiness of data.
Essential for verifying and validating data accuracy.
Big Data with 8 V's
Virality
Can the information be easily conveyed and shared?
Viscosity
Does the information trigger action?
Visualization
Can it be quickly understood, facilitating fast decision-making?
Velocity
Do changes and opportunities arise swiftly in the information flow?
Value
Is the information accessible when needed?
Variety
Is the data well-balanced in representation?
Data Types
Structured Data
Definition: Data arranged in rows and columns.
Examples: Numerical values, dates, and strings.
Characteristics: Easier to manage and secure; estimated to represent about 20% of enterprise data (Gartner).
Unstructured Data
Definition: Data that cannot be neatly organized in rows and columns.
Examples: Images, audio, video, email, and word processing files.
Characteristics: Comprises about 80% of enterprise data; more demanding on storage and management.
Data Processing and Analytics
Data Analytics: Uses data, IT, statistical analysis, and models to allow managers a better grasp of their business operations for fact-based decision-making.
### 4 Types of Data Analytics
Descriptive Analytics
Question: What's happening in the business?
Outcome: Provides comprehensive, accurate, and real-time observation.
Diagnostic Analytics
Question: Why is it happening?
Outcome: Isolates root causes and extraneous information.
Predictive Analytics
Question: What's likely to occur?
Outcome: Uses historical patterns and algorithms to anticipate outcomes.
Prescriptive Analytics
Question: What should I do?
Outcome: Involves recommended actions based on advanced analytical techniques.
Business Intelligence (BI)
Definition: While Big Data refers to large data sets, Business Intelligence is the process of performing analysis on that data. BI employs technology to extract valuable information from data, including normal and Big Data.
Components of BI:
Strategies and technologies for analyzing business data.
Merging enterprise information systems into a cohesive data warehouse for analytics.
Data Analytics Tools
Common Tools for BI
Tableau
Qlik Sense
Microsoft Power BI
Google Analytics
Common Tools for Big Data
Hadoop
Spark
Hive
Cassandra
Storm
Impacts of Big Data and BI
Education
Healthcare
Agriculture
Manufacturing
Data Management and Reporting Systems
Structure of Business Reporting Systems:
Focuses on collecting, processing, and delivering business information.
Enables decision-making at operational, tactical, and strategic levels.
Future Trends
Cloud Computing: Shifts towards online storage and collaboration models.
Increased reliance on analyzing large volumes of data in various formats can enhance business workflows.
Security and Ethical Considerations
Ethics: Guiding principles for responsible technology use to prevent harm.
Privacy: Controls regarding personal data usage and sharing.
Security: Ensures confidentiality, integrity, and availability of information systems through various technical and administrative measures.
Conclusion
Understanding Big Data and Business Intelligence is crucial for effective decision-making and operational efficiency in modern businesses.