Understanding the Relevance of Big Data Analytics in Healthcare

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/50

flashcard set

Earn XP

Description and Tags

Vocabulary flashcards covering key terms, technologies, data types, AI techniques, workflow steps, and challenges discussed in Module 1 of the Certificate Course on Artificial Intelligence in Medicine (Big Data Analytics in Healthcare).

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

51 Terms

1
New cards

Data

Raw facts, figures, or observations that can be processed to generate insights.

2
New cards

Structured Data

Organized data stored in predefined formats such as tables, spreadsheets, or relational databases.

3
New cards

Unstructured Data

Data without a fixed format, e.g., emails, physician notes, images, audio, and video.

4
New cards

Healthcare Data

Medical and health-related information collected to improve patient care, decision-making, and operations.

5
New cards

Patient Data

Demographics, medical history, diagnoses, and treatment records about an individual patient.

6
New cards

Clinical Data

Lab results, imaging reports, and physician notes gathered during clinical care.

7
New cards

Operational Data

Information on hospital resources, staff schedules, and workflow efficiency.

8
New cards

Genomic Data

DNA sequencing information used for precision and personalized medicine.

9
New cards

Wearable & IoT Data

Real-time physiological readings from smart devices and sensors (e.g., heart rate, glucose).

10
New cards

Importance of Healthcare Data

Enhances diagnosis, improves outcomes, supports prediction, optimizes resources, and fuels AI innovation.

11
New cards

Big Data

Extremely large, complex datasets that traditional systems cannot process efficiently.

12
New cards

Volume (Big Data V)

The massive quantity of data generated and stored.

13
New cards

Velocity (Big Data V)

The speed at which data is produced, transmitted, and processed.

14
New cards

Variety (Big Data V)

The diversity of data types—structured, semi-structured, and unstructured.

15
New cards

Veracity (Big Data V)

The reliability, accuracy, and trustworthiness of data.

16
New cards

Value (Big Data V)

The actionable benefits and insights derived from data.

17
New cards

Electronic Health Records (EHR)

Digital patient charts that consolidate medical histories across care settings.

18
New cards

Medical Imaging Data

Large unstructured files from X-rays, MRIs, CT scans used for diagnostics.

19
New cards

Clinical Trials Data

Structured and unstructured information collected during medical research studies.

20
New cards

Social Media & Patient Feedback

Unstructured posts, reviews, and comments reflecting patient experience and public health trends.

21
New cards

Health Insurance & Billing Data

Structured claims and payment records used for reimbursement and fraud detection.

22
New cards

Kilobyte (KB)

≈1,024 bytes; a small storage unit for simple text files.

23
New cards

Megabyte (MB)

≈1,024 KB; stores images or small datasets.

24
New cards

Gigabyte (GB)

≈1,024 MB; typical size for hospital databases or imaging studies.

25
New cards

Terabyte (TB)

≈1,024 GB; capacity for years of EHR or imaging archives.

26
New cards

Petabyte (PB)

≈1,024 TB; scale of nationwide health systems or genomic repositories.

27
New cards

Small Data in Healthcare

KB–MB datasets processed locally with tools like Excel or Access.

28
New cards

Medium Data in Healthcare

GB–TB datasets managed with enterprise SQL databases or cloud solutions.

29
New cards

Big Data in Healthcare

TB–PB+ datasets requiring distributed storage and parallel processing frameworks.

30
New cards

Hadoop

Open-source framework with HDFS & MapReduce for distributed big data storage and processing.

31
New cards

Hive

SQL-like data warehouse on Hadoop enabling easy querying with HiveQL.

32
New cards

Pig

High-level scripting platform (Pig Latin) that simplifies complex MapReduce tasks.

33
New cards

Spark

In-memory big data engine offering fast batch, streaming, ML, and SQL analytics.

34
New cards

HBase

NoSQL database on Hadoop providing real-time read/write access to massive tables.

35
New cards

Machine Learning (ML)

AI technique that learns patterns from data to make predictions (e.g., readmission risk).

36
New cards

Natural Language Processing (NLP)

AI method for extracting insights from unstructured text like physician notes.

37
New cards

Deep Learning

Neural-network approach (e.g., CNNs) for tasks such as tumor detection in MRI.

38
New cards

Predictive Analytics

Using historical data and models to forecast outcomes (e.g., disease outbreaks).

39
New cards

Real-Time Analytics

Instant analysis of streaming data, enabling immediate clinical interventions.

40
New cards

Data Collection (Workflow)

Gathering EHRs, lab results, demographics, and admission details for analysis.

41
New cards

Data Preprocessing

Cleaning, normalizing, and anonymizing raw healthcare data before modeling.

42
New cards

Feature Engineering

Creating meaningful variables (e.g., age, medication adherence) to improve model performance.

43
New cards

Model Training & Selection

Applying algorithms like Random Forest or XGBoost to learn from training data.

44
New cards

Model Evaluation

Assessing accuracy, precision, and recall to validate predictive performance.

45
New cards

Deployment

Integrating an analytic model into clinical systems for real-time decision support.

46
New cards

Data Privacy & Security

Protecting sensitive health data and complying with HIPAA, GDPR, and related laws.

47
New cards

Data Integration

Combining disparate data sources into a unified, analyzable format.

48
New cards

Scalability

Ability of systems to handle growing data volumes without performance loss.

49
New cards

Interoperability

Seamless exchange of information across different healthcare IT systems.

50
New cards

MIMIC Database

Public ICU dataset widely used for AI research in intensive care.

51
New cards

PhysioNet

Repository of physiological signals and time-series data for health research.