Big Data and Cloud Computing

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/14

flashcard set

Earn XP

Description and Tags

These flashcards cover key concepts from the lecture on Big Data and Cloud Computing, focusing on Data Science, Cloud computing, Hypervisors, and HDFS.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

15 Terms

1
New cards

What is Data Science?

Data Science is the technology of handling and extracting value from big data and combines mathematics, statistics, programming, and domain expertise.

2
New cards

What are the key skills a Data Scientist must have?

Key skills include mathematics and statistics, programming and database knowledge, domain knowledge and soft skills, and communication and visualization.

3
New cards

Name two goals of Data Science.

Prediction and Classification are two goals of Data Science.

4
New cards

What is Cloud Computing?

Cloud Computing describes a class of network-based computing that occurs over the internet, providing integrated hardware, software, and networking services to clients.

5
New cards

List two major Cloud Computing Service Providers.

Microsoft Azure and Amazon Web Services (AWS) are two major Cloud Computing Service Providers.

6
New cards

What is Hypervisor?

A Hypervisor is software that creates and runs virtual machines by virtualizing hardware resources.

7
New cards

What is the difference between Type 1 and Type 2 Hypervisors?

Type 1 Hypervisors run directly on the hardware, whereas Type 2 Hypervisors run on an operating system.

8
New cards

What does HDFS stand for?

HDFS stands for Hadoop Distributed File System.

9
New cards

What is one challenge of traditional systems for data storage?

One challenge is high cost, with traditional systems costing $10,000 to $14,000 per terabyte.

10
New cards

What is a benefit of using HDFS over a regular file system?

HDFS reads huge data sequentially in a single seek operation, eliminating I/O problems.

11
New cards

What is one characteristic of HDFS?

HDFS has high fault tolerance.

12
New cards

Name a type of cloud deployment model.

Public Cloud is one type of cloud deployment model.

13
New cards

What is a key benefit of cloud computing architecture?

It reduces IT operating costs and provides easy accessibility to data and digital tools.

14
New cards

What are the primary components of Cloud Computing Architecture?

Primary components include cloud infrastructure, front end, back end, and security.

15
New cards

What is one application area of Data Science?

Healthcare is one application area of Data Science.