1/14
These flashcards cover key concepts from the lecture on Big Data and Cloud Computing, focusing on Data Science, Cloud computing, Hypervisors, and HDFS.
Name | Mastery | Learn | Test | Matching | Spaced |
---|
No study sessions yet.
What is Data Science?
Data Science is the technology of handling and extracting value from big data and combines mathematics, statistics, programming, and domain expertise.
What are the key skills a Data Scientist must have?
Key skills include mathematics and statistics, programming and database knowledge, domain knowledge and soft skills, and communication and visualization.
Name two goals of Data Science.
Prediction and Classification are two goals of Data Science.
What is Cloud Computing?
Cloud Computing describes a class of network-based computing that occurs over the internet, providing integrated hardware, software, and networking services to clients.
List two major Cloud Computing Service Providers.
Microsoft Azure and Amazon Web Services (AWS) are two major Cloud Computing Service Providers.
What is Hypervisor?
A Hypervisor is software that creates and runs virtual machines by virtualizing hardware resources.
What is the difference between Type 1 and Type 2 Hypervisors?
Type 1 Hypervisors run directly on the hardware, whereas Type 2 Hypervisors run on an operating system.
What does HDFS stand for?
HDFS stands for Hadoop Distributed File System.
What is one challenge of traditional systems for data storage?
One challenge is high cost, with traditional systems costing $10,000 to $14,000 per terabyte.
What is a benefit of using HDFS over a regular file system?
HDFS reads huge data sequentially in a single seek operation, eliminating I/O problems.
What is one characteristic of HDFS?
HDFS has high fault tolerance.
Name a type of cloud deployment model.
Public Cloud is one type of cloud deployment model.
What is a key benefit of cloud computing architecture?
It reduces IT operating costs and provides easy accessibility to data and digital tools.
What are the primary components of Cloud Computing Architecture?
Primary components include cloud infrastructure, front end, back end, and security.
What is one application area of Data Science?
Healthcare is one application area of Data Science.