AWS Analytics In-depth - CSAA Part I

0.0(0)
Studied by 3 people
call kaiCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/28

flashcard set

Earn XP

Description and Tags

Last updated 3:43 AM on 12/30/22
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai

No analytics yet

Send a link to your students to track their progress

29 Terms

1
New cards
Athena
________ is integrated with AWS Glue Data Catalog, which allows you to create a unified metadata repository across various services and maintain schema versioning.
2
New cards
Amazon EMR
Overall, ________ is a powerful and cost- effective tool for processing large datasets in the cloud, and is well- suited for a wide range of data- intensive workloads, including log analysis, financial analysis, and ETL (extract, transform, and load) tasks.
3
New cards
Kibana
________ is a visualization tool that allows you to create dashboards and charts based on data stored in Elasticsearch.
4
New cards
Elasticsearch
________ is a distributed search and analytics engine.
5
New cards
Python
Steps can be written in a variety of programming languages, such as ________ or Scala, and can be used to perform a wide range of tasks, including data transformation, filtering, and aggregation.
6
New cards
CSV
It supports a wide range of data formats, including ________, JSON, ORC, Avro, and Parquet, and can handle complex analysis, such as large joins, window functions, and arrays.
7
New cards
ETL engine
The ________ is built on top of Apache Spark and can scale out to process large data sets efficiently.
8
New cards
OpenSearch
________ provides a variety of features and tools to help users search and analyze their data, including:
9
New cards
ELK stack
The ________ is a popular tool for centralized logging, analytics, and visualization of data.
10
New cards
Security
________: OpenSearch includes built- in ________ features, such as encryption at rest and in transit, to help protect your data.
11
New cards
Logstash
________ is a data processing pipeline that can ingest data from a variety of sources, transform it, and then send it to Elasticsearch or other destinations.
12
New cards
Security Assertion Markup Language
OpenSearch Service supports HTTP basic authentication, as well as authentication through SAML (________) and Amazon Cognito.
13
New cards
Amazon OpenSearch Service
________ is a managed service that is built on top of the Elasticsearch open- source search and analytics engine.
14
New cards
Real time
________ search and analytics: With OpenSearch, you can search and analyze data in ________, using the full- text search capabilities of Elasticsearch.
15
New cards
Amazon QuickSight
It is integrated with ________ for visualizing streaming data, and can be used with Kinesis Data Streams or Kinesis Data Firehose as the data source.
16
New cards
Easy setup
________ and management: OpenSearch simplifies the process of setting up and managing Elasticsearch clusters in the cloud, including automatic software updates and patches.
17
New cards
S3
________ is designed for 99.999999999 % durability on a per object basis.
18
New cards
Scalability
________: OpenSearch can automatically ________ your Elasticsearch cluster to meet the demands of your application, without the need for manual intervention.
19
New cards
Amazon ES
________ also provides integrations with other AWS services, such as Amazon VPC, to allow you to secure your data and control access to your Elasticsearch clusters.
20
New cards
Amazon Kinesis
Integration with other AWS services: OpenSearch integrates with other AWS services, such as ________, Amazon S3, and Amazon CloudWatch, allowing you to easily ingest, process, and analyze data from these sources.
21
New cards
Athena
________ uses Presto, an open- source, distributed SQL query engine optimized for low latency and interactive data analysis, to execute queries.
22
New cards
Amazon OpenSearch Service
________ is a fully managed search service that offers a number of security features to help protect your data.
23
New cards
Athena
________ integrates directly with Identity and Access Management (IAM) and you can leverage the use of bucket policies within S3 to control access to data and restrict users from querying it using ________.
24
New cards
AWS Glue
________ also includes an ETL engine that can generate Scala or Python code to transform your data.
25
New cards
Athena
________ is out- of- the- box integrated with business intelligence and SQL development applications through its JDBC and ODBC drivers, and can be accessed through the ________ console, API, CLI, AWS SDK, or through these drivers.
26
New cards
AWS Glue
________ provides a number of tools and features to help users build and maintain their ETL pipelines, including the ability to schedule crawlers to run on a regular basis to ensure that the Data Catalog remains up- to- date, and the ability to use machine learning transforms to clean and prepare data for analysis.
27
New cards
Athena
________ is highly scalable and reliable, and is hosted in a multi- tenant environment that is designed to maintain high availability.
28
New cards
AWS Glue
________ is a fully managed ETL service that makes it easy for users to discover, transform, and prepare data for analytics.
29
New cards
AWS Glue
________ is a fully managed extract, transform, and load (ETL) service that makes it easy for customers to prepare and load their data for analytics.