Amazon Web Services Bootcamp
上QQ阅读APP看书,第一时间看更新

Analytics

AWS offers an analytics compute engine that performs complex algorithms on data on demand:

  • Athena: AWS Athena is a query service that analyzes data on AWS S3. Athena supports standard queries over S3 data. It doesn't have any infrastructure to be managed, and we just need to pay for the queries being executed.
  • Elastic MapReduce (EMR): AWS Elastic MapReduce is based on the Hadoop framework, which provides fast processing of a large set of data stores using EC2 instances. AWS EMR supports a wide range of frameworks, such as HBase, Apache Spark, Presto, and Flink. AWS EMR can be integrated with various AWS services, such as S3 and DynamoDB.
  • CloudSearch: AWS CloudSearch is a search service that can be integrated with applications. CloudSearch can search a large set of data from various sources. AWS CloudSearch also scales as per the load, so users don't need to do any manual actions in order to achieve scalability.
  • ElasticSearch Service: AWS ElasticSearch Service is a managed service that allows users to create an ElasticSearch cluster in AWS. ElasticSearch is an open source search engine used for searching and analytics.
  • Kinesis: AWS Kinesis provides streams that can be used for processing real-time data streaming. It also provides Firehose, which is responsible for delivering data records received on streams to respective AWS services. It also provides Kinesis Analytics, which can be used via standard SQL.
  • Data Pipeline: AWS Data Pipeline is an automated service that processes data from one source and moves it to destinations on AWS. It has the ability to perform complex processing, it can handle workloads, it has highly availability of resources, and more.
  • Quicksight: AWS Quicksight is a business analytics tool that generates visual reports based on data. It automatically generates several visual graphs and analysis reports from static and dynamic data sources.
  • AWS Glue: AWS Glue is an extract, transform, and load (ETL) service that is used for analytics purposes. AWS Glue takes a data source as input and creates the table definition automatically in the AWS Glue Data Catalog. Then, the user can query the data over AWS Glue.