Query Your Data in S3 with SQL and Optimize for Cost and Performance

August 30, 2019
Streaming services allow you to ingest and analyze events continuously in real time. One of Big Data's principles is to store raw data as long as possible - to be able to answer future questions. If the data is permanently stored in Amazon Simple Storage Service (S3), it can be queried at any time with Amazon Athena without spinning up a database. This session shows step by step how the data should be structured so that both costs and response times are reduced when using Athena. The details and effects of compression, partitions, and column storage formats are compared. Finally, the CTAS feature of Amazon Athena is used to derive optimized views from the raw data for frequently issued queries. Speaker: Steffen Grunwald, Solutions Architect, AWS Level: 400 (Expert)
Previous Video
Visualize Data Stored in Data Lakes
Visualize Data Stored in Data Lakes

Storing data in S3 data lakes opens up door for enormous opportunities including analytics and AI/ML. Many ...

Next Video
Modern Data Platform - Rethinking Data
Modern Data Platform - Rethinking Data

A Modern Data Platform combines Traditional Business Intelligence, Big Data and Machine Learning. It includ...