Amazon Athena : Serverless Architecture and Troubleshooting

  IJCTT-book-cover
 
         
 
© 2023 by IJCTT Journal
Volume-71 Issue-5
Year of Publication : 2023
Authors : Amol Kulkarni
DOI :  10.14445/22312803/IJCTT-V71I5P110

How to Cite?

Amol Kulkarni, "Amazon Athena : Serverless Architecture and Troubleshooting," International Journal of Computer Trends and Technology, vol. 71, no. 5, pp. 57-61, 2023. Crossref, https://doi.org/10.14445/22312803/IJCTT-V71I5P110

Abstract
Data Analysis is of topmost import important for the survival of any organization in today's digital and competitive world. In order to store enormous amounts of data, different OLTP ERP Systems like SAP S/4HANA, Amazon S3, Oracle Net Suite etc. are available, but accessing that data in a simple and easiest way is tricky due to the heavy cost involved in Extraction, transformation and loading ETL operations, maintenance, monitoring and data visualization in data warehousing. Serverless Systems come into the picture for data analysis due to their features like simplicity, ease of operation, automatic scalability and less cost for accessing huge data. There are various serverless systems available in the market, like Amazon Athena, Amazon Glue, Microsoft Azure, and Google Big Query. In all these serverless systems, Amazon Athena is best used for cost savings, easy access to huge amounts of data, tight integration with Amazon S3, scalability, security and simplicity. This paper aims to explain the serverless architecture of Amazon Athena and how to troubleshoot different issues in Amazon Athena while accessing huge data.

Keywords
Amazon S3, AWS Glue, Business Intelligence(BI), ETL, Machine Learning(ML), SQL.

Reference

[1] Alexander S Gills, Amazon Athena . [Online] Available: https://www.techtarget.com/searchaws/definition/Amazon-Athena
[2] Anthony Virtuoso, Mert Turkay Hocanin, and Aaron Wishnick, Serverless Analytics with Amazon Athena, 2021. [Online] Available: https://www.oreilly.com/library/view/serverless-analytics-with/9781800562349/
[3] Mert Hocanin, and Pathik Shah, Top 10 Performance Tuning Tips for Amazon Athena. [Online] Available: https://aws.amazon.com/blogs/big-data/top-10-performance-tuning-tips-for-amazon-athena/
[4] Dhiraj Thakur, Building AWS Data Lake visualizations with Amazon Athena and Tableau. [Online] Available: https://aws.amazon.com/blogs/big-data/building-aws-data-lake-visualizations-with-amazon-athena-and-tableau/
[5] Amazon QuickSight. [Online] Available: https://aws.amazon.com/quicksight/
[6] Amazon Athena features. [Online] Available: https://aws.amazon.com/athena/features/
[7] Ernesto Marquez, How to use Athena to troubleshoot AWS operations issues. [Online] Available: https://www.techtarget.com/searchaws/tip/How-to-use-Athena-to-troubleshoot-AWS-operations-issues
[8] Noreen Hasan, Amazon Athena, [Online] Available: https://acloudguru.com/blog/engineering/amazon-athena-explained-what-is-it-and-when-should-i-use-it
[9] Amazon RedShift Spectrum vs Amazon Athena. [Online] Available: https://digitalcloud.training/amazon-redshift-spectrum-vs-amazon-athena/
[10] Troubleshooting in Athena. [Online] Available: https://docs.aws.amazon.com/athena/latest/ug/troubleshooting-athena.html
[11] Workgroups in Athena. [Online] Available: https://docs.aws.amazon.com/athena/latest/ug/troubleshooting-athena.html#troubleshooting-athena-workgroups
[12] Using Athena ACID transactions. [Online] Available: https://docs.aws.amazon.com/athena/latest/ug/acid-transactions.html
[13] Data Architecture for AWS Athena. [Online] Available: https://www.upsolver.com/blog/data-architecture-aws-athena-examples
[14] Amazon Athena Architecture, Why Athena with QuickSight. [Online] Available: https://www.xenonstack.com/blog/amazon-athena-quicksight
[15] Invoking Machine Learning Models with Amazon Athena using SQL Queries, Machine Learning with Amazon Sage Maker Cookbook. [Online]. Available: https://subscription.packtpub.com/book/data/9781800567030/4/ch04lvl1sec43/invoking-machine-learning-models-with-amazon-athena-using-sql-queries
[16] Work with Amazon Athena Data in Apache Spark Using SQL. [Online] Available: https://www.cdata.com/kb/tech/athena-jdbc-apache-spark.rst
[17] Amazon Athena for Apache Spark. [Online] Available: https://aws.amazon.com/athena/spark/
[18] Blokdyk Gerardus, Amazon Athena a Clear and Concise Reference.
[19] Haritha Chatradipalli, Blog Amazon Athena.
[20] Integration with AWS Glue. [Online] Available: https://docs.aws.amazon.com/athena/latest/ug/glue-athena.html
[21] AWS Athena and Glue: Querying S3 data. [Online] Available: https://towardsdatascience.com/aws-athena-and-glue-querying-s3-data-ce83f1ba9f9f
[22] Building a DynamoDB to Athena Data Pipeline with AWS Glue and CDK. [Online] Available: https://docs.getcommandeer.com/docs/Glue/building-a-dynamodb-to-athena-data-pipeline-with-aws-glue-and-cdk/#the-problem
[23] Athena SQL basics – How to Write SQL against Files. [Online] Available: https://www.obstkel.com/amazon-athena-sql
[24] Amazon Athena Connection with Tableau [Online] Available: https://help.tableau.com/current/pro/desktop/en-us/examples_amazonathena.htm
[25] Configure and Optimize Performance of Amazon Athena Federation with Amazon Redshift. [Online] Available: https://aws.amazon.com/blogs/big-data/configure-and-optimize-performance-of-amazon-athena-federation-with-amazon-redshift/