Big Data Analytics in Cloud – Comparative Study

  IJCTT-book-cover
 
         
 
© 2023 by IJCTT Journal
Volume-71 Issue-12
Year of Publication : 2023
Authors : Naresh Kumar Miryala, Divit Gupta
DOI :  10.14445/22312803/IJCTT-V71I12P107

How to Cite?

Naresh Kumar Miryala, Divit Gupta, "Big Data Analytics in Cloud – Comparative Study," International Journal of Computer Trends and Technology, vol. 71, no. 12, pp. 30-34, 2023. Crossref, https://doi.org/10.14445/22312803/IJCTT-V71I12P107

Abstract
In the dynamic landscape of information technology, the convergence of Big Data Analytics and Cloud Computing stands out as a powerful paradigm reshaping the way organizations extract insights from massive datasets. This abstract encapsulates the essence of Big Data Analytics in the cloud, illustrating its broad significance and impact. It also highlights the inherent advantages of leveraging cloud infrastructure for Big Data Analytics, including scalability, flexibility, and cost-effectiveness. However, it acknowledges the challenges related to privacy, security, and ethical considerations in handling large datasets. By providing a general overview, this abstract aims to convey the transformative potential of integrating Big Data Analytics with Cloud Computing, ushering in a new era of data-driven innovation and insights. In a world where data is abundant and diverse, the scalability and flexibility offered by cloud platforms enable efficient storage, processing, and analysis of vast datasets. This abstract highlights the significance of leveraging cloud infrastructure for Big Data Analytics, facilitating real-time insights, informed decision-making, business intelligence, optimizing healthcare practices, and revolutionizing financial strategies and innovation. The paper addresses the synergy between Big Data and the cloud, emphasizing the role of distributed computing and parallel processing in handling large volumes of information.
Cloud computing emerges as a potent technology for large-scale and intricate computing, offering a solution that negates the need to maintain costly hardware, dedicated space, and software infrastructure. The growth in the volume of data, often referred to as Big Data facilitated by cloud computing, has been substantial. The research also investigates challenges related to scalability, availability, data integrity, transformation, quality, heterogeneity, privacy, legal and regulatory matters, and governance. Additionally, the paper delves into various Big Data processing techniques from both system and application perspectives, presenting a structured overview of challenges faced by application developers and database management system (DBMS) designers in developing and deploying internet-scale applications. Big Data Analytics in the Cloud represents a paradigm shift in the way organizations handle and derive value from massive datasets. This abstract explores the convergence of Big Data Analytics with cloud computing, showcasing its transformative impact on businesses across various sectors. It also discusses the challenges and opportunities associated with this integration, including considerations of data security, privacy, and the ethical use of analytics. By examining the broader implications, this abstract aims to provide a general understanding of the dynamic intersection between Big Data Analytics and cloud technologies, driving advancements in the data-driven landscape.

Keywords
Cloud Computing, Big Data, Data Processing, Data Analysis, Data Management, Data Privacy, Infrastructure as a Service (IaaS), Platform as a Service (PaaS), Software as a Service (SaaS), Structured Data, Semi-Structured Data, Unstructured Data, Snowflake, Google Bi Query, MySQL Heatwave, Amazon Redshift.

Reference

[1] Manoj Muniswamaiah, Tilak Agerwala, and Charles Tappert, “Big Data in Cloud Computing Review and Opportunities,” arXiv, pp. 1- 15, 2019.
[CrossRef] [Google Scholar] [Publisher Link]
[2] Neelay Jagani, Parthil Jagani, and Suril Shah, “Big Data in Cloud Computing: A Literature Review,” International Journal of Engineering Applied Sciences and Technology, vol. 5, no. 11, pp. 185-191, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[3] Mounika Narang, What is Big Data: Types, Characteristics and Benefits, 2023. [Online]. Available: https://www.knowledgehut.com/blog/big-data/types-of-big-data
[4] Alan Litchfield, and Jacqui Althouse, “A Systematic Review of Cloud Computing, Big Data and Databases on the Cloud,” Twentieth Americas Conference on Information Systems, Savannah, pp. 1-19, 2014.
[Google Scholar] [Publisher Link]
[5] Bala M. Balachandran, and Shivika Prasad, “Challenges and Benefits of Deploying Big Data Analytics in the Cloud for Business Intelligence,” Procedia Computer Science, vol. 112, pp. 1112-1122, 2017.
[CrossRef] [Google Scholar] [Publisher Link]
[6] Blend Berisha, Endrit Meziu, and Isak Shabani, “Big Data Analytics in Cloud Computing: An Overview,” Journal of Cloud Computing, vol. 11, no. 1, pp. 1-10, 2022.
[CrossRef] [Google Scholar] [Publisher Link]
[7] Farhan Aslam, “Role of Cloud Computing for Big Data,” International Journal of Innovative Science and Research Technology, vol. 8, no. 8, pp. 1-5, 2023.
[CrossRef] [Google Scholar] [Publisher Link]
[8] Mohaiminul Islam, and Shamim Reza, “The Rise of Big Data and Cloud Computing,” Internet of Things and Cloud Computing, vol. 7, no. 2, pp. 45-53, 2019.
[CrossRef] [Google Scholar] [Publisher Link]
[9] Izhar Alam, Structured, Semi Structured and Unstructured Data, k21Academy, 2023. [Online]. Available: https://k21academy.com/microsoft-azure/dp-900/structured-data-vs-unstructured-data-vs-semi-structured-data/
[10] Venkatesh H, Shrivatsa D Perur, and Nivedita Jalihal, “A Study on Use of Big Data in Cloud Computing Environment,” International Journal of Computer Science and Information Technologies, vol. 6, no. 3, pp. 2076-2078, 2015.
[Google Scholar] [Publisher Link]
[11] Saqib Luqman, “Big Data Processing in the Cloud: Scalable and Real-time Data,” OSF Preprints, 2023.
[CrossRef] [Google Scholar] [Publisher Link]
[12] Mustapha Malami Idina, “The Concept of Big Data and Solutions of Cloud Computing,” International Journal of Advanced Engineering and Management Research, vol. 8, no. 2, pp. 99-106, 2023.
[CrossRef] [Google Scholar] [Publisher Link]
[13] Shahad Alghamdi et al., “Big Data Management and Analytics as a Cloud Service,” International Journal of Emerging Multidisciplinaries: Computer Science & Artificial Intelligence, vol. 2, no. 1, pp. 1-19, 2023.
[CrossRef] [Publisher Link]
[14] Anurag Gupta et al., “Amazon Redshift and the Case for Simpler Data Warehouses,” SIGMOD ’15: Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data, Melbourne Victoria Australia, pp. 1917-1923, 2015.
[CrossRef] [Google Scholar] [Publisher Link]
[15] Satyabrata Jena, Difference between Batch Processing and Stream Processing, Geeksforgeeks. [Online]. Available: https://www.geeksforgeeks.org/difference-between-batch-processing-and-stream-processing/
[16] What are the 5 V’s of Big Data?, Teradata. [Online]. Available: https://www.teradata.com/glossary/what-are-the-5-v-s-of-big-data#:~:text=Big%20data%20is%20a%20collection,variety%2C%20velocity%2C%20and%20veracity.
[17] Big Data: The 3V’s Explained, Bigdataldn News, 2022. [Online]. Available: https://bigdataldn.com/news/big-data-the-3-vs-explained/
[18] Raktim Singh, What is Big Data and why it is so Important, Medium. [Online]. Available: https://raktimsingh.medium.com/what-is-big-data-9ccc2cf002b0#:~:text=Big%20Data%20combines%20structured%20and,How%20Big%20Is%20Big%20Data%3F
[19] Husen Ali, Sarwar Hosain, and Anwar Hossain, “Big Data Analysis using Bigquery on Cloud Computing Platform,” Australian Journal of Engineering and Innovative Technology, vol. 3, no. 1, pp. 1-9, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[20] Mohamed Benjelloun, Mohamed El Merouani, and El Amin Aoulad Abdelouarit, “Using Snowflake Schema and Bitmap Index for Big Data Warehouse Volume,” International Journal of Computer Applications, vol. 180, no. 8, pp. 30-32, 2017.
[CrossRef] [Google Scholar] [Publisher Link]
[21] Elvin Li, Redshift vs BigQuery vs Snowflake: A Comparison of the Most Popular Data Warehouse for Data-Driven Digital Transformation and Data Analytics within Enterprises, Medium, 2020. [Online]. Available: https://medium.com/2359media/redshift-vs-bigquery-vs-snowflake-a-comparison-of-the-most-popular-data-warehouse-for-data-driven-cb1c10ac8555
[22] Athira Nambiar, and Divyansh Mundra, “An Overview of Data Warehouse and Data Lake in Modern Enterprise Data Management, Big Data Cognitive Computing, vol. 6, no. 4, pp. 1-24, 2022.
[CrossRef] [Google Scholar] [Publisher Link]