Data Governance and Quality Management in Data Engineering

  IJCTT-book-cover
 
         
 
© 2023 by IJCTT Journal
Volume-71 Issue-11
Year of Publication : 2023
Authors : Alekhya Achanta, Roja Boina
DOI :  10.14445/22312803/IJCTT-V71I11P106

How to Cite?

Alekhya Achanta, Roja Boina, "Data Governance and Quality Management in Data Engineering," International Journal of Computer Trends and Technology, vol. 71, no. 11, pp. 40-45, 2023. Crossref, https://doi.org/10.14445/22312803/IJCTT-V71I11P106

Abstract
Data has become one of the most valuable assets for organizations today. With the exponential growth in data, effectively governing and managing its quality is critical for gaining business insights and maintaining regulatory compliance. This paper examines the importance of data governance and quality management in data engineering. It outlines the fundamental principles, processes, and best practices for implementing robust data governance frameworks and quality management programs. The roles of key stakeholders, such as data owners, stewards, and engineers, are discussed. It also explores the challenges, such as inadequate data quality culture and lack of executive support. The focus is on new technologies, such as machine learning and automation, which can potentially improve data governance and quality. The paper concludes by emphasizing the need for a holistic strategy, strong leadership, and a collaborative culture for successful data governance and quality management outcomes.

Keywords
Data governance frameworks, Data profiling and monitoring, Data validation and standards, Data quality assurance, Metadata management.

Reference

[1] Peter Ghavami, Big Data Management: Data Governance Principles for Big Data Analytics, Walter De Gruyter GmbH and Co KG, pp. 1-174, 2020.
[Google Scholar] [Publisher Link]
[2] Miye Wang et al., “Big Data Health Care Platform with Multisource Heterogeneous Data Integration and Massive High-Dimensional Data Governance for Large Hospitals: Design, Development, and Application,” JMIR Medical Informatics, vol. 10, no. 4, pp. 1-15, 2022.
[CrossRef] [Google Scholar] [Publisher Link]
[3] Kristin Wende, “A Model for Data Governance-Organising Accountabilities for Data Quality Management,” Association for Information Systems Electronic Library, vol. 80, pp. 1-10, 2007.
[Google Scholar] [Publisher Link]
[4] Kristin Wende, and Boris Otto, “A Contingency Approach to Data Governance,” International Consultation on Incontinence Questionnaire, pp. 163-176, 2007.
[Google Scholar] [Publisher Link]
[5] Wei Dai et al., “Data Profiling Technology of Data Governance Regarding Big Data: Review and Rethinking,” Information Technology: New Generations: 13th International Conference on Information Technology, pp. 439-450, 2016.
[CrossRef] [Google Scholar] [Publisher Link]
[6] Dominik Lis, and Boris Otto, “Data Governance in Data Ecosystems-Insights from Organizations,” Association for Information Systems Electronic Library, pp. 1-11, 2020.
[Google Scholar] [Publisher Link]
[7] Tibor Koltay, “Data Governance, Data Literacy and the Management of Data Quality,” International Federation of Library and Institutions Journal, vol. 42, no. 4, pp. 303-312, 2016.
[CrossRef] [Google Scholar] [Publisher Link]
[8] Soňa Karkošková, “Data Governance Model to Enhance Data Quality in Financial Institutions,” Information Systems Management, vol. 40, no. 1, pp. 90-110, 2023.
[CrossRef] [Google Scholar] [Publisher Link]
[9] Sung Une Lee, Liming Zhu, and Ross Jeffery, “A Contingency-Based Approach to Data Governance Design for Platform Ecosystems,” Association for Information Systems Electronic Library, pp. 1-15, 2018.
[Google Scholar] [Publisher Link]
[10] Rene Abraham, Johannes Schneider, and Jan vom Brocke, “Data Governance: A Conceptual Framework, Structured Review, and Research Agenda,” International Journal of Information Management, vol. 49, pp. 424-438, 2019.
[CrossRef] [Google Scholar] [Publisher Link]
[11] Ibrahim Alhassan, David Sammon, and Mary Daly, “Data Governance Activities: An Analysis of the Literature,” Journal of Decision Systems, vol. 25, no. 1, pp. 64-75, 2016.
[CrossRef] [Google Scholar] [Publisher Link]
[12] John A. Pearce, and Shaker A. Zahra, “Board Composition from a Strategic Contingency Perspective,” Journal of Management Studies, vol. 29, no. 4, pp. 411-438, 1992.
[CrossRef] [Google Scholar] [Publisher Link]
[13] Chunxia Wang, and Jian Xie, “Constructing a Computer Model for Discipline Data Governance using the Contingency Theory and Data Mining,” 2021 4th International Conference on Information Systems and Computer Aided Education, pp. 1967-1970, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[14] Majid Al-Ruithe, Elhadj Benkhelifa, and Khawar Hameed, “A Systematic Literature Review of Data Governance and Cloud Data Governance,” Personal and Ubiquitous Computing, vol. 23, pp. 839-859, 2019.
[CrossRef] [Google Scholar] [Publisher Link]
[15] Marijn Janssen et al., “Data Governance: Organizing Data for Trustworthy Artificial Intelligence,” Government Information Quarterly, vol. 37, no. 3, 2020.
[CrossRef] [Google Scholar] [Publisher Link]
[16] Zeljko Panian, “Some Practical Experiences in Data Governance,” World Academy of Science, Engineering and Technology, vol. 62, no. 1, pp. 939-946, 2010.
[Google Scholar] [Publisher Link]
[17] Boris Otto, “A Morphology of the Organisation of Data Governance,” Association for Information Systems Electronic Library, pp. 1-13, 2011.
[Google Scholar] [Publisher Link]
[18] Marina Micheli et al., “Emerging Models of Data Governance in the Age of Datafication,” Big Data and Society, vol. 7, no. 2, pp. 1-15, 2020.
[CrossRef] [Google Scholar] [Publisher Link]
[19] Stephanie Russo Carroll, Desi Rodriguez-Lonebear, and Andrew Martinez, “Indigenous Data Governance: Strategies from United States Native Nations,” Data Science Journal, vol. 18, no. 31, pp. 1-15, 2019.
[CrossRef] [Google Scholar] [Publisher Link]
[20] Steve Sarsfield, “The Data Governance Imperative: A Business Strategy for Corporate Data, IT Governance Publishing, pp. 1-161, 2009.
[Google Scholar] [Publisher Link]
[21] Ibrahim Alhassan, David Sammon, and Mary Daly, “Data Governance Activities: A Comparison between Scientific and Practice-Oriented Literature,” Journal of Enterprise Information Management, vol. 31, no. 2, pp. 300-316, 2018.
[CrossRef] [Google Scholar] [Publisher Link]
[22] Krassimira Paskaleva et al., “Data Governance in the Sustainable Smart City,” Informatics, vol. 4, no. 4, pp. 1-19, 2017.
[CrossRef] [Google Scholar] [Publisher Link]
[23] Huberman A. Michael, and Miles B. Matthew, “Data Management and Analysis Methods,” Handbook of Qualitative Research, pp. 428- 444, 1994.
[Google Scholar] [Publisher Link]