FiLeD: File Level Deduplication Approach

Jyoti Malhotra; Jagdish Bakal

doi:https://doi.org/10.14445/22312803/IJCTT-V44P113

Research Article | Open Access | Download PDF

Volume 44 | Number 1 | Year 2017 | Article Id. IJCTT-V44P113 | DOI : https://doi.org/10.14445/22312803/IJCTT-V44P113

FiLeD: File Level Deduplication Approach

Jyoti Malhotra, Jagdish Bakal

Citation :

Jyoti Malhotra, Jagdish Bakal, "FiLeD: File Level Deduplication Approach," International Journal of Computer Trends and Technology (IJCTT), vol. 44, no. 1, pp. 74-79, 2017. Crossref, https://doi.org/10.14445/22312803/IJCTT-V44P113

Abstract

In the digital era, uncontrolled data growth is a huge problem. This paper intends to cover the various data storage medium and their backup patterns adopted by end users for their personal data. With respect to an individual concern; the rate of increase in personal data is directly proportional to storage space issues; we focus on an implementation of file-level deduplication, which keeps away the duplicate files. This increases the storage capacity making a room for new data. It also illustrates the comparison of compression, deduplication, and deduplication with compression. We conclude that data will continue to grow and users should seek intelligent methods to shrink the storage space.

Keywords

Compression, Deduplication, Post-Process, Backup, Fingerprints

References

[1] http://www.gartner.com/technology/home.jsp
[2] Nagapramod Mandagere, Pin Zhou, Mark A Smith, Sandeep Uttamchandani, “Demystifying data de-duplication”, Proceedings of the ACM/IFIP/USENIX Middleware `08
` [3] http://searchdatabackup.techtarget.com/feature/Laptop-data-backup-and-desktop-data-backup-best-practices.
[4] Jyoti Malhotra, Jagdish Bakal, “A survey and comparative study of data deduplication techniques”, Pervasive Computing (ICPC), 2015 International Conference on, IEEE ICPC 2015, Pages:1-5
[5] Devi, R. Parimala, and V. Thigarasu. "A Semantic Deduplication of Temporal Dynamic Records from Multiple Web Databases." Indian Journal of Science and Technology 8.34 (2015).
[6] Kim, Daehee, Sejun Song, and Baek-Young Choi. "SAFE: Structure-aware file and email deduplication for cloud-based storage systems." Data Deduplication for Data Optimization for Storage and Network Systems. Springer International Publishing, 2017. 97-115.
[7] Jung, Ho Min, et al. "Energy Efficient Deduplication System Exploiting Similarity Information." Future Information Technology, Application, and Service. Springer Netherlands, 2012. 67-74.
[8] Malhotra, Jyoti, Jagdish Bakal, and L. G. Malik. "Caching: QoS Enabled Metadata Processing Scheme for Data Deduplication." Proceedings of the International Congress on Information and Communication Technology. Springer Singapore, 2016.
[9] Guohua Wang School of Software Engineering, Yuelong Zhao,Xiaoling Xie, and Lin Liu School of Computer Science & Engineering China University of Technology Guangzhou, China,” Research on a clustering data de-duplication mechanism based on Bloom Filter”,2010 IEEE.
[10] Kaiser, J. Meister, D. ; Brinkmann, A. ; Effert, S. “Design of an exact data deduplication cluster” Mass Storage Systems and Technologies (MSST), 2012 IEEE 28th Symposium on Computing and Processing, Pages(s) 1-12