Improving Performance of Cloud Storage Systems Using Support-Based Replication Algorithm
Main Article Content
Abstract
Data replication is a mechanism for creating a copy of the same file block on many sites. It is used in cloud storage systems to improve the performance of data availability. The current replication technique helps users fix the number of replicas required. The problem with the existing approaches is that the user cannot determine which file should be duplicated and how many copies are necessary, causing the cloud storage system's performance to suffer. The problem mentioned above has an impact on the performance of cloud storage systems. Thus, our proposed replication method determines the replication factor based on the support values of the file blocks to determine the precise number of duplicates to be replicated. We have also proposed an efficient technique to place the replicas based on the local support values to increase the performance of the cloud storage system. Our results indicate that our proposed replication algorithm performs better than the algorithm used in the Hadoop distributed file system.
Article Details
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
References
B. Rajkumar, V. Christian and S. S. Thamarai, Mastering Cloud Computing, Mc Graw Hill, 2013.
K. Swaroopa, A.S.P. Kumari, N. Manne, R. Satpathy and T.P. Kumare, “An efficient replication management system for HDFS management,” science direct material proceedings, july 2021.
A. Shakarami, M. Ghobaei-Arani, A. Shahidine- jad, M. Masdari and H. Shakarami , “Data replication schemes in cloud computing: a survey,” Cluster Compute, vol. 24, pp. 2545-2579, 2021.
Y. Su and W. Zhang, “A Multi-index Evaluation Replication Placement Strategy for Cloud Storage Cluster,” ICCBDC '20: Proceedings of the 2020 4th International Conference on Cloud and Big Data Computing, pp. 20-26, 2020.
M. Ghobaei-Arani, “A workload clustering based resource provisioning mechanism using Biogeography based optimization technique in the cloud based systems,” Soft Compute, vol. 25, pp. 3813-3830, 2021.
S. N. John and T.T. Mirnalinee, “A novel dynamic data replication strategy to improve access e ciency of cloud storage,” Information Systems and e-Business Management, vol. 18, pp. 405-426, 2020.
E. Torabi, M. Ghobaei-Arani and A. Shahidinejad, “Data replica placement approaches in fog computing: a review,” Cluster Compute, vol. 25, pp. 3561-3589, 2022.
S. Gopinath and E. Sherly, “A Dynamic Replica Factor Calculator for Weighted Dynamic Replication Management in Cloud Storage Systems,” Procedia Computer Science, vol. 132, pp. 1771- 1780, 2018.
T. Shwe and M. Aritsugi, “Avoiding Performance Impacts by Re-Replication Workload Shifting in HDFS Based Cloud Storage,” IEICE Transaction on information system 2018, vol. E101-D, pp. 2958-2967, 2018.
I. A. Ibrahim, W. Dai and M. Bassiouni, “Intelligent Data Placement Mechanism for Replicas Distribution in Cloud Storage Systems,” 2016 IEEE International Conference on Smart Cloud (SmartCloud), pp. 134-139, 2016.
D. Sun, G. Chang, S. Gao, L. Jin and X. Wang, “Modelling a Dynamic Data Replication Strategy to Increase System Availability in Cloud Computing Environments,” Journal Of Computer Science And Technology, vol.27, pp. 256- 272, 2012.
ApacheHadoop, http://Hadoop.apache.org/
H. Jiawei and K. Michelin, Data Mining Concepts and Techniques, 2nd Edition, pp. 23-29, 2007.
T.Ragunathan and M. Sharfuddin, “Frequent block access pattern-based replication algorithm for cloud storage systems,” 2015 Eighth International Conference on Contemporary Computing (IC3), pp. 7-12, 2015.
Resource & Design Center for Development with Intel. (n.d.). Retrieved June 18, 2019, from Intelwebsite: https://www.intel.com/content/www/us/en/design/resource-design-center.html
Seagate Enterprise Performance 10K HDD Review StorageReview.com-Storage Reviews. (2015, May 18). Retrieved June 18, 2019, from https://www.storagereview.com/seagate_enterprise_performance_10k_hdd_review
List of Intel SSDs. (2019). In Wikipedia. Retrieved from https://en.wikipedia.org/w/index.php?title=List_of_Intel_SSDs&oldid=898338259
Corsair Vengeance LPX DDR4 3000 C15 2x16GB Market-Data and Back-O ce Data Delivery Environments. (n.d.). Retrieved June 18, 2019,from Ciscowebsite: https://www.cisco.com/c/en/us/products/collateral/switches/nexus-5000-series-switches/white_paper_c11-492751.html.