Improving Performance of Cloud Storage Systems Using Support-Based Replication Algorithm

Main Article Content

Mohammed Sharfuddin
Thirumalaisamy Ragunathan

Abstract

Data replication is a mechanism for creating a copy of the same file block on many sites. It is used in cloud storage systems to improve the performance of data availability. The current replication technique helps users fix the number of replicas required. The problem with the existing approaches is that the user cannot determine which file should be duplicated and how many copies are necessary, causing the cloud storage system's performance to suffer. The problem mentioned above has an impact on the performance of cloud storage systems. Thus, our proposed replication method determines the replication factor based on the support values of the file blocks to determine the precise number of duplicates to be replicated. We have also proposed an efficient technique to place the replicas based on the local support values to increase the performance of the cloud storage system. Our results indicate that our proposed replication algorithm performs better than the algorithm used in the Hadoop distributed file system.

Article Details

How to Cite
[1]
M. Sharfuddin and T. Ragunathan, “Improving Performance of Cloud Storage Systems Using Support-Based Replication Algorithm ”, ECTI-CIT Transactions, vol. 17, no. 1, pp. 14–26, Nov. 2022.
Section
Research Article

References

B. Rajkumar, V. Christian and S. S. Thamarai, Mastering Cloud Computing, Mc Graw Hill, 2013.

K. Swaroopa, A.S.P. Kumari, N. Manne, R. Satpathy and T.P. Kumare, “An efficient replication management system for HDFS management,” science direct material proceedings, july 2021.

A. Shakarami, M. Ghobaei-Arani, A. Shahidine- jad, M. Masdari and H. Shakarami , “Data replication schemes in cloud computing: a survey,” Cluster Compute, vol. 24, pp. 2545-2579, 2021.

Y. Su and W. Zhang, “A Multi-index Evaluation Replication Placement Strategy for Cloud Storage Cluster,” ICCBDC '20: Proceedings of the 2020 4th International Conference on Cloud and Big Data Computing, pp. 20-26, 2020.

M. Ghobaei-Arani, “A workload clustering based resource provisioning mechanism using Biogeography based optimization technique in the cloud based systems,” Soft Compute, vol. 25, pp. 3813-3830, 2021.

S. N. John and T.T. Mirnalinee, “A novel dynamic data replication strategy to improve access e ciency of cloud storage,” Information Systems and e-Business Management, vol. 18, pp. 405-426, 2020.

E. Torabi, M. Ghobaei-Arani and A. Shahidinejad, “Data replica placement approaches in fog computing: a review,” Cluster Compute, vol. 25, pp. 3561-3589, 2022.

S. Gopinath and E. Sherly, “A Dynamic Replica Factor Calculator for Weighted Dynamic Replication Management in Cloud Storage Systems,” Procedia Computer Science, vol. 132, pp. 1771- 1780, 2018.

T. Shwe and M. Aritsugi, “Avoiding Performance Impacts by Re-Replication Workload Shifting in HDFS Based Cloud Storage,” IEICE Transaction on information system 2018, vol. E101-D, pp. 2958-2967, 2018.

I. A. Ibrahim, W. Dai and M. Bassiouni, “Intelligent Data Placement Mechanism for Replicas Distribution in Cloud Storage Systems,” 2016 IEEE International Conference on Smart Cloud (SmartCloud), pp. 134-139, 2016.

D. Sun, G. Chang, S. Gao, L. Jin and X. Wang, “Modelling a Dynamic Data Replication Strategy to Increase System Availability in Cloud Computing Environments,” Journal Of Computer Science And Technology, vol.27, pp. 256- 272, 2012.

ApacheHadoop, http://Hadoop.apache.org/

H. Jiawei and K. Michelin, Data Mining Concepts and Techniques, 2nd Edition, pp. 23-29, 2007.

T.Ragunathan and M. Sharfuddin, “Frequent block access pattern-based replication algorithm for cloud storage systems,” 2015 Eighth International Conference on Contemporary Computing (IC3), pp. 7-12, 2015.

Resource & Design Center for Development with Intel. (n.d.). Retrieved June 18, 2019, from Intelwebsite: https://www.intel.com/content/www/us/en/design/resource-design-center.html

Seagate Enterprise Performance 10K HDD Review StorageReview.com-Storage Reviews. (2015, May 18). Retrieved June 18, 2019, from https://www.storagereview.com/seagate_enterprise_performance_10k_hdd_review

List of Intel SSDs. (2019). In Wikipedia. Retrieved from https://en.wikipedia.org/w/index.php?title=List_of_Intel_SSDs&oldid=898338259

Corsair Vengeance LPX DDR4 3000 C15 2x16GB Market-Data and Back-O ce Data Delivery Environments. (n.d.). Retrieved June 18, 2019,from Ciscowebsite: https://www.cisco.com/c/en/us/products/collateral/switches/nexus-5000-series-switches/white_paper_c11-492751.html.