Enhanced chaos-driven automation: a unique resilience testing toolkit for cloud-native IoT networks

Conventional approaches, such as static load testing and synthetic monitoring, typically evaluate system performance under controlled conditions but do not fully capture the unpredictable scenarios encountered in real-world operations. For instance, static load testing involves applying a predetermi...

Full description

Bibliographic Details
Main Authors: Weiyuan, Yu, Osman, Mohd Hafeez, Atan, Rodziah, Wan Ab Rahman, Wan Nurhayati
Format: Article
Language:English
Published: Insight Society 2024
Online Access:http://psasir.upm.edu.my/id/eprint/117862/
http://psasir.upm.edu.my/id/eprint/117862/1/117862.pdf
_version_ 1848867363196239872
author Weiyuan, Yu
Osman, Mohd Hafeez
Atan, Rodziah
Wan Ab Rahman, Wan Nurhayati
author_facet Weiyuan, Yu
Osman, Mohd Hafeez
Atan, Rodziah
Wan Ab Rahman, Wan Nurhayati
author_sort Weiyuan, Yu
building UPM Institutional Repository
collection Online Access
description Conventional approaches, such as static load testing and synthetic monitoring, typically evaluate system performance under controlled conditions but do not fully capture the unpredictable scenarios encountered in real-world operations. For instance, static load testing involves applying a predetermined load to the system to measure performance metrics like response time and throughput, which may not reflect the variability and chaos of actual usage. Similarly, synthetic monitoring uses scripted transactions to check system availability and performance, but these scripts often lack the complexity and variability of real-world interactions. This research aims to overcome these limitations by utilizing advanced chaos engineering techniques to simulate a range of faults, including network latency, service crashes, resource exhaustion, message loss, and security attacks. The proposed tool integrates components for data generation, fault injection, storage, monitoring, and visualization, allowing for a thorough evaluation of system robustness. The methodology involves conducting controlled experiments within an AWS-based cloud-native IoT environment to assess the tool's effectiveness. These experiments demonstrate that the tool effectively identifies weaknesses in system resilience and improves overall robustness. By replicating real-world disruptions and analyzing system responses, the tool provides critical insights into the behavior of IoT devices under stress. The study concludes that this chaos engineering tool significantly enhances the ability to detect and address vulnerabilities, supporting creating more resilient IoT systems. Future work will expand the range of simulated faults, validate the tool across various cloud platforms, and incorporate additional real-time analysis features.
first_indexed 2025-11-15T14:35:18Z
format Article
id upm-117862
institution Universiti Putra Malaysia
institution_category Local University
language English
last_indexed 2025-11-15T14:35:18Z
publishDate 2024
publisher Insight Society
recordtype eprints
repository_type Digital Repository
spelling upm-1178622025-06-13T08:06:57Z http://psasir.upm.edu.my/id/eprint/117862/ Enhanced chaos-driven automation: a unique resilience testing toolkit for cloud-native IoT networks Weiyuan, Yu Osman, Mohd Hafeez Atan, Rodziah Wan Ab Rahman, Wan Nurhayati Conventional approaches, such as static load testing and synthetic monitoring, typically evaluate system performance under controlled conditions but do not fully capture the unpredictable scenarios encountered in real-world operations. For instance, static load testing involves applying a predetermined load to the system to measure performance metrics like response time and throughput, which may not reflect the variability and chaos of actual usage. Similarly, synthetic monitoring uses scripted transactions to check system availability and performance, but these scripts often lack the complexity and variability of real-world interactions. This research aims to overcome these limitations by utilizing advanced chaos engineering techniques to simulate a range of faults, including network latency, service crashes, resource exhaustion, message loss, and security attacks. The proposed tool integrates components for data generation, fault injection, storage, monitoring, and visualization, allowing for a thorough evaluation of system robustness. The methodology involves conducting controlled experiments within an AWS-based cloud-native IoT environment to assess the tool's effectiveness. These experiments demonstrate that the tool effectively identifies weaknesses in system resilience and improves overall robustness. By replicating real-world disruptions and analyzing system responses, the tool provides critical insights into the behavior of IoT devices under stress. The study concludes that this chaos engineering tool significantly enhances the ability to detect and address vulnerabilities, supporting creating more resilient IoT systems. Future work will expand the range of simulated faults, validate the tool across various cloud platforms, and incorporate additional real-time analysis features. Insight Society 2024-12-31 Article PeerReviewed text en cc_by_sa_4 http://psasir.upm.edu.my/id/eprint/117862/1/117862.pdf Weiyuan, Yu and Osman, Mohd Hafeez and Atan, Rodziah and Wan Ab Rahman, Wan Nurhayati (2024) Enhanced chaos-driven automation: a unique resilience testing toolkit for cloud-native IoT networks. International Journal on Advanced Science, Engineering and Information Technology, 14 (6). pp. 2059-2067. ISSN 2088-5334; eISSN: 2460-6952 https://ijaseit.insightsociety.org/index.php/ijaseit/article/view/15956 10.18517/ijaseit.14.6.15956
spellingShingle Weiyuan, Yu
Osman, Mohd Hafeez
Atan, Rodziah
Wan Ab Rahman, Wan Nurhayati
Enhanced chaos-driven automation: a unique resilience testing toolkit for cloud-native IoT networks
title Enhanced chaos-driven automation: a unique resilience testing toolkit for cloud-native IoT networks
title_full Enhanced chaos-driven automation: a unique resilience testing toolkit for cloud-native IoT networks
title_fullStr Enhanced chaos-driven automation: a unique resilience testing toolkit for cloud-native IoT networks
title_full_unstemmed Enhanced chaos-driven automation: a unique resilience testing toolkit for cloud-native IoT networks
title_short Enhanced chaos-driven automation: a unique resilience testing toolkit for cloud-native IoT networks
title_sort enhanced chaos-driven automation: a unique resilience testing toolkit for cloud-native iot networks
url http://psasir.upm.edu.my/id/eprint/117862/
http://psasir.upm.edu.my/id/eprint/117862/
http://psasir.upm.edu.my/id/eprint/117862/
http://psasir.upm.edu.my/id/eprint/117862/1/117862.pdf