Fine-tuned RetinaNet models for Vision-based Human Presence Detection

Moving towards Industry 4.0, the idea of human-robot interaction (HRI) and human-robot collaboration (HRC) has been popularized. To introduce more robots into the industries, risk-correlated issues would be always on the hook as robots are not as flexible as human. In fact, although robots can repla...

Full description

Bibliographic Details
Main Authors: Tang, Jin Cheng, Ahmad Fakhri, Ab. Nasir, Anwar P. P., Abdul Majeed, Thai, Li Lim, Mohd Azraai, Mohd Razman, Ismail, Mohd Khairuddin
Format: Article
Language:English
Published: Mekatronika 2022
Subjects:
Online Access:http://umpir.ump.edu.my/id/eprint/37065/
http://umpir.ump.edu.my/id/eprint/37065/1/Fine-tuned%20RetinaNet%20models%20for%20Vision-based%20Human%20Presence%20Detection.pdf
_version_ 1848825157385191424
author Tang, Jin Cheng
Ahmad Fakhri, Ab. Nasir
Anwar P. P., Abdul Majeed
Thai, Li Lim
Mohd Azraai, Mohd Razman
Ismail, Mohd Khairuddin
author_facet Tang, Jin Cheng
Ahmad Fakhri, Ab. Nasir
Anwar P. P., Abdul Majeed
Thai, Li Lim
Mohd Azraai, Mohd Razman
Ismail, Mohd Khairuddin
author_sort Tang, Jin Cheng
building UMP Institutional Repository
collection Online Access
description Moving towards Industry 4.0, the idea of human-robot interaction (HRI) and human-robot collaboration (HRC) has been popularized. To introduce more robots into the industries, risk-correlated issues would be always on the hook as robots are not as flexible as human. In fact, although robots can replace human workers in some of the dangerous tasks, still human safety is always the top priority for all industries. The most common way to safeguard the human was to isolate the working space of human workers and robots. To realize the idea of Industry 4.0, it is postulated to have the robots and cobots out of the cage to maximize productivity and efficiency. Hence, studies have been conducted with the attempts to free the robots from the isolated working space while preserve the safety of human operators. The present study seeks to explore the feasibility of transfer learning strategy — fine-tuning to human presence detection tasks as the base of practicing safe HRI. A custom image dataset with 1463 images was collected and separated into train, validation, and test set with a ratio of 70:20:10. Three RetinaNet object detection models with different backbone networks were fine-tuned with the acquired dataset to transfer the knowledge learned from source domain to the target domain, which is the human presence detection tasks. The result has shown that the RetinaNet_ResNet152-V1-FPN has the highest test AP of 74.4% with an inference speed of 13.09 FPS, suggesting that it is the best fine-tuned RetinaNet models. This study has demonstrated the feasibility of using fine-tuning as the strategy to train the object detection models, which can possibly act as the base for improving HRI applications via a deep learning visual-based method. In summary, the research has signified the uses of deep learning models to perform human presence detections and can be further extended for HRI safety applications
first_indexed 2025-11-15T03:24:28Z
format Article
id ump-37065
institution Universiti Malaysia Pahang
institution_category Local University
language English
last_indexed 2025-11-15T03:24:28Z
publishDate 2022
publisher Mekatronika
recordtype eprints
repository_type Digital Repository
spelling ump-370652023-03-09T04:12:29Z http://umpir.ump.edu.my/id/eprint/37065/ Fine-tuned RetinaNet models for Vision-based Human Presence Detection Tang, Jin Cheng Ahmad Fakhri, Ab. Nasir Anwar P. P., Abdul Majeed Thai, Li Lim Mohd Azraai, Mohd Razman Ismail, Mohd Khairuddin QA76 Computer software T Technology (General) TK Electrical engineering. Electronics Nuclear engineering Moving towards Industry 4.0, the idea of human-robot interaction (HRI) and human-robot collaboration (HRC) has been popularized. To introduce more robots into the industries, risk-correlated issues would be always on the hook as robots are not as flexible as human. In fact, although robots can replace human workers in some of the dangerous tasks, still human safety is always the top priority for all industries. The most common way to safeguard the human was to isolate the working space of human workers and robots. To realize the idea of Industry 4.0, it is postulated to have the robots and cobots out of the cage to maximize productivity and efficiency. Hence, studies have been conducted with the attempts to free the robots from the isolated working space while preserve the safety of human operators. The present study seeks to explore the feasibility of transfer learning strategy — fine-tuning to human presence detection tasks as the base of practicing safe HRI. A custom image dataset with 1463 images was collected and separated into train, validation, and test set with a ratio of 70:20:10. Three RetinaNet object detection models with different backbone networks were fine-tuned with the acquired dataset to transfer the knowledge learned from source domain to the target domain, which is the human presence detection tasks. The result has shown that the RetinaNet_ResNet152-V1-FPN has the highest test AP of 74.4% with an inference speed of 13.09 FPS, suggesting that it is the best fine-tuned RetinaNet models. This study has demonstrated the feasibility of using fine-tuning as the strategy to train the object detection models, which can possibly act as the base for improving HRI applications via a deep learning visual-based method. In summary, the research has signified the uses of deep learning models to perform human presence detections and can be further extended for HRI safety applications Mekatronika 2022-11-20 Article PeerReviewed pdf en http://umpir.ump.edu.my/id/eprint/37065/1/Fine-tuned%20RetinaNet%20models%20for%20Vision-based%20Human%20Presence%20Detection.pdf Tang, Jin Cheng and Ahmad Fakhri, Ab. Nasir and Anwar P. P., Abdul Majeed and Thai, Li Lim and Mohd Azraai, Mohd Razman and Ismail, Mohd Khairuddin (2022) Fine-tuned RetinaNet models for Vision-based Human Presence Detection. Journal of Mechatronics and Intelligent Manufacturing (Mekatronika), 4 (2). pp. 16-23. ISSN 2637-0883 (Online). (Published) https://doi.org/10.15282/mekatronika.v4i2.8850 DOI: 10.15282/mekatronika.v4i2.8850
spellingShingle QA76 Computer software
T Technology (General)
TK Electrical engineering. Electronics Nuclear engineering
Tang, Jin Cheng
Ahmad Fakhri, Ab. Nasir
Anwar P. P., Abdul Majeed
Thai, Li Lim
Mohd Azraai, Mohd Razman
Ismail, Mohd Khairuddin
Fine-tuned RetinaNet models for Vision-based Human Presence Detection
title Fine-tuned RetinaNet models for Vision-based Human Presence Detection
title_full Fine-tuned RetinaNet models for Vision-based Human Presence Detection
title_fullStr Fine-tuned RetinaNet models for Vision-based Human Presence Detection
title_full_unstemmed Fine-tuned RetinaNet models for Vision-based Human Presence Detection
title_short Fine-tuned RetinaNet models for Vision-based Human Presence Detection
title_sort fine-tuned retinanet models for vision-based human presence detection
topic QA76 Computer software
T Technology (General)
TK Electrical engineering. Electronics Nuclear engineering
url http://umpir.ump.edu.my/id/eprint/37065/
http://umpir.ump.edu.my/id/eprint/37065/
http://umpir.ump.edu.my/id/eprint/37065/
http://umpir.ump.edu.my/id/eprint/37065/1/Fine-tuned%20RetinaNet%20models%20for%20Vision-based%20Human%20Presence%20Detection.pdf