New hybrid deep learning method to recognize human action from video

There has been a tremendous increase in internet users and enough bandwidth in recent years. Because Internet connectivity is so inexpensive, information sharing (text, audio, and video) has become more popular and faster. This video content must be examined in order to classify it for different pur...

Full description

Bibliographic Details
Main Authors: Md Shofiqul, Islam, Sunjida, Sultana, Md Jabbarul, Islam
Format: Article
Language:English
Published: Universitas Ahmad Dahlan 2021
Subjects:
Online Access:http://umpir.ump.edu.my/id/eprint/33225/
http://umpir.ump.edu.my/id/eprint/33225/1/New%20hybrid%20deep%20learning%20method%20to%20recognize_FULL.pdf
_version_ 1848824204789547008
author Md Shofiqul, Islam
Sunjida, Sultana
Md Jabbarul, Islam
author_facet Md Shofiqul, Islam
Sunjida, Sultana
Md Jabbarul, Islam
author_sort Md Shofiqul, Islam
building UMP Institutional Repository
collection Online Access
description There has been a tremendous increase in internet users and enough bandwidth in recent years. Because Internet connectivity is so inexpensive, information sharing (text, audio, and video) has become more popular and faster. This video content must be examined in order to classify it for different purposes for users. Several machine learning approaches for video classification have been developed to save users time and energy. The use of deep neural networks to recognize human behavior has become a popular issue in recent years. Although significant progress has been made in the field of video recognition, there are still numerous challenges in the realm of video to be overcome. Convolutional neural networks (CNNs) are well-known for requiring a fixedsize image input, which limits the network topology and reduces identification accuracy. Despite the fact that this problem has been solved in the world of photos, it has yet to be solved in the area of video. We present a ten stacked three-dimensional (3D) convolutional network based on the spatial pyramidbased pooling to handle the input problem of fixed size video frames in video recognition. The network structure is made up of three sections, as the name suggests: a ten-layer stacked 3DCNN, DenseNet, and SPPNet. A KTH dataset was used to test our algorithms. The experimental findings showed that our model outperformed existing models in the area of video-based behavior identification by 2% margin accuracy.
first_indexed 2025-11-15T03:09:19Z
format Article
id ump-33225
institution Universiti Malaysia Pahang
institution_category Local University
language English
last_indexed 2025-11-15T03:09:19Z
publishDate 2021
publisher Universitas Ahmad Dahlan
recordtype eprints
repository_type Digital Repository
spelling ump-332252024-01-15T02:49:05Z http://umpir.ump.edu.my/id/eprint/33225/ New hybrid deep learning method to recognize human action from video Md Shofiqul, Islam Sunjida, Sultana Md Jabbarul, Islam QA76 Computer software There has been a tremendous increase in internet users and enough bandwidth in recent years. Because Internet connectivity is so inexpensive, information sharing (text, audio, and video) has become more popular and faster. This video content must be examined in order to classify it for different purposes for users. Several machine learning approaches for video classification have been developed to save users time and energy. The use of deep neural networks to recognize human behavior has become a popular issue in recent years. Although significant progress has been made in the field of video recognition, there are still numerous challenges in the realm of video to be overcome. Convolutional neural networks (CNNs) are well-known for requiring a fixedsize image input, which limits the network topology and reduces identification accuracy. Despite the fact that this problem has been solved in the world of photos, it has yet to be solved in the area of video. We present a ten stacked three-dimensional (3D) convolutional network based on the spatial pyramidbased pooling to handle the input problem of fixed size video frames in video recognition. The network structure is made up of three sections, as the name suggests: a ten-layer stacked 3DCNN, DenseNet, and SPPNet. A KTH dataset was used to test our algorithms. The experimental findings showed that our model outperformed existing models in the area of video-based behavior identification by 2% margin accuracy. Universitas Ahmad Dahlan 2021 Article PeerReviewed pdf en http://umpir.ump.edu.my/id/eprint/33225/1/New%20hybrid%20deep%20learning%20method%20to%20recognize_FULL.pdf Md Shofiqul, Islam and Sunjida, Sultana and Md Jabbarul, Islam (2021) New hybrid deep learning method to recognize human action from video. Jurnal Ilmiah Teknik Elektro Komputer dan Informatika (JITEKI), 7 (2). pp. 306-313. ISSN 2338-3070. (Published) http://10.26555/jiteki.v7i2.21499 http://10.26555/jiteki.v7i2.21499
spellingShingle QA76 Computer software
Md Shofiqul, Islam
Sunjida, Sultana
Md Jabbarul, Islam
New hybrid deep learning method to recognize human action from video
title New hybrid deep learning method to recognize human action from video
title_full New hybrid deep learning method to recognize human action from video
title_fullStr New hybrid deep learning method to recognize human action from video
title_full_unstemmed New hybrid deep learning method to recognize human action from video
title_short New hybrid deep learning method to recognize human action from video
title_sort new hybrid deep learning method to recognize human action from video
topic QA76 Computer software
url http://umpir.ump.edu.my/id/eprint/33225/
http://umpir.ump.edu.my/id/eprint/33225/
http://umpir.ump.edu.my/id/eprint/33225/
http://umpir.ump.edu.my/id/eprint/33225/1/New%20hybrid%20deep%20learning%20method%20to%20recognize_FULL.pdf