New hybrid deep learning method to recognize human action from video

There has been a tremendous increase in internet users and enough bandwidth in recent years. Because Internet connectivity is so inexpensive, information sharing (text, audio, and video) has become more popular and faster. This video content must be examined in order to classify it for different pur...

Full description

Bibliographic Details
Main Authors:	Md Shofiqul, Islam, Sunjida, Sultana, Md Jabbarul, Islam
Format:	Article
Language:	English
Published:	Universitas Ahmad Dahlan 2021
Subjects:	QA76 Computer software
Online Access:	http://umpir.ump.edu.my/id/eprint/33225/ http://umpir.ump.edu.my/id/eprint/33225/1/New%20hybrid%20deep%20learning%20method%20to%20recognize_FULL.pdf

_version_	1848824204789547008
author	Md Shofiqul, Islam Sunjida, Sultana Md Jabbarul, Islam
author_facet	Md Shofiqul, Islam Sunjida, Sultana Md Jabbarul, Islam
author_sort	Md Shofiqul, Islam
building	UMP Institutional Repository
collection	Online Access
description	There has been a tremendous increase in internet users and enough bandwidth in recent years. Because Internet connectivity is so inexpensive, information sharing (text, audio, and video) has become more popular and faster. This video content must be examined in order to classify it for different purposes for users. Several machine learning approaches for video classification have been developed to save users time and energy. The use of deep neural networks to recognize human behavior has become a popular issue in recent years. Although significant progress has been made in the field of video recognition, there are still numerous challenges in the realm of video to be overcome. Convolutional neural networks (CNNs) are well-known for requiring a fixedsize image input, which limits the network topology and reduces identification accuracy. Despite the fact that this problem has been solved in the world of photos, it has yet to be solved in the area of video. We present a ten stacked three-dimensional (3D) convolutional network based on the spatial pyramidbased pooling to handle the input problem of fixed size video frames in video recognition. The network structure is made up of three sections, as the name suggests: a ten-layer stacked 3DCNN, DenseNet, and SPPNet. A KTH dataset was used to test our algorithms. The experimental findings showed that our model outperformed existing models in the area of video-based behavior identification by 2% margin accuracy.
first_indexed	2025-11-15T03:09:19Z
format	Article
id	ump-33225
institution	Universiti Malaysia Pahang
institution_category	Local University
language	English
last_indexed	2025-11-15T03:09:19Z
publishDate	2021
publisher	Universitas Ahmad Dahlan
recordtype	eprints
repository_type	Digital Repository
spelling	ump-332252024-01-15T02:49:05Z http://umpir.ump.edu.my/id/eprint/33225/ New hybrid deep learning method to recognize human action from video Md Shofiqul, Islam Sunjida, Sultana Md Jabbarul, Islam QA76 Computer software There has been a tremendous increase in internet users and enough bandwidth in recent years. Because Internet connectivity is so inexpensive, information sharing (text, audio, and video) has become more popular and faster. This video content must be examined in order to classify it for different purposes for users. Several machine learning approaches for video classification have been developed to save users time and energy. The use of deep neural networks to recognize human behavior has become a popular issue in recent years. Although significant progress has been made in the field of video recognition, there are still numerous challenges in the realm of video to be overcome. Convolutional neural networks (CNNs) are well-known for requiring a fixedsize image input, which limits the network topology and reduces identification accuracy. Despite the fact that this problem has been solved in the world of photos, it has yet to be solved in the area of video. We present a ten stacked three-dimensional (3D) convolutional network based on the spatial pyramidbased pooling to handle the input problem of fixed size video frames in video recognition. The network structure is made up of three sections, as the name suggests: a ten-layer stacked 3DCNN, DenseNet, and SPPNet. A KTH dataset was used to test our algorithms. The experimental findings showed that our model outperformed existing models in the area of video-based behavior identification by 2% margin accuracy. Universitas Ahmad Dahlan 2021 Article PeerReviewed pdf en http://umpir.ump.edu.my/id/eprint/33225/1/New%20hybrid%20deep%20learning%20method%20to%20recognize_FULL.pdf Md Shofiqul, Islam and Sunjida, Sultana and Md Jabbarul, Islam (2021) New hybrid deep learning method to recognize human action from video. Jurnal Ilmiah Teknik Elektro Komputer dan Informatika (JITEKI), 7 (2). pp. 306-313. ISSN 2338-3070. (Published) http://10.26555/jiteki.v7i2.21499 http://10.26555/jiteki.v7i2.21499
spellingShingle	QA76 Computer software Md Shofiqul, Islam Sunjida, Sultana Md Jabbarul, Islam New hybrid deep learning method to recognize human action from video
title	New hybrid deep learning method to recognize human action from video
title_full	New hybrid deep learning method to recognize human action from video
title_fullStr	New hybrid deep learning method to recognize human action from video
title_full_unstemmed	New hybrid deep learning method to recognize human action from video
title_short	New hybrid deep learning method to recognize human action from video
title_sort	new hybrid deep learning method to recognize human action from video
topic	QA76 Computer software
url	http://umpir.ump.edu.my/id/eprint/33225/ http://umpir.ump.edu.my/id/eprint/33225/ http://umpir.ump.edu.my/id/eprint/33225/ http://umpir.ump.edu.my/id/eprint/33225/1/New%20hybrid%20deep%20learning%20method%20to%20recognize_FULL.pdf

New hybrid deep learning method to recognize human action from video

Similar Items