Vision-based violence detection through deep learning

In the present society, video surveillance systems have continued to develop and incorporate more sophisticated video analysis to enhance security and public safety. With increasing demand, the need for accurate and efficient violence detection in video footage has become more critical. However, det...

Full description

Bibliographic Details
Main Author:	Koh, Wei Zhe
Format:	Final Year Project / Dissertation / Thesis
Published:	2024
Subjects:	Q Science (General) QA75 Electronic computers. Computer science T Technology (General)
Online Access:	http://eprints.utar.edu.my/6826/ http://eprints.utar.edu.my/6826/1/2004757_KOH_WEI_ZHE.pdf

_version_	1848886777808420864
author	Koh, Wei Zhe
author_facet	Koh, Wei Zhe
author_sort	Koh, Wei Zhe
building	UTAR Institutional Repository
collection	Online Access
description	In the present society, video surveillance systems have continued to develop and incorporate more sophisticated video analysis to enhance security and public safety. With increasing demand, the need for accurate and efficient violence detection in video footage has become more critical. However, detecting violence in video footage remains challenging due to varying lighting conditions and data quality. While advancements in deep learning techniques can improve the accuracy and robustness of violence detection, they often require extensive datasets, leading to overloaded training processes. This research focuses on advancing and utilizing deep learning models for violence detection in surveillance videos, with particular emphasis on varying lighting conditions. A dataset of 2,000 videos mostly in normal lighting conditions is used to train a hybrid deep learning model combining MobileNet-v2, a lightweight Convolutional Neural Network (CNN), with BiLSTM (Bidirectional Long Short-Term Memory). This hybrid model seeks to employ MobileNet-v2 for feature extraction and BiLSTM for temporal analysis in video datasets. To enhance detection accuracy under different lighting conditions, histogram equalization is integrated into the video prediction process alongside the trained base model. The approach is designed to optimize video-based violence detection without overwhelming the model with large datasets and excessive training times. The base model (MobileNet-v2 and BiLSTM) performed well in normal light conditions (96.33%). While the base model with histogram equalization achieved higher accuracy (98.91%) and the model trained on varying lighting conditions further improved to (99.15%). On the other hand, the base model performed poorly in very dark conditions (24.89%) but showed significant improvement with histogram equalization (92.21%), nearly matching the performance of the base model trained on varying lighting conditions (99.97%). This result highlights the benefit of the proposed histogram equalization method, which achieves high detection accuracy without relying on extensive datasets and overloaded training resources, making it a potential solution for real-time violence detection in diverse lighting scenarios.
first_indexed	2025-11-15T19:43:53Z
format	Final Year Project / Dissertation / Thesis
id	utar-6826
institution	Universiti Tunku Abdul Rahman
institution_category	Local University
last_indexed	2025-11-15T19:43:53Z
publishDate	2024
recordtype	eprints
repository_type	Digital Repository
spelling	utar-68262024-11-21T05:56:11Z Vision-based violence detection through deep learning Koh, Wei Zhe Q Science (General) QA75 Electronic computers. Computer science T Technology (General) In the present society, video surveillance systems have continued to develop and incorporate more sophisticated video analysis to enhance security and public safety. With increasing demand, the need for accurate and efficient violence detection in video footage has become more critical. However, detecting violence in video footage remains challenging due to varying lighting conditions and data quality. While advancements in deep learning techniques can improve the accuracy and robustness of violence detection, they often require extensive datasets, leading to overloaded training processes. This research focuses on advancing and utilizing deep learning models for violence detection in surveillance videos, with particular emphasis on varying lighting conditions. A dataset of 2,000 videos mostly in normal lighting conditions is used to train a hybrid deep learning model combining MobileNet-v2, a lightweight Convolutional Neural Network (CNN), with BiLSTM (Bidirectional Long Short-Term Memory). This hybrid model seeks to employ MobileNet-v2 for feature extraction and BiLSTM for temporal analysis in video datasets. To enhance detection accuracy under different lighting conditions, histogram equalization is integrated into the video prediction process alongside the trained base model. The approach is designed to optimize video-based violence detection without overwhelming the model with large datasets and excessive training times. The base model (MobileNet-v2 and BiLSTM) performed well in normal light conditions (96.33%). While the base model with histogram equalization achieved higher accuracy (98.91%) and the model trained on varying lighting conditions further improved to (99.15%). On the other hand, the base model performed poorly in very dark conditions (24.89%) but showed significant improvement with histogram equalization (92.21%), nearly matching the performance of the base model trained on varying lighting conditions (99.97%). This result highlights the benefit of the proposed histogram equalization method, which achieves high detection accuracy without relying on extensive datasets and overloaded training resources, making it a potential solution for real-time violence detection in diverse lighting scenarios. 2024 Final Year Project / Dissertation / Thesis NonPeerReviewed application/pdf http://eprints.utar.edu.my/6826/1/2004757_KOH_WEI_ZHE.pdf Koh, Wei Zhe (2024) Vision-based violence detection through deep learning. Final Year Project, UTAR. http://eprints.utar.edu.my/6826/
spellingShingle	Q Science (General) QA75 Electronic computers. Computer science T Technology (General) Koh, Wei Zhe Vision-based violence detection through deep learning
title	Vision-based violence detection through deep learning
title_full	Vision-based violence detection through deep learning
title_fullStr	Vision-based violence detection through deep learning
title_full_unstemmed	Vision-based violence detection through deep learning
title_short	Vision-based violence detection through deep learning
title_sort	vision-based violence detection through deep learning
topic	Q Science (General) QA75 Electronic computers. Computer science T Technology (General)
url	http://eprints.utar.edu.my/6826/ http://eprints.utar.edu.my/6826/1/2004757_KOH_WEI_ZHE.pdf

Vision-based violence detection through deep learning

Similar Items