Reduced complexity on video encoding for wireless multimedia sensor networks

A wireless multimedia sensor network (WMSN) requires a video encoding system that should be energy-efficient because of its special characteristics: limited power capacity but long service life without maintenance. Besides, video is one of the most important information contents that a WMSN delivers...

Full description

Bibliographic Details
Main Author:	YAN, Zhuge
Format:	Thesis (University of Nottingham only)
Language:	English
Published:	2020
Subjects:	wireless multimedia sensor network video processing
Online Access:	https://eprints.nottingham.ac.uk/60558/

_version_	1848799777994571776
author	YAN, Zhuge
author_facet	YAN, Zhuge
author_sort	YAN, Zhuge
building	Nottingham Research Data Repository
collection	Online Access
description	A wireless multimedia sensor network (WMSN) requires a video encoding system that should be energy-efficient because of its special characteristics: limited power capacity but long service life without maintenance. Besides, video is one of the most important information contents that a WMSN delivers. Motion estimation plays an important role in predictive coding, which is the key part of video encoding and requires large amount of computation. In order to reduce the computational complexity of motion estimation, the block-matching search algorithm is able to find the motion vectors using fewer search points. An algorithm which adds a predictive search technique to the enhanced modified orthogonal search (EMOS) has been proposed. It improves the efficiency of block-matching search algorithms, and has two self-adapting search patterns for large and small movements. The proposed algorithm requires less search points to work out the movement of blocks and provides acceptable image quality. This algorithm was also tested on field programmable gate array (FPGA) and Arduino platforms. Moreover, a back propagation neural network model is introduced for predictive block-matching. The proposed back propagation neural network has very simple structure with only 5 inputs, 5 hidden neurons and 1 output architecture. Because of its simplicity, it requires very little computational power which is negligible compared with existing computation complexity. The test results show the prediction accuracy in 10 - 30\% higher then the competing algorithms with a peak signal-to-noise ratro (PSNR) improvement up to 0.3 dB. The above advantages make it a feasible replacement of the current solution. With the information technology developing dramatically, there is reason to believe that the next generation video encoding standard HEVC will soon be able to run on very cheap platforms. Therefore, a prospective study on HEVC inter prediction acceleration was also carried out. We extracted specific image features that represent prediction unit texture, incorporated a machine learning technique, namely random forest, in HEVC intra prediction mode selection, to improve the performance of inter coding of HEVC. Benchmarking with other existing algorithms, our method extracts very specific features of image texture changes in terms of angle. Therefore the proposed method can achieve very high prediction accuracy. Having similar reduction in complexity, the proposed algorithm will be demonstrated to have a higher video quality compared with similar algorithms.
first_indexed	2025-11-14T20:41:04Z
format	Thesis (University of Nottingham only)
id	nottingham-60558
institution	University of Nottingham Malaysia Campus
institution_category	Local University
language	English
last_indexed	2025-11-14T20:41:04Z
publishDate	2020
recordtype	eprints
repository_type	Digital Repository
spelling	nottingham-605582025-02-28T14:54:37Z https://eprints.nottingham.ac.uk/60558/ Reduced complexity on video encoding for wireless multimedia sensor networks YAN, Zhuge A wireless multimedia sensor network (WMSN) requires a video encoding system that should be energy-efficient because of its special characteristics: limited power capacity but long service life without maintenance. Besides, video is one of the most important information contents that a WMSN delivers. Motion estimation plays an important role in predictive coding, which is the key part of video encoding and requires large amount of computation. In order to reduce the computational complexity of motion estimation, the block-matching search algorithm is able to find the motion vectors using fewer search points. An algorithm which adds a predictive search technique to the enhanced modified orthogonal search (EMOS) has been proposed. It improves the efficiency of block-matching search algorithms, and has two self-adapting search patterns for large and small movements. The proposed algorithm requires less search points to work out the movement of blocks and provides acceptable image quality. This algorithm was also tested on field programmable gate array (FPGA) and Arduino platforms. Moreover, a back propagation neural network model is introduced for predictive block-matching. The proposed back propagation neural network has very simple structure with only 5 inputs, 5 hidden neurons and 1 output architecture. Because of its simplicity, it requires very little computational power which is negligible compared with existing computation complexity. The test results show the prediction accuracy in 10 - 30\% higher then the competing algorithms with a peak signal-to-noise ratro (PSNR) improvement up to 0.3 dB. The above advantages make it a feasible replacement of the current solution. With the information technology developing dramatically, there is reason to believe that the next generation video encoding standard HEVC will soon be able to run on very cheap platforms. Therefore, a prospective study on HEVC inter prediction acceleration was also carried out. We extracted specific image features that represent prediction unit texture, incorporated a machine learning technique, namely random forest, in HEVC intra prediction mode selection, to improve the performance of inter coding of HEVC. Benchmarking with other existing algorithms, our method extracts very specific features of image texture changes in terms of angle. Therefore the proposed method can achieve very high prediction accuracy. Having similar reduction in complexity, the proposed algorithm will be demonstrated to have a higher video quality compared with similar algorithms. 2020-05-28 Thesis (University of Nottingham only) NonPeerReviewed application/pdf en arr https://eprints.nottingham.ac.uk/60558/1/Zhuge%20PhD.pdf YAN, Zhuge (2020) Reduced complexity on video encoding for wireless multimedia sensor networks. PhD thesis, University of Nottingham. wireless multimedia sensor network video processing
spellingShingle	wireless multimedia sensor network video processing YAN, Zhuge Reduced complexity on video encoding for wireless multimedia sensor networks
title	Reduced complexity on video encoding for wireless multimedia sensor networks
title_full	Reduced complexity on video encoding for wireless multimedia sensor networks
title_fullStr	Reduced complexity on video encoding for wireless multimedia sensor networks
title_full_unstemmed	Reduced complexity on video encoding for wireless multimedia sensor networks
title_short	Reduced complexity on video encoding for wireless multimedia sensor networks
title_sort	reduced complexity on video encoding for wireless multimedia sensor networks
topic	wireless multimedia sensor network video processing
url	https://eprints.nottingham.ac.uk/60558/

Reduced complexity on video encoding for wireless multimedia sensor networks

Similar Items