Semantic image understanding: from pixel to word

The aim of semantic image understanding is to reveal the semantic meaning behind the image pixel. This thesis investigates problems related to semantic image understanding, and have made the following contributions. Our first contribution is to propose the usage of histogram matching in Multiple Ke...

Full description

Bibliographic Details
Main Author: Fu, Hao
Format: Thesis (University of Nottingham only)
Language:English
Published: 2012
Subjects:
Online Access:https://eprints.nottingham.ac.uk/12847/
_version_ 1848791592317485056
author Fu, Hao
author_facet Fu, Hao
author_sort Fu, Hao
building Nottingham Research Data Repository
collection Online Access
description The aim of semantic image understanding is to reveal the semantic meaning behind the image pixel. This thesis investigates problems related to semantic image understanding, and have made the following contributions. Our first contribution is to propose the usage of histogram matching in Multiple Kernel Learning. We treat the two-dimensional kernel matrix as an image and transfer the histogram matching algorithm in image processing to kernel matrix. Experiments on various computer vision and machine learning datasets have shown that our method can always boost the performance of state of the art MKL methods. Our second contribution is to advocate the segment-then-recognize strategy in pixel-level semantic image understanding. We have developed a new framework which tries to integrate semantic segmentation with low-level segmentation for proposing object consistent regions. We have also developed a novel method trying to integrate semantic segmentation with interactive segmentation. We found this segment-then-recognize strategy also works well on medical image data, where we designed a novel polar space random field model for proposing gland-like regions. In the realm of image-level semantic image understanding, our contribution is a novel way to utilize the random forest. Most of the previous works utilizing random forest store the posterior probabilities at each leaf node, and each random tree in the random forest is considered to be independent from each other. In contrast, we store the training samples instead of the posterior probabilities at each leaf node. We consider the random forest as a whole and propose the concept of semantic nearest neighbor and semantic similarity measure. Based on these two concepts, we devise novel methods for image annotation and image retrieval tasks.
first_indexed 2025-11-14T18:30:57Z
format Thesis (University of Nottingham only)
id nottingham-12847
institution University of Nottingham Malaysia Campus
institution_category Local University
language English
last_indexed 2025-11-14T18:30:57Z
publishDate 2012
recordtype eprints
repository_type Digital Repository
spelling nottingham-128472025-02-28T11:21:40Z https://eprints.nottingham.ac.uk/12847/ Semantic image understanding: from pixel to word Fu, Hao The aim of semantic image understanding is to reveal the semantic meaning behind the image pixel. This thesis investigates problems related to semantic image understanding, and have made the following contributions. Our first contribution is to propose the usage of histogram matching in Multiple Kernel Learning. We treat the two-dimensional kernel matrix as an image and transfer the histogram matching algorithm in image processing to kernel matrix. Experiments on various computer vision and machine learning datasets have shown that our method can always boost the performance of state of the art MKL methods. Our second contribution is to advocate the segment-then-recognize strategy in pixel-level semantic image understanding. We have developed a new framework which tries to integrate semantic segmentation with low-level segmentation for proposing object consistent regions. We have also developed a novel method trying to integrate semantic segmentation with interactive segmentation. We found this segment-then-recognize strategy also works well on medical image data, where we designed a novel polar space random field model for proposing gland-like regions. In the realm of image-level semantic image understanding, our contribution is a novel way to utilize the random forest. Most of the previous works utilizing random forest store the posterior probabilities at each leaf node, and each random tree in the random forest is considered to be independent from each other. In contrast, we store the training samples instead of the posterior probabilities at each leaf node. We consider the random forest as a whole and propose the concept of semantic nearest neighbor and semantic similarity measure. Based on these two concepts, we devise novel methods for image annotation and image retrieval tasks. 2012-12-13 Thesis (University of Nottingham only) NonPeerReviewed application/pdf en arr https://eprints.nottingham.ac.uk/12847/1/HaoFU-Thesis.pdf Fu, Hao (2012) Semantic image understanding: from pixel to word. PhD thesis, University of Nottingham. semantic image understanding pixels multiple kernel learning mkl rkdi relative kernel distribution invariance semantic nearest neighbours snn semantic similarity measure ssm
spellingShingle semantic
image
understanding
pixels
multiple kernel learning
mkl
rkdi
relative kernel distribution invariance
semantic nearest neighbours
snn
semantic similarity measure
ssm
Fu, Hao
Semantic image understanding: from pixel to word
title Semantic image understanding: from pixel to word
title_full Semantic image understanding: from pixel to word
title_fullStr Semantic image understanding: from pixel to word
title_full_unstemmed Semantic image understanding: from pixel to word
title_short Semantic image understanding: from pixel to word
title_sort semantic image understanding: from pixel to word
topic semantic
image
understanding
pixels
multiple kernel learning
mkl
rkdi
relative kernel distribution invariance
semantic nearest neighbours
snn
semantic similarity measure
ssm
url https://eprints.nottingham.ac.uk/12847/