A Spatial Layout and Scale Invariant Feature Representation for Indoor Scene Classification

Unlike standard object classification, where the image to be classified contains one or multiple instances of the same object, indoor scene classification is quite different since the image consists of multiple distinct objects. Furthermore, these objects can be of varying sizes and are present acro...

Full description

Bibliographic Details
Main Authors:	Hayat, M., Khan, S., Bennamoun, M., An, Senjian
Format:	Journal Article
Published:	IEEE 2016
Online Access:	http://hdl.handle.net/20.500.11937/69931

_version_	1848762170495467520
author	Hayat, M. Khan, S. Bennamoun, M. An, Senjian
author_facet	Hayat, M. Khan, S. Bennamoun, M. An, Senjian
author_sort	Hayat, M.
building	Curtin Institutional Repository
collection	Online Access
description	Unlike standard object classification, where the image to be classified contains one or multiple instances of the same object, indoor scene classification is quite different since the image consists of multiple distinct objects. Furthermore, these objects can be of varying sizes and are present across numerous spatial locations in different layouts. For automatic indoor scene categorization, large-scale spatial layout deformations and scale variations are therefore two major challenges and the design of rich feature descriptors which are robust to these challenges is still an open problem. This paper introduces a new learnable feature descriptor called 'spatial layout and scale invariant convolutional activations' to deal with these challenges. For this purpose, a new convolutional neural network architecture is designed which incorporates a novel 'spatially unstructured' layer to introduce robustness against spatial layout deformations. To achieve scale invariance, we present a pyramidal image representation. For feasible training of the proposed network for images of indoor scenes, this paper proposes a methodology, which efficiently adapts a trained network model (on a large-scale data) for our task with only a limited amount of available training data. The efficacy of the proposed approach is demonstrated through extensive experiments on a number of data sets, including MIT-67, Scene-15, Sports-8, Graz-02, and NYU data sets.
first_indexed	2025-11-14T10:43:19Z
format	Journal Article
id	curtin-20.500.11937-69931
institution	Curtin University Malaysia
institution_category	Local University
last_indexed	2025-11-14T10:43:19Z
publishDate	2016
publisher	IEEE
recordtype	eprints
repository_type	Digital Repository
spelling	curtin-20.500.11937-699312019-01-24T02:47:44Z A Spatial Layout and Scale Invariant Feature Representation for Indoor Scene Classification Hayat, M. Khan, S. Bennamoun, M. An, Senjian Unlike standard object classification, where the image to be classified contains one or multiple instances of the same object, indoor scene classification is quite different since the image consists of multiple distinct objects. Furthermore, these objects can be of varying sizes and are present across numerous spatial locations in different layouts. For automatic indoor scene categorization, large-scale spatial layout deformations and scale variations are therefore two major challenges and the design of rich feature descriptors which are robust to these challenges is still an open problem. This paper introduces a new learnable feature descriptor called 'spatial layout and scale invariant convolutional activations' to deal with these challenges. For this purpose, a new convolutional neural network architecture is designed which incorporates a novel 'spatially unstructured' layer to introduce robustness against spatial layout deformations. To achieve scale invariance, we present a pyramidal image representation. For feasible training of the proposed network for images of indoor scenes, this paper proposes a methodology, which efficiently adapts a trained network model (on a large-scale data) for our task with only a limited amount of available training data. The efficacy of the proposed approach is demonstrated through extensive experiments on a number of data sets, including MIT-67, Scene-15, Sports-8, Graz-02, and NYU data sets. 2016 Journal Article http://hdl.handle.net/20.500.11937/69931 10.1109/TIP.2016.2599292 IEEE restricted
spellingShingle	Hayat, M. Khan, S. Bennamoun, M. An, Senjian A Spatial Layout and Scale Invariant Feature Representation for Indoor Scene Classification
title	A Spatial Layout and Scale Invariant Feature Representation for Indoor Scene Classification
title_full	A Spatial Layout and Scale Invariant Feature Representation for Indoor Scene Classification
title_fullStr	A Spatial Layout and Scale Invariant Feature Representation for Indoor Scene Classification
title_full_unstemmed	A Spatial Layout and Scale Invariant Feature Representation for Indoor Scene Classification
title_short	A Spatial Layout and Scale Invariant Feature Representation for Indoor Scene Classification
title_sort	spatial layout and scale invariant feature representation for indoor scene classification
url	http://hdl.handle.net/20.500.11937/69931

A Spatial Layout and Scale Invariant Feature Representation for Indoor Scene Classification

Similar Items