Enabling multimodal interaction in web-based personal digital photo browsing

Retrieval process of both digital photos and physical photos has not been easy, especially when the collections grow into thousands. In this paper, we describe an interactive web-based photo retrieval system that enables personal digital photo users to accomplish photo browsing by using multimodal i...

Full description

Bibliographic Details
Main Authors: Ismail, N. A, O'Brien, E. A
Format: Conference or Workshop Item
Language:English
Published: 2008
Subjects:
Online Access:http://eprints.utm.my/5732/
http://eprints.utm.my/5732/1/ICCCE2008_preprint_version_UTM_IR.pdf
_version_ 1848891111187152896
author Ismail, N. A
O'Brien, E. A
author_facet Ismail, N. A
O'Brien, E. A
author_sort Ismail, N. A
building UTeM Institutional Repository
collection Online Access
description Retrieval process of both digital photos and physical photos has not been easy, especially when the collections grow into thousands. In this paper, we describe an interactive web-based photo retrieval system that enables personal digital photo users to accomplish photo browsing by using multimodal interaction. This system not only enables users to use mouse clicks input modalities but also speech input modality to browse their personal digital photos in the World Wide Web (WWW) environment. The prototype system and it architecture utilize web technology which was build using web programming scripting (JavaScript, XHTML, ASP, XML based markup language) and image database in order to achieve its objective. All prototype programs and data files including the user’s photo repository, profiles, dialogues, grammars, prompt, and retrieval engine are stored and located in the web server. Our approach also consists of human-computer speech dialogue based on photo browsing of image content by four main categories (Who? What? When? and Where?). Our user study with 20 digital photo users showed that the participants reacted positively to their experience with the system interactions.
first_indexed 2025-11-15T20:52:46Z
format Conference or Workshop Item
id utm-5732
institution Universiti Teknologi Malaysia
institution_category Local University
language English
last_indexed 2025-11-15T20:52:46Z
publishDate 2008
recordtype eprints
repository_type Digital Repository
spelling utm-57322011-06-08T05:06:25Z http://eprints.utm.my/5732/ Enabling multimodal interaction in web-based personal digital photo browsing Ismail, N. A O'Brien, E. A T Technology (General) TR Photography QA76 Computer software Retrieval process of both digital photos and physical photos has not been easy, especially when the collections grow into thousands. In this paper, we describe an interactive web-based photo retrieval system that enables personal digital photo users to accomplish photo browsing by using multimodal interaction. This system not only enables users to use mouse clicks input modalities but also speech input modality to browse their personal digital photos in the World Wide Web (WWW) environment. The prototype system and it architecture utilize web technology which was build using web programming scripting (JavaScript, XHTML, ASP, XML based markup language) and image database in order to achieve its objective. All prototype programs and data files including the user’s photo repository, profiles, dialogues, grammars, prompt, and retrieval engine are stored and located in the web server. Our approach also consists of human-computer speech dialogue based on photo browsing of image content by four main categories (Who? What? When? and Where?). Our user study with 20 digital photo users showed that the participants reacted positively to their experience with the system interactions. 2008-05-13 Conference or Workshop Item PeerReviewed application/pdf en http://eprints.utm.my/5732/1/ICCCE2008_preprint_version_UTM_IR.pdf Ismail, N. A and O'Brien, E. A (2008) Enabling multimodal interaction in web-based personal digital photo browsing. In: International Conference on Computer and Communication Engineering 2008, 13 to 15 May 2008 , Kuala Lumpur. (Unpublished)
spellingShingle T Technology (General)
TR Photography
QA76 Computer software
Ismail, N. A
O'Brien, E. A
Enabling multimodal interaction in web-based personal digital photo browsing
title Enabling multimodal interaction in web-based personal digital photo browsing
title_full Enabling multimodal interaction in web-based personal digital photo browsing
title_fullStr Enabling multimodal interaction in web-based personal digital photo browsing
title_full_unstemmed Enabling multimodal interaction in web-based personal digital photo browsing
title_short Enabling multimodal interaction in web-based personal digital photo browsing
title_sort enabling multimodal interaction in web-based personal digital photo browsing
topic T Technology (General)
TR Photography
QA76 Computer software
url http://eprints.utm.my/5732/
http://eprints.utm.my/5732/1/ICCCE2008_preprint_version_UTM_IR.pdf