Enhancing composite Digital Documents Using XML-based Standoff Markup

Document representations can rapidly become unwieldy if they try to encapsulate all possible document properties, ranging from abstract structure to detailed rendering and layout. We present a composite document approach wherein an XMLbased document representation is linked via a shadow tree of b...

Full description

Bibliographic Details
Main Authors: Thomas, Peter L., Brailsford, David F.
Other Authors: King, Peter R.
Format: Conference or Workshop Item
Published: ACM Press 2005
Subjects:
Online Access:https://eprints.nottingham.ac.uk/284/
_version_ 1848790386317721600
author Thomas, Peter L.
Brailsford, David F.
author2 King, Peter R.
author_facet King, Peter R.
Thomas, Peter L.
Brailsford, David F.
author_sort Thomas, Peter L.
building Nottingham Research Data Repository
collection Online Access
description Document representations can rapidly become unwieldy if they try to encapsulate all possible document properties, ranging from abstract structure to detailed rendering and layout. We present a composite document approach wherein an XMLbased document representation is linked via a shadow tree of bi-directional pointers to a PDF representation of the same document. Using a two-window viewer any material selected in the PDF can be related back to the corresponding material in the XML, and vice versa. In this way the treatment of specialist material such as mathematics, music or chemistry (e.g. via read aloud or play aloud ) can be activated via standard tools working within the XML representation, rather than requiring that application-specific structures be embedded in the PDF itself. The problems of textual recognition and tree pattern matching between the two representations are discussed in detail. Comparisons are drawn between our use of a shadow tree of pointers to map between document representations and the use of a code-replacement shadow tree in technologies such as XBL.
first_indexed 2025-11-14T18:11:47Z
format Conference or Workshop Item
id nottingham-284
institution University of Nottingham Malaysia Campus
institution_category Local University
last_indexed 2025-11-14T18:11:47Z
publishDate 2005
publisher ACM Press
recordtype eprints
repository_type Digital Repository
spelling nottingham-2842020-05-04T20:31:03Z https://eprints.nottingham.ac.uk/284/ Enhancing composite Digital Documents Using XML-based Standoff Markup Thomas, Peter L. Brailsford, David F. Document representations can rapidly become unwieldy if they try to encapsulate all possible document properties, ranging from abstract structure to detailed rendering and layout. We present a composite document approach wherein an XMLbased document representation is linked via a shadow tree of bi-directional pointers to a PDF representation of the same document. Using a two-window viewer any material selected in the PDF can be related back to the corresponding material in the XML, and vice versa. In this way the treatment of specialist material such as mathematics, music or chemistry (e.g. via read aloud or play aloud ) can be activated via standard tools working within the XML representation, rather than requiring that application-specific structures be embedded in the PDF itself. The problems of textual recognition and tree pattern matching between the two representations are discussed in detail. Comparisons are drawn between our use of a shadow tree of pointers to map between document representations and the use of a code-replacement shadow tree in technologies such as XBL. ACM Press King, Peter R. 2005 Conference or Workshop Item PeerReviewed Thomas, Peter L. and Brailsford, David F. (2005) Enhancing composite Digital Documents Using XML-based Standoff Markup. In: ACM Symposium on Document Enginering (DocEng05), 2 - 4 November 2005, Bristol UK. XML PDF standoff markup composite documents MathML MusicXML XBL
spellingShingle XML
PDF
standoff markup
composite documents
MathML
MusicXML
XBL
Thomas, Peter L.
Brailsford, David F.
Enhancing composite Digital Documents Using XML-based Standoff Markup
title Enhancing composite Digital Documents Using XML-based Standoff Markup
title_full Enhancing composite Digital Documents Using XML-based Standoff Markup
title_fullStr Enhancing composite Digital Documents Using XML-based Standoff Markup
title_full_unstemmed Enhancing composite Digital Documents Using XML-based Standoff Markup
title_short Enhancing composite Digital Documents Using XML-based Standoff Markup
title_sort enhancing composite digital documents using xml-based standoff markup
topic XML
PDF
standoff markup
composite documents
MathML
MusicXML
XBL
url https://eprints.nottingham.ac.uk/284/