Peter L. Thomas
Enhancing composite Digital Documents Using XML-based Standoff Markup
Thomas, Peter L.; Brailsford, David F.
Authors
David F. Brailsford
Contributors
Peter R. King
Editor
Abstract
Document representations can rapidly become unwieldy if they try to encapsulate all possible document properties, ranging from abstract structure to detailed rendering and layout.
We present a composite document approach wherein an XMLbased document representation is linked via a shadow tree of bi-directional pointers to a PDF representation of the same document. Using a two-window viewer any material selected in the PDF can be related back to the corresponding material in the XML, and vice versa. In this way the treatment of specialist material such as mathematics, music or chemistry (e.g. via read aloud or play aloud ) can be activated via standard tools working within the XML representation, rather than requiring that application-specific structures be embedded in the PDF itself.
The problems of textual recognition and tree pattern matching between the two representations are discussed in detail. Comparisons are drawn between our use of a shadow tree of pointers to map between document representations and the use of a code-replacement shadow tree in technologies such as XBL.
Citation
Thomas, P. L., & Brailsford, D. F. Enhancing composite Digital Documents Using XML-based Standoff Markup. Presented at ACM Symposium on Document Enginering (DocEng05)
Conference Name | ACM Symposium on Document Enginering (DocEng05) |
---|---|
End Date | Nov 4, 2005 |
Publication Date | Jan 1, 2005 |
Deposit Date | Nov 7, 2005 |
Publicly Available Date | Oct 9, 2007 |
Peer Reviewed | Peer Reviewed |
Keywords | XML, PDF, standoff markup, composite documents, MathML, MusicXML, XBL |
Public URL | https://nottingham-repository.worktribe.com/output/1020733 |
Files
standoffpaper.pdf
(781 Kb)
PDF
Downloadable Citations
About Repository@Nottingham
Administrator e-mail: discovery-access-systems@nottingham.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2025
Advanced Search