David F. Brailsford
Automated conversion of Web-based marriage register data into a printed format with predefined layout
Brailsford, David F.
Authors
Abstract
The Phillimore Marriage Registers for England were published in the period 1896 to 1922 and have defined a standard layout format for the typesetting of marriage data. However, not all English parish churches had their marriage registers analysed and printed by the Phillimore organisation within this time period.
This paper tells the story of Wirksworth, a town in Derbyshire with a large church, licensed for marriages, yet whose marriage data was not released to the Phillimore organisation. Hence there is no printed Phillimore Marriages volume for Wirksworth. However, in recent years, a Wirksworth web site, created by John Palmer, has become famous as being probably the most comprehensive record of a parish’s activities anywhere on the Web.
Within a total of 120 MB of data on the web site, covering events in Wirksworth from medieval times to the present, is a set of data recording births, marriages and deaths transcribed from the original hand-written church register volumes.
The work described here covers the software tools and techniques that were used in creating a set of awk scripts to extract all the marriage records from the Wirksworth web site data. The extracted material was then automatically re-processed, typeset and indexed to form an entirely new Phillimore-style volume for Wirksworth marriages.
Citation
Brailsford, D. F. (2011). Automated conversion of Web-based marriage register data into a printed format with predefined layout.
Conference Name | ACM Symposium on Document Engineering (DocEng '11) |
---|---|
End Date | Sep 22, 2011 |
Publication Date | Sep 1, 2011 |
Deposit Date | Feb 24, 2015 |
Publicly Available Date | Feb 24, 2015 |
Peer Reviewed | Peer Reviewed |
Keywords | Re-typesetting, Web-to-Print, troff, genealogy, hyperlinking, indexing |
Public URL | https://nottingham-repository.worktribe.com/output/1009703 |
Publisher URL | http://dl.acm.org/citation.cfm?id=2034704 |
Additional Information | Published in: DocEng '11: proceedings of the 11th ACM Symposium on Document Engineering. New York : ACM, 2011, ISBN: 978-1-4503-0863-2. pp. 61-64, doi: 10.1145/2034691.2034704 |
Files
eprint-dfb2011.pdf
(85 Kb)
PDF
Downloadable Citations
About Repository@Nottingham
Administrator e-mail: discovery-access-systems@nottingham.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2024
Advanced Search