Skip to main content

Research Repository

Advanced Search

All Outputs (10)

Automatically Labeling Cyber Threat Intelligence reports using Natural Language Processing (2023)
Conference Proceeding
Abdi, H., Bagley, S. R., Furnell, S., & Twycross, J. (2023). Automatically Labeling Cyber Threat Intelligence reports using Natural Language Processing. In DocEng ’23 : Proceedings of the 2023 ACM Symposium on Document Engineering. https://doi.org/10.1145/3573128.3609348

Attribution provides valuable intelligence in the face of Advanced Persistent Threat (APT) attacks. By accurately identifying the culprits and actors behind the attacks, we can gain more insights into their motivations, capabilities, and potential fu... Read More about Automatically Labeling Cyber Threat Intelligence reports using Natural Language Processing.

Generating summary documents for a variable-quality PDF document collection (2014)
Conference Proceeding
Hughes, J., Brailsford, D. F., Bagley, S. R., & Adams, C. E. (2014). Generating summary documents for a variable-quality PDF document collection.

The Cochrane Schizophrenia Group’s Register of studies details all aspects of the effects of treating people with schizophrenia. It has been gathered over the last 20 years and consists of around 20,000 documents, overwhelmingly in PDF. Document coll... Read More about Generating summary documents for a variable-quality PDF document collection.

Revisiting a summer vacation: digital restoration and typesetter forensics (2013)
Conference Proceeding
Bagley, S. R., Brailsford, D. F., & Kernighan, B. W. (2013). Revisiting a summer vacation: digital restoration and typesetter forensics.

In 1979 the Computing Science Research Center (‘Center 127’) at Bell Laboratories bought a Linotron 202 typesetter from the Mergenthaler company. This was a ‘third generation’ digital machine that used a CRT to image characters onto photographic pape... Read More about Revisiting a summer vacation: digital restoration and typesetter forensics.

No need to justify your choice: pre-compiling line breaks to improve eBook readability (2013)
Conference Proceeding
Pinkney, A. J., Bagley, S. R., & Brailsford, D. F. (2013). No need to justify your choice: pre-compiling line breaks to improve eBook readability.

Implementations of eBooks have existed in one form or another for at least the past 20 years, but it is only in the past 5 years that dedicated eBook hardware has become a mass-market item. New screen technologies, such as e-paper, provide a readi... Read More about No need to justify your choice: pre-compiling line breaks to improve eBook readability.

Reflowable documents composed from pre-rendered atomic components (2011)
Conference Proceeding
Pinkney, A. J., Bagley, S. R., & Brailsford, D. F. (2011). Reflowable documents composed from pre-rendered atomic components.

Mobile eBook readers are now commonplace in today’s society, but their document layout algorithms remain basic, largely due to constraints imposed by short battery life. At present, with any eBook file format not based on PDF, the layout of the docum... Read More about Reflowable documents composed from pre-rendered atomic components.

Optimized reprocessing of documents using stored processor state (2010)
Conference Proceeding
Ollis, J. A., Brailsford, D. F., & Bagley, S. R. (2010). Optimized reprocessing of documents using stored processor state.

Variable Data Printing (VDP) allows customised versions of material such as advertising flyers to be readily produced. However, VDP is often extremely demanding of computing resources because, even when much of the material stays invariant from one d... Read More about Optimized reprocessing of documents using stored processor state.

Tracking sub-page components in document workflows (2008)
Conference Proceeding
Ollis, J. A., Bagley, S. R., & Brailsford, D. F. (2008). Tracking sub-page components in document workflows. . https://doi.org/10.1145/1410140.1410156

Documents go through numerous transformations and intermediate formats as they are processed from abstract markup into final printable form. This notion of a document workflow is well established but it is common to find that ideas about document com... Read More about Tracking sub-page components in document workflows.

Extracting reusable document components for variable data printing (2007)
Conference Proceeding
Bagley, S. R., Brailsford, D. F., & Ollis, J. A. (2007). Extracting reusable document components for variable data printing.

Variable Data Printing (VDP) has brought new flexibility and dynamism to the printed page. Each printed instance of a specific class of document can now have different degrees of customized content within the document template. This flexibility co... Read More about Extracting reusable document components for variable data printing.

Page Composition using PPML as a Link-editing Script (2004)
Conference Proceeding
Bagley, S. R., & Brailsford, D. F. (2004). Page Composition using PPML as a Link-editing Script. In J. Vion-Dury (Ed.),

The advantages of a COG (Component Object Graphic) approach to the composition of PDF pages have been set out in a previous paper [1]. However, if pages are to be composed in this way then the individual graphic objects must have known bounding boxe... Read More about Page Composition using PPML as a Link-editing Script.

Creating reusable well-structured pdf as a sequence of component object graphic (cog) elements (2003)
Conference Proceeding
Bagley, S. R., Brailsford, D. F., & Hardy, M. R. B. (2003). Creating reusable well-structured pdf as a sequence of component object graphic (cog) elements. In C. Vanoirbeek, C. Roisin, & E. Munson (Eds.),

Portable Document Format (PDF) is a page-oriented, graphically rich format based on PostScript semantics and it is also the format interpreted by the Adobe Acrobat viewers. Although each of the pages in a PDF document is an independent graphic objec... Read More about Creating reusable well-structured pdf as a sequence of component object graphic (cog) elements.