Enhancing the searchability of page-image PDF documents using an aligned hidden layer from a truth text
(2016)
Conference Proceeding
The search accuracy achieved in a PDF image-plus-hidden- text (PDF-IT) document depends upon the accuracy of the optical character recognition (OCR) process that produced the searchable hidden text layer. In many cases recognising words in a blurred... Read More about Enhancing the searchability of page-image PDF documents using an aligned hidden layer from a truth text.