Enhancing the searchability of page-image PDF documents using an aligned hidden layer from a truth text
The search accuracy achieved in a PDF image-plus-hidden- text (PDF-IT) document depends upon the accuracy of the optical character recognition (OCR) process that produced the searchable hidden text layer. In many cases recognising words in a blurred area of a PDF page image may exceed the capabiliti...
| Main Authors: | Knight, Ian A., Brailsford, David F. |
|---|---|
| Format: | Conference or Workshop Item |
| Language: | English |
| Published: |
2016
|
| Subjects: | |
| Online Access: | https://eprints.nottingham.ac.uk/45753/ |
Similar Items
Generating summary documents for a variable-quality PDF document collection
by: Hughes, Jacob, et al.
Published: (2014)
by: Hughes, Jacob, et al.
Published: (2014)
A novel approach to handwritten character recognition
by: Clarke, Eddie
Published: (1995)
by: Clarke, Eddie
Published: (1995)
Practical Segmentation Methods for Logical and Geometric Layout Analysis to Improve Scanned PDF Accessibility to Vision Impaired
by: Nazemi, Azadeh, et al.
Published: (2014)
by: Nazemi, Azadeh, et al.
Published: (2014)
Tracking sub-page components in document workflows
by: Ollis, James A., et al.
Published: (2008)
by: Ollis, James A., et al.
Published: (2008)
Subject to truth: before and after governmentality in Foucault’s 1970s
by: Legg, Stephen
Published: (2016)
by: Legg, Stephen
Published: (2016)
Ground truthing protocols for biomass estimation in rangeland environments
by: Mundava, C., et al.
Published: (2013)
by: Mundava, C., et al.
Published: (2013)
Using built-in functions of Adobe Acrobat Pro DC to help the selection process in systematic review of randomized trials
by: Nur, Selin, et al.
Published: (2016)
by: Nur, Selin, et al.
Published: (2016)
Page Composition using PPML as a Link-editing Script
by: Bagley, Steven R., et al.
Published: (2004)
by: Bagley, Steven R., et al.
Published: (2004)
Reflowable documents composed from pre-rendered atomic components
by: Pinkney, Alexander J., et al.
Published: (2011)
by: Pinkney, Alexander J., et al.
Published: (2011)
Nefarious presentism
by: Tallant, Jonathan, et al.
Published: (2015)
by: Tallant, Jonathan, et al.
Published: (2015)
Separate compilation of structured documents
by: Groves, Michael J., et al.
Published: (1993)
by: Groves, Michael J., et al.
Published: (1993)
Creating reusable well-structured pdf as a sequence of component object graphic (cog) elements
by: Bagley, Steven R., et al.
Published: (2003)
by: Bagley, Steven R., et al.
Published: (2003)
“Heroic Souls”: Representations of the Black Female Heroism of Harriet Tubman and Sojourner Truth
by: James, Charlotte
Published: (2022)
by: James, Charlotte
Published: (2022)
Laying out the future of final-form digital documents
by: Brailsford, David F.
Published: (2006)
by: Brailsford, David F.
Published: (2006)
Automated re-typesetting, indexing and content enhancement for scanned marriage registers
by: Brailsford, David F.
Published: (2009)
by: Brailsford, David F.
Published: (2009)
Reconstituting typeset Marriage Registers using simple software tools
by: Brailsford, David F.
Published: (2012)
by: Brailsford, David F.
Published: (2012)
Enhancing composite Digital Documents Using XML-based Standoff Markup
by: Thomas, Peter L., et al.
Published: (2005)
by: Thomas, Peter L., et al.
Published: (2005)
Truth maintenance in knowledge-based systems
by: Nguyen, Hai Hoang
Published: (2014)
by: Nguyen, Hai Hoang
Published: (2014)
Extracting reusable document components for variable data printing
by: Bagley, Steven R., et al.
Published: (2007)
by: Bagley, Steven R., et al.
Published: (2007)
Document analysis of PDF files: methods, results and implications
by: Lovegrove, William S., et al.
Published: (1995)
by: Lovegrove, William S., et al.
Published: (1995)
Using SVG as the Rendering Model for Structured and Graphically Complex Web Material
by: Mong, Julius, et al.
Published: (2003)
by: Mong, Julius, et al.
Published: (2003)
Text Categorization Using an Automatically Generated Labelled Dataset: An Evaluation Study
by: Zhu, Dengya, et al.
Published: (2014)
by: Zhu, Dengya, et al.
Published: (2014)
Creating Structured PDF Files Using XML Templates
by: Hardy, Matthew, et al.
Published: (2004)
by: Hardy, Matthew, et al.
Published: (2004)
Mapping and Displaying Structural Transformations between XML and PDF
by: Hardy, Matthew R. B., et al.
Published: (2002)
by: Hardy, Matthew R. B., et al.
Published: (2002)
Dynamic Link Inclusion in Online PDF Journals
by: Probets, Steve, et al.
Published: (1998)
by: Probets, Steve, et al.
Published: (1998)
A strategy for extracting information from semi-structured web pages.
by: Shaker, Mahmoud, et al.
Published: (2010)
by: Shaker, Mahmoud, et al.
Published: (2010)
State-of-the-art in techniques of text digital watermarking: challenges and limitations
by: Al-Maweri, Nasr Addin Ahmed Salem, et al.
Published: (2016)
by: Al-Maweri, Nasr Addin Ahmed Salem, et al.
Published: (2016)
Vector Graphics: From PostScript and Flash to SVG
by: Probets, Steve, et al.
Published: (2001)
by: Probets, Steve, et al.
Published: (2001)
Substituting outline fonts for bitmap fonts in archived PDF files
by: Probets, Steve, et al.
Published: (2003)
by: Probets, Steve, et al.
Published: (2003)
Literacy challenges faced by students using scientific texts
by: Thompson, Marilyn Joy
Published: (2011)
by: Thompson, Marilyn Joy
Published: (2011)
Hybrid texts and historical fiction
by: Nichols, Ian
Published: (2011)
by: Nichols, Ian
Published: (2011)
Adobe's Acrobat -- the Electronic Journal Catalyst?
by: Brailsford, David F.
Published: (1993)
by: Brailsford, David F.
Published: (1993)
CD-ROM Acrobat Journals Using Networks
by: Brailsford, David F., et al.
Published: (1994)
by: Brailsford, David F., et al.
Published: (1994)
Encapsulating and Manipulating Component Object Graphics (COGs) using SVG
by: Macdonald, Alexander J., et al.
Published: (2005)
by: Macdonald, Alexander J., et al.
Published: (2005)
Experience with the use of Acrobat in the CAJUN publishing project
by: Brailsford, David F.
Published: (1994)
by: Brailsford, David F.
Published: (1994)
Towards computation of novel ideas from corpora of scientific text
by: Liu, Haixia, et al.
Published: (2015)
by: Liu, Haixia, et al.
Published: (2015)
A driving simulator study to explore the effects of text size on the visual demand of in-vehicle displays
by: Crundall, Elizabeth, et al.
Published: (2016)
by: Crundall, Elizabeth, et al.
Published: (2016)
Visualization of extracted grammatical role of words using parse tree conversion to improve understanding of English texts
by: Mirzabeiki, Erfan
Published: (2014)
by: Mirzabeiki, Erfan
Published: (2014)
Adobe's Acrobat -- providing the missing link?
by: Brailsford, David F.
Published: (1994)
by: Brailsford, David F.
Published: (1994)
Towards structured, block-based PDF
by: Smith, Philip N., et al.
Published: (1995)
by: Smith, Philip N., et al.
Published: (1995)
Similar Items
-
Generating summary documents for a variable-quality PDF document collection
by: Hughes, Jacob, et al.
Published: (2014) -
A novel approach to handwritten character recognition
by: Clarke, Eddie
Published: (1995) -
Practical Segmentation Methods for Logical and Geometric Layout Analysis to Improve Scanned PDF Accessibility to Vision Impaired
by: Nazemi, Azadeh, et al.
Published: (2014) -
Tracking sub-page components in document workflows
by: Ollis, James A., et al.
Published: (2008) -
Subject to truth: before and after governmentality in Foucault’s 1970s
by: Legg, Stephen
Published: (2016)