Practical Segmentation Methods for Logical and Geometric Layout Analysis to Improve Scanned PDF Accessibility to Vision Impaired

The use of electronic documents has rapidly increased in recent decades and the PDF is one the most commonly used electronic document formats. A scanned PDF is an image and does not actually contain any text. For the vision–impaired user who is dependent upon a screen reader to access this informati...

Full description

Bibliographic Details
Main Authors: Nazemi, Azadeh, Murray, Iain, McMeekin, David
Format: Journal Article
Published: SERSC Science & Engineering Research Support Society 2014
Subjects:
Online Access:http://hdl.handle.net/20.500.11937/10775
_version_ 1848747625515319296
author Nazemi, Azadeh
Murray, Iain
McMeekin, David
author_facet Nazemi, Azadeh
Murray, Iain
McMeekin, David
author_sort Nazemi, Azadeh
building Curtin Institutional Repository
collection Online Access
description The use of electronic documents has rapidly increased in recent decades and the PDF is one the most commonly used electronic document formats. A scanned PDF is an image and does not actually contain any text. For the vision–impaired user who is dependent upon a screen reader to access this information, this format is not useful. Thus addressing PDF accessibility through assistive technology has now become an important concern. PDF layout analysis provides precious formatting information that supports PDF component classification. This classification facilitates the tag generation. Accurate tagging produces a searchable and navigable scanned PDF document. This paper describes several practical segmentation methods which are easy to implement and efficient for PDF layout analysis so that the scanned PDF document can be navigated or searched using assistive technologies.
first_indexed 2025-11-14T06:52:07Z
format Journal Article
id curtin-20.500.11937-10775
institution Curtin University Malaysia
institution_category Local University
last_indexed 2025-11-14T06:52:07Z
publishDate 2014
publisher SERSC Science & Engineering Research Support Society
recordtype eprints
repository_type Digital Repository
spelling curtin-20.500.11937-107752017-09-13T16:04:16Z Practical Segmentation Methods for Logical and Geometric Layout Analysis to Improve Scanned PDF Accessibility to Vision Impaired Nazemi, Azadeh Murray, Iain McMeekin, David Optical character recognition (OCR) Vision-impaired PDF layout analysis The use of electronic documents has rapidly increased in recent decades and the PDF is one the most commonly used electronic document formats. A scanned PDF is an image and does not actually contain any text. For the vision–impaired user who is dependent upon a screen reader to access this information, this format is not useful. Thus addressing PDF accessibility through assistive technology has now become an important concern. PDF layout analysis provides precious formatting information that supports PDF component classification. This classification facilitates the tag generation. Accurate tagging produces a searchable and navigable scanned PDF document. This paper describes several practical segmentation methods which are easy to implement and efficient for PDF layout analysis so that the scanned PDF document can be navigated or searched using assistive technologies. 2014 Journal Article http://hdl.handle.net/20.500.11937/10775 10.14257/ijsip.2014.7.4.03 SERSC Science & Engineering Research Support Society restricted
spellingShingle Optical character recognition (OCR)
Vision-impaired
PDF layout analysis
Nazemi, Azadeh
Murray, Iain
McMeekin, David
Practical Segmentation Methods for Logical and Geometric Layout Analysis to Improve Scanned PDF Accessibility to Vision Impaired
title Practical Segmentation Methods for Logical and Geometric Layout Analysis to Improve Scanned PDF Accessibility to Vision Impaired
title_full Practical Segmentation Methods for Logical and Geometric Layout Analysis to Improve Scanned PDF Accessibility to Vision Impaired
title_fullStr Practical Segmentation Methods for Logical and Geometric Layout Analysis to Improve Scanned PDF Accessibility to Vision Impaired
title_full_unstemmed Practical Segmentation Methods for Logical and Geometric Layout Analysis to Improve Scanned PDF Accessibility to Vision Impaired
title_short Practical Segmentation Methods for Logical and Geometric Layout Analysis to Improve Scanned PDF Accessibility to Vision Impaired
title_sort practical segmentation methods for logical and geometric layout analysis to improve scanned pdf accessibility to vision impaired
topic Optical character recognition (OCR)
Vision-impaired
PDF layout analysis
url http://hdl.handle.net/20.500.11937/10775