Practical Segmentation Methods for Logical and Geometric Layout Analysis to Improve Scanned PDF Accessibility to Vision Impaired
The use of electronic documents has rapidly increased in recent decades and the PDF is one the most commonly used electronic document formats. A scanned PDF is an image and does not actually contain any text. For the vision–impaired user who is dependent upon a screen reader to access this informati...
| Main Authors: | , , |
|---|---|
| Format: | Journal Article |
| Published: |
SERSC Science & Engineering Research Support Society
2014
|
| Subjects: | |
| Online Access: | http://hdl.handle.net/20.500.11937/10775 |
| _version_ | 1848747625515319296 |
|---|---|
| author | Nazemi, Azadeh Murray, Iain McMeekin, David |
| author_facet | Nazemi, Azadeh Murray, Iain McMeekin, David |
| author_sort | Nazemi, Azadeh |
| building | Curtin Institutional Repository |
| collection | Online Access |
| description | The use of electronic documents has rapidly increased in recent decades and the PDF is one the most commonly used electronic document formats. A scanned PDF is an image and does not actually contain any text. For the vision–impaired user who is dependent upon a screen reader to access this information, this format is not useful. Thus addressing PDF accessibility through assistive technology has now become an important concern. PDF layout analysis provides precious formatting information that supports PDF component classification. This classification facilitates the tag generation. Accurate tagging produces a searchable and navigable scanned PDF document. This paper describes several practical segmentation methods which are easy to implement and efficient for PDF layout analysis so that the scanned PDF document can be navigated or searched using assistive technologies. |
| first_indexed | 2025-11-14T06:52:07Z |
| format | Journal Article |
| id | curtin-20.500.11937-10775 |
| institution | Curtin University Malaysia |
| institution_category | Local University |
| last_indexed | 2025-11-14T06:52:07Z |
| publishDate | 2014 |
| publisher | SERSC Science & Engineering Research Support Society |
| recordtype | eprints |
| repository_type | Digital Repository |
| spelling | curtin-20.500.11937-107752017-09-13T16:04:16Z Practical Segmentation Methods for Logical and Geometric Layout Analysis to Improve Scanned PDF Accessibility to Vision Impaired Nazemi, Azadeh Murray, Iain McMeekin, David Optical character recognition (OCR) Vision-impaired PDF layout analysis The use of electronic documents has rapidly increased in recent decades and the PDF is one the most commonly used electronic document formats. A scanned PDF is an image and does not actually contain any text. For the vision–impaired user who is dependent upon a screen reader to access this information, this format is not useful. Thus addressing PDF accessibility through assistive technology has now become an important concern. PDF layout analysis provides precious formatting information that supports PDF component classification. This classification facilitates the tag generation. Accurate tagging produces a searchable and navigable scanned PDF document. This paper describes several practical segmentation methods which are easy to implement and efficient for PDF layout analysis so that the scanned PDF document can be navigated or searched using assistive technologies. 2014 Journal Article http://hdl.handle.net/20.500.11937/10775 10.14257/ijsip.2014.7.4.03 SERSC Science & Engineering Research Support Society restricted |
| spellingShingle | Optical character recognition (OCR) Vision-impaired PDF layout analysis Nazemi, Azadeh Murray, Iain McMeekin, David Practical Segmentation Methods for Logical and Geometric Layout Analysis to Improve Scanned PDF Accessibility to Vision Impaired |
| title | Practical Segmentation Methods for Logical and Geometric Layout Analysis to Improve Scanned PDF Accessibility to Vision Impaired |
| title_full | Practical Segmentation Methods for Logical and Geometric Layout Analysis to Improve Scanned PDF Accessibility to Vision Impaired |
| title_fullStr | Practical Segmentation Methods for Logical and Geometric Layout Analysis to Improve Scanned PDF Accessibility to Vision Impaired |
| title_full_unstemmed | Practical Segmentation Methods for Logical and Geometric Layout Analysis to Improve Scanned PDF Accessibility to Vision Impaired |
| title_short | Practical Segmentation Methods for Logical and Geometric Layout Analysis to Improve Scanned PDF Accessibility to Vision Impaired |
| title_sort | practical segmentation methods for logical and geometric layout analysis to improve scanned pdf accessibility to vision impaired |
| topic | Optical character recognition (OCR) Vision-impaired PDF layout analysis |
| url | http://hdl.handle.net/20.500.11937/10775 |