Using TEI XML Schema to Encode the Structures of Sarawak Gazette

Automatic extraction of information from old printed documents which have been digitised injudiciously will end up with a lot human corrections. To overcome the problem, one possible solution is to annotate the documents with some markups. This paper presents the encoding of the digitised sampl...

Full description

Bibliographic Details
Main Authors: Tze-Min, Fong, Bali, Ranaivo-Malançon
Format: Article
Language:English
Published: IJSSH 2015
Subjects:
Online Access:http://ir.unimas.my/id/eprint/12923/
http://ir.unimas.my/id/eprint/12923/1/Ranaivo.pdf
_version_ 1848837305817628672
author Tze-Min, Fong
Bali, Ranaivo-Malançon
author_facet Tze-Min, Fong
Bali, Ranaivo-Malançon
author_sort Tze-Min, Fong
building UNIMAS Institutional Repository
collection Online Access
description Automatic extraction of information from old printed documents which have been digitised injudiciously will end up with a lot human corrections. To overcome the problem, one possible solution is to annotate the documents with some markups. This paper presents the encoding of the digitised sample of Sarawak Gazette published from 1903 until 1939 using the standard TEI XML schema. The output of the work is a set of six TEI XML templates that is considered to represent the different layout structures found in the studied samples.
first_indexed 2025-11-15T06:37:33Z
format Article
id unimas-12923
institution Universiti Malaysia Sarawak
institution_category Local University
language English
last_indexed 2025-11-15T06:37:33Z
publishDate 2015
publisher IJSSH
recordtype eprints
repository_type Digital Repository
spelling unimas-129232021-06-22T16:18:17Z http://ir.unimas.my/id/eprint/12923/ Using TEI XML Schema to Encode the Structures of Sarawak Gazette Tze-Min, Fong Bali, Ranaivo-Malançon T Technology (General) Automatic extraction of information from old printed documents which have been digitised injudiciously will end up with a lot human corrections. To overcome the problem, one possible solution is to annotate the documents with some markups. This paper presents the encoding of the digitised sample of Sarawak Gazette published from 1903 until 1939 using the standard TEI XML schema. The output of the work is a set of six TEI XML templates that is considered to represent the different layout structures found in the studied samples. IJSSH 2015 Article PeerReviewed text en http://ir.unimas.my/id/eprint/12923/1/Ranaivo.pdf Tze-Min, Fong and Bali, Ranaivo-Malançon (2015) Using TEI XML Schema to Encode the Structures of Sarawak Gazette. International Journal of Social Science and Humanity, 5 (10). ISSN 2010-3646 DOI: 10.7763/IJSSH.2015.V5.569
spellingShingle T Technology (General)
Tze-Min, Fong
Bali, Ranaivo-Malançon
Using TEI XML Schema to Encode the Structures of Sarawak Gazette
title Using TEI XML Schema to Encode the Structures of Sarawak Gazette
title_full Using TEI XML Schema to Encode the Structures of Sarawak Gazette
title_fullStr Using TEI XML Schema to Encode the Structures of Sarawak Gazette
title_full_unstemmed Using TEI XML Schema to Encode the Structures of Sarawak Gazette
title_short Using TEI XML Schema to Encode the Structures of Sarawak Gazette
title_sort using tei xml schema to encode the structures of sarawak gazette
topic T Technology (General)
url http://ir.unimas.my/id/eprint/12923/
http://ir.unimas.my/id/eprint/12923/
http://ir.unimas.my/id/eprint/12923/1/Ranaivo.pdf