Using TEI XML Schema to Encode the Structures of Sarawak Gazette
Automatic extraction of information from old printed documents which have been digitised injudiciously will end up with a lot human corrections. To overcome the problem, one possible solution is to annotate the documents with some markups. This paper presents the encoding of the digitised sampl...
| Main Authors: | , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
IJSSH
2015
|
| Subjects: | |
| Online Access: | http://ir.unimas.my/id/eprint/12923/ http://ir.unimas.my/id/eprint/12923/1/Ranaivo.pdf |
| _version_ | 1848837305817628672 |
|---|---|
| author | Tze-Min, Fong Bali, Ranaivo-Malançon |
| author_facet | Tze-Min, Fong Bali, Ranaivo-Malançon |
| author_sort | Tze-Min, Fong |
| building | UNIMAS Institutional Repository |
| collection | Online Access |
| description | Automatic extraction of information from old
printed documents which have been digitised injudiciously will
end up with a lot human corrections. To overcome the problem,
one possible solution is to annotate the documents with some
markups. This paper presents the encoding of the digitised
sample of Sarawak Gazette published from 1903 until 1939
using the standard TEI XML schema. The output of the work is
a set of six TEI XML templates that is considered to represent
the different layout structures found in the studied samples. |
| first_indexed | 2025-11-15T06:37:33Z |
| format | Article |
| id | unimas-12923 |
| institution | Universiti Malaysia Sarawak |
| institution_category | Local University |
| language | English |
| last_indexed | 2025-11-15T06:37:33Z |
| publishDate | 2015 |
| publisher | IJSSH |
| recordtype | eprints |
| repository_type | Digital Repository |
| spelling | unimas-129232021-06-22T16:18:17Z http://ir.unimas.my/id/eprint/12923/ Using TEI XML Schema to Encode the Structures of Sarawak Gazette Tze-Min, Fong Bali, Ranaivo-Malançon T Technology (General) Automatic extraction of information from old printed documents which have been digitised injudiciously will end up with a lot human corrections. To overcome the problem, one possible solution is to annotate the documents with some markups. This paper presents the encoding of the digitised sample of Sarawak Gazette published from 1903 until 1939 using the standard TEI XML schema. The output of the work is a set of six TEI XML templates that is considered to represent the different layout structures found in the studied samples. IJSSH 2015 Article PeerReviewed text en http://ir.unimas.my/id/eprint/12923/1/Ranaivo.pdf Tze-Min, Fong and Bali, Ranaivo-Malançon (2015) Using TEI XML Schema to Encode the Structures of Sarawak Gazette. International Journal of Social Science and Humanity, 5 (10). ISSN 2010-3646 DOI: 10.7763/IJSSH.2015.V5.569 |
| spellingShingle | T Technology (General) Tze-Min, Fong Bali, Ranaivo-Malançon Using TEI XML Schema to Encode the Structures of Sarawak Gazette |
| title | Using TEI XML Schema to Encode the Structures of
Sarawak Gazette |
| title_full | Using TEI XML Schema to Encode the Structures of
Sarawak Gazette |
| title_fullStr | Using TEI XML Schema to Encode the Structures of
Sarawak Gazette |
| title_full_unstemmed | Using TEI XML Schema to Encode the Structures of
Sarawak Gazette |
| title_short | Using TEI XML Schema to Encode the Structures of
Sarawak Gazette |
| title_sort | using tei xml schema to encode the structures of
sarawak gazette |
| topic | T Technology (General) |
| url | http://ir.unimas.my/id/eprint/12923/ http://ir.unimas.my/id/eprint/12923/ http://ir.unimas.my/id/eprint/12923/1/Ranaivo.pdf |