The development of Malaysian Corpus of Financial English (MaCFE)

This paper presents the processes involved in the design and development of the Malaysian Corpus of Financial English (MaCFE); a specialized corpus containing a wide range of online/internet documents (i.e. communiqué) from various financial institutions in Malaysia. It describes in detail the pr...

Full description

Bibliographic Details
Main Authors: Roslan Sadjirin, Roslina Abdul Aziz, Noli Maishara Nordin, Mohd Rozaidi Ismail, Norzie Diana Baharum
Format: Article
Language:English
Published: Penerbit Universiti Kebangsaan Malaysia 2018
Online Access:http://journalarticle.ukm.my/17609/
http://journalarticle.ukm.my/17609/1/23571-82969-1-PB.pdf
_version_ 1848814355687145472
author Roslan Sadjirin,
Roslina Abdul Aziz,
Noli Maishara Nordin,
Mohd Rozaidi Ismail,
Norzie Diana Baharum,
author_facet Roslan Sadjirin,
Roslina Abdul Aziz,
Noli Maishara Nordin,
Mohd Rozaidi Ismail,
Norzie Diana Baharum,
author_sort Roslan Sadjirin,
building UKM Institutional Repository
collection Online Access
description This paper presents the processes involved in the design and development of the Malaysian Corpus of Financial English (MaCFE); a specialized corpus containing a wide range of online/internet documents (i.e. communiqué) from various financial institutions in Malaysia. It describes in detail the processes involved in the collection and selection of data and preprocessing of raw data, which includes data digitizing, cleansing and tagging. This paper also introduces the user interface for MaCFE with its built-in linguistic analysis features. MaCFE was designed and developed with the intention of providing corpus linguistic researchers with the avenue to explore the field and for ESP/EAP practitioners in Malaysia, as the resources for the development of local-based ESP/EAP curriculum and teaching and learning materials. It would also serve as a learning avenue for future financial professionals in their training. MaCFE corpus has approximately 4.3 million words from 1472 electronic documents retrieved from banks and financial institutions’ official websites. At present, users can make queries to the MaCFE database using its built-in concordancer. In the future, its language-data-processing facilities will be expanded to include tools for keyword, wordlist and word collocations queries.
first_indexed 2025-11-15T00:32:46Z
format Article
id oai:generic.eprints.org:17609
institution Universiti Kebangasaan Malaysia
institution_category Local University
language English
last_indexed 2025-11-15T00:32:46Z
publishDate 2018
publisher Penerbit Universiti Kebangsaan Malaysia
recordtype eprints
repository_type Digital Repository
spelling oai:generic.eprints.org:176092021-11-22T04:59:50Z http://journalarticle.ukm.my/17609/ The development of Malaysian Corpus of Financial English (MaCFE) Roslan Sadjirin, Roslina Abdul Aziz, Noli Maishara Nordin, Mohd Rozaidi Ismail, Norzie Diana Baharum, This paper presents the processes involved in the design and development of the Malaysian Corpus of Financial English (MaCFE); a specialized corpus containing a wide range of online/internet documents (i.e. communiqué) from various financial institutions in Malaysia. It describes in detail the processes involved in the collection and selection of data and preprocessing of raw data, which includes data digitizing, cleansing and tagging. This paper also introduces the user interface for MaCFE with its built-in linguistic analysis features. MaCFE was designed and developed with the intention of providing corpus linguistic researchers with the avenue to explore the field and for ESP/EAP practitioners in Malaysia, as the resources for the development of local-based ESP/EAP curriculum and teaching and learning materials. It would also serve as a learning avenue for future financial professionals in their training. MaCFE corpus has approximately 4.3 million words from 1472 electronic documents retrieved from banks and financial institutions’ official websites. At present, users can make queries to the MaCFE database using its built-in concordancer. In the future, its language-data-processing facilities will be expanded to include tools for keyword, wordlist and word collocations queries. Penerbit Universiti Kebangsaan Malaysia 2018-08 Article PeerReviewed application/pdf en http://journalarticle.ukm.my/17609/1/23571-82969-1-PB.pdf Roslan Sadjirin, and Roslina Abdul Aziz, and Noli Maishara Nordin, and Mohd Rozaidi Ismail, and Norzie Diana Baharum, (2018) The development of Malaysian Corpus of Financial English (MaCFE). GEMA ; Online Journal of Language Studies, 18 (3). pp. 73-100. ISSN 1675-8021 https://ejournal.ukm.my/gema/issue/view/1098
spellingShingle Roslan Sadjirin,
Roslina Abdul Aziz,
Noli Maishara Nordin,
Mohd Rozaidi Ismail,
Norzie Diana Baharum,
The development of Malaysian Corpus of Financial English (MaCFE)
title The development of Malaysian Corpus of Financial English (MaCFE)
title_full The development of Malaysian Corpus of Financial English (MaCFE)
title_fullStr The development of Malaysian Corpus of Financial English (MaCFE)
title_full_unstemmed The development of Malaysian Corpus of Financial English (MaCFE)
title_short The development of Malaysian Corpus of Financial English (MaCFE)
title_sort development of malaysian corpus of financial english (macfe)
url http://journalarticle.ukm.my/17609/
http://journalarticle.ukm.my/17609/
http://journalarticle.ukm.my/17609/1/23571-82969-1-PB.pdf