Leveraging transfer learning and label optimization for enhanced traditional Chinese medicine ner performance

Named Entity Recognition (NER) is a crucial component in various domains, including medical and financial fields, as it helps identify text fragments belonging to predefined categories from unstructured text. Over time, NER algorithms have evolved from dictionary-based approaches to machine learning...

Full description

Bibliographic Details
Main Authors: Saidah Saad, Zikun, Huang
Format: Article
Language:English
Published: Penerbit Universiti Kebangsaan Malaysia 2024
Online Access:http://journalarticle.ukm.my/23992/
http://journalarticle.ukm.my/23992/1/47%20%E2%80%93%2060.pdf
_version_ 1848815986835193856
author Saidah Saad,
Zikun, Huang
author_facet Saidah Saad,
Zikun, Huang
author_sort Saidah Saad,
building UKM Institutional Repository
collection Online Access
description Named Entity Recognition (NER) is a crucial component in various domains, including medical and financial fields, as it helps identify text fragments belonging to predefined categories from unstructured text. Over time, NER algorithms have evolved from dictionary-based approaches to machine learning and deep learning techniques. Transfer learning, a novel deep learning method, has shown impressive results in NER tasks. However, transfer learning models still face challenges, such as limited entity labels and the impact of noisy datasets. To address these challenges, this research aims to optimise the application of deep learning models for NER and achieve enhanced results. The research initially applied the BERT+CRF model to the WanChuang dataset, resulting in an F1-measure of 89.1%. This established the feasibility of using transfer learning models for NER on Chinese medical data and served as a baseline for comparison in the project. To address label-related issues in the baseline model, a scheme was proposed to improve the learning rate of the CRF layer, resulting in an increased F1 measure of 91.0%. Additionally, to mitigate the impact of noisy training data, a 10-fold retraining scheme was introduced to optimise the training set. By retraining the model using the optimised training set, an optimal F1 measure of 92.7% was achieved. The experiments demonstrated that the transfer learning model enhances NER entity extraction capabilities while the optimised CRF layer effectively captures the internal relationships of entity tags, thus improving overall performance. This research contributes to advancing NER techniques and their application in various domains.
first_indexed 2025-11-15T00:58:42Z
format Article
id oai:generic.eprints.org:23992
institution Universiti Kebangasaan Malaysia
institution_category Local University
language English
last_indexed 2025-11-15T00:58:42Z
publishDate 2024
publisher Penerbit Universiti Kebangsaan Malaysia
recordtype eprints
repository_type Digital Repository
spelling oai:generic.eprints.org:239922024-08-12T03:26:57Z http://journalarticle.ukm.my/23992/ Leveraging transfer learning and label optimization for enhanced traditional Chinese medicine ner performance Saidah Saad, Zikun, Huang Named Entity Recognition (NER) is a crucial component in various domains, including medical and financial fields, as it helps identify text fragments belonging to predefined categories from unstructured text. Over time, NER algorithms have evolved from dictionary-based approaches to machine learning and deep learning techniques. Transfer learning, a novel deep learning method, has shown impressive results in NER tasks. However, transfer learning models still face challenges, such as limited entity labels and the impact of noisy datasets. To address these challenges, this research aims to optimise the application of deep learning models for NER and achieve enhanced results. The research initially applied the BERT+CRF model to the WanChuang dataset, resulting in an F1-measure of 89.1%. This established the feasibility of using transfer learning models for NER on Chinese medical data and served as a baseline for comparison in the project. To address label-related issues in the baseline model, a scheme was proposed to improve the learning rate of the CRF layer, resulting in an increased F1 measure of 91.0%. Additionally, to mitigate the impact of noisy training data, a 10-fold retraining scheme was introduced to optimise the training set. By retraining the model using the optimised training set, an optimal F1 measure of 92.7% was achieved. The experiments demonstrated that the transfer learning model enhances NER entity extraction capabilities while the optimised CRF layer effectively captures the internal relationships of entity tags, thus improving overall performance. This research contributes to advancing NER techniques and their application in various domains. Penerbit Universiti Kebangsaan Malaysia 2024-06-01 Article PeerReviewed application/pdf en http://journalarticle.ukm.my/23992/1/47%20%E2%80%93%2060.pdf Saidah Saad, and Zikun, Huang (2024) Leveraging transfer learning and label optimization for enhanced traditional Chinese medicine ner performance. Asia-Pacific Journal of Information Technology and Multimedia, 13 (1). pp. 47-60. ISSN 2289-2192 https://www.ukm.my/apjitm
spellingShingle Saidah Saad,
Zikun, Huang
Leveraging transfer learning and label optimization for enhanced traditional Chinese medicine ner performance
title Leveraging transfer learning and label optimization for enhanced traditional Chinese medicine ner performance
title_full Leveraging transfer learning and label optimization for enhanced traditional Chinese medicine ner performance
title_fullStr Leveraging transfer learning and label optimization for enhanced traditional Chinese medicine ner performance
title_full_unstemmed Leveraging transfer learning and label optimization for enhanced traditional Chinese medicine ner performance
title_short Leveraging transfer learning and label optimization for enhanced traditional Chinese medicine ner performance
title_sort leveraging transfer learning and label optimization for enhanced traditional chinese medicine ner performance
url http://journalarticle.ukm.my/23992/
http://journalarticle.ukm.my/23992/
http://journalarticle.ukm.my/23992/1/47%20%E2%80%93%2060.pdf