A hybrid approach to semi-supervised named entity recognition in health, safety and environment reports

In the last few years, text mining have become the area of interests in Natural Language Processing (NLP). They share a similar idea i.e. to extract important facts from unstructured text which later help to populate database entries. Name Entity Recognition (NER) is one of the main task needed to d...

Full description

Bibliographic Details
Main Authors: Sari, Y., Hassan, M.F., Zamin, N.
Format: Conference or Workshop Item
Published: 2009
Subjects:
Online Access:http://scholars.utp.edu.my/id/eprint/1777/
_version_ 1848659168261570560
author Sari, Y.
Hassan, M.F.
Zamin, N.
author_facet Sari, Y.
Hassan, M.F.
Zamin, N.
author_sort Sari, Y.
building UTP Institutional Repository
collection Online Access
description In the last few years, text mining have become the area of interests in Natural Language Processing (NLP). They share a similar idea i.e. to extract important facts from unstructured text which later help to populate database entries. Name Entity Recognition (NER) is one of the main task needed to develop text mining systems in which it is used to identify and classify entities in the text into predefined categories such as the names of persons, organizations, locations, dates, times, quantities, monetary values, percentages, etc. This paper focuses on studying the optimum solution to perform NER. To achieve our target, Health Safety and Environment (HSE) reports available from the Universiti Teknologi PETRONAS (UTP) are chosen as the case study. The UTP's HSE reports are the investigation reports which contain the information on incidents and accidents occurred during the daily operations. Many algorithms have been reported for NER ranging from simple statistical methods to advanced Natural language Processing (NLP) methods. This paper describes the possibility to apply Link Grammar (LG) and Basilisk Algorithm in NER.
first_indexed 2025-11-13T07:26:08Z
format Conference or Workshop Item
id oai:scholars.utp.edu.my:1777
institution Universiti Teknologi Petronas
institution_category Local University
last_indexed 2025-11-13T07:26:08Z
publishDate 2009
recordtype eprints
repository_type Digital Repository
spelling oai:scholars.utp.edu.my:17772010-05-09T17:16:09Z http://scholars.utp.edu.my/id/eprint/1777/ A hybrid approach to semi-supervised named entity recognition in health, safety and environment reports Sari, Y. Hassan, M.F. Zamin, N. QA75 Electronic computers. Computer science QA76 Computer software In the last few years, text mining have become the area of interests in Natural Language Processing (NLP). They share a similar idea i.e. to extract important facts from unstructured text which later help to populate database entries. Name Entity Recognition (NER) is one of the main task needed to develop text mining systems in which it is used to identify and classify entities in the text into predefined categories such as the names of persons, organizations, locations, dates, times, quantities, monetary values, percentages, etc. This paper focuses on studying the optimum solution to perform NER. To achieve our target, Health Safety and Environment (HSE) reports available from the Universiti Teknologi PETRONAS (UTP) are chosen as the case study. The UTP's HSE reports are the investigation reports which contain the information on incidents and accidents occurred during the daily operations. Many algorithms have been reported for NER ranging from simple statistical methods to advanced Natural language Processing (NLP) methods. This paper describes the possibility to apply Link Grammar (LG) and Basilisk Algorithm in NER. 2009-04 Conference or Workshop Item PeerReviewed Sari, Y. and Hassan, M.F. and Zamin, N. (2009) A hybrid approach to semi-supervised named entity recognition in health, safety and environment reports. In: International Conference on Future Computer and Communication (ICFCC 2009), 3-5 April 2009, Kuala Lumpur. http://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=05189853
spellingShingle QA75 Electronic computers. Computer science
QA76 Computer software
Sari, Y.
Hassan, M.F.
Zamin, N.
A hybrid approach to semi-supervised named entity recognition in health, safety and environment reports
title A hybrid approach to semi-supervised named entity recognition in health, safety and environment reports
title_full A hybrid approach to semi-supervised named entity recognition in health, safety and environment reports
title_fullStr A hybrid approach to semi-supervised named entity recognition in health, safety and environment reports
title_full_unstemmed A hybrid approach to semi-supervised named entity recognition in health, safety and environment reports
title_short A hybrid approach to semi-supervised named entity recognition in health, safety and environment reports
title_sort hybrid approach to semi-supervised named entity recognition in health, safety and environment reports
topic QA75 Electronic computers. Computer science
QA76 Computer software
url http://scholars.utp.edu.my/id/eprint/1777/
http://scholars.utp.edu.my/id/eprint/1777/