A Review of Recent Trends: Text Mining of Taxonomy Using WordNet 3.1 for the Solution and Problems of Ambiguity in Social Media

Text processing has been playing a great role in information retrieval to solve the problem of ambiguity in natural language processing, e.g., internet search, data mining, and social media. In semantic similarity, it will be used to analyze the relationships between Word-Pairs on social media. Orga...

Full description

Bibliographic Details
Main Authors: Hasan, Ali Muttaleb, Rassem, Taha H., Noorhuzaimi@Karimah, Mohd Noor, Hasan, Ahmed Muttaleb
Format: Conference or Workshop Item
Language:English
Published: Springer 2020
Subjects:
Online Access:http://umpir.ump.edu.my/id/eprint/28455/
http://umpir.ump.edu.my/id/eprint/28455/1/A%20Review%20of%20Recent%20Trends.pdf
_version_ 1848823058431737856
author Hasan, Ali Muttaleb
Rassem, Taha H.
Noorhuzaimi@Karimah, Mohd Noor
Hasan, Ahmed Muttaleb
author_facet Hasan, Ali Muttaleb
Rassem, Taha H.
Noorhuzaimi@Karimah, Mohd Noor
Hasan, Ahmed Muttaleb
author_sort Hasan, Ali Muttaleb
building UMP Institutional Repository
collection Online Access
description Text processing has been playing a great role in information retrieval to solve the problem of ambiguity in natural language processing, e.g., internet search, data mining, and social media. In semantic similarity, it will be used to analyze the relationships between Word-Pairs on social media. Organizing a huge number of unstructured text documents into a small number of concepts of word sense disambiguation is essential so that the lexical source could incorporate the features for capturing more semantic evidence. Text mining involves the pre-processing of documents collections, text categorization and classification, and extracting information and terms from golden standard data sets. This work proposed the lexical sourced from the semantic representation. The paper contained an evaluation of the advanced measures, which include shortest path, depth, and information content measures. In this paper, we used the same set of measures as previous studies, but different methods such as taxonomy on social media by semantic similarities, such as Synonymy (https://github.com/alimuttaleb/Ali-Muttaleb/blob/master/Synonym.txt), Non-taxonomy, Hypernym, and Glosses. This paper has focused to address the synonymy and ambiguity by incorporating the knowledge in the lexical resources. Thus, each word in a document is linked to its corresponding concept in the lexical resources. To build the semantic representation, these approaches can be classified into two main approaches: knowledge-based and statistical approaches. The knowledge-based approaches depend on structured information that is normally available in forms of dictionaries, thesaurus, lexicons, WordNet 3.1, and ontologies. The statistical approaches are based on finding the semantic relations among words using the frequencies of words in a given corpus.
first_indexed 2025-11-15T02:51:06Z
format Conference or Workshop Item
id ump-28455
institution Universiti Malaysia Pahang
institution_category Local University
language English
last_indexed 2025-11-15T02:51:06Z
publishDate 2020
publisher Springer
recordtype eprints
repository_type Digital Repository
spelling ump-284552021-01-26T03:36:20Z http://umpir.ump.edu.my/id/eprint/28455/ A Review of Recent Trends: Text Mining of Taxonomy Using WordNet 3.1 for the Solution and Problems of Ambiguity in Social Media Hasan, Ali Muttaleb Rassem, Taha H. Noorhuzaimi@Karimah, Mohd Noor Hasan, Ahmed Muttaleb QA75 Electronic computers. Computer science QA76 Computer software Text processing has been playing a great role in information retrieval to solve the problem of ambiguity in natural language processing, e.g., internet search, data mining, and social media. In semantic similarity, it will be used to analyze the relationships between Word-Pairs on social media. Organizing a huge number of unstructured text documents into a small number of concepts of word sense disambiguation is essential so that the lexical source could incorporate the features for capturing more semantic evidence. Text mining involves the pre-processing of documents collections, text categorization and classification, and extracting information and terms from golden standard data sets. This work proposed the lexical sourced from the semantic representation. The paper contained an evaluation of the advanced measures, which include shortest path, depth, and information content measures. In this paper, we used the same set of measures as previous studies, but different methods such as taxonomy on social media by semantic similarities, such as Synonymy (https://github.com/alimuttaleb/Ali-Muttaleb/blob/master/Synonym.txt), Non-taxonomy, Hypernym, and Glosses. This paper has focused to address the synonymy and ambiguity by incorporating the knowledge in the lexical resources. Thus, each word in a document is linked to its corresponding concept in the lexical resources. To build the semantic representation, these approaches can be classified into two main approaches: knowledge-based and statistical approaches. The knowledge-based approaches depend on structured information that is normally available in forms of dictionaries, thesaurus, lexicons, WordNet 3.1, and ontologies. The statistical approaches are based on finding the semantic relations among words using the frequencies of words in a given corpus. Springer 2020-03-15 Conference or Workshop Item PeerReviewed pdf en http://umpir.ump.edu.my/id/eprint/28455/1/A%20Review%20of%20Recent%20Trends.pdf Hasan, Ali Muttaleb and Rassem, Taha H. and Noorhuzaimi@Karimah, Mohd Noor and Hasan, Ahmed Muttaleb (2020) A Review of Recent Trends: Text Mining of Taxonomy Using WordNet 3.1 for the Solution and Problems of Ambiguity in Social Media. In: Intelligent Computing and Innovation on Data Science: Proceedings of ICTIDS 2019 , 11-12 October 2019 , Kuala Lumpur, Malaysia. pp. 137-152., 118. ISBN 978-981-15-3284-9 (Published) https://doi.org/10.1007/978-981-15-3284-9_17 https://doi.org/10.1007/978-981-15-3284-9_17
spellingShingle QA75 Electronic computers. Computer science
QA76 Computer software
Hasan, Ali Muttaleb
Rassem, Taha H.
Noorhuzaimi@Karimah, Mohd Noor
Hasan, Ahmed Muttaleb
A Review of Recent Trends: Text Mining of Taxonomy Using WordNet 3.1 for the Solution and Problems of Ambiguity in Social Media
title A Review of Recent Trends: Text Mining of Taxonomy Using WordNet 3.1 for the Solution and Problems of Ambiguity in Social Media
title_full A Review of Recent Trends: Text Mining of Taxonomy Using WordNet 3.1 for the Solution and Problems of Ambiguity in Social Media
title_fullStr A Review of Recent Trends: Text Mining of Taxonomy Using WordNet 3.1 for the Solution and Problems of Ambiguity in Social Media
title_full_unstemmed A Review of Recent Trends: Text Mining of Taxonomy Using WordNet 3.1 for the Solution and Problems of Ambiguity in Social Media
title_short A Review of Recent Trends: Text Mining of Taxonomy Using WordNet 3.1 for the Solution and Problems of Ambiguity in Social Media
title_sort review of recent trends: text mining of taxonomy using wordnet 3.1 for the solution and problems of ambiguity in social media
topic QA75 Electronic computers. Computer science
QA76 Computer software
url http://umpir.ump.edu.my/id/eprint/28455/
http://umpir.ump.edu.my/id/eprint/28455/
http://umpir.ump.edu.my/id/eprint/28455/
http://umpir.ump.edu.my/id/eprint/28455/1/A%20Review%20of%20Recent%20Trends.pdf