A Review of Recent Trends: Text Mining of Taxonomy Using WordNet 3.1 for the Solution and Problems of Ambiguity in Social Media
Text processing has been playing a great role in information retrieval to solve the problem of ambiguity in natural language processing, e.g., internet search, data mining, and social media. In semantic similarity, it will be used to analyze the relationships between Word-Pairs on social media. Orga...
| Main Authors: | , , , |
|---|---|
| Format: | Conference or Workshop Item |
| Language: | English |
| Published: |
Springer
2020
|
| Subjects: | |
| Online Access: | http://umpir.ump.edu.my/id/eprint/28455/ http://umpir.ump.edu.my/id/eprint/28455/1/A%20Review%20of%20Recent%20Trends.pdf |
| _version_ | 1848823058431737856 |
|---|---|
| author | Hasan, Ali Muttaleb Rassem, Taha H. Noorhuzaimi@Karimah, Mohd Noor Hasan, Ahmed Muttaleb |
| author_facet | Hasan, Ali Muttaleb Rassem, Taha H. Noorhuzaimi@Karimah, Mohd Noor Hasan, Ahmed Muttaleb |
| author_sort | Hasan, Ali Muttaleb |
| building | UMP Institutional Repository |
| collection | Online Access |
| description | Text processing has been playing a great role in information retrieval to solve the problem of ambiguity in natural language processing, e.g., internet search, data mining, and social media. In semantic similarity, it will be used to analyze the relationships between Word-Pairs on social media. Organizing a huge number of unstructured text documents into a small number of concepts of word sense disambiguation is essential so that the lexical source could incorporate the features for capturing more semantic evidence. Text mining involves the pre-processing of documents collections, text categorization and classification, and extracting information and terms from golden standard data sets. This work proposed the lexical sourced from the semantic representation. The paper contained an evaluation of the advanced measures, which include shortest path, depth, and information content measures. In this paper, we used the same set of measures as previous studies, but different methods such as taxonomy on social media by semantic similarities, such as Synonymy (https://github.com/alimuttaleb/Ali-Muttaleb/blob/master/Synonym.txt), Non-taxonomy, Hypernym, and Glosses. This paper has focused to address the synonymy and ambiguity by incorporating the knowledge in the lexical resources. Thus, each word in a document is linked to its corresponding concept in the lexical resources. To build the semantic representation, these approaches can be classified into two main approaches: knowledge-based and statistical approaches. The knowledge-based approaches depend on structured information that is normally available in forms of dictionaries, thesaurus, lexicons, WordNet 3.1, and ontologies. The statistical approaches are based on finding the semantic relations among words using the frequencies of words in a given corpus. |
| first_indexed | 2025-11-15T02:51:06Z |
| format | Conference or Workshop Item |
| id | ump-28455 |
| institution | Universiti Malaysia Pahang |
| institution_category | Local University |
| language | English |
| last_indexed | 2025-11-15T02:51:06Z |
| publishDate | 2020 |
| publisher | Springer |
| recordtype | eprints |
| repository_type | Digital Repository |
| spelling | ump-284552021-01-26T03:36:20Z http://umpir.ump.edu.my/id/eprint/28455/ A Review of Recent Trends: Text Mining of Taxonomy Using WordNet 3.1 for the Solution and Problems of Ambiguity in Social Media Hasan, Ali Muttaleb Rassem, Taha H. Noorhuzaimi@Karimah, Mohd Noor Hasan, Ahmed Muttaleb QA75 Electronic computers. Computer science QA76 Computer software Text processing has been playing a great role in information retrieval to solve the problem of ambiguity in natural language processing, e.g., internet search, data mining, and social media. In semantic similarity, it will be used to analyze the relationships between Word-Pairs on social media. Organizing a huge number of unstructured text documents into a small number of concepts of word sense disambiguation is essential so that the lexical source could incorporate the features for capturing more semantic evidence. Text mining involves the pre-processing of documents collections, text categorization and classification, and extracting information and terms from golden standard data sets. This work proposed the lexical sourced from the semantic representation. The paper contained an evaluation of the advanced measures, which include shortest path, depth, and information content measures. In this paper, we used the same set of measures as previous studies, but different methods such as taxonomy on social media by semantic similarities, such as Synonymy (https://github.com/alimuttaleb/Ali-Muttaleb/blob/master/Synonym.txt), Non-taxonomy, Hypernym, and Glosses. This paper has focused to address the synonymy and ambiguity by incorporating the knowledge in the lexical resources. Thus, each word in a document is linked to its corresponding concept in the lexical resources. To build the semantic representation, these approaches can be classified into two main approaches: knowledge-based and statistical approaches. The knowledge-based approaches depend on structured information that is normally available in forms of dictionaries, thesaurus, lexicons, WordNet 3.1, and ontologies. The statistical approaches are based on finding the semantic relations among words using the frequencies of words in a given corpus. Springer 2020-03-15 Conference or Workshop Item PeerReviewed pdf en http://umpir.ump.edu.my/id/eprint/28455/1/A%20Review%20of%20Recent%20Trends.pdf Hasan, Ali Muttaleb and Rassem, Taha H. and Noorhuzaimi@Karimah, Mohd Noor and Hasan, Ahmed Muttaleb (2020) A Review of Recent Trends: Text Mining of Taxonomy Using WordNet 3.1 for the Solution and Problems of Ambiguity in Social Media. In: Intelligent Computing and Innovation on Data Science: Proceedings of ICTIDS 2019 , 11-12 October 2019 , Kuala Lumpur, Malaysia. pp. 137-152., 118. ISBN 978-981-15-3284-9 (Published) https://doi.org/10.1007/978-981-15-3284-9_17 https://doi.org/10.1007/978-981-15-3284-9_17 |
| spellingShingle | QA75 Electronic computers. Computer science QA76 Computer software Hasan, Ali Muttaleb Rassem, Taha H. Noorhuzaimi@Karimah, Mohd Noor Hasan, Ahmed Muttaleb A Review of Recent Trends: Text Mining of Taxonomy Using WordNet 3.1 for the Solution and Problems of Ambiguity in Social Media |
| title | A Review of Recent Trends: Text Mining of Taxonomy Using WordNet 3.1 for the Solution and Problems of Ambiguity in Social Media |
| title_full | A Review of Recent Trends: Text Mining of Taxonomy Using WordNet 3.1 for the Solution and Problems of Ambiguity in Social Media |
| title_fullStr | A Review of Recent Trends: Text Mining of Taxonomy Using WordNet 3.1 for the Solution and Problems of Ambiguity in Social Media |
| title_full_unstemmed | A Review of Recent Trends: Text Mining of Taxonomy Using WordNet 3.1 for the Solution and Problems of Ambiguity in Social Media |
| title_short | A Review of Recent Trends: Text Mining of Taxonomy Using WordNet 3.1 for the Solution and Problems of Ambiguity in Social Media |
| title_sort | review of recent trends: text mining of taxonomy using wordnet 3.1 for the solution and problems of ambiguity in social media |
| topic | QA75 Electronic computers. Computer science QA76 Computer software |
| url | http://umpir.ump.edu.my/id/eprint/28455/ http://umpir.ump.edu.my/id/eprint/28455/ http://umpir.ump.edu.my/id/eprint/28455/ http://umpir.ump.edu.my/id/eprint/28455/1/A%20Review%20of%20Recent%20Trends.pdf |