Paraphrase Detection using Semantic Relatedness based on Synset Shortest Path in WordNet

This study presents a sentence-to-sentence semantic relatedness measures for paraphrase detection. The proposed measures adopt the shortest path between synsets in WordNet as the core to measure the relatedness between two sentences. The interlinked synsets in WordNet are based on the conceptual-sem...

Full description

Bibliographic Details
Main Authors: Lee, Jun Choi, Cheah, Yu-N
Format: Proceeding
Language:English
Published: 2016
Subjects:
Online Access:http://ir.unimas.my/id/eprint/13239/
http://ir.unimas.my/id/eprint/13239/7/Paraphrase%20Detection%20using%20Semantic%20Relatedness%20%28abstract%29.pdf
_version_ 1848837364824145920
author Lee, Jun Choi
Cheah, Yu-N
author_facet Lee, Jun Choi
Cheah, Yu-N
author_sort Lee, Jun Choi
building UNIMAS Institutional Repository
collection Online Access
description This study presents a sentence-to-sentence semantic relatedness measures for paraphrase detection. The proposed measures adopt the shortest path between synsets in WordNet as the core to measure the relatedness between two sentences. The interlinked synsets in WordNet are based on the conceptual-semantic relation between two synsets. Thus the distance between two synsets in WordNet can be used to measure the semantic relatedness between two synsets. This study derived a sentence-level semantic relatedness using this feature to detect paraphrasing among sentences. The performance of the proposed semantic relatedness in paraphrasing is evaluated based on the accuracy and F-measures of the proposed measures in identifying paraphrase in Microsoft Research Paraphrase Corpus. The proposed method achieved 71.1% in accuracy and 81.8% in F-measures. The performance of the proposed method is compared with 6 paraphrase detection methods which include Salient Semantic Analysis and Second-order Co-occurrence Pointwise Mutual Information. In the comparison, the proposed method achieved the fourth highest accuracy and the second highest F-measure compare to other methods. This is a reasonable performance for the proposed semantic relatedness in paraphrase detection. Keywords— paraphrase detection, semantic relatedness, WordNet, synset shortest path.
first_indexed 2025-11-15T06:38:29Z
format Proceeding
id unimas-13239
institution Universiti Malaysia Sarawak
institution_category Local University
language English
last_indexed 2025-11-15T06:38:29Z
publishDate 2016
recordtype eprints
repository_type Digital Repository
spelling unimas-132392017-02-17T08:18:46Z http://ir.unimas.my/id/eprint/13239/ Paraphrase Detection using Semantic Relatedness based on Synset Shortest Path in WordNet Lee, Jun Choi Cheah, Yu-N QA75 Electronic computers. Computer science This study presents a sentence-to-sentence semantic relatedness measures for paraphrase detection. The proposed measures adopt the shortest path between synsets in WordNet as the core to measure the relatedness between two sentences. The interlinked synsets in WordNet are based on the conceptual-semantic relation between two synsets. Thus the distance between two synsets in WordNet can be used to measure the semantic relatedness between two synsets. This study derived a sentence-level semantic relatedness using this feature to detect paraphrasing among sentences. The performance of the proposed semantic relatedness in paraphrasing is evaluated based on the accuracy and F-measures of the proposed measures in identifying paraphrase in Microsoft Research Paraphrase Corpus. The proposed method achieved 71.1% in accuracy and 81.8% in F-measures. The performance of the proposed method is compared with 6 paraphrase detection methods which include Salient Semantic Analysis and Second-order Co-occurrence Pointwise Mutual Information. In the comparison, the proposed method achieved the fourth highest accuracy and the second highest F-measure compare to other methods. This is a reasonable performance for the proposed semantic relatedness in paraphrase detection. Keywords— paraphrase detection, semantic relatedness, WordNet, synset shortest path. 2016-08-17 Proceeding PeerReviewed text en http://ir.unimas.my/id/eprint/13239/7/Paraphrase%20Detection%20using%20Semantic%20Relatedness%20%28abstract%29.pdf Lee, Jun Choi and Cheah, Yu-N (2016) Paraphrase Detection using Semantic Relatedness based on Synset Shortest Path in WordNet. In: International Conference on Advanced Informatics: Concepts, Theory and Applications, 16-17 August 2016, Parkroyal Penang Resort. DOI: 10.1109/ICAICTA.2016.7803127
spellingShingle QA75 Electronic computers. Computer science
Lee, Jun Choi
Cheah, Yu-N
Paraphrase Detection using Semantic Relatedness based on Synset Shortest Path in WordNet
title Paraphrase Detection using Semantic Relatedness based on Synset Shortest Path in WordNet
title_full Paraphrase Detection using Semantic Relatedness based on Synset Shortest Path in WordNet
title_fullStr Paraphrase Detection using Semantic Relatedness based on Synset Shortest Path in WordNet
title_full_unstemmed Paraphrase Detection using Semantic Relatedness based on Synset Shortest Path in WordNet
title_short Paraphrase Detection using Semantic Relatedness based on Synset Shortest Path in WordNet
title_sort paraphrase detection using semantic relatedness based on synset shortest path in wordnet
topic QA75 Electronic computers. Computer science
url http://ir.unimas.my/id/eprint/13239/
http://ir.unimas.my/id/eprint/13239/
http://ir.unimas.my/id/eprint/13239/7/Paraphrase%20Detection%20using%20Semantic%20Relatedness%20%28abstract%29.pdf