Enhancing Match Detection Process Using Chi-Square Equation for Improving Type-3 and Type-4 Clones in Java Applications

Generic Code Clone Detection (GCCD) is a code clone detection model that use distance measure equation, enabling detection of all types of code clones, naming clone Type-1, Type-2, Type-3 and Type-4 in Java programming language applications. However, the detection process of GCCD did not focus on de...

Full description

Bibliographic Details
Main Authors: Noormaizzattul Akmaliza, Abdullah, Al Fahim, Mubarak Ali, Mohd Azwan, Mohamad Hamza, Siti Salwani, Yaacob
Format: Article
Language:English
Published: The Science and Information (SAI) Organization Limited 2025
Subjects:
Online Access:https://umpir.ump.edu.my/id/eprint/45327/
Description
Summary:Generic Code Clone Detection (GCCD) is a code clone detection model that use distance measure equation, enabling detection of all types of code clones, naming clone Type-1, Type-2, Type-3 and Type-4 in Java programming language applications. However, the detection process of GCCD did not focus on detecting clones of Type-3 and Type-4. Hence, this paper suggested two experiments to incorporate enhancements to the GCCD in order to improve the detection rate of clone Type-3 and clone Type-4. The implementation of Chi-square distance in the match detection process produced a significant result increase in the experiment specifically on clones Type-3 and Type-4, in comparison with the Euclidean distance in GCCD, which allows the increase of detection rate due to the dissimilarity of the distance measures. Based on the results, the suggested enhancement using Chi-square distance on match detection process outperforms GCCD in terms of improving code clone detection results based on clone Type-3 and Type-4, as the objectives for each experiment are carried, contributes to the research on improving the code clone detection result.