Sequential pattern mining using PrefixSpan with pseudoprojection and separator database

Sequential pattern mining is a new branch of data mining science that solves inter-transaction pattern mining problems. A comprehensive performance study has been reported that PrefixSpan, one of its algorithms, outperforms GSP, SPADE, as well as FreeSpan in most cases, and PrefixSpan integrated wit...

Full description

Bibliographic Details
Main Authors: Saputra, D., Rambli, D.R.A., Foong, Oi Mean
Format: Conference or Workshop Item
Language:English
Published: 2008
Subjects:
Online Access:http://scholars.utp.edu.my/id/eprint/222/
http://scholars.utp.edu.my/id/eprint/222/1/paper.pdf
_version_ 1848658934590603264
author Saputra, D.
Rambli, D.R.A.
Foong, Oi Mean
author_facet Saputra, D.
Rambli, D.R.A.
Foong, Oi Mean
author_sort Saputra, D.
building UTP Institutional Repository
collection Online Access
description Sequential pattern mining is a new branch of data mining science that solves inter-transaction pattern mining problems. A comprehensive performance study has been reported that PrefixSpan, one of its algorithms, outperforms GSP, SPADE, as well as FreeSpan in most cases, and PrefixSpan integrated with pseudoprojection technique is the fastest among those tested algorithms. Nevertheless, Pseudoprojection technique, which requires maintaining and visiting the in-memory sequence database frequently until all patterns are found, consumes a considerable amount of memory and induces the algorithm to undertake redundant and unnecessary checks to this copy of original database into memory when the candidate patterns are examined. In this paper, we propose Separator Database to improve PrefixSpan with pseudoprojection through early removal of uneconomical in-memory sequence database. The experimental results show that Separator Database improves PrefixSpan with pseudoprojection. Future research includes exploring the use of Separator Database in PrefixSpan with pseudoprojection to improve mining constrained sequential patterns. © 2008 IEEE.
first_indexed 2025-11-13T07:22:25Z
format Conference or Workshop Item
id oai:scholars.utp.edu.my:222
institution Universiti Teknologi Petronas
institution_category Local University
language English
last_indexed 2025-11-13T07:22:25Z
publishDate 2008
recordtype eprints
repository_type Digital Repository
spelling oai:scholars.utp.edu.my:2222017-01-19T08:26:44Z http://scholars.utp.edu.my/id/eprint/222/ Sequential pattern mining using PrefixSpan with pseudoprojection and separator database Saputra, D. Rambli, D.R.A. Foong, Oi Mean QA75 Electronic computers. Computer science Sequential pattern mining is a new branch of data mining science that solves inter-transaction pattern mining problems. A comprehensive performance study has been reported that PrefixSpan, one of its algorithms, outperforms GSP, SPADE, as well as FreeSpan in most cases, and PrefixSpan integrated with pseudoprojection technique is the fastest among those tested algorithms. Nevertheless, Pseudoprojection technique, which requires maintaining and visiting the in-memory sequence database frequently until all patterns are found, consumes a considerable amount of memory and induces the algorithm to undertake redundant and unnecessary checks to this copy of original database into memory when the candidate patterns are examined. In this paper, we propose Separator Database to improve PrefixSpan with pseudoprojection through early removal of uneconomical in-memory sequence database. The experimental results show that Separator Database improves PrefixSpan with pseudoprojection. Future research includes exploring the use of Separator Database in PrefixSpan with pseudoprojection to improve mining constrained sequential patterns. © 2008 IEEE. 2008 Conference or Workshop Item NonPeerReviewed application/pdf en http://scholars.utp.edu.my/id/eprint/222/1/paper.pdf Saputra, D. and Rambli, D.R.A. and Foong, Oi Mean (2008) Sequential pattern mining using PrefixSpan with pseudoprojection and separator database. In: International Symposium on Information Technology 2008, ITSim, 26 August 2008 through 29 August 2008, Kuala Lumpur.
spellingShingle QA75 Electronic computers. Computer science
Saputra, D.
Rambli, D.R.A.
Foong, Oi Mean
Sequential pattern mining using PrefixSpan with pseudoprojection and separator database
title Sequential pattern mining using PrefixSpan with pseudoprojection and separator database
title_full Sequential pattern mining using PrefixSpan with pseudoprojection and separator database
title_fullStr Sequential pattern mining using PrefixSpan with pseudoprojection and separator database
title_full_unstemmed Sequential pattern mining using PrefixSpan with pseudoprojection and separator database
title_short Sequential pattern mining using PrefixSpan with pseudoprojection and separator database
title_sort sequential pattern mining using prefixspan with pseudoprojection and separator database
topic QA75 Electronic computers. Computer science
url http://scholars.utp.edu.my/id/eprint/222/
http://scholars.utp.edu.my/id/eprint/222/1/paper.pdf