Infrequent Item mining in multiple data streams

The problem of extracting infrequent patterns from streams and building associations between these patterns is becoming increasingly relevant today as many events of interest such as attacks in network data or unusual stories in news data occur rarely. The complexity of the problem is compounded whe...

Full description

Bibliographic Details
Main Authors: Saha, Budhaditya, Lazarescu, Mihai, Venkatesh, Svetha
Other Authors: IEEE computer society
Format: Conference Paper
Published: IEEE computer society publishing services 2007
Online Access:http://hdl.handle.net/20.500.11937/8427
Description
Summary:The problem of extracting infrequent patterns from streams and building associations between these patterns is becoming increasingly relevant today as many events of interest such as attacks in network data or unusual stories in news data occur rarely. The complexity of the problem is compounded when a system is required to deal with data from multiple streams. To address these problems, we present a framework that combines the time based association mining with a pyramidal structure that allows a rolling analysis of the stream and maintains a synopsis of the data without requiring increasing memory resources. We apply the algorithms and show the usefulness of the techniques.