MFI-transSW+: Efficiently mining frequent itemsets in clickstreams

Franklin A. De Amorim, Bernardo Pereira Nunes*, Giseli Rabello Lopes, Marco A. Casanova

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Data stream mining is the process of extracting knowledge from massive real-time sequence of data items arriving at a very high data rate. It has several practical applications, such as user behavior analysis, software testing and market research. However, the large amount of data generated may offer challenges to process and analyze data at nearly real time. In this paper, we first present the MFI-TransSW+ algorithm, an optimized version of MFI-TransSW algorithm that efficiently processes clickstreams, that is, data streams where the data items are the pages of a Web site. Then, we outline the implementation of a news articles recommender system, called ClickRec, to demonstrate the efficiency and applicability of the proposed algorithm. Finally, we describe experiments, conducted with real world data, which show that MFI-TransSW+ outperforms the original algorithm, being up to two orders of magnitude faster when processing clickstreams.

Original languageEnglish
Title of host publicationE-Commerce and Web Technologies - 17th International Conference, EC-Web 2016, Revised Selected Papers
EditorsDerek Bridge, Heiner Stuckenschmidt
PublisherSpringer Verlag
Pages87-99
Number of pages13
ISBN (Print)9783319536750
DOIs
Publication statusPublished - 2017
Externally publishedYes
Event17th International Conference on E-Commerce and Web Technologies, EC-Web 2016 - Porto, Portugal
Duration: 5 Sept 20168 Sept 2016

Publication series

NameLecture Notes in Business Information Processing
Volume278
ISSN (Print)1865-1348

Conference

Conference17th International Conference on E-Commerce and Web Technologies, EC-Web 2016
Country/TerritoryPortugal
CityPorto
Period5/09/168/09/16

Fingerprint

Dive into the research topics of 'MFI-transSW+: Efficiently mining frequent itemsets in clickstreams'. Together they form a unique fingerprint.

Cite this