文章詳目資料

Journal of Computers EIMEDLINEScopus

  • 加入收藏
  • 下載文章
篇名 Approximate Data Mining for Sliding Window Based Data Streams
卷期 23:2
作者 Kuo-Cheng YinYu-Lung HsiehDon-Lin Yang
頁次 001-013
關鍵字 association ruleFP-treedata streamsliding windowapproximate miningEIMEDLINEScopus
出刊日期 201207

中文摘要

英文摘要

Abstract. In the sliding window model of continuous dynamic data streams, the real-time process and update is an important issue for association rule mining. The existing researches deal with the problem by using specific data structures to retain the scanned data. However, if the next window slot contains any new frequent items, all the data must be rescanned to generate itemsets containing the new frequents. It is prohibitive to read the data twice for time-critical mining of continuous data streams. In order to meet the requirement of scanning data only one time, we propose a new approximate data stream mining algorithm (ADSMiner) using an extended FP-tree (EFP-tree) to save the current frequent-patterns. The EFP-tree not only records the frequent itemsets, but also keeps the counts of each itemset in the panes. If any new 1-itemset becomes frequent after the old data is replaced by the new data, there is no need to re-read the data. Instead, it is just added to the EFP-tree. When the order of the frequent 1-itemsets sequence changes, we use the Longest Common Subsequence method to locate the nodes requiring adjustment and maintain the structure of EFP tree efficiently. The results of experiment show that our approach performs well as we expected on various datasets.

相關文獻