Indexed by:
Abstract:
Research on the anonymization of static data has made great progress in recent years. Generalization and suppression are two common technologies for quasi-identifiers' anonymization. However, the characteristics of data streams, such as potential infinity and high dynamicity, make the anonymization of data streams different from the anonymization of static data. The methods for static data anonymization cannot be directly applied to anonymizing data streams. In this paper, a novel k-anonymization approach for data streams based on clustering is proposed. In order to speed up the anonymization process and reduce the information loss, the new approach scans a stream in one turn to recognize and reuse the clusters satisfying the k-anonymity principle. The time constraints on tuple publication and cluster reuse, which are specific to data streams, are considered as well. Furthermore, the approach is improved to conform to the l-diversity principle. The experiments conducted on the real datasets show that the proposed methods are both efficient and effective. (C) 2013 Elsevier B.V. All rights reserved.
Keyword:
Reprint 's Address:
Email:
Version:
Source :
KNOWLEDGE-BASED SYSTEMS
ISSN: 0950-7051
Year: 2013
Volume: 46
Page: 95-108
3 . 0 5 8
JCR@2013
7 . 2 0 0
JCR@2023
ESI Discipline: COMPUTER SCIENCE;
JCR Journal Grade:1
CAS Journal Grade:2
Cited Count:
WoS CC Cited Count: 51
SCOPUS Cited Count: 69
ESI Highly Cited Papers on the List: 0 Unfold All
WanFang Cited Count:
Chinese Cited Count:
30 Days PV: 0