自然科学版
陕西师范大学学报(自然科学版)
数学与计算机科学
一种基于差分隐私机制的自适应轨迹数据发布算法
PDF下载 ()
张双越, 田丰, 吴振强*
(陕西师范大学 计算机科学学院, 陕西 西安 710119)
吴振强,男,教授,主要研究方向为信息安全、计算机网络。E-mail: zqiangwu@snnu.edu.cn
摘要:
轨迹数据的发布能够为政府部门进行城市规划或商业机构进行决策制定提供有力支持,但存在着严重的隐私泄露风险。在现有的基于差分隐私机制的轨迹发布技术基础上,提出以TF-IDF统计值作为参考指标的AC_TFIDF算法。该算法符合差分隐私的定义并能够动态确定轨迹中不同时刻的泛化程度;在泛化过程中,用距离聚类中心最近的有效点替换聚类中心,进一步提高发布数据的可用性。通过在真实数据集上的验证与分析,表明了该算法具有较好的效用性。
关键词:
隐私保护;轨迹数据;数据发布;差分隐私
收稿日期:
2017-08-23
中图分类号:
TP309.2
文献标识码:
A
文章编号:
1672-4291(2018)05-0009-07
基金项目:
国家自然科学基金(61602290,61173190);中央高校基本科研业务费专项资金(GK201501008,GK201603093);陕西省自然科学基础研究计划(2017JQ6038)
Doi:
An adaptive trajectory data publishing algorithm based on differential privacy
ZHANG Shuangyue, TIAN Feng, WU Zhenqiang*
(School of Computer Science, Shaanxi Normal University, Xi′an 710119, Shaanxi, China)
Abstract:
Releasing trajectory data can provide strong support for government and commercial organizations to make urban planning and decision-making, but there is a serious risk of privacy disclosure.The AC_TFIDF algorithm based on the TF-IDF statistics is proposed on the basis of the existing differential privacy protection trajectory. The algorithm follows the definition of differential privacy and can dynamically determine the degree of generalization at different moments in the trajectory. In the generalization process, the clustering center is replaced by the nearest one to further improve the availability of the published data. The feasibility and validity of the proposed algorithm is proved by the analysis and validation on the real data set.
KeyWords:
privacy preserving; trajectory data; data publication; differential privacy