登录    注册    忘记密码

详细信息

基于联合树的隐私高维数据发布方法  ( EI收录)  

Private High-Dimensional Data Publication with Junction Tree

文献类型:期刊文献

中文题名:基于联合树的隐私高维数据发布方法

英文题名:Private High-Dimensional Data Publication with Junction Tree

作者:张啸剑[1];陈莉[2];金凯忠[1];孟小峰[3]

第一作者:张啸剑

机构:[1]河南财经政法大学计算机与信息工程学院;[2]河南财经政法大学网络信息安全研究所;[3]中国人民大学信息学院

第一机构:河南财经政法大学计算机与信息工程学院

年份:2018

卷号:55

期号:12

起止页码:2794-2809

中文期刊名:计算机研究与发展

外文期刊名:Journal of Computer Research and Development

收录:CSTPCD;;EI(收录号:20191206653163);Scopus;北大核心:【北大核心2017】;CSCD:【CSCD2017_2018】;

基金:国家自然科学基金项目(61502146;91646203;91746115;61772131);河南省自然科学基金项目(162300410006);河南省科技攻关项目(172102310713);河南省教育厅高等学校重点科研项目(16A520002);河南财经政法大学青年拔尖人才资助计划项目~~

语种:中文

中文关键词:高维数据;差分隐私;Markov网;联合树;边缘分布

外文关键词:high-dimensional data; differential privacy; Markov network; junction tree; marginal distribution

摘要:基于差分隐私的数据发布已得到研究者的广泛关注.然而,现有的发布方法却不能有效地处理高维数据,其原因在于维度灾难和值域多样会引入极大的噪音值,进而使得发布结果的可用性比较低.基于此,提出一种基于联合树的隐私高维数据发布方法 PrivHD(differentially private high dimensional data release),该方法通过指数机制构造Markov网,引入满足差分隐私的高通滤波技术缩减指数机制搜索空间.结合充分三角化操作和顶点消除操作对Markov网分割来获得完全团图,采用最大生成树方法生成满足差分隐私的联合树.利用联合树中各个团后置处理之后的联合分布表合成最终的高维数据.基于真实的高维数据集比较PrivHD算法与PrivBayes(private Bayesian network),JTree(junction tree)算法的精度,实验结果表明:PrivHD算法的k-way查询和SVM(support vector machine)分类精度优于同类算法.
The problem of differentially private data publishing has attracted considerable research attention in recent years.The current existing solutions,however,cannot effectively handle the release of high-dimensional data.That is because these methods suffer from curse of dimensionality and various domain sizes,which will lead to the lower utility of publication.To address the problems,this paper presents PrivHD(differentially private high dimensional data release)with junction tree,a differentially private method for publishing high-dimensional data.PrivHD firstly generates a Markov network with exponential mechanism,which employs the high-pass filter technique to reduce the candidate space in the sampling process.After that,based on the network,PrivHD obtains a complete cluster graph in terms of full triangulation and node elimination,and then relies on the cluster graph and maximum spanning tree method to construct a differentially private junction tree.Finally,PrivHD uses the post-processing technique to boost the noisy counts of marginal tables in each cluster in junction tree,and based on the boosted result,PrivHD produces the high-dimensional synthetic dataset.PrivHD is compared with the existing approaches such as PrivBayes,JTree on the different real datasets.The experimental results show that PrivHD is better than its competitors on k-way query and SVM classification.

参考文献:

正在载入数据...

版权所有©河南财经政法大学 重庆维普资讯有限公司 渝B2-20050021-8 
渝公网安备 50019002500408号 违法和不良信息举报中心