深圳大学学报理工版

在线变分期望最大(online variational expectation maximization, onlineVEM)算法可快速发现大规模网络的聚类模式,但在网络结构复杂时算法的处理结果稳定性和准确性欠佳.为更快更准地识别其聚类模式,提出一种主动半监督在线变分期望最大(active semi-supervised onlineVEM, ASonlineVEM)算法.算法首先自动选择代表节点,确定类的个数,并基于代表节点初始化模型; 然后迭代执行3个任务:运行在线算法onlineVEM、主动选节点及模型更新,直至算法达到准确率的设定阈值或收敛.在不同结构的人工网络和真实网络上的实验结果表明,ASonlineVEM算法的准确性和效率均优于同类算法.ASonlineVEM算法利用主动选择的节点先验信息提高了网络聚类模式发现的稳定性及准确性,提高了在线算法的运行效率.

The algorithm of online variational expectation maximization(onlineVEM)can explore the clustering patterns of large networks fast. But the stability and accuracy of the algorithm are poor when the network structure is complex. In order to identify the clustering patterns faster and more accurately, an active semi-supervised online variational expectation maximization(ASonlineVEM)algorithm is proposed. Firstly, the algorithm selects the representative nodes automatically, determines the numbers of clusters, and initializes the model based on the representative nodes. Then, it iteratively executes three tasks: running the online algorithm onlineVEM, actively selecting nodes, and updating parameters until the algorithm reaches the preset threshold of accuracy or convergences. Experiments on artificial networks and real networks with different structures show that the accuracy and efficiency of ASonlineVEM algorithm are better than those of similar algorithms. The ASonlineVEM algorithm uses the priori information of actively selected nodes to improve the stability and accuracy of clustering pattern detection of networks and to improve the efficiency of online algorithm.

引言
1 大规模网络的主动半监督结构发现策略
2 ASonlineVEM算法
3 实验及结果分析
4 结语

图1 三种算法在GN网络上的NMI对比结果<br/>Fig.1 NMI comparison results of three algorithms on GN networks

图1 三种算法在GN网络上的NMI对比结果
Fig.1 NMI comparison results of three algorithms on GN networks

图2 三种算法在LFR网络的NMI对比结果<br/>Fig.2 NMI comparison results of three algorithms on LFR networks

图2 三种算法在LFR网络的NMI对比结果
Fig.2 NMI comparison results of three algorithms on LFR networks

表1 人工网络数据集<br/>Table 1 Synthetic network dataset

表1 人工网络数据集
Table 1 Synthetic network dataset

图3 三种算法在人工网络上的NMI对比结果<br/>Fig.3 NMI comparison results of three algorithms on synthetic networks

图3 三种算法在人工网络上的NMI对比结果
Fig.3 NMI comparison results of three algorithms on synthetic networks

图4 三种算法在经典真实网络的NMI对比结果<br/>Fig.4 NMI comparison results of three algorithms on classic real networks

图4 三种算法在经典真实网络的NMI对比结果
Fig.4 NMI comparison results of three algorithms on classic real networks

图5 ASonlineVEM和ALISE算法在Facebook网络上的NMI对比结果<br/>Fig.5 NMI comparison results of ASonlineVEM and ALISE algorithms on Facebook networks

图5 ASonlineVEM和ALISE算法在Facebook网络上的NMI对比结果
Fig.5 NMI comparison results of ASonlineVEM and ALISE algorithms on Facebook networks

图6 ASonlineVEM和ALISE算法在人工网络上的运行时间对比<br/>Fig.6 Time comparison of ASonlineVEM and ALISE algorithms on synthetic networks

图6 ASonlineVEM和ALISE算法在人工网络上的运行时间对比
Fig.6 Time comparison of ASonlineVEM and ALISE algorithms on synthetic networks

图7 ASonlineVEM和ALISE算法在Facebook网络上运行时间对比<br/>Fig.7 Time comparison of ASonlineVEM and ALISE algorithms on Facebook networks

图7 ASonlineVEM和ALISE算法在Facebook网络上运行时间对比
Fig.7 Time comparison of ASonlineVEM and ALISE algorithms on Facebook networks

[1] CHAI Bianfang, JIA Caiyan, YU Jian. An online expectation maximization algorithm for exploring general structure in massive networks[J]. Physica A: Statistical Mechanics and Its Applications, 2015, 438: 454-468.
[2] FORTUNATO S. Community detection in graphs[J]. Physics Reports, 2010, 486(3/4/5): 75-174.
[3] 赵学华,杨博,陈贺昌.一种高效的随机块模型学习算法[J].软件学报,2016,27(9):2248-2264.
[4] YA Fangli,CAI Yanjia,JIAN Qiangli, et al. Enhanced semi-supervised community detection with active node and link selection[J]. Physica A:Statistical Mechanics and Its Applications,2018, 510:219-232.
[5] EATON E, MANSBACH R. A spin-glass model for semi-supervised community detection[C]// Proceedings of the 26th AAAI Conference on Artificial Intelligence. Atlanta, USA: AAAI Press, 2012: 900-906.
[6] MA Xiaoke, GAO Lin, YONG Xuerong, et al. Semi-supervised clustering algorithm for community structure detection in complex networks[J]. Physica A: Statistical Mechanics and Its Applications, 2010, 389: 187-197.
[7] ZHANG Zhongyuan. Community structure detection in complex networks with partial background information[J]. Europhysics Letters, 2013, 101(4): 48005.
[8] ZHANG Zhongyuan, SUN Kaidi, WANG Siqi. Enhanced community structure detection in complex networks with partial background information[J]. Scientific Reports, 2013, 3(11): 3241.
[9] YANG Liang, JIN Di, HE Dongxiao, et al. Improving the efficiency and effectiveness of community detection via prior-induced equivalent super-network[J]. Scientific Reports, 2017, 7(1): 634.
[10] YANG Liang, CAO Xiaochun, JIN Di, et al. A unified semi-supervised community detection framework using latent space graph regularization[J].IEEE Transactions on Cybernetics, 2015, 45(11): 2585-2598.
[11] YANG Liang, GE Meng, JIN Di, et al. Exploring the roles of cannot-link constraint in community detection via multi-variance mixed Gaussian generative model[J]. PLoS ONE, 2017, 12(7): e0178029.
[12] CHENG Jianjun, LENG Mingwei, LI Longjie, et al. Active semi-supervised community detection based on must-link and cannot-link constraints[J]. PLoS ONE, 2014, 9(10): e110088.
[13] YANG Liang, JIN Di, WANG Xiao, et al. Active link selection for efficient semi-supervised community detection[J]. Scientific Report, 2015, 5(3): 9039.
[14] JIA Caiyan, LI Yafang, CARSON M B, et al. Node attribute-enhanced community detection in complex networks[J]. Scientific Reports, 2017, 7(1): 2626.
[15] LI Yafang, JIA Caiyan, YU Jian. A parameter-free community detection method based on centrality and dispersion of nodes in complex networks[J]. Physica A: Statistical Mechanics and its Applications, 2015, 438: 321-334
[16] NEWMAN M E J, LEICHT E A. Mixture models and exploratory analysis in networks[J] Proceedings of the National Academy of Sciences, 2007, 104(23): 9564-9569.
[17] LIU Xin, CHENG Huimin, ZHANG Zhongyuan. Evaluation of community detection methods[J]. IEEE Transactions on Knowledge and Data Engineering.(2019-04-17).http://ieeexplore.ieee.org/stamp.jsp?tp=&arnumber:8693534.
[18] NEWMAN M. Network data[DB/OL].(2013-04-19). http://www-personal.umich.edu/～mejn/netdata/.

备注

引言

1 大规模网络的主动半监督结构发现策略

2 ASonlineVEM算法

3 实验及结果分析

4 结语

补充材料文件请点击下载

期刊信息

备注

引言