An Effective Algorithm for Optimizing Surprise in Network Community Detection

Many methods have been proposed to detect communities/modules in various networks such as biological molecular networks and disease networks, while optimizing statistical measures for community structures is one of the most popular ways for community detection. Surprise, which is a statistical measu...

Descripción completa

Guardado en:
Detalles Bibliográficos
Autores principales: Yan-Ni Tang, Ju Xiang, Yuan-Yuan Gao, Zhi-Zhong Wang, Hui-Jia Li, Shi Chen, Yan Zhang, Jian-Ming Li, Yong-Hong Tang, Yong-Jun Chen
Formato: article
Lenguaje:EN
Publicado: IEEE 2019
Materias:
Acceso en línea:https://doaj.org/article/b962e79c8ed34d83b0db87ef3b15e781
Etiquetas: Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
Descripción
Sumario:Many methods have been proposed to detect communities/modules in various networks such as biological molecular networks and disease networks, while optimizing statistical measures for community structures is one of the most popular ways for community detection. Surprise, which is a statistical measure of interest for community detection, has good performance in many networks, but it still encounters the resolution limit in some cases and it is hard to be optimized due to its strong nonlinearity. Here, we discussed the resolution limit of Surprise by a phase diagram in community-partition transition, and then proposed an improved algorithm for Surprise optimization by introducing three effective strategies: a pre-processing of topological structure based on local random walks (Pre_TS), a pre-processing of community partition (Pre_CS), and a post-processing of community partition (Post_CS). By a series of experimental tests in various networks, we show that Pre_TS can effectively enhance the resolution of Surprise, Pre_CS and Post_CS can improve the optimization performance in different aspects, and as expected, the combination of these strategies can more effectively enhance the ability of Surprise to detect communities in complex networks. Finally, we displayed the effectiveness of the improved algorithm for Surprise optimization in several real-world networks, and applied the algorithm to the analysis of disease-related networks in computational biology.