Abstract
Recovering latent communities is a key unsupervised learning Recovering latent communities is a key unsupervised learning task in network data with applications spanning across a multitude of disciplines. For example, identifying communities in web pages can lead to faster search, classifying regions of the human brain in communities can be used to predict onset of psychosis, and identifying communities of assets can help investors manage risk by investing in different communities of assets. However, the scale of these massive networks has become so large that it is often impossible to work with the entire network data. In this talk, I will talk about some theoretical progress for community detection in a probabilistic set up especially when we have missing data about the network. Based on joint works with Julia Gaudio, Elchanan Mossel and Colin Sandon.