We introduce an analytical statistical method for characterizing the communities detected in heterogeneous complex systems. By proposing a suitable null hypothesis, our method makes use of the hypergeometric distribution to assess the probability that a given property is over-expressed in the elements of a community with respect to all the elements of the investigated set. We apply our method to two specific complex networks, namely a network of world movies and a network of physics preprints. The characterization of the elements and of the communities is done in terms of languages and countries for the movie network and of journals and subject categories for papers. We find that our method is able to characterize clearly the communities identified. Moreover our method works well both for large and for small communities.
Community characterization of heterogeneous complex systems
LILLO, FABRIZIO;
2011
Abstract
We introduce an analytical statistical method for characterizing the communities detected in heterogeneous complex systems. By proposing a suitable null hypothesis, our method makes use of the hypergeometric distribution to assess the probability that a given property is over-expressed in the elements of a community with respect to all the elements of the investigated set. We apply our method to two specific complex networks, namely a network of world movies and a network of physics preprints. The characterization of the elements and of the communities is done in terms of languages and countries for the movie network and of journals and subject categories for papers. We find that our method is able to characterize clearly the communities identified. Moreover our method works well both for large and for small communities.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.