Network motifs come in sets: Correlations in the randomization process
Source: Phys. Rev. E 82, 011921 (2010); doi:10.1103/PhysRevE.82.011921
Published 22 July 2010
The identification of motifs—subgraphs that appear significantly more often in a particular network than in an ensemble of randomized networks—has become a ubiquitous method for uncovering potentially important subunits within networks drawn from a wide variety of fields. We find that the most common algorithms used to generate the ensemble from the real network change subgraph counts in a highly correlated manner, such that one subgraph's status as a motif may not be independent from the statuses of the other subgraphs. We demonstrate this effect for the problem of three- and four-node motif identification in the transcriptional regulatory networks of E. coli and S. cerevisiae in which randomized networks are generated via an edge-swapping algorithm. We find strong correlations among subgraph counts; for three-node subgraphs these correlations are easily interpreted, and we present an information-theoretic tool that may be used to identify correlations among subgraphs of any size. Our results suggest that single-feature statistics such as Z scores that implicitly assume independence among subgraph counts constitute an insufficient summary of the network.
©2010 The American Physical Society
| History: | Received 25 January 2010; revised 5 May 2010; published 22 July 2010 |
| Permalink: |
http://link.aps.org/abstract/PRE/v82/e011921 |
ADVERTISEMENT


