On the surface, the protein universe seems dauntingly vast. Driven by the increasingly rapid accumulation of genomic sequences, the past few decades have yielded sequence data for several million gene products, leaving researchers struggling to keep up. Of the 10,000 protein families listed in the latest release from PFAM (http://pfam.sanger.ac.uk/), an online database that groups proteins base...