Explosive diversification explained by network analysis

Using genomic analyses of 100 cichlid species, scientists from Eawag and the University Bern, together with co-workers in Australia, the UK, Tanzania, Uganda and the US, have investigated the striking variation observed in cichlid fish speciation rates. Their findings show that exchanges of genetic variants between existing species dramatically accelerate the development of new species – given favourable ecological conditions.

Evolution depends on genetic mutations: if numerous random mutations accumulate in a population, a new species may emerge. Speciation of this kind takes millions of years. If a young species develops under selection pressure, the process may be more rapid; in such cases, however, only two new species will usually be formed from one progenitor, as alternative genetic variants get eliminated in the process and the evolutionary potential is thus exhausted. But in the geologically young Lake Victoria, around 500 new cichlid species have evolved over the past 15,000 years. In a study just published in Nature, the question how this is possible is explored by a group of Eawag and University of Bern scientists led by evolutionary and fish biologist Ole Seehausen.

Explosive adaptive radiation

Cichlids are a textbook example of the process known as adaptive radiation, in which organisms rapidly diversify to form new species, adapted to different ecological niches. As the world’s second-largest vertebrate family, cichlids are clearly not averse to speciation. However, of the 1712 cichlid species that had been scientifically described by 2019, many developed less than rapidly or even extremely slowly. In other words, rapid speciation cannot be said to be simply a characteristic of cichlids.

Joana Meier, a postdoctoral researcher involved in the project, points out that ecological diversity can promote a high level of species diversity, but also emphasises: “Favourable ecological conditions are not sufficient to explain why adaptive radiation in Lake Victoria was 10,000 times more rapid than in most other lakes.”

Representative sample

In previous studies, Seehausen’s group showed that the rapid cichlid radiation was due to the earlier hybridisation of two distantly related lineages, which came into contact again around 150,000 years ago after several million years of geographical separation; subsequently, their descendants are believed to have been repeatedly isolated and brought together again in hybrid swarms during the turbulent period leading up to the formation of Lake Victoria and its modern fauna.

In the latest study, the researchers analysed the genomes of 100 cichlid species from the Lake Victoria radiation, representing all the various genera, feeding habits and morphologies. This data was compared with genomic data from 20 slowly or less rapidly speciating cichlids from eight other lakes, and with literature/digital repository data on all known cichlid species.

Indels in the spotlight

The new data showed a positive correlation between the speciation rate and the number of indels found in the genome. Indels are short DNA sequences containing base pair insertions or deletions, which may be absent in one species while one or more copies are present in closely related species; they are more difficult to detect than base pair replacements (i.e. point mutations) and have tended to play a minor role in evolutionary theory to date.

The researchers found thousands of indels between the genomes of any two species in the rapidly speciating Lake Victoria flock,. In the “slower” lineages from other lakes, the density of indels in the genome that had accumulated between species per unit time was lower; they had evidently accumulated more slowly – and the rate of speciation was correspondingly slower.

“Use it or lose it” potential

Investigations of the function of Lake Victoria indels revealed that they were significantly associated, in many cases, with variation in diet, habitat and male nuptial coloration. This means that the Lake Victoria cichlids could draw on an indel pool which was not only large but also contained variants important for speciation – directly, through mate choice, or indirectly, through specialisations enabling the fish to occupy new ecological niches. Many of these indels are much older than Lake Victoria or its cichlids and probably arrived in the common ancestor of the radiation through ancient hybridization with other cichlids. Together with the many point mutations that were brought together by ancient hybridisation too, an enormous genomic potential arose.

In a homogeneous environment, this potential would probably soon have disappeared; if genomic variants are not used, they are easily lost – either through chance or uniform selection pressure. But over the last 15,000 years, the large new Lake Victoria evidently offered a wide variety of ecological opportunities and, at a stroke, the fish were able to utilise hundreds of different variants for new – sometimes highly complex – specialisations.

For example, some specialised in scraping firmly attached algae from rocks in clear water close to the shore, while others developed an acute visual capacity enabling them to prey on other fish in the murky deep waters – thanks to gene variants that had been selected much earlier for these specialisations, copies of which were available “ready for use” by young species after the hybridisation.

As these genomic variants facilitated vital specialisations, complementary sets were maintained in the different species and could be recombined again after hybridisation, leading to the evolution of further new species.

A new model

Ole Seehausen says: “We’ve started to develop a model which uses networks rather than phylogenetic trees to describe the evolution of new species. In these networks, not only do new genomic variants arise, but new and old genetic material can be exchanged between species for a very long time. As a result, much more genetic material is available for evolution at all times.”

To what extent this model can also be used to explain different speciation rates in other fish is currently being investigated by scientists at Eawag and the University of Bern – for example, with regard to the whitefish radiation which occurred in prealpine lakes after the last ice age.

Using temporal information

For the reconstruction of “evolutionary networks”, the authors of the Nature study had to determine the age both of the species investigated and of the genetic information (primarily indels) exchanged.

Evolutionary relationships are usually determined on the basis of similarities in genome sequences. However, this approach is of limited value for closely related species that have evolve from a hybrid population. Nonetheless, relative species age in such cases may be estimated using IBD (identity by descent) segment size: whenever reproduction occurs, genetic material is recombined. As a result, the segments of DNA sequence remaining identical in all descendants become shorter from one generation to the next. In the published study, this natural “rate of decay” of IBD segments was used to determine the relative age of cichlid species: the shorter the IBD segments shared between two species, the longer the time since the species diverged.

The age of the indels in the genomes of the Lake Victoria cichlids was also estimated by comparison with the genomes of other cichlid species. If Lake Victoria cichlids share an indel with a species in another river system, to which they are only distantly related, then the indel must be ancient. Alternatively – and this would require further investigation – it may have been exchanged via hybridisation between the groups concerned.


Matthew D McGee et al. (2020). The ecological and genomic basis of explosive adaptive radiation, Nature. https://doi.org/10.1038/s41586-020-2652-7

Quelle: Eawag