Large-scale comparative analysis of fish genomes in the era of biodiversity sequencing
Stage · Stage M2 · 6 mois Bac+5 / Master IBENS - CNRS · Paris (France) indemnités de stage
Date de prise de poste : 1 février 2024
Mots-Clés
comparative genomics - evolution - biodiversity
Description
Sujet de stage de M2
Equipe DYOGEN (https://www.biologie.ens.fr/dyogen)
Encadrant: H. Roest Crollius (DR1 CNRS)
Période février-juillet 2024 (flexible).
Large-scale comparative analysis of fish genomes in the era of biodiversity sequencing
The DYOGEN team at the Institute of Biology of the ENS (IBENS) seeks to make sense of the information contained in vertebrate genomes through the comparative study of their evolution, organization, gene content and regulatory sequence content. For several years now, the team has been particularly interested in fish genomes, which underwent a complete duplication before the great radiation of the teleosts, which has given rise to about 30,000 species today. The team is at the heart of the ATLASea project, which aims to sequence the genomes of 4500 marine species from the french coastline (https://www.atlasea.fr). This massive data is ideal material to identify genotype-phenotype relationships using either "Forward” or “Reverse” genomics strategies [1–3], where one seeks to associate signatures of phenotypic changes with genomic changes.
The internship will build on the team’s expertise in bioinformatics and fish comparative genomics to develop such strategies applied to known cases of convergent evolution in teleost fish. Convergent evolution, where the same phenotype has evolved more than once in independent lineages, is a powerful approach to identify genotype-phenotype relationships because it involves replicate observations. It has been documented numerous times in fish, such as for the emergence of amphibiousness, viviparity, osmoregulation, endothermy, extreme short or long lifespan, or the loss of the swim bladder, of the stomach, of scales, of fins. The approach will consist in building reference datasets for protein coding gene phylogenies and conserved non-coding element (CNE) families. More than 450 high-quality fish genomes are currently available, and will provide the necessary raw data from which relevant data subsets (gene families, multiple genome alignments, etc.) will be established. When no specific genomic elements (genes or CNEs) can be ascribed to a phenotype of interest through discrete retention or loss patterns, two strategies will be employed to broaden the range of the genotype to be considered. First, gene annotations (gene ontology, metabolic pathways, paralogy relationships) will serve to extend the biological context to consider. Second, a continuous signal of molecular evolutionary rate change will be measured across phylogenies, instead of a binary retention/loss of sequences.
The project is an exciting opportunity to leverage the abundance of high-quality genomic data that biodiversity sequencing projects are currently producing[4]. IBENS is equipped with a substantial bioinformatics platform providing access to a local high-performance computing cluster (>4500 CPUs, > 20 Tb RAM).
1. Chen HI, Turakhia Y, Bejerano G, Kingsley DM. Whole-genome Comparisons Identify Repeated Regulatory Changes Underlying Convergent Appendage Evolution in Diverse Fish Lineages. Molecular Biology and Evolution. 2023;40: msad188. doi:10.1093/molbev/msad188
2. Hiller M, Schaar BT, Indjeian VB, Kingsley DM, Hagey LR, Bejerano G. A “Forward Genomics” Approach Links Genotype to Phenotype using Independent Phenotypic Losses among Related Species. Cell Reports. 2012;2: 817–823. doi:10.1016/j.celrep.2012.08.032
3. Prudent X, Parra G, Schwede P, Roscito JG, Hiller M. Controlling for Phylogenetic Relatedness and Evolutionary Rates Improves the Discovery of Associations Between Species’ Phenotypic and Genomic Differences. Molecular Biology and Evolution. 2016;33: 2135–2150. doi:10.1093/molbev/msw098
4. Lewin HA, Richards S, Aiden EL, Allende ML, Archibald JM, Bálint M, et al. The Earth BioGenome Project 2020: Starting the clock. PNAS. 2022;119. doi:10.1073/pnas.2115635118
Candidature
Procédure : Envoyer un mail avec CV et lettre de motivation
Date limite : 15 janvier 2025
Contacts
Hugues Roest Crollius
hrNOSPAMc@bio.ens.psl.eu
Offre publiée le 18 novembre 2024, affichage jusqu'au 15 janvier 2025