Illustration prédire les phénotypes
Prediction

Prediction

Axis 2: Predicting phenotypes and their responses to changes in stress fields

This axis deals with the prediction of phenotypes from the cell to the individual and population levels, their functionalities and responses to changes in stress fields (biotic and abiotic environment, management methods, practices).

One of the challenges of digital biology is to develop, compare and improve learning methods, adapting them to integrate multi-source data (omics, sensors, environment, data from participatory approaches). The modelling of biological and physiological processes to develop predictive approaches and the simulation of complex biological systems are also major challenges. In particular, the robustness of the models built in axis 1 must be tested by subjecting them to fluctuating conditions (internal or external).

This research includes predictive approaches based on modelling, data assimilation and the coupling of "data-driven" and "concept-driven" approaches, taking into account the essential dimensions of prediction, i.e. planning, quantification of uncertainties and evaluation of forecasts.

Research themes

  • Genomic prediction;
  • Taking account of all transmissible factors in predicting an individual’s potential to transmit characteristics to its descendants;
  • Application of these methods to identify bio-markers for an organism’s state or potential (resistance or stress levels, etc.), and  genetic targets to optimize networks;
  • Predicting the impacts of complex chemical and biological exposure scenarios on physiology;
  • Mobilizing information on states and knowledge of processes, at organism (and sub-organism) levels, to predict the functioning and trajectories of populations at different spatial and/or temporal scales.

Methodological challenges

  • Select the learning methods best suited to a given problem, determining their contributions and limitations for detecting complex structures or weak signals in the data;
  • Build and simulate integrative models with good predictive capabilities, allowing the effects of confounding factors in the data to be identified;
  • Access unobservable quantities through model inversion (e.g. trait value estimation);
  • Improve the quality and robustness of predictions (underlying issues: planning and sampling, uncertainty assessment, taking account of changing environments).

In this folder

Illustration thèse confinancée

Thesis by Maxime Delmas (Toxalim, defended in 2022). Using Semantic Web technologies, the work presented in this thesis proposes the extraction and aggregation of a set of relationships between chemical compounds and biomedical concepts, to build an open knowledge graph : FORUM.

Illustration thèse confinancée

Thesis by Camille Juigné (PEGASE, defended in 2023). This thesis focuses on the development of an integrative computational method for analyzing massive and complex biological data to extract relevant knowledge. The resultant multi-layer graph provides multiple links between elements, allowing characterization of the relationships between specific molecules to determine variations in pig feed efficiency, a key phenotype of interest for sustainable animal production systems.

Illustration thèse confinancée

Thesis by Annaig de Walsche (GQE, 2022-2025). This thesis seeks to develop the set of methods necessary to detect QTLs (quantitative trait loci) from interdependent characterised panels involving heterogenous inter-environmental effects in natural biological networks. These innovative methods will combine computational and synthetic biology approaches.

Thesis by Clémentine Borrelli (SVQV, 2022-2025). French viticulture must currently deal with the effects of climate heating while reducing its use of phytosanitary products. One solution is to create new varieties that are disease resistant. The purpose of this thesis is to optimise predictive models relating to the genetic value of these resistant varieties.

Thesis by Felicia Maviane-Macia (LIMPE, 2022-2025). This thesis aims to develop highly effective and generic image-based computational tools to quantify and improve understanding of the dynamics of the manifestation of foliar symptoms caused by pathogenic microorganisms on Arabidopsis and grapevine.

Illustration thèse confinancée

Thesis by Elfried Salanon. This thesis sets out to study the developmental trajectories of metabolic syndrome (MetS). This syndrome is increasingly widespread, making it an important issue in public health, especially in older populations suffering from chronic illness and disease.

Thesis by Antoine Bourlier (PRC, 2022-2025). This thesis seeks to create new algorithms for anato-functional comparison of brain data from growing lambs using both traditional graph theory and new graph-based deep learning methods to study the differences between individuals and over time.

Thesis by Elise Jorge (GenPhyse, 2023-2025). This thesis, at the interface between statistics, molecular biology and functional genomics, seeks to develop a differential analysis method for Hi-C data to identify regions in the 3-D genome structure where modifications occur.

Thesis by Alexandre Asset (BREED /MIA-PS, 2024 - 2026). Building on the work of EPINUM, this thesis proposes to investigate the most appropriate AI approaches that integrate epigenetic data into phenotype prediction models.

OBAMA © Pexels Sarai Zuno

The genetic selection of animals has been revolutionised over the past years by the advent of genomics, making it easier to select for specific essential phenotypes. Nevertheless, the task of understanding the links between observed genetic variations and phenotypic characteristics of interest remains complex. The OBAMA interdisciplinary project proposes to combine AI with genomics to improve our understanding of the influence of genetic factors on phenotypes in pigs.

  • 1 (current)