Illustration prédire les phénotypes
Prediction

Prediction

Axis 2: predicting phenotypes and their responses to changes in stress fields

This axis deals with the prediction of phenotypes from the cell to the individual and population levels, their functionalities and responses to changes in stress fields (biotic and abiotic environment, management methods, practices).

One of the challenges of digital biology is to develop, compare and improve learning methods, adapting them to integrate multi-source data (omics, sensors, environment, data from participatory approaches). The modelling of biological and physiological processes to develop predictive approaches and the simulation of complex biological systems are also major challenges. In particular, the robustness of the models built in axis 1 must be tested by subjecting them to fluctuating conditions (internal or external).

This research includes predictive approaches based on modelling, data assimilation and the coupling of "data-driven" and "concept-driven" approaches, taking into account the essential dimensions of prediction, i.e. planning, quantification of uncertainties and evaluation of forecasts.

Research themes

  • Genomic prediction;
  • Taking account of all transmissible factors in predicting an individual’s potential to transmit characteristics to its descendants;
  • Application of these methods to identify bio-markers for an organism’s state or potential (resistance or stress levels, etc.), and  genetic targets to optimize networks;
  • Predicting the impacts of complex chemical and biological exposure scenarios on physiology;
  • Mobilizing information on states and knowledge of processes, at organism (and sub-organism) levels, to predict the functioning and trajectories of populations at different spatial and/or temporal scales.

Methodological challenges

  • Select the learning methods best suited to a given problem, determining their contributions and limitations for detecting complex structures or weak signals in the data;
  • Build and simulate integrative models with good predictive capabilities, allowing the effects of confounding factors in the data to be identified;
  • Access unobservable quantities through model inversion (e.g. trait value estimation);
  • Improve the quality and robustness of predictions (underlying issues: planning and sampling, uncertainty assessment, taking account of changing environments).

In this folder

The development of genomic selection - and other "omics" analyses such as metagenomics, transcriptomics, metabolomics and proteomics - now makes it possible to characterise animals using thousands of measurements. This massive data is integrated into models to predict production traits with the highest possible degree of accuracy.

A major issue in in vitro fertilisation (IVF) is the selection of the "best" embryo, i.e. the one most likely to implant in the uterus. The objective of the BovMovie2Pred consortium is to propose solutions to assist in the selection of bovine embryos in order to increase the percentage of viable births from in vitro produced embryos.

Jumeaux numériques de systèmes microbiens © katemangostar, Freepik

The Arte-mis consortium will bring together an interdisciplinary community of researchers working at the interface between the experimental and digital sciences to overcome methodological barriers to the creation of digital twins in microbial ecology.

EPINIUM © Wirestock, Freepik

Hit by the impacts of a changing climate, cattle farming must adapt to changes in agro-ecological practices. To meet these challenges, a new generation of finely tuned, rapid and minimally invasive phenotyping tools must be developed to ensure the continued compatibility of animal/environment pairings. The EPINUM consortium proposes to deploy machine learning approaches to improve phenotypic prediction based on epigenotyping data.

Fermentwin © Freepik

Gene transcription is an essential process in the adaptive response of plants to environmental constraints. The interdisciplinary scientific consortium PRECURSOR aims to investigate and better understand how this process takes place in the proximal regions of genes to ultimately improve the predictive power of selection models.

Photo vache dans une étable

In plant and animal genetics, selection programmes aim to identify individuals whose performance (yield, resistance to disease or environmental stress) meets previously defined criteria. This selection requires the acquisition of data, in the field or in breeding, which can be costly or time-consuming.

OBAMA © Pexels Sarai Zuno

The genetic selection of animals has been revolutionised over the past years by the advent of genomics, making it easier to select for specific essential phenotypes. Nevertheless, the task of understanding the links between observed genetic variations and phenotypic characteristics of interest remains complex. The OBAMA interdisciplinary project proposes to combine AI with genomics to improve our understanding of the influence of genetic factors on phenotypes in pigs.

To survive, plants must take up water and many nutrients from the soil. These resources are unevenly distributed and plants must explore the soil to find them. This exploration requires the extension of roots, which is a development that comes at a cost for the plant.

Photo Acyrthosiphon pisum

Today, agriculture faces many challenges, including to avoid the development of certain pathogens resulting from the reduction in the use of inputs with a view to sustainable agriculture as well as the effects of climate change. In this context, many questions arise in the short term about the adaptive capacities of these bio-aggressors. Will an insect pest resist the next heat wave? Or will it instead be greatly affected by rising temperatures and cease to be a threat ?

In biology, as in other scientific fields, the integration of multi-source data is more relevant than ever. Indeed, the data collected are increasingly complex and their volume is growing, due to the development of analytical platforms, imaging techniques, the rise of omics data, etc

  • 1 (current)