Exploratory project DINAMIC (2022 - 2023)

Analysing biological networks of mixed-type data with copula models

Integrative biology is based on the study of complex biological networks. Understanding the plasticity of biological interaction networks due to phenotypic, environmental or interventional variability is an important challenge in fields as diverse as genomics or human nutrition. Such studies often include comparisons between contrasting groups, including variables of various natures (continuous, counts, binary, etc.). These so-called "mixed-type" data can be difficult to analyse in a unified way. While multivariate probabilistic models provide a robust framework for inferring interrelationships among continuous variables, an analogous model for mixed-type data has yet to be defined.

Background and challenges

A particularly promising but as-yet unexplored approach for this purpose is the use of parametric copula models, which can be used to couple variables of disparate natures. The development of such a model in a computationally efficient graphical form thus represents an open methodological challenge for the inference of generic networks from mixed-type data.

Goals

The DINAMIC project aims to develop and implement an innovative and widely applicable multivariate framework based on copulas and random pairwise likelihood (Mazo et al., 2021) for the differential analysis of mixed-type networks.

These methodological developments will be based on a succession of three applications covering several research themes at INRAE:

  • cognitive health networks in seniors following the introduction of nutritional supplements;
  • phenotypic networks in response to thermal stress in maize lines structured according to their genetic proximity;
  • multi-omic networks in sperm from groups of bulls with contrasting fertility.

Each application will motivate a distinct facet of our approach, highlighting the added value of our interdisciplinary collaboration. To combine theoretically sound and computationally efficient statistical developments with relevant modelling assumptions aligned with the underlying biology, the DINAMIC project relies on a continuous cycle of interactions between methodologists and domain-specific experts.

Our multivariate mixed-type network model will represent a new approach to digital biology, with the potential to generate new insights into network plasticity in a wide variety of scientific disciplines.

Contact - coordination :

Andrea Rau, GABI

Partnerships

INRAE participants

Plant Biology and Breeding division

Expertise

UMR GABI

Biostatistics

 

UMR Transfrontalière BIoEcoAgro

Quantitative genetics, plant genomics

 

UMR GQE Le Moulon

Omic analysis

 

Mathematics and Digital Technologies division

UMR MaIAge

Statistics, mathematics

 

Animal physiology and Livestock system division

UMR BREED

Animal genomics

 

Nutrition, Chemical Food Safety and Consumer Behaviour division 

UMR NutriNeuro

Human nutrition

 

Partners

Greece

Expertise

Athens University of Economics and Business

Statistics and methodology

 

See also

References

Mazo, G., Karlis, D., and Rau, A. (2021) A randomized pairwise likelihood method for complex statistical inferences. Under review. hal-03126621