Illustration thèse confinancée
Thesis by Maxime Delmas (2019 - 2022)

Construire, exploiter et étendre un graphe de connaissances pour l’étude des liens entre métabolisme et santé

Thesis by Maxime Delmas (Toxalim, defended in 2022). Using Semantic Web technologies, the work presented in this thesis proposes the extraction and aggregation of a set of relationships between chemical compounds and biomedical concepts, to build an open knowledge graph : FORUM.

  • Date : 2019 - 2022
  • Research laboratory : Toxalim
  • Thesis director :   Fabien Jourdan (INRAE, UMR Toxalim)
  • Supervisors :  Clément FRAINAY
  • Metaprogramme axis : Axis 2 (Predicting phenotypes and their responses to changes in stress fields)

Summary

Metabolomics aims at determining metabolic profiles, describing the impact on metabolism of an experimental condition or a particular phenotype. In human health, the study and comparison of these profiles are instrumental for the characterization of pathological mechanisms, toxic effects, or to identify new targets for diagnosis. The interpretation of these profiles requires the recontextualization of the observations both in their biochemical context, but also more broadly on the biological level, by integrating external knowledge from the scientific literature. From chemical databases to the literature, all this knowledge is accessible, but its volume and growth limit its exploration. In response to this information overload, new approaches need to be developed to help the researcher exploit this knowledge. Using Semantic Web technologies, the work presented in this thesis proposes the extraction and aggregation of a set of relationships between chemical compounds and biomedical concepts, to build an open knowledge graph : FORUM. The associative links between concepts have been inferred from their co-mention frequency in the literature, and augmented by the semantic representation of entities provided by chemical and biomedical vocabularies. Beyond the support for the interpretation of metabolic profiles, the potential of FORUM to suggest new hypotheses and explore relationships through the Web of data has also been discussed. Despite its growth, the coverage of the literature on the metabolome remains partial. When interpreting metabolic profiles, the amount of available information for each observed compound can therefore be unequal. In order to extend the knowledge graph to metabolites neglected in the literature, a second approach combining network analysis and Bayesian statistics was also developed.

The new suggested relationships with diseases are derived from the literature of the metabolic neighbours of the compound, and provide a new level of relationship in the knowledge graph 

delmas

Contact

See also

Publications

Journal articles

  • Maxime Delmas, Olivier Filangi, Nils Paulhe, Florence Vinson, Christophe Duperier, William Garrier, P.-E. Saunier, Yoann Pitarch, Fabien Jourdan, Franck Giacomoni and Clément Frainay (2021). FORUM: Building a Knowledge Graph from public databases and scientific literature to extract associations between chemicals and diseasesBionformatics.https://hal.archives-ouvertes.fr/hal-03318295 https://dx.doi.org/10.1101/2021.02.12.430944
  • Juliette Cooke, Maxime Delmas, Cecilia Wieder, Pablo Rodríguez Mier, Clément Frainay, et al.. Genome scale metabolic network modelling for metabolic profile predictions. PLoS Computational Biology, 2024, 20 (2), pp.e1011381. ⟨10.1371/journal.pcbi.1011381⟩⟨hal-04596969⟩
  • M. Delmas, O. Filangi, C. Duperier, N. Paulhe, F. Vinson, P. Rodriguez-Mier, F. Giacomoni, F. Jourdan, C. Frainay (2022). Suggesting disease associations for overlooked metabolites using literature from metabolic neighbours.bioRxiv 2022.09.13.507596; doi: https://doi.org/10.1101/2022.09.13.507596