Pangenome analysis of Enterobacteria reveals richness of secondary metabolite gene clusters and their associated gene sets

Omkar S. Mohite, Colton J. Lloyd, Jonathan M. Monk, Tilmann Weber*, Bernhard O. Palsson

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

In silico genome mining provides easy access to secondary metabolite biosynthetic gene clusters (BGCs) encoding the biosynthesis of many bioactive compounds, which are the basis for many important drugs used in human medicine. However, the association between BGCs and other functions encoded in the genomes of producers have remained elusive. Here, we present a systems biology workflow that integrates genome mining with a detailed pangenome analysis for detecting genes associated with a particular BGC. We analyzed 3,889 enterobacterial genomes and found 13,266 BGCs, represented by 252 distinct BGC families and 347 additional singletons. A pangenome analysis revealed 88 genes putatively associated with a specific BGC coding for the colon cancer-related colibactin that code for diverse metabolic and regulatory functions. The presented workflow opens up the possibility to discover novel secondary metabolites, better understand their physiological roles, and provides a guide to identify and analyze BGC associated gene sets.

Original languageEnglish
Pages (from-to)900-910
Number of pages11
JournalSynthetic and Systems Biotechnology
Volume7
Issue number3
DOIs
Publication statusPublished - 1 Sept 2022

Bibliographical note

Funding Information:
This work was supported by grants from the Novo Nordisk Foundation ( NNF20CC0035580 , NNF16OC0021746 ).

Publisher Copyright:
© 2022 The Authors

Other keywords

  • Colibactin
  • Enterobacteria
  • Pangenome analysis
  • Secondary metabolites
  • Secretion systems
  • Workflow

Fingerprint

Dive into the research topics of 'Pangenome analysis of Enterobacteria reveals richness of secondary metabolite gene clusters and their associated gene sets'. Together they form a unique fingerprint.

Cite this