MetaPGN: a pipeline for construction and graphical visualization of annotated pangenome networks.

1. School of Biology and Biological Engineering, South China University of Technology, Building B6, 382 Zhonghuan Road East, Guangzhou Higher Education Mega Center, Guangzhou 510006, China.
Authors
Peng Y^{1,

2}
Li J^{1,

2}
(2 authors)
2. BGI-Shenzhen, Building 11, Beishan Industrial Zone, Yantian, Shenzhen 518083, China.
Authors
Peng Y^{1,

2}
Tang S²
Wang D²
Zhong H²
Jia H²
Cai X²
Zhang Z²
Xiao M²
Yang H²
Wang J²
Kristiansen K²
Xu X²
Li J^{1,

2}
(13 authors)

ORCIDs linked to this article

Gigascience, 01 Nov 2018, 7(11)
https://doi.org/10.1093/gigascience/giy121 PMID: 30277499 PMCID: PMC6251982

This article is in the Europe PMC Open access subset. Refer to the copyright information in the article for licensing details.

Free full text in Europe PMC

This article is based on a previously available preprint.

Abstract

Pangenome analyses facilitate the interpretation of genetic diversity and evolutionary history of a taxon. However, there is an urgent and unmet need to develop new tools for advanced pangenome construction and visualization, especially for metagenomic data. Here, we present an integrated pipeline, named MetaPGN, for construction and graphical visualization of pangenome networks from either microbial genomes or metagenomes. Given either isolated genomes or metagenomic assemblies coupled with a reference genome of the targeted taxon, MetaPGN generates a pangenome in a topological network, consisting of genes (nodes) and gene-gene genomic adjacencies (edges) of which biological information can be easily updated and retrieved. MetaPGN also includes a self-developed Cytoscape plugin for layout of and interaction with the resulting pangenome network, providing an intuitive and interactive interface for full exploration of genetic diversity. We demonstrate the utility of MetaPGN by constructing Escherichia coli pangenome networks from five E. coli pathogenic strains and 760 human gut microbiomes,revealing extensive genetic diversity of E. coli within both isolates and gut microbial populations. With the ability to extract and visualize gene contents and gene-gene physical adjacencies of a specific taxon from large-scale metagenomic data, MetaPGN provides advantages in expanding pangenome analysis to uncultured microbial taxa.

Free full text

Gigascience. 2018 Nov; 7(11): giy121.

Published online 2018 Oct 2. https://doi.org/10.1093/gigascience/giy121

PMCID: PMC6251982

PMID: 30277499

MetaPGN: a pipeline for construction and graphical visualization of annotated pangenome networks

Ye Peng,^1,^2,³ Shanmei Tang,^2,^3,⁴ Dan Wang,^2,^3,⁴ Huanzi Zhong,^2,^3,^4,⁵ Huijue Jia,^2,^3,⁴ Xianghang Cai,^2,³ Zhaoxi Zhang,^2,³ Minfeng Xiao,^2,³ Huanming Yang,^2,⁶ Jian Wang,^2,⁶ Karsten Kristiansen,^2,^3,⁵ Xun Xu,^2,³ and Junhua Li^1,^2,^3,⁴

Ye Peng

¹School of Biology and Biological Engineering, South China University of Technology, Building B6, 382 Zhonghuan Road East, Guangzhou Higher Education Mega Center, Guangzhou 510006, China