Skip to main content

Research Repository

Advanced Search

INfrastructure for a PHAge REference Database: Identification of Large-Scale Biases in the Current Collection of Cultured Phage Genomes

Cook, Ryan; Brown, Nathan; Redgwell, Tamsin; Rihtman, Branko; Barnes, Megan; Clokie, Martha; Stekel, Dov J.; Hobman, Jon; Jones, Michael A.; Millard, Andrew

Authors

Ryan Cook

Nathan Brown

Tamsin Redgwell

Branko Rihtman

Megan Barnes

Martha Clokie

Andrew Millard



Abstract

Background: With advances in sequencing technology and decreasing costs, the number of phage genomes that have been sequenced has increased markedly in the past decade. Materials and Methods: We developed an automated retrieval and analysis system for phage genomes (https://github.com/RyanCook94/inphared) to produce the INfrastructure for a PHAge REference Database (INPHARED) of phage genomes and associated metadata. Results: As of January 2021, 14,244 complete phage genomes have been sequenced. The INPHARED data set is dominated by phages that infect a small number of bacterial genera, with 75% of phages isolated on only 30 bacterial genera. There is further bias, with significantly more lytic phage genomes (∼70%) than temperate (∼30%) within our database. Collectively, this results in ∼54% of temperate phage genomes originating from just three host genera. With much debate on the carriage of antibiotic resistance genes and their potential safety in phage therapy, we searched for putative antibiotic resistance genes. Frequency of antibiotic resistance gene carriage was found to be higher in temperate phages than in lytic phages and again varied with host. Conclusions: Given the bias of currently sequenced phage genomes, we suggest to fully understand phage diversity, efforts should be made to isolate and sequence a larger number of phages, in particular temperate phages, from a greater diversity of hosts.

Citation

Cook, R., Brown, N., Redgwell, T., Rihtman, B., Barnes, M., Clokie, M., Stekel, D. J., Hobman, J., Jones, M. A., & Millard, A. (2021). INfrastructure for a PHAge REference Database: Identification of Large-Scale Biases in the Current Collection of Cultured Phage Genomes. PHAGE Therapy, Applications, and Research, 2(4), 214-223. https://doi.org/10.1089/phage.2021.0007

Journal Article Type Article
Acceptance Date Aug 14, 2021
Online Publication Date Dec 16, 2021
Publication Date Dec 1, 2021
Deposit Date Feb 24, 2025
Journal PHAGE: Therapy, Applications, and Research
Print ISSN 2641-6530
Electronic ISSN 2641-6549
Publisher Mary Ann Liebert
Peer Reviewed Peer Reviewed
Volume 2
Issue 4
Pages 214-223
DOI https://doi.org/10.1089/phage.2021.0007
Public URL https://nottingham-repository.worktribe.com/output/8636437
Publisher URL https://www.liebertpub.com/doi/10.1089/phage.2021.0007
Other Repo URL https://www.biorxiv.org/content/10.1101/2021.05.01.442102v1