
Antonio Pedro Camargo
Articles
-
Nov 21, 2024 |
biorxiv.org | Nishant Jha |Joshua Kravitz |Jacob West-Roberts |Antonio Pedro Camargo
AbstractProtein sequence similarity search is fundamental to genomics research, but current methods are typically not able to consider crucial genomic context information that can be indicative of protein function, especially in microbial systems. Here we present Gaia (Genomic AI Annotator), a sequence annotation platform that enables rapid, context-aware protein sequence search across genomic datasets.
-
Oct 1, 2024 |
biorxiv.org | Andre L Cornman |Jacob West-Roberts |Antonio Pedro Camargo |Simon Roux
AbstractBiological language model performance depends heavily on pretraining data quality, diversity, and size. While metagenomic datasets feature enormous biological diversity, their utilization as pretraining data has been limited due to challenges in data accessibility, quality filtering and deduplication.
-
Sep 20, 2024 |
nature.com | Antonio Pedro Camargo
Correction to: Nature Reviews Microbiology https://doi.org/10.1038/s41579-024-01093-3, published online 28 August 2024. In the version of this article initially published, the copyright line appeared incorrectly and has now been updated to “Lawrence Berkeley National Laboratory, under exclusive licence to Springer Nature Limited,” in the HTML and PDF versions of the article. About this articleCamargo, A.P. Publisher Correction: Unveiling plasmid diversity in nature. Nat Rev Microbiol (2024).
-
Aug 28, 2024 |
nature.com | Antonio Pedro Camargo
This Genome Watch highlights recent metagenomic surveys that have revealed the extensive prevalence and diversity of plasmids in the human gut microbiome and discusses the challenges in accurately reporting plasmid genomes identified from metagenomic data.
-
Aug 17, 2024 |
biorxiv.org | Andre L Cornman |Jacob West-Roberts |Antonio Pedro Camargo |Simon Roux
AbstractBiological language model performance depends heavily on pretraining data quality, diversity, and size. While metagenomic datasets feature enormous biological diversity, their utilization as pretraining data has been limited due to challenges in data accessibility, quality filtering and deduplication.
Try JournoFinder For Free
Search and contact over 1M+ journalist profiles, browse 100M+ articles, and unlock powerful PR tools.
Start Your 7-Day Free Trial →