Articles

  • Dec 5, 2024 | nature.com | James Zou

    AbstractLarge-scale gene-expression data are being leveraged to pretrain models that implicitly learn gene and cellular functions. However, such models require extensive data curation and training. Here we explore a much simpler alternative: leveraging ChatGPT embeddings of genes based on the literature.

  • Nov 21, 2024 | nature.com | Mehran Karimzadeh |Helen Li |Olivier Elemento |James Zou |Fereydoun Hormozdiari |Babak Alipanahi

    AbstractLiquid biopsies have the potential to revolutionize cancer care through non-invasive early detection of tumors. Developing a robust liquid biopsy test requires collecting high-dimensional data from a large number of blood samples across heterogeneous groups of patients. We propose that the generative capability of variational auto-encoders enables learning a robust and generalizable signature of blood-based biomarkers.

  • Oct 29, 2024 | biorxiv.org | James Zou

    AbstractPredicting how perturbation of a target gene affects the expression of other genes is a critical component of understanding cell biology. This is a challenging prediction problem as the model must capture complex gene-gene relationships and the output is high-dimensional and sparse.

  • Jun 10, 2024 | cell.com | Moritz Gerstung |David C Liu |Marzyeh Ghassemi |James Zou

    Experts discuss the challenges and opportunities of using artificial intelligence (AI) to study the evolution of cancer cells and their microenvironment, improve diagnosis, predict treatment response, and ensure responsible implementation in the clinic.

  • May 17, 2024 | biorxiv.org | Kevin Wu |Howard Chang |James Zou

    AbstractLanguage models have enabled a new era of biological sequence modeling. However, extracting meaningful sequence-level embeddings from these models remains challenging. In this work, we introduce ProteinCLIP, which applies contrastive learning between a protein's amino acid sequence and curated text describing its function. ProteinCLIP thus learns to take a pre-trained protein language model's sequence embedding and refines it produce a function-centric embedding.

Contact details

Socials & Sites

Try JournoFinder For Free

Search and contact over 1M+ journalist profiles, browse 100M+ articles, and unlock powerful PR tools.

Start Your 7-Day Free Trial →