Articles

  • Oct 29, 2024 | nature.com | Bin Shao |Jiawei Yan

    AbstractInspired by the success of large language models (LLMs), we develop a long-context generative model for genomes. Our multiscale transformer model, megaDNA, is pre-trained on unannotated bacteriophage genomes with nucleotide-level tokenization. We demonstrate the foundational capabilities of our model including the prediction of essential genes, genetic variant effects, regulatory element activity and taxonomy of unannotated sequences.

  • Oct 1, 2024 | biorxiv.org | Bin Shao

    AbstractWe introduce PlasmidGPT, a generative language model pretrained on 153k engineered plasmid sequences from Addgene. PlasmidGPT generates de novo sequences that share similar characteristics with engineered plasmids but show low sequence identity to the training data. We demonstrate its ability to generate plasmids in a controlled manner based on the input sequence or specific design constraint.

  • May 24, 2024 | onlinelibrary.wiley.com | Bin Shao |Jiabin Liu |Heng C. Su |Yingying Zong

    7075 aluminum alloy is widely used in the aerospace field because of its low density, high specific strength, high fracture toughness, and good machinability. This paper systematically studied the evolution of the microstructure and properties of 7075 aluminum alloy at different temperatures. Based on the rolling and heat treatment process, a short process of rolling and heat treatment is proposed, which can significantly improve the properties.

  • Mar 29, 2024 | link.aps.org | Haiping Li |Jian Zou |Bin Shao

    We consider a collision model representation of nonequilibrium dynamics for an externally driven open quantum system. Specifically we investigate the nondemolition quasiprobability distributions (QPDFs) of work and heat in both Markovian and non-Markovian regimes. In this model, work and heat correspond to different evolution processes, and their contributions can be distinguished.

  • Mar 4, 2024 | nature.com | Bin Shao

    AbstractTranslation elongation is essential for maintaining cellular proteostasis, and alterations in the translational landscape are associated with a range of diseases. Ribosome profiling allows detailed measurements of translation at the genome scale. However, it remains unclear how to disentangle biological variations from technical artifacts in these data and identify sequence determinants of translation dysregulation.

Contact details

Socials & Sites

Try JournoFinder For Free

Search and contact over 1M+ journalist profiles, browse 100M+ articles, and unlock powerful PR tools.

Start Your 7-Day Free Trial →