Dongyu Ru

Featured in: Favicon

amazon.science

Articles

RefChecker: Reference-based fine-grained hallucination checker and benchmark for large language models

Oct 29, 2024 | amazon.science | Xiangkun Hu |Dongyu Ru |Tianhang Zhang |Zheng Zhang

Large Language Models (LLMs) have shown impressive capabilities but also a concerning tendency to hallucinate. This paper presents REFCHECKER, a framework that introduces claim-triplets to represent claims in LLM responses, aiming to detect fine-grained hallucinations. In REFCHECKER, an extractor generates claim-triplets from a response, which are then evaluated by a checker against a reference.
Prompt pre-training with twenty-thousand classes for open-vocabulary visual recognition

Mar 27, 2024 | amazon.science | Shuai Zhang |Huijun Yu |Xiangkun Hu |Dongyu Ru

We introduce a prompt pre-training method POMP, which fisrt enables prompt learning on large-scale datasets like ImageNet-21K with over twenty-thousand classes. POMP is memory and computation efficient. Compared with previous methods like CoOp, it achieves comparable accuracy on ImageNet-1K with only 19% GPU memory and 50% training time. POMP achieves new SOTAs on various open-vocabulary visual recognition datasets and tasks.
Bias bounties

Jan 25, 2024 | amazon.science | Michael Kearns |Aaron Thomas Roth |Xiangkun Hu |Dongyu Ru

Project DescriptionThis is a test framework for the bias bounties project. Getting Started as a Bounty HunterIf you are interacting with this codebase as a "bounty hunter", you'll need to have a way to run Jupyter notebooks. The easiest way to do this is to download Anaconda, which will also manage all of your python packages for you. See here for installation instructions: https://docs.anaconda.com/anaconda/install/index.html.
Hierarchical bayesian analysis

Jan 23, 2024 | amazon.science | Yu Liu |Huijun Yu |Xiangkun Hu |Dongyu Ru

This package contains the Hierarchical Bayesian model to predict sample size for online activity. The bang package (https://cran.rstudio.com/web/packages/bang/index.html) was used and accelerated by modifying it to use sufficient statistics, and to only simulate from the posterior over the hyper-parameters. "misc.R", "beta_prior.R", "binom_beta.R", "hef.R" and "set_and_check_prior.R" are source files from bang packages (https://cran.rstudio.com/web/packages/bang/index.html).
New tool, dataset help detect hallucinations in large language models

Jan 17, 2024 | amazon.science | Xiangkun Hu |Dongyu Ru |Tamer H.M. Soliman

For all their remarkable abilities, large language models (LLMs) have an Achilles heel, which is their tendency to hallucinate, or make assertions that sound plausible but are factually inaccurate. Sometimes, these hallucinations can be quite subtle: an LLM might, for instance, make an assertion that’s mostly accurate but gets a date wrong by just a year or two.
RefChecker

Jan 17, 2024 | amazon.science | Dongyu Ru |Xiangkun Hu |Lin Qiu |Tryambak Gangopadhyay

The Artificial General Intelligence (AGI) team is looking for a passionate, talented, and inventive Applied Scientist with a strong deep learning background, to help build industry-leading technology with multimodal systems. Key job responsibilities As an Applied Scientist with the AGI team, you will work with talented peers to develop novel algorithms and modeling techniques to advance the state of the art with multimodal systems.
GraVL-BERT

Oct 20, 2023 | amazon.science | Arpit Gupta |Huijun Yu |Xiangkun Hu |Dongyu Ru

Amazon Fulfillment Planning & Execution (FPX) Science team within Supply Chain Optimization Technologies (SCOT) Fulfilment Optimization group is seeking a Principal Research Scientist with expertise in Machine Learning and a proven record of solving business problems through scalable ML solutions. Network Planning and Fulfillment Execution tackles some of the most mathematically complex challenges in facility and transportation planning to improve Amazon's operational efficiency worldwide.
Alexa unveils new speech recognition, text-to-speech technologies

Sep 20, 2023 | amazon.science | Changyou Chen |Xiangkun Hu |Dongyu Ru

Today in Arlington, Virginia, at Amazon’s new HQ2, Amazon senior vice president Dave Limp hosted an event at which the Devices and Services organization rolled out its new lineup of products and services. For part of the presentation, Limp was joined by Rohit Prasad, an Amazon senior vice president and head scientist for artificial general intelligence, who previewed a host of innovations from the Alexa team.
STORYANALOGY: Deriving story-level analogies from large language models to unlock analogical understanding

Jul 15, 2023 | amazon.science | Lin Qiu |Dongyu Ru |Wenyi Wu |Qi Li

Cheng Jiayang, Lin Qiu, Tsz Ho Chan, Tianqing Fang, Weiqi Wang, Chunkit Chan, Dongyu Ru, Qipeng Guo, Hongming Zhang, Yangqiu Song, Yue Zhang, Zheng Zhang Copy BibTeX Analogy-making between narratives is crucial for human reasoning.