
Stefano Ermon
Articles
-
Nov 15, 2024 |
flipboard.com | Eric Nguyen |Michael Poli |Matthew Durrant |Brian Kang |Dhruva Katrekar |Liam Bartie | +14 more
2 hours agoHere’s a straightforward list, in no certain order, of Mac apps that I’m currently using. Rocket for easy emoji access• Rocket Typist for expanding text snippets• Just Press Record for accessing recordings from iPhone on the Mac• Audio Hijack for recording podcast appearances and recording live …
-
May 29, 2023 |
arxiv.org | Archit Sharma |Eric Mitchell |Stefano Ermon
[Submitted on 29 May 2023 ( v1 ), last revised 13 Dec 2023 (this version, v2)] Title:Direct Preference Optimization: Your Language Model is Secretly a Reward Model Download a PDF of the paper titled Direct Preference Optimization: Your Language Model is Secretly a Reward Model, by Rafael Rafailov and 5 other authors Download PDF HTML (experimental) Abstract:While large-scale unsupervised language models (LMs) learn broad world knowledge and some reasoning skills, achieving precise control of...
-
May 29, 2023 |
arxiv.org | Archit Sharma |Eric Mitchell |Stefano Ermon
[Submitted on 29 May 2023 ( v1 ), last revised 13 Dec 2023 (this version, v2)] Title:Direct Preference Optimization: Your Language Model is Secretly a Reward Model Download a PDF of the paper titled Direct Preference Optimization: Your Language Model is Secretly a Reward Model, by Rafael Rafailov and 5 other authors Download PDF HTML (experimental) Abstract:While large-scale unsupervised language models (LMs) learn broad world knowledge and some reasoning skills, achieving precise control of...
Try JournoFinder For Free
Search and contact over 1M+ journalist profiles, browse 100M+ articles, and unlock powerful PR tools.
Start Your 7-Day Free Trial →