
Steven Byrnes
Articles
-
Jan 16, 2025 |
ivoox.com | Steven Byrnes
Descripción de “Gaming TruthfulQA: Simple Heuristics Exposed Dataset Weaknesses” by TurnTrout This is a link post.(Explanation. Also I have no reason to think they hate me.)https://turntrout.com/original-truthfulqa-weaknesses --- First published: January 16th, 2025 Source: https://www.lesswrong.com/posts/57k6xNcWtAtsSTcor/gaming-truthfulqa-simple-heuristics-exposed-dataset --- Narrated by TYPE III AUDIO.
-
Jan 15, 2025 |
ivoox.com | Steven Byrnes
TLDR: There is a potential issue with the multiple-choice versions of our TruthfulQA benchmark (a test of truthfulness in LLMs), which could lead to inflated model scores. This issue was analyzed by a helpful post by Alex Turner (@TurnTrout). We created a new multiple-choice version of TruthfulQA that fixes the issue. We compare models on the old and new versions and find very similar performance.
-
Jan 15, 2025 |
ivoox.com | Steven Byrnes
Descripción de “What Is The Alignment Problem?” by johnswentworth So we want to align future AGIs. Ultimately we’d like to align them to human values, but in the shorter term we might start with other targets, like e.g. corrigibility. That problem description all makes sense on a hand-wavy intuitive level, but once we get concrete and dig into technical details… wait, what exactly is the goal again? When we say we want to “align AGI”, what does that mean?
-
Jan 15, 2025 |
ivoox.com | Steven Byrnes
Table of Contents Man With a Plan. Oh the Pain. Actual Proposals. For AI Builders. Think of the Children. Content Identification. Infrastructure Week. Paying Attention. Man With a Plan The primary Man With a Plan this week for government-guided AI prosperity was UK Prime Minister Keir Starmer, with a plan coming primarily from Matt Clifford. I’ll be covering that soon. Today I will be covering the other Man With a Plan, Sam Altman, as OpenAI offers its Economic Blueprint.
-
Jan 15, 2025 |
ivoox.com | Steven Byrnes
Descripción de “Lecture Series on Tiling Agents” by abramdemski For my AISC, I'll[1] be presenting more details about the research every Thursday for approximately the next three months. If you are interested in listening in, here is a calendar link. ^Maybe there will be guest speakers at some point, EG, the AISC mentees. The original text contained 1 footnote which was omitted from this narration.
Try JournoFinder For Free
Search and contact over 1M+ journalist profiles, browse 100M+ articles, and unlock powerful PR tools.
Start Your 7-Day Free Trial →