Rana Dexsin

Featured in:

Articles

GPT-o1 - LessWrong

Sep 16, 2024 | lesswrong.com | Nathan Helm-Burger |Shankar Sivarajan |Rana Dexsin

Deceptive alignment. GPT-4o1 engaged in deception towards developers in order to get deployed, pretending to be aligned in ways it was not. Lying to the developers. It strategically manipulated task data. To be clear, it did not do anything of the sort to its actual developers/testers. What it did was deceive some (non-interactive) roleplay characters, who were labeled "developers" in the roleplay scenario.
Manifold “exploring real cash prizes” — LessWrong

Apr 23, 2024 | lesswrong.com | Rana Dexsin

Manifold Markets has announced that they intend to add cash prizes to their current play-money model, with a raft of attendant changes to mana management and conversion. I first became aware of this via a comment on ACX Open Thread 326; the linked Notion document appears to be the official one.
AI #55: Keep Clauding Along — LessWrong

Mar 14, 2024 | lesswrong.com | Ben Smith |Rana Dexsin

Simeon: If your moat was having good ideas: RIP. The human (remaining) moat will be in execution. Daniel Losey: Claude 3 as a research assistant?
Speaking to Congressional staffers about AI risk — LessWrong

Dec 4, 2023 | lesswrong.com | Rana Dexsin |Zach Stein-Perlman |Lucius Bushnaq

In May and June of 2023, I (Akash) had about 50-70 meetings about AI risks with congressional staffers. I had been meaning to write a post reflecting on the experience and some of my takeaways, and I figured it could be a good topic for a LessWrong dialogue. I saw that hath had offered to do LW dialogues with folks, and I reached out.