
Rana Dexsin
Articles
-
Sep 16, 2024 |
lesswrong.com | Nathan Helm-Burger |Shankar Sivarajan |Rana Dexsin
Deceptive alignment. GPT-4o1 engaged in deception towards developers in order to get deployed, pretending to be aligned in ways it was not. Lying to the developers. It strategically manipulated task data. To be clear, it did not do anything of the sort to its actual developers/testers. What it did was deceive some (non-interactive) roleplay characters, who were labeled "developers" in the roleplay scenario.
-
Apr 23, 2024 |
lesswrong.com | Rana Dexsin
Manifold Markets has announced that they intend to add cash prizes to their current play-money model, with a raft of attendant changes to mana management and conversion. I first became aware of this via a comment on ACX Open Thread 326; the linked Notion document appears to be the official one.
-
Mar 14, 2024 |
lesswrong.com | Ben Smith |Rana Dexsin
Simeon: If your moat was having good ideas: RIP. The human (remaining) moat will be in execution. Daniel Losey: Claude 3 as a research assistant?
-
Dec 4, 2023 |
lesswrong.com | Rana Dexsin |Zach Stein-Perlman |Lucius Bushnaq
In May and June of 2023, I (Akash) had about 50-70 meetings about AI risks with congressional staffers. I had been meaning to write a post reflecting on the experience and some of my takeaways, and I figured it could be a good topic for a LessWrong dialogue. I saw that hath had offered to do LW dialogues with folks, and I reached out.
Try JournoFinder For Free
Search and contact over 1M+ journalist profiles, browse 100M+ articles, and unlock powerful PR tools.
Start Your 7-Day Free Trial →