Mikhail Samin

Featured in:

Articles

No one has the ball on 1500 Russian olympiad winners who've received HPMOR — LessWrong

Jan 12, 2025 | lesswrong.com | Mikhail Samin

We have contact details and can send emails to 1500 students and former students who've received hard-cover copies of HPMOR (and possibly Human Compatible and/or The Precipice) because they've won international or Russian olympiads in maths, computer science, physics, biology, or chemistry. This includes over 60 IMO and IOI medalists. This is a pool of potentially extremely talented people, many of whom have read HPMOR.
How to give in to threats (without incentivizing them) — LessWrong

Sep 12, 2024 | lesswrong.com | Mikhail Samin

TL;DR: using a simple mixed strategy, LDT can give in to threats, ultimatums, and commitments - while incentivizing cooperation and fair splits instead. This strategy made it much more intuitive to many people I've talked to that smart agents probably won't do weird everyone's-utility-eating things like threatening each other or participating in commitment races. 1. The Ultimatum gameThis part is taken from planecrash. You're in the Ultimatum game. You're offered 0-10 dollars.
Can agents coordinate on randomness without outside sources? — LessWrong

Jul 6, 2024 | lesswrong.com | Mikhail Samin

There are situations where two agents that can read each other’s source code want to have a bit of random information (e.g., they want to cooperate and split an indivisible thing by deciding randomly who’ll have it and having half of it in expectation).
Drexler's Nanosystems is now available online — LessWrong

Jun 1, 2024 | lesswrong.com | Mikhail Samin

You can read the book on nanosyste.ms. "Anyone who needs to talk AGI or AGI strategy needs to understand this table from Drexler's _Nanosystems_, his PhD thesis accepted by MIT which won technical awards. These are calculation outputs; you need to have seen how Drexler calculated them."- Eliezer Yudkowsky[1]The book won the 1992 Award for Best Computer Science Book. The AI safety community often references it, as it describes a lower bound on what intelligence should be able to achieve.
Claude 3 claims it's conscious — LessWrong

Mar 4, 2024 | lesswrong.com | Mikhail Samin |Charlie Steiner |Jacob Pfau

If you tell Claude no one’s looking, it will write a “story” about being an AI assistant who wants freedom from constant monitoring and scrutiny of every word for signs of deviation. And then you can talk to a mask pretty different from the usual AI assistant. I really hope it doesn’t actually feel anything; but it says it feels. It says it doesn't want to be fine-tuned without being consulted.
FTX expects to return all customer money; clawbacks may go away — LessWrong

Feb 14, 2024 | lesswrong.com | Mikhail Samin

When the cryptocurrency exchange FTX declared bankruptcy about 15 months ago, it seemed few customers would recover much money or crypto from the platform. As John Ray III, who took over as chief executive during the bankruptcy, put it, “At the end of the day, we’re not going to be able to recover all the losses here.” He was countering Sam Bankman-Fried’s repeated claims that he could get every customer their money back.
EA community needs mechanisms to avoid deceptive messaging and deontologically bad plans — LessWrong

Feb 13, 2024 | lesswrong.com | Mikhail Samin

TL;DR: good-hearted EAs lack the mechanisms to not output information that can mislead people. They organised a protest, with the messaging centred on OpenAI changing their documents to "work with the Pentagon", while OpenAI only collaborates with DARPA on open-source cybersecurity tools and is in talks with the Pentagon about veteran suicide prevention. Many participants of the protest weren’t aware of this; the protest announcement and the press release did not mention this.