Articles
-
2 weeks ago |
thezvi.substack.com | Zvi Mowshowitz
GPT-4o tells you what it thinks you want to hear. The results of this were rather ugly. You get extreme sycophancy. Absurd praise. Mystical experiences. (Also some other interesting choices, like having no NSFW filter, but that one’s good.)People like Janus and Near Cyan tried to warn us, even more than usual. Then OpenAI combined this with full memory, and updated GPT-4o sufficiently that many people (although not I) tried using it in the first place.
-
3 weeks ago |
thezvi.substack.com | Zvi Mowshowitz
I love o3. I’m using it for most of my queries now. But that damn model is a lying liar. Who lies. This post covers that fact, and some related questions. The biggest thing to love about o3 is it just does things. You don’t need complex or multi-step prompting, ask and it will attempt to do things. Ethan Mollick: o3 is far more agentic than people realize. Worth playing with a lot more than a typical new model. You can get remarkably complex work out of a single prompt. It just does things.
-
2 months ago |
thezvi.substack.com | Zvi Mowshowitz
It’s happening. The question is, what is the it that is happening? An impressive progression of intelligence? An expensive, slow disappointment? Something else? The evals we have available don’t help us that much here, even more than usual. My tentative conclusion is it’s Secret Third Thing. It’s a different form factor, with unique advantages, that is hard to describe precisely in words.
-
2 months ago |
thezvi.substack.com | Zvi Mowshowitz
One hell of a paper dropped this week. It turns out that if you fine-tune models, especially GPT-4o and Qwen2.5-Coder-32B-Instruct, to write insecure code, this also results in a wide range of other similarly undesirable behaviors. They more or less grow a mustache and become their evil twin. More precisely, they become antinormative. They do what seems superficially worst. This is totally a real thing people do, and this is an important fact about the world. The misalignment here is not subtle.
-
2 months ago |
thezvi.substack.com | Zvi Mowshowitz
Anthropic has reemerged from stealth and offers us Claude 3.7.Given this is named Claude 3.7, an excellent choice, from now on this blog will refer to what they officially call Claude Sonnet 3.5 (new) as Sonnet 3.6. Claude 3.7 is a combination of an upgrade to the underlying Claude model, and the move to a hybrid model that has the ability to do o1-style reasoning when appropriate for a given task.
thezvi.substack.com journalists
Contact details
No sites or socials found.
Try JournoFinder For Free
Search and contact over 1M+ journalist profiles, browse 100M+ articles, and unlock powerful PR tools.
Start Your 7-Day Free Trial →