Gerald M. Monroe

Featured in: Favicon

jstor.org

daily.jstor.org

Articles

What's with all the bans recently? — LessWrong

Apr 4, 2024 | lesswrong.com | Gerald M. Monroe |Victor Ashioya |M. Y. Zuo |Nora Belrose

Summary: the moderators appear to be soft banning users with 'rate-limits' without feedback. A careful review of each banned user reveals it's common to be banned despite earnestly attempting to contribute to the site. Some of the most intelligent banned users have mainstream instead of EA views on AI. Note how the punishment lengths are all the same, I think it was a mass ban-wave of 3 week bans:Gears to ascension was here but is no longer, guess she convinced them it was a mistake.
What could a policy banning AGI look like? — LessWrong

Mar 13, 2024 | lesswrong.com | Gerald M. Monroe |Nathan Helm-Burger |Akram Akhtar Choudhary

State monopoly:The Song Dynasty (960-1279) established a state monopoly over saltpeter production, a critical ingredient in gunpowder. The government appointed officials to oversee the collection and refinement of saltpeter. During the Ming Dynasty (1368-1644), the government further tightened its control over saltpeter production, with the "Saltpeter Censorate" responsible for managing the state monopoly. Limiting knowledge:Chinese officials kept the recipe for gunpowder a closely guarded secret.
Two Tales of AI Takeover: My Doubts — LessWrong

Mar 5, 2024 | lesswrong.com | Gerald M. Monroe |Daniel Kokotajlo |Donald Hobson

There’s a basic high-level story which worries a lot of people. The story goes like this: as AIs become more capable, the default outcome of AI training is the development of a system which, unbeknownst to us, is using its advanced capabilities to scheme against us. The conclusion of this process likely leads to AI takeover, and thence our death. We are not currently dead. So, any argument for our death by route of AI must offer us a causal story.
New LessWrong review winner UI ("The LeastWrong" section and full-art post pages) — LessWrong

Feb 28, 2024 | lesswrong.com | Neel Nanda |Matt Goldenberg |Ben Pace |Gerald M. Monroe

(Also announcing: annual review prediction markets & full-height table of contents. If you're looking for this year's review results, you can find them here)The top 50 posts of each of LessWrong’s annual reviews have a new home: The LeastWrong. What will I see when I click that link? You will find the posts organized into six “books”: Rationality, Optimization, World, Practical, AI Strategy, and Technical AI Safety. Each square on the grid is a post that made the top 50 of the review in some year.
Hidden Cognition Detection Methods and Benchmarks — LessWrong

Feb 26, 2024 | lesswrong.com | Paul Colognese |Gerald M. Monroe

Thanks to Johannes Treutlein for discussions and feedback. An AI may be able to hide cognition that leads to negative outcomes from certain oversight processes (such as deceptive alignment/scheming). Without being able to detect this hidden cognition, an overseer may not be able to prevent the associated negative outcomes or include this information as part of the training signal.
Does increasing the power of a multimodal LLM get you an agentic AI? — LessWrong

Feb 23, 2024 | lesswrong.com | Gerald M. Monroe

"agentic" is "give the model a goal and access to tools and it emits outputs intended to accomplish the goal. Example: https://chat.openai.com/share/0f396757-0b81-4ace-af8c-a2cb37e0985dThis barely worked with gpt-3.5, works sometimes with gpt-4, and is supposedly better with Gemini 1.5 1M. It also requires tools, for example code interpreters, web search APIs, and image generators. Why can this fail in a dangerous way? Very large scope goals, and a model that can run for a long time working on them.
AI #52: Oops — LessWrong

Feb 22, 2024 | lesswrong.com | Gerald M. Monroe |Donald Hobson

Given that humans on their own haven't yet found these better architectures, humans + imitative AI doesn't seem like it would find the problem trivial. Humans on their own already did invent better RL algorithms for optimizing at the network architecture layer. https://arxiv.org/pdf/1611.01578.pdf backgroundhttps://arxiv.org/pdf/1707.07012.pdf : page 6With NAS, the RL model learns the relationship between [architecture] and [predicted performance].
The Gemini Incident — LessWrong

Feb 22, 2024 | lesswrong.com | Shankar Sivarajan |Gerald M. Monroe |Matt Goldenberg |Nikola Smolenski

[Original title; Gemini Has a Problem]Google’s Gemini 1.5 is impressive and I am excited by its huge context window. I continue to default to Gemini Advanced as my default AI for everyday use when the large context window is not relevant. However, while it does not much interfere with what I want to use Gemini for, there is a big problem with Gemini Advanced that has come to everyone’s attention. Gemini comes with an image generator. Until today it would, upon request, create pictures of humans.
Monthly Roundup #15: February 2024 — LessWrong

Feb 20, 2024 | lesswrong.com | Donald Hobson |Gerald M. Monroe

Another month. More things. Much roundup. Bad NewsJesse Smith writes in Asterisk that our HVAC workforce is both deeply incompetent and deeply corrupt. This certainly matches my own experience. Calculations are almost always flubbed when they are done at all, outright fraudulent paperwork is standard, no one has the necessary skills. It certainly seems like the Biden Administration is doing its best to hurt Elon Musk?
More on the Apple Vision Pro — LessWrong

Feb 13, 2024 | lesswrong.com | Gerald M. Monroe

Previously: On the Apple Vision ProThe reviews are coming in. What say the people, other than the complaining about the two to three hour battery life? Then later I’ll get to my own thoughts after the demo. Reviews and ReactionsBen Thompson reviews the Apple Vision Pro. He continues to find it a technical marvel, but is ultimately disappointed for uses other than entertainment. There is no support for multiple users beyond a highly unwieldy guest mode.