Articles

  • 1 month ago | interconnects.ai | Nathan Lambert |Ethan Mollick

    As GPT-4.5 was being released, the first material the public got access to was OpenAI’s system card for the model that details some capability evaluations and mostly safety estimates. Before the live stream and official blog post, we knew things were going to be weird because of this line:GPT-4.5 is not a frontier model. The updated system card in the launch blog post does not have this.

  • Oct 16, 2024 | interconnects.ai | Nathan Lambert

    Other than the pace of progress on individual evaluations being extremely high, the landscape of evaluating leading language models has not changed substantially in the last year. The biggest change is the required level of detail in reporting results, where more information must be communicated to contextualize the capabilities of your models. The new types of models that OpenAI's o1 heralds welcome a new axis to this, evaluation-time compute.

  • Sep 9, 2024 | interconnects.ai | Nathan Lambert

    The central point of AI regulation and policy over the last few years, everything from the Biden Executive Order to California’s SB 1047 bill, has been model size. The most common tool for AI enforcement proportional to model size has been thresholds that kick in once an AI system uses more than a certain amount of compute (or money) to be trained. The use of thresholds for regulation is the subject of substantial pushback and is likely fading in relevance due to it.

  • Aug 16, 2024 | interconnects.ai | Nathan Lambert

    Edit 1, 16 August: I made a few errors in this original post, which makes the Nous Research team, and by extension the EluetherAI evaluation harness, look bad. This isn’t my intention. As a summary:The Nous report is clear on their evaluation recipe. The open question of how best to evaluate Llama 3.1 models still holds. I’ve changed the title of the post. I’ve toned down the text in a few places to better reflect this. Sorry for the error.

  • Jun 14, 2024 | interconnects.ai | Nathan Lambert

    By being quiet in the race for the biggest and most rad foundation models, many people assumed that Apple didn’t have anything to contribute to the AI race. After their new announcements, many in the heart of the AI labs still will argue this, forecasting that . Apple is betting that AI follows paths paved by previous technological revolutions: incremental progress to transformational results.

interconnects.ai journalists

Contact details

No sites or socials found.

Try JournoFinder For Free

Search and contact over 1M+ journalist profiles, browse 100M+ articles, and unlock powerful PR tools.

Start Your 7-Day Free Trial →