August 28th, 2024

OpenAI shows 'Strawberry' to feds, races to launch it

OpenAI is developing the "Strawberry" AI model to generate synthetic data for its Orion LLM, improving accuracy and reducing errors, with a potential ChatGPT integration by fall 2024.

Read original articleLink Icon
OpenAI shows 'Strawberry' to feds, races to launch it

OpenAI is racing to launch its new AI model, code-named "Strawberry," which has been demonstrated to U.S. national security officials. Strawberry is designed to generate high-quality synthetic data for OpenAI's upcoming flagship large language model (LLM), Orion. While Strawberry is more expensive and slower in inference time, it excels at solving complex problems accurately on the first attempt, reducing the likelihood of hallucinations—errors common in AI outputs. OpenAI is also working on a distilled version of Strawberry to integrate its capabilities into ChatGPT, potentially launching this fall. The model can tackle math and programming problems and is capable of answering subjective questions when given additional time to process. Strawberry's development is rooted in research initiated by Ilya Sutskever, who has since left OpenAI. The model aims to enhance the training data quality for Orion, addressing challenges in acquiring sufficient high-quality data from real-world sources. OpenAI's efforts reflect the competitive landscape in AI development, as they seek to improve the performance of their models while minimizing errors.

- OpenAI's new model, Strawberry, aims to generate synthetic data for the upcoming Orion LLM.

- Strawberry can solve complex problems accurately and reduce hallucinations in AI outputs.

- A distilled version of Strawberry may be integrated into ChatGPT by fall 2024.

- The model has been demonstrated to U.S. national security officials, highlighting its potential applications.

- Strawberry's development is part of OpenAI's strategy to enhance training data quality and model performance.

Link Icon 10 comments
By @nuz - 6 months
OpenAI are so good at fake leaking things to build hype
By @lm28469 - 6 months
> Unnamed source says "They showed it to feds"

I can't think of a more vague way to bring this up

By @thesz - 6 months

  > Its main purpose is to produce synthetic data for Orion, their next big LLM
https://arxiv.org/abs/2404.03502

"This is generally useful, but widespread reliance on recursive AI systems could lead to a process we define as "knowledge collapse", and argue this could harm innovation and the richness of human understanding and culture... In our default model, a 20% discount on AI-generated content generates public beliefs 2.3 times further from the truth than when there is no discount."

By @jexe - 6 months
Missed opportunity to name it Strawbery
By @skywhopper - 6 months
Lots of red flags here. Besides synthetic training data (huge red flag), this new model is going to somehow generate “correct” (red flag) training data. Also, this new model will be much more expensive to run (red flag), so they are distilling it (back down to lower quality — red flag) and need to figure out how to make it work in ChatGPT (not ready for release — red flag), and the true impact of it anyway is for the next generation (huge red flag) model.

We’re in a constant cycle of “sure the current stuff doesn’t live up to the hype but the next release that’s just on the horizon, that one will blow you away!”

This leak is clearly targeted at creating the buzz necessary to raise the money to keep the pipe dream flowing for the time being. But just thinking through the steps outlined here, it should be clear none of this makes sense. Any “correct” synthetic training data is either going to be badly biased, or be limited to such banal “logical” output that the model it trains will be entirely unable to process natural language input and you’ll need to specify your questions in something more precise. At which point, we’re back to traditional programming languages, only with several layers of unnecessary and expensive processing going on in between.

By @qwertox - 6 months
OpenAI has become the new vaporware announcer. No memory, no advanced voice, no SearchGPT.

GPT-4o is a dumbed-down version of GPT-4, GPT-4o mini is even more useless. Might be good for the API, but in ChatGPT?

GPT-4 is from 14. March 2023, since then nothing has improved.

By @seydor - 6 months
I m not holding my breath for it. It will be good at solving "complex" high-school problems but nothing more. OpenAI has a history of overhyped launches now. Trying to keep those investors to their toes.

If I 'm wrong, i dare openAI to release it early

By @franze - 6 months
totally an AI written article, could all be an hallucination after all.
By @hprotagonist - 6 months
Strawberry? With two “r”s?