A mysterious AI chatbot labelled ‘gpt2-chatbot’ was briefly obtainable on-line earlier than subsequently disappearing once more. The chatbot quietly made its debut on the web site LMSYS Chatbot Area — a web site which is used to benchmark, evaluate, and rank completely different AI techniques.
Based mostly on its title, some are speculating that the instrument could be an earlier model of OpenAI‘s chatbot language mannequin, GPT-2. However customers have famous that the language mannequin appears equally as highly effective — or extra highly effective than — GPT-4, OpenAI’s more moderen and superior language mannequin.
In reality, some netizens discovered that the language mannequin carried out higher than GPT-4 on sure checks. This has led to hypothesis that the “gpt2-chatbot” might be an early prototype of GPT-5, or maybe a extra up to date, superior model of GPT-4 which, for all intents and functions, could be thought of GPT-4.5.
Thanks for the unimaginable enthusiasm from our group! We actually did not see this coming.
Simply a few issues to clear up:
– Consistent with our coverage, we have labored with a number of mannequin builders previously to supply group entry to unreleased fashions/checkpoints (e.g.,…
— lmsys.org (@lmsysorg) April 30, 2024
However customers who managed to check the mannequin earlier than it was taken offline famous that there was surprisingly little details about what the language mannequin was and the place it got here from. Nonetheless, it wasn’t lengthy till the language mannequin was taken again offline, with LMSYS saying in a tweet: “Consistent with our coverage, we’ve labored with a number of mannequin builders previously to supply group entry to unreleased fashions/checkpoints (e.g., mistral-next, gpt2-chatbot) for preview testing.”
The web site then went on so as to add that it needed to “briefly” take down the gpt2-chatbot on account of “excessive visitors and capability restrict.”
Hypothesis grows over ‘gpt2-chatbot’
Due to a subsequent tweet by OpenAI CEO Sam Altman, it looks as if the language mannequin is extra more likely to be one thing new relatively than an earlier mannequin of GPT-2.
Altman wrote: “I do have a delicate spot for GPT-2,” earlier than later enhancing the tweet in order that it appeared as “gpt-2.” And including additional gasoline to the hearth, OpenAI employees member Steven Heidel wrote a tweet saying: “when gpt-2.”
Based mostly on these responses, it appears extra doubtless than not that, as hinted by LMSYS, that is an unreleased mannequin of some variety.
Featured Picture: Franz Bachinger from Pixabay