
On X, regular AI experimenter Ethan Mollick composed, “Been playing with o1 and o1-pro for bit. They are very good & a little weird. They are also not for most people most of the time. You really need to have particular hard problems to solve in order to get value out of it. But if you have those problems, this is a very big deal.”
OpenAI declares enhanced dependability
OpenAI is promoting professional mode’s enhanced dependability, which is assessed internally based upon whether it can fix a concern properly in 4 out of 4 efforts instead of simply a single effort.
“In evaluations from external expert testers, o1 pro mode produces more reliably accurate and comprehensive responses, especially in areas like data science, programming, and case law analysis,” OpenAI composes.
Even without professional mode, OpenAI pointed out considerable boosts in efficiency over the o1 sneak peek design on popular mathematics and coding standards (AIME 2024 and Codeforces), and more limited enhancements on a “PhD-level science” standard (GPQA Diamond). The boost in ratings in between o1 and o1 professional mode were far more minimal on these criteria.
We’ll likely have more protection of the complete variation of o1 once it presents extensively– and it’s expected to release today, available to ChatGPT Plus and Team users internationally. Business and Edu users will have gain access to next week. At the minute, the ChatGPT Pro membership is not yet readily available on our test account.
Find out more
As an Amazon Associate I earn from qualifying purchases.