OpenAI’s new “Deep Research” blows ChatGPT o3-mini and DeepSeek out of the water with 26.6% accuracy in the world’s hardest “AI exam” — but it skipped the line
Deep Research holds a significant lead ahead of ChatGPT o3-mini and DeepSeek’s R1 V3-powered model in the world’s hardest AI exam, with a 26.6% accuracy score.