"Grok 5 might end up being nearly perfect on the Humanity's Last Exam and probably point out errors in the question. Even Grok 4 which is primitive at this point got I think 52% excluding visual questions." 一 Elon Musk