Musk has additionally apparently used the Grok chatbots as an automatic extension of his trolling habits, exhibiting examples of Grok 3 producing “based mostly” opinions that criticized the media in February. In Could, Grok on X started repeatedly producing outputs about white genocide in South Africa, and most lately, we have seen the Grok Nazi output debacle. It is admittedly tough to take Grok severely as a technical product when it is linked to so many examples of unserious and capricious purposes of the expertise.
Nonetheless, the technical achievements xAI claims for numerous Grok 4 fashions appear to face out. The Arc Prize group reported that Grok 4 Considering (with simulated reasoning enabled) achieved a rating of 15.9 % on its ARC-AGI-2 take a look at, which the group says practically doubles the earlier business finest and tops the present Kaggle competitors chief.
“With respect to tutorial questions, Grok 4 is healthier than PhD degree in each topic, no exceptions,” Musk claimed in the course of the livestream. We have beforehand coated nebulous claims about “PhD-level” AI, discovering them to be typically specious advertising and marketing speak.
Premium pricing amid controversy
Throughout Wednesday’s livestream, xAI additionally introduced plans for an AI coding mannequin in August, a multi-modal agent in September, and a video technology mannequin in October. The corporate additionally plans to make Grok 4 out there in Tesla autos subsequent week, additional increasing Musk’s AI assistant throughout his numerous corporations.
Regardless of the current turmoil, xAI has moved ahead with an aggressive pricing technique for “premium” variations of Grok. Alongside Grok 4 and Grok 4 Heavy, xAI launched “SuperGrok Heavy,” a $300-per-month subscription that makes it the costliest AI service amongst main suppliers. Subscribers will get early entry to Grok 4 Heavy and upcoming options.
Whether or not customers can pay xAI’s premium pricing stays to be seen, notably given the AI assistant’s tendency to periodically generate politically motivated outputs. These incidents symbolize basic administration and implementation points that, to this point, no fancy-looking test-taking benchmarks have been in a position to seize.
Keep forward of the curve with NextBusiness 24. Discover extra tales, subscribe to our publication, and be a part of our rising group at nextbusiness24.com

