Musk: Grok V9 and V8 Have a Massive Gap; V9 Training Version Already Shows Superior Performance
Elon Musk posted on X, stating that the latest completed training run of Grok V9 (1.5T parameters) has "performed very well," and this result has not yet incorporated the supplementary training portion from Cursor data. The base model currently under internal development is V9, with approximately 1.5 trillion parameters. Compared to V8, it features significant improvements in data cleaning, training methods, model scale, and has been optimized for the Blackwell architecture to enhance computational efficiency.
Musk emphasized that, in contrast, the current public-facing version v4.2, built on the V8 base model with approximately 0.5T parameters and running on the Hopper architecture, still has certain limitations in training data quality and coverage. The performance gap between Grok V8 and V9 is massive, with the new-generation model achieving a leapfrog upgrade in overall capabilities.
