The billionaire businessman has boasted that the new language model for generative AI will be a "big deal on several fronts".
Speaking at the firm's third quarterly earnings call, he said: “I expect that the smaller Llama 4 models will be ready first, and we expect [them] sometime early next year, and I think that they’re going to be a big deal on several fronts — new modalities, capabilities, stronger reasoning, and much faster."
The social media and tech mogul also revealed that they are training it on a GPU cluster "bigger than anything" his rivals are working on, to his knowledge.
He said: "We’re training the Llama 4 models on a cluster that is bigger than 100k H100s or bigger than anything that I’ve seen reported for what others are doing."
However, Elon Musk had already said his AI startup, xAI, has worked with Nvidia on a cluster of the same amount.
Clusters are used to train and run complex AI models, such as those used in machine learning, deep learning, natural language processing (NLP), and computer vision.