Nvidia's closest rival once again obliterates cloud giants in AI performance; Cerebras Inference is 75x faster than AWS, 32x faster than Google on Llama 3.1 405B

Cerebras hits 969 tokens/second on Llama 3.1 405B, 75x faster than AWSClaims industry-low 240ms latency, twice as fast as Google VertexCerebras Inference runs on the CS-3 with the WSE-3 AI…

Continue Reading Nvidia's closest rival once again obliterates cloud giants in AI performance; Cerebras Inference is 75x faster than AWS, 32x faster than Google on Llama 3.1 405B

Google denies it offered hundreds of millions of dollars support to continue Microsoft cloud probes

Google Cloud offered CISPE at least €114 million in financial incentivesDespite perfect timing, Google says it’s unrelated to the Microsoft complaintMicrosoft is accusing Google of fabricating the Open Cloud Coalition…

Continue Reading Google denies it offered hundreds of millions of dollars support to continue Microsoft cloud probes