July 27th, 2024

Ex-Twitter dev reminisces about finding 700 unused Nvidia GPUs after takeover

Tim Zaman, a former Twitter engineer, revealed 700 idle Nvidia V100 GPUs in Twitter's data center post-Elon Musk's acquisition, highlighting inefficiencies in resource management amid rising AI demands.

Read original articleLink Icon
Ex-Twitter dev reminisces about finding 700 unused Nvidia GPUs after takeover

Tim Zaman, a former Twitter engineer now at Google DeepMind, recently shared his experience of discovering 700 unused Nvidia V100 GPUs in Twitter's data center shortly after Elon Musk's acquisition in 2022. The GPUs were found powered on but idle, described by Zaman as "the forgotten remains of an honest attempt to make a cluster within Twitter 1.0." This revelation highlights a significant waste of computing resources, as these powerful GPUs had been left without purpose for years. Zaman's post was inspired by news of xAI's Memphis Supercluster, which utilizes 100,000 Nvidia H100 accelerators for AI training. He noted the irony of Twitter having such a substantial amount of GPU power sitting idle while tech companies are racing to build larger AI training clusters. Zaman also pointed out that the GPUs were of the PCIe variety, which have lower bandwidth compared to the NVLink interfaced SXM2 form factor, raising questions about Twitter's decision-making in their procurement. He reflected on the challenges of managing large-scale GPU operations, emphasizing the importance of graceful failure management in such setups. The discovery serves as a reminder of the potential inefficiencies in tech companies' resource management, especially in the context of the growing demand for AI capabilities.

Related

Intel's Gaudi 3 will cost half the price of Nvidia's H100

Intel's Gaudi 3 will cost half the price of Nvidia's H100

Intel's Gaudi 3 AI processor is priced at $15,650, half of Nvidia's H100. Intel aims to compete in the AI market dominated by Nvidia, facing challenges from cloud providers' custom AI processors.

Sohu AI chip claimed to run models 20x faster and cheaper than Nvidia H100 GPUs

Sohu AI chip claimed to run models 20x faster and cheaper than Nvidia H100 GPUs

Etched startup introduces Sohu AI chip, specialized for transformer models, outperforming Nvidia's H100 GPUs in AI LLM inference. Sohu aims to revolutionize AI processing efficiency, potentially reshaping the industry.

AI's $600B Question

AI's $600B Question

The AI industry's revenue growth and market dynamics reveal a significant gap between expectations and actual growth. Nvidia's milestone as the most valuable company prompts a reevaluation, highlighting a $600 billion challenge. Various factors like GPU supply, revenue distribution, and new technologies impact the industry's financial landscape. Speculative investment and pricing dynamics raise concerns, emphasizing a cautious approach to AI industry complexities.

Can the climate survive the insatiable energy demands of the AI arms race?

Can the climate survive the insatiable energy demands of the AI arms race?

Google's emissions spike 50% in 5 years due to AI energy needs, posing climate challenges. Tech firms invest in renewables, but face infrastructure hurdles. AI advancements may paradoxically drive energy consumption.

XAI's Memphis Supercluster has gone live, with up to 100,000 Nvidia H100 GPUs

XAI's Memphis Supercluster has gone live, with up to 100,000 Nvidia H100 GPUs

Elon Musk launches xAI's Memphis Supercluster with 100,000 Nvidia H100 GPUs for AI training, aiming for advancements by December. Online status unclear, SemiAnalysis estimates 32,000 GPUs operational. Plans for 150MW data center expansion pending utility agreements. xAI partners with Dell and Supermicro, targeting full operation by fall 2025. Musk's humorous launch time noted.

Link Icon 0 comments