Sohu AI chip claimed to run models 20x faster and cheaper than Nvidia H100 GPUs
Etched startup introduces Sohu AI chip, specialized for transformer models, outperforming Nvidia's H100 GPUs in AI LLM inference. Sohu aims to revolutionize AI processing efficiency, potentially reshaping the industry.
Read original articleEtched, a startup, has developed the Sohu AI chip, an ASIC designed for AI LLM inference that claims to outperform Nvidia's H100 GPUs. The Sohu chip is specialized for transformer models, allocating more transistors to AI compute tasks compared to general-purpose AI chips like Nvidia's H100. This specialization allows Sohu to run models 20 times faster and cheaper than the H100, potentially disrupting Nvidia's dominance in the AI space. By focusing on transformer architectures popularized by models like ChatGPT, Sohu aims to provide more efficient AI processing, reducing power consumption in AI data centers. While concerns over power consumption in AI infrastructure persist, Sohu's approach could offer a more sustainable solution by optimizing hardware for specific AI tasks. The success of Sohu could lead to a shift in the industry towards more specialized AI chips tailored for specific models, potentially reshaping the landscape of AI computing.
Related
TSMC experimenting with rectangular wafers vs. round for more chips per wafer
TSMC is developing an advanced chip packaging method to address AI-driven demand for computing power. Intel and Samsung are also exploring similar approaches to boost semiconductor capabilities amid the AI boom.
Intel's Gaudi 3 will cost half the price of Nvidia's H100
Intel's Gaudi 3 AI processor is priced at $15,650, half of Nvidia's H100. Intel aims to compete in the AI market dominated by Nvidia, facing challenges from cloud providers' custom AI processors.
Finnish startup says it can speed up any CPU by 100x
A Finnish startup, Flow Computing, introduces the Parallel Processing Unit (PPU) chip promising 100x CPU performance boost for AI and autonomous vehicles. Despite skepticism, CEO Timo Valtonen is optimistic about partnerships and industry adoption.
Etched Is Making the Biggest Bet in AI
Etched invests in AI with Sohu, a specialized chip for transformers, surpassing traditional models like DLRMs and CNNs. Sohu optimizes transformer models like ChatGPT, aiming to excel in AI superintelligence.
Sohu: The First Transformer ASIC
Etched secures $120 million to develop Sohu, a transformer ASIC enhancing AI model performance. Sohu enables real-time voice agents, rapid text processing, and trillion-parameter models, revolutionizing AI processing with advanced features.
Related
TSMC experimenting with rectangular wafers vs. round for more chips per wafer
TSMC is developing an advanced chip packaging method to address AI-driven demand for computing power. Intel and Samsung are also exploring similar approaches to boost semiconductor capabilities amid the AI boom.
Intel's Gaudi 3 will cost half the price of Nvidia's H100
Intel's Gaudi 3 AI processor is priced at $15,650, half of Nvidia's H100. Intel aims to compete in the AI market dominated by Nvidia, facing challenges from cloud providers' custom AI processors.
Finnish startup says it can speed up any CPU by 100x
A Finnish startup, Flow Computing, introduces the Parallel Processing Unit (PPU) chip promising 100x CPU performance boost for AI and autonomous vehicles. Despite skepticism, CEO Timo Valtonen is optimistic about partnerships and industry adoption.
Etched Is Making the Biggest Bet in AI
Etched invests in AI with Sohu, a specialized chip for transformers, surpassing traditional models like DLRMs and CNNs. Sohu optimizes transformer models like ChatGPT, aiming to excel in AI superintelligence.
Sohu: The First Transformer ASIC
Etched secures $120 million to develop Sohu, a transformer ASIC enhancing AI model performance. Sohu enables real-time voice agents, rapid text processing, and trillion-parameter models, revolutionizing AI processing with advanced features.