September 27th, 2024

How AlphaChip transformed computer chip design

AlphaChip, developed by Google DeepMind, optimizes chip design using reinforcement learning, significantly reducing layout time and influencing the industry, with future iterations promising enhanced efficiency and performance across applications.

Read original article

How AlphaChip transformed computer chip design

AlphaChip, developed by Google DeepMind, has significantly transformed the field of computer chip design through its innovative use of reinforcement learning. Initially introduced in 2020, AlphaChip optimizes chip layouts, producing superhuman designs in a fraction of the time it traditionally takes human engineers. This AI-driven method has been instrumental in designing the Tensor Processing Units (TPUs) that power Google's AI systems, including large language models and generative AI applications. By employing a novel edge-based graph neural network, AlphaChip learns the intricate relationships between chip components, allowing it to improve with each design iteration. Its impact extends beyond Google, influencing the broader chip design industry, with companies like MediaTek adopting and adapting AlphaChip for their advanced chip development. The technology has sparked a new wave of research into AI applications in chip design, promising to enhance efficiency and performance across various stages of the design cycle. Future iterations of AlphaChip aim to further revolutionize chip design, making it faster, cheaper, and more power-efficient for a wide range of applications, from smartphones to medical devices.

- AlphaChip uses reinforcement learning to optimize computer chip layouts.

- It has significantly reduced design time for chips, producing layouts in hours instead of weeks.

- The technology has been applied in Google's Tensor Processing Units and adopted by external companies like MediaTek.

- AlphaChip has inspired new research in AI for chip design, impacting various design stages.

- Future developments aim to enhance chip design efficiency and performance across multiple industries.

Etched Is Making the Biggest Bet in AI

Etched invests in AI with Sohu, a specialized chip for transformers, surpassing traditional models like DLRMs and CNNs. Sohu optimizes transformer models like ChatGPT, aiming to excel in AI superintelligence.

Intel vs. Samsung vs. TSMC

Competition intensifies among Intel, Samsung, and TSMC in the foundry industry. Focus on 3D transistors, AI/ML applications, and chiplet assemblies drives advancements in chip technology for high-performance, low-power solutions.

TPU transformation: A look back at 10 years of our AI-specialized chips

Google has advanced its AI capabilities with Tensor Processing Units (TPUs), specialized chips for AI workloads, enhancing performance and efficiency, and making them available through Cloud services for external developers.

China's Taichi-II Chip: First Optical AI Processor Outperforms Nvidia H100

Beijing researchers unveiled the Taichi-II chip, the first fully optical AI processor, achieving six orders of magnitude energy efficiency improvement and 40% higher accuracy than NVIDIA's H100, enhancing AI technology in China.

OpenAI plans to build its own AI chips on TSMC's forthcoming

OpenAI is developing AI chips with TSMC's 1.6 nm A16 process, aiming to reduce costs from Nvidia's servers. Partnerships with Broadcom, Marvell, and possibly Apple are under consideration.

31 comments

By @vighneshiyer - 7 months

This work from Google (original Nature paper: https://www.nature.com/articles/s41586-021-03544-w) has been credibly criticized by several researchers in the EDA CAD discipline. These papers are of interest:

- A rebuttal by a researcher within Google who wrote this at the same time as the "AlphaChip" work was going on ("Stronger Baselines for Evaluating Deep Reinforcement Learning in Chip Placement"): http://47.190.89.225/pub/education/MLcontra.pdf

- The 2023 ISPD paper from a group at UCSD ("Assessment of Reinforcement Learning for Macro Placement"): https://vlsicad.ucsd.edu/Publications/Conferences/396/c396.p...

- A paper from Igor Markov which critically evaluates the "AlphaChip" algorithm ("The False Dawn: Reevaluating Google's Reinforcement Learning for Chip Macro Placement"): https://arxiv.org/pdf/2306.09633

In short, the Google authors did not fairly evaluate their RL macro placement algorithm against other SOTA algorithms: rather they claim to perform better than a human at macro placement, which is far short of what mixed-placement algorithms are capable of today. The RL technique also requires significantly more compute than other algorithms and ultimately is learning a surrogate function for placement iteration rather than learning any novel representation of the placement problem itself.

In full disclosure, I am quite skeptical of their work and wrote a detailed post on my website: https://vighneshiyer.com/misc/ml-for-placement/

By @lordswork - 7 months

Some interesting context on this work: 2 researchers were bullied to the point of leaving Google for Anthropic by a senior researcher (who has now been terminated himself): https://www.wired.com/story/google-brain-ai-researcher-fired...

They must feel vindicated by their work turning out to be so fruitful now.

By @hinkley - 7 months

TSMC made a point of calling out that their latest generation of software for automating chip design has features that allow you to select logic designs for TDP over raw speed. I think that’s our answer to keep Dennard scaling alive in spirit if not in body. Speed of light is still going to matter, so physical proximity of communicating components will always matter, but I wonder how many wins this will represent versus avoiding thermal throttling.

By @pfisherman - 7 months

Questions for those in the know about chip design. How are they measuring the quality of a chip design? Does the metric that Google is reporting make sense? Or is it just something to make themselves look good?

Without knowing much, my guess is that “quality” of a chip design is multifaceted and heavily dependent on the use case. That is the ideal chip for a data center would look very different from those for a mobile phone camera or automobile.

So again what does “better” mean in the context of this particular problem / task.

By @thesz - 7 months

Eurisco [1], if I remember correctly, was once used to perform placement-and-route task and was pretty good at it.

[1] https://en.wikipedia.org/wiki/Eurisko

What's more, Eurisco was then used in designing Traveler TCS' game fleet of battle spaceships. And Eurisco used symmetry-based placement learned from VLSI design in the design of the spaceships' fleet.

Can AlphaChip's heuistics be used anywhere else?

By @AshamedCaptain - 7 months

What is Google doing here? At best, the quality of their "computer chip design" work can be described as "controversial" https://spectrum.ieee.org/chip-design-controversy . What is there to gain by just making a PR now without doing anything new?

By @yeahwhatever10 - 7 months

Why do they keep saying "superhuman"? Algorithms are used for these tasks, humans aren't laying out trillions of transistors by hand.

By @Upvoter33 - 7 months

To me, there is an underlying issue: why are so many DeepX papers being sent to Nature, instead of appropriate CS forums? If you are doing better work in chip design, send it to IPSD or ISCA or whatever, and then you will get the types of reviews needed for this work. I have no idea what Nature does with a paper like this.

By @negativeonehalf - 7 months

Chips are the limiting factor for AI, and now we have AIs making chips better than human engineers. This feels like an infinite compute cheat code, or at least a way to get us very, very quickly to the physical optimum.

By @cobrabyte - 7 months

I'd love a tool like this for PCB design/layout

By @dreamcompiler - 7 months

Looks like this is only about placement. I wonder if it can be applied to routing?

By @ilaksh - 7 months

How far are we from memory-based computing going from research into competitive products? I get the impression that we are already well passed the point where it makes sense to invest very aggressively to scale up experiments with things like memristors. Because they are talking about how many new nuclear reactors they are going to need just for the AI datacenters.

By @ninetyninenine - 7 months

What occupation is there that is purely intellectual that has no chance of an AI ever progressing to a point where it can take it over?

By @mirchiseth - 7 months

I must be old because first thing I thought reading AlphaChip was why is deepmind talking about chips in DEC Alpha :-) https://en.wikipedia.org/wiki/DEC_Alpha.

By @red75prime - 7 months

I hope I'll still be alive when they'll announce AlephZero.

By @QuadrupleA - 7 months

How good are TPUs in comparison with state of the art Nvidia datacenter GPUs, or Groq's ASICs? Per watt, per chip, total cost, etc.? Is there any published data?

By @FrustratedMonky - 7 months

So AI designing it's own chips. Now that is moving towards exponential growth. Like at the end of "Colossus" the movie.

Forget LLM's. What DeepMind is doing seems more like how an AI will rule, in the world. Building real world models, and applying game logic like winning.

LLM's will just be the text/voice interface to what DeepMind is building.

By @ur-whale - 7 months

Seems to me the article is claiming a lot of things, but is very light on actual comparisons that matter to you and me, namely: how does one of those fabled AI-designed chop compare to their competition ?

For example, how much better are these latest gen TPU's when compared to NVidia's equivalent offering ?

By @colesantiago - 7 months

A marvellous achievement from DeepMind as usual, I am quite surprised that Google acquired them for a significant discount of $400M, when I would have expected it to be in the range of $20BN, but then again Deepmind wasn’t making any money back then.

By @loandbehold - 7 months

Every generation of chips is used to design next generation. That seems to be the root of exponential growth in Moore's law.

By @bankcust08385 - 7 months

Technology singularity is around the corner as soon as the chips (mostly) design themselves. There will be a few engineers, zillions of semiskilled maintenance people making a pittance, and most of the world will be underemployed or unemployed. Technical people better understand this and unionize or they will find themselves going the way of piano tuners and Russian physicists. Slow boiling frog...

By @amelius - 7 months

Can this be abstracted and generalized into a more generally applicable optimization method?

By @kayson - 7 months

I'm pretty sure Cadence and Synopsys have both released reinforcement-learning-based placing and floor planning tools. How do they compare...?

By @bachback - 7 months

Deepmind is producing science vapourware while OpenAI is changing the world

By @idunnoman1222 - 7 months

So one other designer plus Google is using alpha chip for their layouts? - not sure on that title, call me when amd and nvidia are using it

By @7e - 7 months

Did it, though? Google’s chips still aren’t very good compared with competitors.

By @negativeonehalf - 7 months

There's a lot of... passionate discussion in this thread, but we shouldn't lose sight of the big picture -- Google has used AlphaChip in multiple generations of TPU, their flagship AI accelerator. This is a multi-billion dollar project that is strategically critical for the success of the company. The idea that they're secretly making TPUs worse in order to prop up a research paper is just absurd. Google has even expanded their of AlphaChip use to other chips (e.g. Axion).

Meanwhile, MediaTek built on AlphaChip and is using it widely, and announced that it was used to help design Dimensity 5G (4nm technology node size).

I can understand that, when this open-source method first came out, there were some who were skeptical, but we are way beyond that now -- the evidence is just overwhelming.

I'm going to paste here the quotes from the bottom of the blog post, as it seems like a lot of people have missed them:

“AlphaChip’s groundbreaking AI approach revolutionizes a key phase of chip design. At MediaTek, we’ve been pioneering chip design’s floorplanning and macro placement by extending this technique in combination with the industry’s best practices. This paradigm shift not only enhances design efficiency, but also sets new benchmarks for effectiveness, propelling the industry towards future breakthroughs.” --SR Tsai, Senior Vice President of MediaTek

“AlphaChip has inspired an entirely new line of research on reinforcement learning for chip design, cutting across the design flow from logic synthesis to floor planning, timing optimization and beyond. While the details vary, key ideas in the paper including pretrained agents that help guide online search and graph network based circuit representations continue to influence the field, including my own work on RL for logic synthesis. If not already, this work is poised to be one of the landmark papers in machine learning for hardware design.” --Siddharth Garg, Professor of Electrical and Computer Engineering, NYU

"AlphaChip demonstrates the remarkable transformative potential of Reinforcement Learning (RL) in tackling one of the most complex hardware optimization challenges: chip floorplanning. This research not only extends the application of RL beyond its established success in game-playing scenarios to practical, high-impact industrial challenges, but also establishes a robust baseline environment for benchmarking future advancements at the intersection of AI and full-stack chip design. The work's long-term implications are far-reaching, illustrating how hard engineering tasks can be reframed as new avenues for AI-driven optimization in semiconductor technology." --Vijay Janapa Reddi, John L. Loeb Associate Professor of Engineering and Applied Sciences, Harvard University

“Reinforcement learning has profoundly influenced electronic design automation (EDA), particularly by addressing the challenge of data scarcity in AI-driven methods. Despite obstacles including delayed rewards and limited generalization, research has proven reinforcement learning's capability in complex electronic design automation tasks such as floorplanning. This seminal paper has become a cornerstone in reinforcement learning-electronic design automation research and is frequently cited, including in my own work that received the Best Paper Award at the 2023 ACM Design Automation Conference.” --Professor Sung-Kyu Lim, Georgia Institute of Technology

"There are two major forces that are playing a pivotal role in the modern era: semiconductor chip design and AI. This research charted a new path and demonstrated ideas that enabled the electronic design automation (EDA) community to see the power of AI and reinforcement learning for IC design. It has had a seminal impact in the field of AI for chip design and has been critical in influencing our thinking and efforts around establishing a major research conference like IEEE LLM-Aided Design (LAD) for discussion of such impactful ideas." --Ruchir Puri, Chief Scientist, IBM Research; IBM Fellow

By @DrNosferatu - 7 months

Yet, their “frontier” LLM lags all the others…

By @abc-1 - 7 months

Why aren’t they using this technique to design better transformer architectures or completely novel machine learning architectures in general? Are plain or mostly plain transformers really peak? I find that hard to believe.

By @mikewarot - 7 months

I understand the achievement, but can't square it with my belief that uniform systolic arrays will prove to be the best general purpose compute engine for neural networks. Those are almost trivial to route, by nature.

Etched Is Making the Biggest Bet in AI

Intel vs. Samsung vs. TSMC

TPU transformation: A look back at 10 years of our AI-specialized chips

China's Taichi-II Chip: First Optical AI Processor Outperforms Nvidia H100

OpenAI plans to build its own AI chips on TSMC's forthcoming

OpenAI is developing AI chips with TSMC's 1.6 nm A16 process, aiming to reduce costs from Nvidia's servers. Partnerships with Broadcom, Marvell, and possibly Apple are under consideration.

How AlphaChip transformed computer chip design

Related

Etched Is Making the Biggest Bet in AI

Intel vs. Samsung vs. TSMC

TPU transformation: A look back at 10 years of our AI-specialized chips

China's Taichi-II Chip: First Optical AI Processor Outperforms Nvidia H100

OpenAI plans to build its own AI chips on TSMC's forthcoming

Related

Etched Is Making the Biggest Bet in AI

Intel vs. Samsung vs. TSMC

TPU transformation: A look back at 10 years of our AI-specialized chips

China's Taichi-II Chip: First Optical AI Processor Outperforms Nvidia H100

OpenAI plans to build its own AI chips on TSMC's forthcoming