Leak claims RTX 5090 has 600W TGP, RTX 5080 hits 400W
Preliminary specifications for Nvidia's RTX 5080 and RTX 5090 graphics cards suggest significant performance advancements, with the RTX 5090 featuring 600W TGP and 21,760 CUDA cores, potentially launching in early 2025.
Read original articlePreliminary specifications for Nvidia's upcoming GeForce RTX 5080 and RTX 5090 graphics cards have been leaked, suggesting significant advancements in performance. The RTX 5090 is expected to feature a total graphics power (TGP) of 600W, utilizing the GB202 graphics processor with 21,760 CUDA cores and 32GB of GDDR7 memory on a 512-bit bus. This positions the RTX 5090 as a high-performance option, potentially doubling the performance of the RTX 4080. In contrast, the RTX 5080 is projected to have a TGP of 400W, powered by the GB203 GPU, with 10,752 CUDA cores and 16GB of GDDR7 memory on a 256-bit interface. The performance gap between the two models is notable, with the RTX 5080 offering roughly half the specifications of the RTX 5090. Speculation suggests that Nvidia may be adopting a multi-chiplet design for the RTX 5090, similar to its datacenter GPUs, although a monolithic design is also possible. The release date for these cards remains uncertain, with some sources indicating a potential launch in early 2025. Until Nvidia provides official confirmation, these details should be viewed with caution.
- Nvidia's RTX 5090 may have a TGP of 600W and 21,760 CUDA cores.
- The RTX 5080 is expected to have a TGP of 400W with 10,752 CUDA cores.
- The performance gap between the RTX 5090 and RTX 5080 is significant.
- Speculation exists regarding a multi-chiplet design for the RTX 5090.
- The release date for the RTX 50-series is uncertain, possibly in early 2025.
Related
Nvidia launches a new RTX 4070
Nvidia has introduced an updated RTX 4070 graphics card with GDDR6 memory to enhance supply and availability, maintaining similar performance to the previous model, available globally from September 2024.
Benchmarks show even an old Nvidia RTX 3090 is enough to serve LLMs to thousands
An analysis by Backprop shows the Nvidia RTX 3090 can effectively serve large language models to thousands of users, achieving 12.88 tokens per second for 100 concurrent requests.
Nvidia GeForce 256 is 25 years old
NVIDIA celebrated the 25th anniversary of the GeForce 256, the first GPU, which revolutionized graphics processing by integrating T&L features, leading to significant advancements in modern GPUs and performance.
RTX 5080 rumored to have 400W TGP
Nvidia plans to finalize designs for the RTX 50-series, including the RTX 5080 with a 400-watt TGP, by September 2024, with a potential launch at CES 2025.
Leak claims RTX 5090 has 600W TGP, 32GB VRAM, 512-bit bus
Preliminary specs for Nvidia's RTX 5080 and RTX 5090 suggest significant performance advancements, with the 5090 featuring 600W TGP and 21,760 CUDA cores, potentially launching in early 2025.
Considered getting a new GPU earlier this year but then realized 5xxx was "around the corner", but now it seems they're pushed back to next year. And with AI being what it is, I'm guessing prices won't drop significantly.
Would be nice if AMD could get their GPU act together so it was a more viable alternative, NVIDIA could do with some competition.
edit: just recalled I had a dual-chip GPU back in the day, the ATI 4870X2[2]. Though that was more like two GPUs glued to one PCB, so effectively "single card SLI".
Hopefully the 5090 would be a better experience, as my 4870X2 never quite lived up to what it could theoretically do.
[1]: https://www.pugetsystems.com/labs/articles/llm-inference-con...
[2]: https://www.techpowerup.com/gpu-specs/radeon-hd-4870-x2.c236
Gamers don't need high end video cards, they want high end video cards. In general, the high marginal price for low marginal value of high end video cards prevents most gamers from acting on their desires.
But this generation of video cards provides a couple of other justifications for the purchase:
- it will allow them to run uncensored ML stacks locally - it will allow the buyers to train themselves on the hottest new career path.
A large number of people who use these excuses to justify the purchases to themselves or their loved ones will only use it for gaming, but those excuses will fuel a lot of sales.
This seems like the wrong generation for AMD to skip the halo tier of gaming cards.
From a gaming perspective, technicality is not anything like the ending days of Voodoo cards, but somehow it reminds me of the same feeling.
[0] https://www.razer.com/mena-en/gaming-laptops/razer-core-x
Their highest efficiency is at 80-90% utilization, but the efficiency drops off when underutilized.
Which has been a huge factor in data center servers from what i know for a long time. Somehow that has been forgotten
The price divide between “desktop” and “datacenter” GPUs is artificial and no doubt there is a collusion of some kind.
Related
Nvidia launches a new RTX 4070
Nvidia has introduced an updated RTX 4070 graphics card with GDDR6 memory to enhance supply and availability, maintaining similar performance to the previous model, available globally from September 2024.
Benchmarks show even an old Nvidia RTX 3090 is enough to serve LLMs to thousands
An analysis by Backprop shows the Nvidia RTX 3090 can effectively serve large language models to thousands of users, achieving 12.88 tokens per second for 100 concurrent requests.
Nvidia GeForce 256 is 25 years old
NVIDIA celebrated the 25th anniversary of the GeForce 256, the first GPU, which revolutionized graphics processing by integrating T&L features, leading to significant advancements in modern GPUs and performance.
RTX 5080 rumored to have 400W TGP
Nvidia plans to finalize designs for the RTX 50-series, including the RTX 5080 with a 400-watt TGP, by September 2024, with a potential launch at CES 2025.
Leak claims RTX 5090 has 600W TGP, 32GB VRAM, 512-bit bus
Preliminary specs for Nvidia's RTX 5080 and RTX 5090 suggest significant performance advancements, with the 5090 featuring 600W TGP and 21,760 CUDA cores, potentially launching in early 2025.