Nvidia GeForce RTX 3080 Founders Edition Review: A Huge Generational Leap in Performance
When you purchase through links on our site, we may earn an affiliate commission. Here’s how it works.
The GeForce RTX 3080 delivers big gains over Turing and the RTX 2080 Ti, but at a lower price.
Editor’s Choice
(Image: © Tom’s Hardware)
Tom’s Hardware Verdict
Pros
- +
Fastest Graphics Card (until 3090 arrives)
- +
Priced the same as RTX 2080 Super
- +
Delivers true 4K 60+ fps gaming
- +
Runs reasonably cool and quiet
- +
Major architectural updates for future games
Cons
- –
Highest TDP for a single GPU (until 3090)
- –
Potential for CPU bottlenecks
- –
Needs a 1440p or 4K display to shine
- –
We’re not sold on the 12-pin power adapter
- –
No RGB bling (but this is a pro for many!)
Why you can trust Tom’s Hardware
Our expert reviewers spend hours testing and comparing products and services so you can choose the best for you. Find out more about how we test.
Nvidia’s GeForce RTX 3080 Founders Edition is here, claiming the top spot on our GPU benchmarks hierarchy, and ranking as the best graphics card currently available — provided you’re after performance first, with price and power being lesser concerns. After months of waiting, we finally have independent benchmarks and testing data. Nvidia has thrown down the gauntlet, clearly challenging AMD’s Big Navi to try and match or beat what the Ampere architecture brings to the table.
We’re going to hold off on a final verdict for now, as we have other third-party RTX 3080 cards to review, which will begin as soon as tomorrow. That’s good news, as it means customers won’t be limited to Nvidia’s Founders Edition for the first month or so like we were with the RTX 20-series launch. Another piece of good news is that there’s no Founders Edition ‘tax’ this time: The RTX 3080 FE costs $699, direct from Nvidia, and that’s the base price of RTX 3080 cards for the time being. The bad news is that we fully expect supply to be insufficient to keep up with what we expect to be exceptionally high demand.
The bottom line, if you don’t mind spoilers, is that the RTX 3080 FE is 33% faster than the RTX 2080 Ti, on average. Or, if you prefer other points of comparison, it’s 57% faster than the RTX 2080 Super, 69% faster than the RTX 2080 FE — heck, it’s even 26% faster than the Titan RTX!
But there’s a catch: We measured all of those ‘percent faster’ results across our test suite running at 4K ultra settings. The lead narrows if you drop down to 1440p, and it decreases even more at 1080p. It’s still 42% faster than a 2080 FE at 1080p ultra, but this is very much a card made for higher resolutions. Also, you might need a faster CPU to get the full 3080 experience — check out our companion GeForce RTX 3080 CPU Scaling article for the full details.
- Nvidia GeForce RTX 3080 at Amazon for $899
Swipe to scroll horizontally
Graphics Card | RTX 3080 FE | RTX 2080 Super FE | RTX 2080 FE | |
---|---|---|---|---|
Architecture | GA102 | TU104 | TU104 | |
Process (nm) | Samsung 8N | TSMC 12FFN | TSMC 12FFN | |
Transistors (Billion) | 28. |
628.4 | 545 | 545 |
GPCs | 6 | 6 | 6 | |
SMs | 68 | 48 | 46 | |
FP32 CUDA Cores | 8704 | 3072 | 2944 | |
Tensor Cores | 272 | 384 | 368 | |
RT Cores | 68 | 48 | 46 | |
Boost Clock (MHz) | 1710 | 1815 | 1800 | |
VRAM Speed (Gbps) | 19 | 15.5 | 14 | |
VRAM (GB) | 10 | 8 | 8 | |
VRAM Bus Width | 320 | 256 | 256 | |
ROPs | 96 | 64 | 64 | |
TPCs | 34 | 24 | 23 | |
TMUs | 272 | 192 | 184 | |
GFLOPS FP32 | 29768 | 11151 | 10598 | |
Tensor TFLOPS FP16 (Sparsity) | 119 (238) | 89 | 85 | |
RT TFLOPS | 58 | 26 | 25 | |
Bandwidth (GBps) | 760 | 496 | 448 | |
TDP (watts) | 320 | 250 | 225 | |
Dimensions (mm) | 285x112x38 | 267x116x38 | 267x116x38 | |
Weight (g) | 1355 | 1278 | 1260 | |
Launch Date | Sep-20 | Jul-19 | Sep-18 | |
Launch Price | $699 | $699 | $799 |
Meet GA102: The Heart of the Beast
(Image credit: Tom’s Hardware)
We have a separate article going deep into the Ampere architecture that powers the GeForce RTX 3080 and other related GPUs. If you want the full rundown of everything that’s changed compared to the Turing architecture, we recommend starting there. But here’s the highlight reel of the most important changes:
The GA102 is the first GPU from Nvidia to drop into the single digits on lithography, using Samsung’s 8N process. The general consensus is that TSMC’s N7 node is ‘better’ overall, but it also costs more and is currently in very high demand — including from Nvidia’s own A100. Could the consumer Ampere GPUs have been even better with 7nm? Perhaps. But they might have cost more, only been available in limited quantities, or maybe they would have been delayed a few more months. Regardless, GA102 is still a big and powerful chip, boasting 28.3 billion transistors packed into a 628.4mm square die. If you’re wondering, that’s 52% more transistors than the TU102 chip used in RTX 2080 Ti, but in a 17% smaller area.
Ampere ends up as a split architecture, with the GA100 taking on data center ambitions while the GA102 and other consumer chips have significant differences. The GA100 focuses far more on FP64 performance for scientific workloads, as well as doubling down on deep learning hardware. Meanwhile, the GA102 drops most of the FP64 functionality and instead includes ray tracing hardware, plus some other architectural enhancements. Let’s take a closer look at the Ampere SM found in the GA102 and GA104.
(Image credit: Tom’s Hardware)
Nvidia GPUs consist of several GPCs (Graphics Processing Clusters), each of which has some number of SMs (Streaming Multiprocessors). Nvidia splits each SM into four partitions that can operate on separate sets of data. With Ampere, each SM partition now has 16 FP32 CUDA cores, 16 FP32/INT CUDA cores, a third-gen Tensor core, load/store units, and a special function unit. The whole SM has access to shared L1 cache and memory, and there’s a single second-gen RT core. In total, that means 64 FP32 cores and 64 FP32/INT cores, four Turing cores, and one RT core. Let’s break that down a bit more.
The Turing GPUs added support for concurrent FP32 (32-bit floating point) and INT (32-bit integer) operations. FP32 tends to be the most important workload for graphics and games, but there’s still a decent amount of INT operations — for things like address calculations, texture lookups, and various other types of code. With Ampere, the INT datapath is upgraded to support INT or FP32, but not at the same time.
If you look at the raw specs, Ampere appears to be a far bigger jump in performance than the 70% we measured. 30 TFLOPS! But it generally won’t get anywhere near that high because the second datapath is an either/or situation: It can’t do both types of instructions on the pipeline in the same cycle. Nvidia says around 35% of gaming calculations are INT operations, which means you’ll end up with something more like 20 TFLOPS of FP32 and 10 TOPS of INT on the RTX 3080.
While we’re on the subject, let’s also point out that a big part of the increased performance comes from increased power limits. RTX 2080 was a 225W part (for the Founders Edition), and RTX 3080 basically adds 100W to that. That’s half again more power for 70% more performance. It’s technically a win in overall efficiency, but in the pursuit of performance, Nvidia had to move further to the right on the voltage and frequency curve. Nvidia says RTX 3080 can deliver a 90% improvement in performance-per-watt if you limit performance to the same level on both the 2080 and 3080 … but come on, who wants to limit performance that way? Well, maybe laptops, but let’s not go there.
(Image credit: Tom’s Hardware)
One thing that hasn’t changed much is the video ports. Okay, that’s only partially true. First, there’s a single HDMI port, but it’s HDMI 2.1 instead of Turing’s HDMI 2.0b, but the three DisplayPort connections remain 1.4a. And last but not least, there’s no VirtualLink port this round — apparently, VirtualLink is dead. RIP. The various ports are all capable of 8K60 using DSC (Display Stream Compression), a “visually lossless” technique that’s actually not really visually lossless. But you might not notice at 8K.
Getting back to the cores, Nvidia’s third-gen tensor cores in GA102 work on 8x4x4 FP16 matrices, so up to 128 matrix operations per cycle. (Turing’s tensor cores used 4x4x4 matrices, while the GA100 uses 8x4x8 matrices.) With FMA (fused multiply-add), that’s 256 FP operations per cycle, per tensor core. Multiply by the 272 total tensor cores and clock speed, and that gives you 119 TFLOPS of FP16 compute. However, Ampere’s tensor cores also add support for fine-grained sparsity — basically, it eliminates wasting time doing multiplications by 0, since the answer is always 0. Sparsity can provide up to twice the FP16 performance in applications that can use it.
The RT cores receive similar enhancements, with up to double the ray/triangle intersection calculations per clock. The RT cores also support a time variable, which is useful for calculating things like motion blur. All told, Nvidia says the 3080’s new RT cores are 1.7 times faster than the RTX 2080’s, and they can be up to five times as fast for motion blur.
There are plenty of other changes as well. The L1 cache/shared memory capacity and bandwidth has been increased to better feed the cores (8704KB vs. 4416KB), and the L2 cache is also 25% larger than before (5120KB vs. 4096KB). The L1 cache can also be configured as varying amounts of L1 vs. shared memory, depending on the needs of the application. Register file size is also nearly 50% larger (17408KB vs. 11776KB) with the RTX 3080. GA102 can also do concurrent RT + graphics + DLSS (previously, using the RT cores would stop the CUDA cores).
Finally, the raster operators (ROPS) have been moved out of the memory controllers and into the GPCs. Each GPC has two ROP partitions of eight ROP units each. This provides more flexibility in performance, so where the GA102 has up to 112 ROPS total, the RTX 3080 disables two memory controllers but only one GPC and ends up with 96 ROPS. This is more critical for the RTX 3070 / GA104, however, which still has 96 ROPS even though it only has eight memory controllers. Each GPC also includes six TPCs (Texture Processing Clusters) with eight TMUs (Texture Mapping Units) and a polymorph engine, though Nvidia only enables 34 TPCs for the 3080.
(Image credit: Tom’s Hardware)
With the core enhancements out of the way, let’s also quickly discuss the memory subsystem. GA102 supports up to twelve 32-bit memory channels, of which ten are enabled on the RTX 3080. Nvidia teamed up with Micron to use its GDDR6X memory, which uses PAM4 signaling to boost data rates even higher than before. Where the RTX 20-series cards topped out at 15.5 Gbps in the 2080 Super and 14 Gbps in the other RTX cards, GDDR6X runs at 19 Gbps in the RTX 3080. Combined with the 320-bit interface, that yields 760 GBps of bandwidth – a 70% improvement over RTX 2080.
The RTX 3080’s memory controller has also been improved, with a new feature called EDR: Error Detection and Replay. When the memory detects a failed transmission, rather than crashing or corrupting data, it simply tries again. It will do this until it’s successful, though it’s still possible to cause a crash with memory overclocking. The interesting bit is that with EDR, higher memory clocks might be achievable, but still result in lower performance. That’s because the EDR ends up reducing memory performance when failed transmissions occur. We’ll have more to say on this in the overclocking section.
(Image credit: Tom’s Hardware)
Nvidia has radically altered the design of its Founders Edition cards for the RTX 30-series. The new design still includes two axial fans, but Nvidia heavily redesigned the PCB and shortened it so that the ‘back’ of the card (away from the video ports) consists of just a fan, heatpipes, radiator fins, and the usual graphics card shroud. Nvidia says the new design delivers substantial improvements in cooling efficiency, while at the same time lowering noise levels. We’ll see the fruits of the design later.
Aesthetics are highly subjective, and we’ve heard plenty of people like the new design, while others think it looks boring. There’s no RGB bling if that’s your thing, and the only lighting consists of a white GeForce RTX logo on the top of the card with subtle lighting around the ‘X’ on both sides of the card (but only half of the ‘X’ is lit up on the side with the “RTX 3080” logo).
Personally, I think the new card looks quite nice, and it feels very solid in the hand. It’s actually about 100g heavier than the previous RTX 2080 design, and as far as I’m aware, it’s the heaviest single-GPU card Nvidia has ever created. It’s also about 2cm longer than the previous generation cards and uses the typical two-slot width. (The GeForce RTX 3090 is about ready to make the 3080 FE look puny, though, with its massive three-slot cooler.)
Image 1 of 6
(Image credit: Tom’s Hardware)(Image credit: Tom’s Hardware)(Image credit: Tom’s Hardware)(Image credit: Tom’s Hardware)(Image credit: Tom’s Hardware)(Image credit: Tom’s Hardware)
Nvidia provided the above images of the teardown of the RTX 3080 Founders Edition. We’re not ready to attempt disassembly of our card yet — and frankly, we’re out of time — but we may return to the subject soon. We’re told getting the card apart is a bit trickier this round, mostly because Nvidia has hidden the screws behind tiny covers.
The main board looks far more densely populated than previous GPUs, with the 10 GDDR6X memory chips surrounding the GPU in the center. You can also see the angled 12-pin power connector and the funky-looking cutout at the end of the PCB. Power delivery is obviously important with a 320W TGP, and you can see all the solid electrolytic capacitors placed to the left and right of the memory chips.
The memory arrangement is also interesting, with four chips on the left and right sides of the GPU, up to three chips above the GPU (two mounting positions are empty for the RTX 3080), and a final single chip below the GPU. Again, Nvidia clearly spent a lot of effort to reduce the size of the board and other components to accommodate the new and improved cooling design. Spoiler: It works very well.
One interesting thing is that the ‘front’ fan (near the video ports) spins in the usual direction — counterclockwise. The ‘back’ fan, which will typically face upward when you install the card in an ATX case, spins clockwise. If you look at the fins, that means the back fan spins the opposite direction from what we normally expect. The reason is that Nvidia found this arrangement pulls air through the radiator better and generates less noise. Also note that the back fan is slightly thicker, and the integrated ring helps increase static pressure on both fans while keeping RPMs low.
If you don’t like the look of the Founders Edition, rest assured there will be plenty of other options. We have a few third-party RTX 3080 cards in for testing, all of which naturally include RGB lighting. None of the third party cards use the 12-pin power connector, either — not that it really matters, since the required adapter comes with the card. Still, that vertically-mounted 12-pin port just looks a bit less robust if you happen to swap GPUs on a regular basis. I plan to leave the adapter permanently connected and just connect or disconnect the normal 8-pin PEG cables. The 12-pin connector appears to be rated for 25 ‘cycles,’ and I’ve already burned through half of those (not that I expect it to fail any time soon).
Nvidia GeForce RTX 3080: Price Comparison
$749.99
View
$1,069
View
$1,199.95
$1,099.95
View
Show More Deals
powered by
- 1
Current page:
GeForce RTX 3080 Founders Edition: Hail to the King!
Next Page GeForce RTX 3080: Initial Overclocking Results
Jarred Walton is a senior editor at Tom’s Hardware focusing on everything GPU. He has been working as a tech journalist since 2004, writing for AnandTech, Maximum PC, and PC Gamer. From the first S3 Virge ‘3D decelerators’ to today’s GPUs, Jarred keeps up with all the latest graphics trends and is the one to ask about game performance.
Nvidia RTX 4070 vs RTX 3080: Should you upgrade?
❘ Published: 2023-06-23T16:33:07
❘ Updated: 2023-06-23T16:33:19
Nvidia’s RTX 4070 has been unveiled, yet for proud owners of the RTX 3080, you’d be surprised to find the performance battle closer than ever. We dive deep on the RTX 4070 vs RTX 3080.
The latest crop of Nvidia’s RTX 40-series GPUs have hit the market, with the RTX 4070 proving to be one of the most user-friendly options from the Green Team’s repertoire. But how does it measure up to the already-iconic RTX 3080 when it comes to cost-effectiveness, value for money, and sheer performance? We’re on a mission to find out.
Article continues after ad
Our comprehensive benchmarks are here to help you decide on which graphics card should be your prime pick. If you’re sitting on an RTX 3080 and the thought of a shiny new RTX 4070 has you itching to switch, we’ve got all the information you need to make an informed decision.
RTX 4070 vs RTX 3080: Specifications
Dexerto
GPU | Nvidia RTX 4070 | Nvidia RTX 3080 |
CUDA cores | 5888 | 9728 |
Base clock | 1920 MHz | 1440 MHz |
Boost clock | 2475 MHz | 1710 MHz |
Memory | 12GB GDDR6X | 10GB GDDR6X |
TGP | 200W | 320W |
Release date | April 13, 2023 | September 17, 2020 |
MSRP | $599 | $699 |
Buy the Nvidia RTX 4070
Buy the Nvidia RTX 3080
When it comes to looking at pure specifications from both GPUs, we can see that the RTX 3080 has significantly more CUDA cores when compared to the newer-generation RTX 4070. However, the 4070 also comes with an increase in clock speeds when compared to the RTX 3080, which will help bridge the performance gap left by the CUDA cores.
Article continues after ad
We can also observe a slightly different memory configuration, with the RTX 4070 having slightly more VRAM when compared to the RTX 3080. The 4070 also houses a slightly increased cache, allowing the memory to go slightly further with a total bandwidth of 504 GB/s. However, the RTX 3080 has a larger overall bandwidth of 760 GB/s. The RTX 4070’s increased cache size however allows for an improvement in performance, power efficiency, and more reliance on data that can exist on-chip, rather than calling for more performance via bandwidth.
RTX 4070 vs RTX 3080: Price
The RTX 4070 retails for just $599, while the RTX 3080 retails for $699. However, the humble aged RTX 3080 can now be had for as low as $500, if you catch it on sale.
Related:
Article continues after ad
The baseline MSRP of the RTX 4070 is $599, but many AIB partners have also released more expensive variants of the GPU with slightly different power configurations, too. However, at current MSRP prices, the RTX 4070 edges out the 3080 in value, which is a good thing, especially as the humble Ampere GPU is now almost three years old.
Subscribe to our newsletter for the latest updates on Esports, Gaming and more.
RTX 4070 vs RTX 3080: Performance
The RTX 3080 is slightly faster than the RTX 4070 across all of our benchmarks by a factor of 0.89%. The GPUs were benchmarked using the exact same testbench which houses current GPU parts. However, despite this seemingly tiny difference in performance, there are a few highlights that you need to know about their performance with a baseline test bench ahead of the official launch of the RTX 4070. Here’s what we’ve found.
Article continues after ad
Benchmark | GeForce RTX 4070 Founders Edition | Geforce RTX 3080 Founders Edition |
Time Spy Extreme | 8608 | 8766 |
Port Royal | 11293 | 11444 |
Speed Way | 4520 | 4582 |
4K Forza | 89 FPS | 92 FPS |
4K Cyberpunk 2077 | 17 FPS | 13 FPS |
4K Cyberpunk DLSS (3&2) | 72 FPS | 50 FPS |
4K Overwatch 2 | 157 FPS | 148 FPS |
4K CS:GO | 204 FPS | 201 FPS |
1440p Forza | 132 FPS | 120 FPS |
1440p Cyberpunk | 39 FPS | 38 FPS |
1440p Cyberpunk DLSS (3&2) | 126 FPS | 85 FPS |
1440p Overwatch 2 | 356 FPS | 334 FPS |
1440p CS:GO | 607 FPS | 582 FPS |
There are several points in which the RTX 4070 edges out the RTX 3080, and that’s in ray-traced performance, and when DLSS 3 comes into play. This is easily observable in the benchmarks of Cyberpunk 2077, where the newer RTX 4070 clearly has an advantage when it comes to DLSS 3-based workloads.
RTX 4070 vs RTX 3080: Which one should you buy?
The adoption of DLSS 3 is only going to get wider, and we think you should hold on to your RTX 3080 if you already have one. You should choose the RTX 4070 over the RTX 3080 if purchasing a brand-new GPU because of the availability of features like DLSS 3, in addition to having a 120W lower TGP, making the graphics card much more efficient than the RTX 3080.
Article continues after ad
With that in mind, there’s no reason to pick up an RTX 3080, unless you can grab it at a bargain price when compared to the RTX 4070.
If you click on a product link on this page we may earn a small affiliate commission.
org/Article”> |
NVIDIA GeForce RTX ™ 3080Ti FOUNDER’S EDITION – Tallinn
Ad is not active, product or service is discontinued
2 199 €
€ 2199.