Blackwell GPU Family Specifications and Benchmarks made it clear that all 50-series positions, with the exception of the GeForce RTX 5090, are little more than a soft upgrade of the corresponding models of the previous generation. The situation was aggravated by the shortage of new video cards and, as a result, sky-high prices.
For most buyers, the top-end devices that NVIDIA monopolizes are of only theoretical interest. But one step down, the technological stagnation of "green" graphics has created a situation where AMD could make up for the lag in such problematic aspects of the RDNA architecture as ray tracing and upscaling in one go. However, we have been hearing the prophecy that AMD will restore parity with its competitor (even without claims to the flagship sector) for a long time, and each time it sounds less and less convincing, so the announcement of the Radeon RX 9000 video cards did not attract much attention against the backdrop of NVIDIA's vigorous activity. But AMD has carried out extensive work on errors and, if not eliminated, then at least compensated for the shortcomings of past GPUs.
We present the Radeon RX 9070 XT review in an unusual comparison format for 3DNews, since the “red” new product and its main competitor, the GeForce RTX 5070 Ti, fell into our hands at the same time, and one device cannot be considered in isolation from the other, since in Russia they are sold for about the same money.
![]() | ![]() |
Navi 48 GPU and RDNA 4 architecture
Only the lazy have not mentioned that NVIDIA has become a monopolist in the high-performance GPU market and behaves accordingly. Less often, attention is paid to the fact that all companies designing large processors are forced to use the services of one contractor for their production - Taiwanese TSMC. The demand for TSMC's capacity was probably one of the reasons why AMD stopped developing large consumer GPUs, and in general, in the line of "red" graphics processors of the new generation, there is only one crystal - Navi 48.
| Manufacturer | AMD | ||
|---|---|---|---|
| Name | Navi 32 | Navi 31 | Navi 48 |
| Where to use | Radeon RX 7700 XT; Radeon RX 7800 XT | Radeon RX 7900 GRE; Radeon RX 7900 XT; Radeon RX 7900 XTX | Radeon RX 9070? Radeon RX 9070 XTX |
| Architecture | RDNA3 | RDNA3 | RDNA4 |
| Process technology, nm | TSMC N5/N6 | TSMC N5/N6 | TSMC N4P |
| Number of transistors, billion | 28,1 | 57,7 | 53,9 |
| Chip area, mm2 | 346 | 522 | 357 |
| Number of CU/WGP/SE | |||
| Compute Units (CU) | 60 | 96 | 64 |
| Workgroup Processors (WGP) | 30 | 48 | 32 |
| Shader Arrays (SA) | No | No | No |
| Shader Engines (SE) | 4? | 6 | 4 |
| Compute Unit Configuration | |||
| Vector ALUs (FP32) | 2 × 32 | 2 × 32 | 2 × 32 |
| Vector ALUs (FP32/INT32) | 2 × 32 | 2 × 32 | 2 × 32 |
| Scalar ALUs | 2 | 2 | 2 |
| Special Purpose Unit (SFU) ALU | 2 × 8 | 2 × 8 | 2 × 8 |
| Ray Accelerators | 1 | 1 | 1 |
| Texture Mapping Units (TMU) | 4 | 4 | 4 |
| Vector/scalar registers, KB | 384/10 | 384/10 | 384/16 |
| L0 cache size, KB | 32 | 32 | 32 |
| Workgroup Processor (WGP) Configuration | |||
| Shared Memory Volume, KB | 128 | 128 | 128 |
| Instruction cache size, KB | 32 | 32 | 32 |
| Scalar cache size, KB | 16 | 16 | 16 |
| GPU Computing Units | |||
| Vector ALUs (FP32) | 3 840 (7 680) | 6 144 (12 288) | 4 096 (8 192) |
| Texture Mapping Units (TMU) | 240 | 384 | 256 |
| Raster Operation Blocks (ROPs) | 96 | 192 | 128 |
| Ray Accelerators | 60 | 96 | 64 |
| Memory Configuration | |||
| L1 cache size, KB | 2 048 | 3 072 | 2 048 |
| L2 cache size, MB | 4 | 6 | 8 |
| Infinity Cache Volume, MB | 64 | 96 | 64 |
| VRAM bus width, bits | 256 | 384 | 256 |
| VRAM chip type | GDDR6 | GDDR6 | GDDR6 |
| PCI Express interface | 4.0 x16 | 4.0 x16 | 5.0 x16 |
Among the older chips, it most closely resembles the second-tier model, Navi 32, in its core specs, but contains almost as many transistors as Navi 31 (53,9 and 57,7 billion, respectively). Navi 48 is built using TSMC's N4P technology, which belongs to the same 5 nm photolithographic node that was mastered two years ago. However, Navi 48 takes up only 68% of the area of Navi 31, thanks to AMD's abandonment of the chiplet design, since MCD chiplets, which contain the third-level cache and memory controllers, are manufactured using the coarser N6 process technology.

The architecture of the new graphics processor, RDNA 4, is structurally no different from previous iterations. Navi 48 contains 64 Compute Units, or 4 FP096-compatible shader ALUs, which is a fairly typical formula for a second-tier GPU - as is the 32-bit video memory bus. However, AMD also compares Navi 256 in performance estimates to the flagship Navi 48, which has 31 standard-precision real-number shader ALUs. The difference should be more than compensated for by multiple logic improvements, and to understand them, we recommend refreshing your memory of the RDNA 6 architecture, which we discussed in detail in the review .
Compared to green architecture RDNA 4 is a larger and deeper upgrade of previous developments. However, the description of all the innovations surprisingly folds into a short list due to the fact that NVIDIA traditionally pays increased attention to complex software or semi-software solutions that require detailed comments, while AMD's innovations, as a rule, begin and end in hardware.

Fundamental changes have occurred in the Compute Unit device — an indivisible computing block, which is an analogue of SM in NVIDIA and CPU cores. Thus, the instruction scheduler now supports the so-called divided barriers, which allow instructions to be issued for execution during the action period. Prefetching of instructions from the cache has also been optimized, and scalar ALUs, which are used for economical execution of conditional jump and branching operations, have inherited the ability to work with real-valued data from the RDNA 3.5 mobile architecture. At the same time, the volume of scalar registers has also been increased: from 10 to 16 KB.

One of the most important innovations of RDNA 4 is the doubled execution rate of WMMA matrix instructions, which are used in machine learning tasks and data processing using neural networks. In addition, the RDNA 4 instruction set contains commands for calculations on structurally sparse matrices and 8-bit floating-point data (FP8 and BF8).

However, the 1 FLOPs per clock means that Navi chips are only on par with the previous green architecture, Ampere. In addition, while NVIDIA and Intel chips use separate ALU arrays for matrix operations, in the RNDA 024 architecture this work still falls on shader SIMDs.
However, let's not forget that in real applications, RDNA 3-based GPUs never even come close to full load. The declared teraflops in general-purpose FP32 calculations are achievable only if the VOPD (Vector Operation Dual) format is used, which allows two independent vector instructions to be packed together. And this format itself imposes a lot of limitations, so half of the 32-bit ALUs inside the Compute Unit are most often idle. In a mixed load (such as a game with upscaling), matrix operations will, of course, take away some computing resources from shaders, but these losses will not be as great as it seems at first glance.
| Compute Unit (AMD RDNA 3) | Compute Unit (AMD RDNA 4) | Streaming Multiprocessor (NVIDIA Ada Lovelace) | Streaming Multiprocessor (NVIDIA Blackwell) | Xe-core (Intel Xe-HPG) | Xe-core (Intel Xe2) | |
|---|---|---|---|---|---|---|
| Execution blocks | 2 × SIMD32 (FP32/INT32); 2 × SIMD32 (FP32); 2 × SIMD2 (FP64); 2×SIMD8 (SFU); 2 × scalar ALUs | 2 × SIMD32 (FP32/INT32); 2 × SIMD32 (FP32); 2 × SIMD2 (FP64); 2×SIMD8 (SFU); 2 × scalar ALUs | 4 × SIMD16 (FP32/INT32); 4 × SIMD16 (FP32); 2 × SISD? (FP64); 4×SIMD4 (SFU); 4 × scalar ALUs; 4 × tensor cores | 8 × SIMD16 (FP32/INT32); 2 × SISD? (FP64); 4×SIMD4 (SFU); 4 × scalar ALUs; 4 × tensor cores | 16 × SIMD8 (FP32); 16 × SIMD8 (INT32); 16×SISD (FP64); 16×SIMD2 (SFU); 16×XXX | 8 × SIMD16 (FP32); 8 × SIMD16 (INT32); 8 × SIMD2 (FP64); 8×SIMD4 (SFU); 8×XXX |
| SIMD Line Operations Per Clock | 128×FP32; 64×INT32; 256×FP16; 4×FP64; 16 × trans functions | 128×FP32; 64×INT32; 256×FP16; 4×FP64; 16 × trans functions | 128×FP32; 64×INT32; 128×FP16; 2×FP64; 16 × trans functions | 128×FP32; 128×INT32; 128×FP16; 2×FP64; 16 × trans functions | 128×FP32; 128×INT32; 256×FP16; 16×FP64; 32 × trans functions | 128×FP32; 128×INT32; 256×FP16; 16×FP64; 32 × trans functions |
| Matrix operations, FLOPs per clock (FP16) | 512 | 1 024 | 2 048 | 2 048 | 2 048 | 2 048 |
Frame scaling with a neural network is no longer a theoretical situation for Radeon accelerators. The new, fourth version of FSR finally relies on machine learning, including for frame generation, and the models used are a hybrid version of convolutional networks and transformers. Alas, unlike previous versions of the upscaler, which worked on almost any hardware, FSR 4 only supports RDNA 4 architecture GPUs (due to the expansion of the WMMA instruction set to FP8 data). A universal mode with simplified computing cores, like in Intel's XeSS, is not provided.

Another area of the RDNA architecture reform was ray tracing. The speed of testing ray intersections with the BVH box and with a triangle has doubled per Ray Accelerator (and therefore one Compute Unit): from 4 and 1 per clock to 8 and 2, respectively. However, the RT block of Intel Battlemage chips works at a rate of 18 and 2, and in Blackwell graphics processors it finds four ray intersections with a triangle per clock and an unpublished number of ray intersections with the BVH box. Moreover, competitors perform two types of testing (against the BVH box and triangle) simultaneously, while AMD still does not. To some extent, parallelism is possible, because each Ray Accelerator now contains two independent "tester", but not at full speed.

Additionally, the Ray Accelerator has been given the ability to process rays out of order. Rays that do not require access to slow levels of the memory stack are processed first, and the test results are saved so that the shader program receives them in the expected order.
The ray tracing logic now uses a denser BVH8 structure instead of BVH4 (eight instead of four nodes for each branch of the volume hierarchy), which increases the speed of passage and simultaneously reduces the pressure on video memory. The size of the BVH in memory has also decreased due to the new primitive compression algorithm. Finally, AMD proposes to untie the orientation of the BVH boxes from the absolute coordinates of the scene. Rotating the boxes in accordance with the geometry of objects allows reducing the number of steps in the search.

It's a shame that despite all these changes, RDNA 4 still does BVH on shader ALUs rather than dedicated logic. According to Imagination Technologies' classification of hardware ray tracing tools, AMD's solutions are stuck at level 3, Intel's GPUs are at level 4, while NVIDIA's hardware is already moving from level XNUMX to level XNUMX thanks to SER, a dynamic instruction stream grouping feature designed to improve memory access coherence and, as a result, reduce latency.

There is no analog of SER in the RDNA 4 architecture either, but there is a function called Out of Order Memory, which helps solve the same problem in a different way. Previous versions of RDNA, working with several groups of instructions (wavefronts in AMD terminology), return data from memory strictly in the same order in which the requests were made. When the data requested by a shader is in a high-speed cache, the return can still be delayed if another shader made its request earlier, and its data is in comparatively slow VRAM. To be honest, this limitation seems completely meaningless in the context of multi-threaded computing, but it certainly appeared in order to simplify the organization of the GPU and for the time being (read: in rasterized games) it remained a reasonable compromise. RDNA 4 allows groups of instructions to return data regardless of the order of requests.

Out of Order Memory is just one of the changes to the memory stack in the RDNA 4 architecture. Previously, Compute Unit register allocation to shader programs was static, based on maximum consumption during the lifetime of the shader, which meant that a certain portion of the register file was effectively empty. RDNA 4 allows shaders to request and free registers dynamically. This allows the register file to be used more economically.
It is interesting that Shader Engine — a large GPU structure that combines the CU array, rasterizer, primitive block and rasterization operation blocks — has lost its own first-level cache in RDNA 4. But each segment of L2 cache contains twice as many banks. Thus, not only its volume has increased, but also its throughput, and in both directions — to Shader Engine and to Infinity Cache (in other words, L3). The volume of Infinity Cache itself remained the same as in Navi 32 — 64 MB.
Navi 48 has 16 PCI Express fifth-generation bus lines, but in gaming tasks, of course, it cannot fully occupy such a fast interface.

Finally, AMD says that the hardware video codec in the next-generation GPU delivers improved encoding quality at the popular low bitrate for streaming, but nothing more. That means there's still no support for YUV 4:2:2 color subsampling, which NVIDIA and Intel GPUs have.
In terms of image output, the Navi 48 display controller is compatible with the latest DisplayPort and HDMI interfaces: 2.1a and 2.1b, respectively.
Technical characteristics, prices
The Radeon RX 9070 XT is equipped with a fully functional Navi 48 crystal with 64 Compute Units, which means 4 standard-precision real-valued ALUs. It is not easy to find a formal predecessor of the new product among the “red” accelerators of the previous generation, because both the model grid and the GPU numbering principle have changed (Navi chips have never reached eight before). In terms of the formula for computing units and the characteristics of the memory stack, the Radeon RX 096 XT is similar to the Radeon RX 9070 XT, but AMD positions it as a replacement for the Radeon RX 7800 XTX.
The GPU's Game Clock and Boost Clock are significantly higher than the previous-gen flagship, at 2,4GHz and 2,97GHz, respectively. The Radeon RX 7900 XTX's theoretical peak clock speed is 26% higher, but AMD is looking to make up for that with improvements to its RDNA 4 architecture.
The Radeon RX 9070 XT features 16GB of GDDR6 memory with a bandwidth of 20Gbps per pin on a 256-bit bus. AMD was rumored to have considered increasing the VRAM capacity to 32GB, but ultimately deemed such a configuration excessive for consumer solutions. It’s a shame, because gaming appetites are steadily growing, and the amount of video memory was an important competitive advantage for many of AMD’s previous devices.
The Radeon RX 9070 XT has a reference power consumption of 304W and a suggested retail price of $599 ($100 more than the Radeon RX 7800 XT).
| Manufacturer | AMD | NVIDIA | ||||||
|---|---|---|---|---|---|---|---|---|
| Model | Radeon RX 7900 XT | Radeon RX 7900 XTX | Radeon RX 9070 | Radeon RX 9070 XT | GeForce RTX 4070 Ti SUPER | RTX GeForce 4080 SUPER | GeForce RTX 5070 Ti | GeForce RTX 5080 |
| Graphics Processor | ||||||||
| Name | Ships 31 XT | Ships 31 XTX | Ships 48 XT | Ships 48 XTX | AD103 | AD103 | GB203 | GB203 |
| Architecture | RDNA3 | RDNA3 | RDNA4 | RDNA4 | Ada Lovelace | Ada Lovelace | Blackwell | Blackwell |
| Technical process | TSMC N5/N6 | TSMC N5/N6 | TSMC N4C | TSMC N4C | TSMC4N | TSMC4N | TSMC 4NP | TSMC 4NP |
| Number of transistors, billion | 57,7 | 57,7 | 53,9 | 53,9 | 45,9 | 45,9 | 45,6 | 45,6 |
| Clock frequency (Base Clock/Game Clock/Boost Clock), MHz | 1/500/2 | 1/855/2 | 1/330/2 | 1/660/2 | 2 310 / 2 610 | 2 205 / 2 550 | 2 295 / 2 452 | 2 295 / 2 617 |
| Shader ALUs (FP32) | 5 376 (10 752) | 6 144 (12 228) | 3 584 (7 168) | 4 096 (8 192) | 8 448 | 10 240 | 8 960 | 10 752 |
| Texture Mapping Units (TMU) | 336 | 384 | 224 | 256 | 264 | 320 | 280 | 336 |
| Raster Operation Blocks (ROPs) | 192 | 192 | 128 | 128 | 112 | 112 | 96 | 112 |
| Tensor cores | No | No | No | No | 264 | 320 | 280 | 336 |
| RT cores | 84 | 96 | 112 | 128 | 66 | 80 | 70 | 84 |
| Last level cache size, MB | 80 | 96 | 64 | 64 | 48 | 64 | 64 | 64 |
| RAM | ||||||||
| Bus width, bit | 320 | 384 | 256 | 256 | 256 | 256 | 256 | 256 |
| Chip type | GDDR6 SGRAM | GDDR6 SGRAM | GDDR6 SGRAM | GDDR6 SGRAM | GDDR6X SRAM | GDDR6X SRAM | GDDR7 SGRAM | GDDR7 SGRAM |
| Throughput per contact, Gbps | 20 | 20 | 20 | 20 | 21 | 23 | 28 | 30 |
| Total throughput, Gbps | 800 | 960 | 640 | 640 | 672 | 736 | 896 | 960 |
| Volume, GB | 20 | 24 | 16 | 16 | 16 | 16 | 16 | 16 |
| Performance | ||||||||
| Peak FP32 performance, TFLOPS | 25,7 (51,5) | 30,7 (61,1) | 18,1 (36,1) | 24,3 (48,7) | 44,1 | 52,2 | 43,9 | 56,3 |
| Performance FP64/FP32 | 1/32 (1/64) | 1/32 (1/64) | 1/32 (1/64) | 1/32 (1/64) | 1/64 | 1/64 | 1/64 | 1/64 |
| Performance FP16/FP32 | 2/1 | 2/1 | 2/1 | 2/1 | 1/1 | 1/1 | 1/1 | 1/1 |
| Other | ||||||||
| PCI Express bus | PCI Express 4.0 x16 | PCI Express 4.0 x16 | PCI Express 5.0 x16 | PCI Express 5.0 x16 | PCI Express 4.0 x16 | PCI Express 4.0 x16 | 5.0 x16 | 5.0 x16 |
| Image output interfaces | DisplayPort 2.1, HDMI 2.1a | DisplayPort 2.1, HDMI 2.1a | DisplayPort 2.1a, HDMI 2.1b | DisplayPort 2.1a, HDMI 2.1b | DisplayPort 1.4a, HDMI 2.1 | DisplayPort 1.4a, HDMI 2.1 | DisplayPort 2.1b, HDMI 2.1b | DisplayPort 2.1b, HDMI 2.1b |
| TDP/TBP, W | 315 | 335 | 220 | 304 | 285 | 320 | 300 | 360 |
| Retail price (USA), $ | 899 (recommended at release date) | 999 (recommended at release date) | 549 (recommended at release date) | 599 (recommended at release date) | 799 (recommended at release date) | 999 (recommended at release date) | 749 (recommended at release date) | 999 (recommended at release date) |
In turn, the GeForce RTX 5070 Ti is based on the same GB203 GPU used in the GeForce RTX 5080, but the younger model has lost 14 streaming multiprocessors, that is, 1 FP792-compatible shader ALUs, and the gaming clock frequency has been reduced. As a result, the GeForce RTX 32 is separated from the RTX 5080 Ti by 5070% in theoretical performance, and the GeForce RTX 27 Ti itself does not differ in this parameter from the RTX 5070 Ti SUPER. As in the case of the Radeon RX 4070 XT, the lack of raw computing power is intended to be compensated for by the new GPU architecture, and first of all, the function of generating multiple frames using DLSS (up to three generated frames in the interval between two "real" ones).
The GeForce RTX 5070 Ti is rated for 300W of power consumption and, with an MSRP of $749, is $50 cheaper than the GeForce RTX 4070 Ti SUPER.
The $150 difference in recommended currency value between the Radeon RX 9070 XT and GeForce RTX 5070 Ti is not noticeable at current Russian prices: we found the most affordable partner models in DNS for 99 and 999 rubles, respectively. These are the ones we will test in benchmarks. The Radeon RX 104 XT is represented by the GIGABYTE GAMING OC graphics card with a factory GPU overclocked to 999/9070 GHz, and the GeForce RTX 2,52 Ti is a Palit GamingPro accelerator with reference specifications.
GIGABYTE Radeon RX 9070 XT GAMING OC: design
The Radeon RX 9070 XT by GIGABYTE has rather modest dimensions for a video card with a power consumption of no less than (and, as tests will show, much more than) 300 W: 288 mm in length and three slots in width. The only LED backlight element is a short strip on the side. The nameplate with the manufacturer's logo is movable: if you move it to the LED strip, the logo starts to glow.

The cooling system is served by three fans with an impeller diameter of 87 mm. Air passes through a small section of the radiator thanks to a cutout in the backplate.

The graphics processor and video memory chips give off heat to the evaporation chamber, but the former directly, and the latter through a U-shaped metal spacer. Separate plates are attached to the radiator lamellas, which serve as heat sinks for the VRM components. There are only seven heat pipes, their diameter is 6 mm. Note that GIGABYTE preferred liquid thermal pads to regular ones. Perhaps this is the reason why the VRAM chips heat up under load (see the empirical section of the review).

The backplate is involved in heat dissipation from the printed circuit board using several thermal pads (here they are regular, not liquid).

GIGABYTE Radeon RX 9070 XT GAMING OC: PCB
Some Radeon RX 9070 XT variants draw power from a 12V-2×6 connector, but GIGABYTE has opted for three universal eight-pin connectors. In light of the graphics card’s high power consumption and the ongoing complaints about the new connector, this is a more reliable solution, albeit not as convenient or aesthetically pleasing. Of the 17 power phases, 14 are dedicated to the GPU, controlled by a Monolithic Power Systems MP2868A PWM controller, and three are dedicated to video memory. Both VRMs are equipped with MP87993 power stages with an estimated current rating of 90A.
VRAM is composed of SK hynix chips, their marking H56G42AS8DX-014 encodes a bandwidth of 20 Gbps.
In addition to the BIOS copy that is active by default, the video card has a backup “silent” firmware.

Palit GeForce RTX 5070 Ti GamingPro: Design
The Palit GeForce RTX 5070 Ti GamingPro measures 331,9 mm in length, but only has three expansion slots. Under the GamingPro logo and the adjacent segment of the front panel are the RGB LEDs. The LEDs can be synchronized with the motherboard via a standard ARGB connector, which is located next to the 12V-2×6 power input.

The side surfaces of the case are formed by a cast aluminum frame with ventilation slots on the long sides. The protective plate on the back of the PCB has a familiar grill for through-blowing the radiator. The fan impellers have a diameter of 94 mm.

At the base of the cooler is a vapor chamber large enough to cover the GPU die and the surrounding video memory chips, and the VRM components are provided with separate plate heat sinks. The radiator lamellas are strung on eight heat pipes with a diameter of 6 mm.

The metal backplate serves only a protective and decorative function, since there is not a single thermal pad between it and the PCB.

The video card comes with an adapter from two eight-pin power connectors to 12V-2×6 and a small fabric mouse pad.

Palit GeForce RTX 5070 Ti GamingPro PCB
Unlike a video card , which we reviewed in our recent GeForce RTX 5080 review, the next-oldest model in the series is assembled on a spacious PCB, but its power supply system is simpler. The GPU has a 15-phase VRM controlled by a Monolithic Power Systems MP29816 PWM controller. The GDDR7 memory chips are powered by a three-phase power supply based on the MP2988 controller. Both regulators are equipped with MP87993 transistor assemblies with an estimated current rating of 90 A.
The marking of the Samsung K4VAF325ZC-SC28 video memory chips indicates a throughput of 28 Gbps.
Like the Radeon RX 9070 XT from GIGABYTE, the Palit accelerator has two BIOS options - "performance" and "quiet".

Test stand, testing methodology
| Test stand | |
|---|---|
| CPU | AMD Ryzen 9 7950X3D (PBO +150 MHz, CU -20) |
| Motherboard | ASUS ROG Crosshair X670E Hero |
| RAM | G.Skill Trident Z5 Neo RGB (F5-6000J3040G32GX2-TZ5NR), 2 x 32 GB (6200 MT/s, CL30) |
| ROM | Solidigm P44 Pro, 2 TB |
| Power supply unit | Corsair AX1600i 1600W |
| CPU cooling system | Custom Liquid Cooling System (EK-Quantum Velocity² DDC 4.2 PWM D-RGB + EK-Quantum Surface X280M) |
| Chassis | Open stand |
| Operating system | Windows 11 Pro |
| AMD GPU software | |
| All video cards | AMD Software Adrenalin Edition 25.2.1/25.3.1 |
| NVIDIA GPU software | |
| All video cards | NVIDIA GeForce Game Ready Driver 532.60/532.70 |
| Games without ray tracing | |||
|---|---|---|---|
| Game | API | Test Method | Graphics settings |
| Alan wake 2 | DirectX 12 | OCAT, Bright Falls location | Max graphics quality |
| Black MythWukong | DirectX 12 | Built-in benchmark | Max graphics quality |
| Cyberpunk 2077 | DirectX 12 | Built-in benchmark | Max graphics quality |
| F1 24 | DirectX 12 | Built-in benchmark, Monaco track (rain) | Max graphics quality |
| Hogwarts legacy | DirectX 12 | OCAT, Trolley Ride in Path to Hogwarts | Max graphics quality |
| Horizon Zero Dawn Remastered | DirectX 12 | Built-in benchmark | Max graphics quality |
| Metro Exodus | DirectX 12 | Built-in benchmark | Max Graphics Quality; Shading Rate: 100% |
| Red Dead Redemption 2 | Vulkan | Built-in benchmark | Max graphics quality |
| return | DirectX 12 | Built-in benchmark | Max graphics quality |
| Total War: WARHAMMER III | DirectX 11 | Built-in benchmark (Mirrors of Madness Benchmark) | Max graphics quality |
| Ray Tracing Games | ||||||
|---|---|---|---|---|---|---|
| Game | API | Test Method | Graphics settings | Frame scaling | ||
| AMD | Intel | NVIDIA | ||||
| Alan wake 2 | DirectX 12 | OCAT, Bright Falls location | Max graphics quality, high quality ray tracing | FSR Balanced | FSR Balanced | DLSS Balanced + Ray Reconstruction (+ Frame Generation) |
| Black MythWukong | Built-in benchmark | Max Graphics Quality and Path Tracing | FSR Balanced (+ Frame Generation) | XeSS Balanced/FSR Balanced + Frame Generation | DLSS Balanced (+ Frame Generation) | |
| Cyberpunk 2077 | Built-in benchmark (OCAT for frame generation) | Max Graphics Quality and Path Tracing | FSR Balanced (+ Frame Generation) | XeSS Balanced/FSR Balanced + Frame Generation | DLSS Balanced (Transformer Model) + Ray Reconstruction (+ Frame Generation) | |
| F1 24 | Built-in benchmark, Monaco track (rain) | Max graphics quality and ray tracing | FSR Balanced (+ Frame Generation) | XeSS Balanced/FSR Balanced + Frame Generation | DLSS Balanced (+ Frame Generation) | |
| Hogwarts legacy | OCAT, Trolley Ride in Path to Hogwarts | Max graphics quality and ray tracing | FSR Balanced | XeSS Balanced | DLSS Balanced + Ray Reconstruction (+ Frame Generation) | |
| Indiana Jones and the Great Circle | OCAT, location Sukhothai | Max Graphics Quality, High Quality Path Tracing; Ray Traced Shadows: Only Sunlight | FSR Balanced (+ Frame Generation) | XeSS Balanced/FSR Balanced + Frame Generation | DLSS Balanced (Transformer Model) + Ray Reconstruction (+ Frame Generation) | |
| Metro Exodus Enchanted Edition | Built-in benchmark | Max graphics quality and ray tracing | N / A | N / A | DLSS Balanced | |
| return | Built-in benchmark (OCAT for frame generation) | Max graphics quality and ray tracing | FSR Balanced (+ Frame Generation) | XeSS Balanced/FSR Balanced + Frame Generation | DLSS Balanced (+ Frame Generation) | |
In most games, the average and minimum (we specify the 1st percentile of the distribution) frame rates are derived from an array of individual frame rendering times or instantaneous frame rates obtained using a built-in benchmark. The exceptions are games that do not have a built-in benchmark and tests that use frame generation: in these cases, we use OCAT to capture interframe intervals.
| Work applications | ||
|---|---|---|
| application | Benchmark | Setting |
| Adobe Premiere Pro 25.x | Standard (4K) | |
| Blender 4.x | Agent 327 Barbershop demo from Blender website | Renderer Cycles |
| Blackmagic Design DaVinci Resolve Studio 19.x | Standard (4K); H.264/HEVC Encoding Mode: Auto | |
| CAD applications | SPECviewperf 2020 v3.1 | Screen resolution: 3840 × 2160 |
| Video Decoding (ffmpeg 5.x) | |||
|---|---|---|---|
| Format | Resolution | Coding parameters | API |
| H.264 (YUV 4:2:0, 8 bits/channel) | 1920 × 1080 | High Profile, L4.1 | D3D11VA |
| 3840 × 2160 | High Profile, L5.1 | ||
| HEVC (YUV 4:2:0, 8 bits/channel) | 1920 × 1080 | Main Profile, L4.0 | |
| 3840 × 2160 | Main Profile, L5.0 | ||
| 7680 × 4320 | Main Profile, L6.0 | ||
| VP9 (YUV 4:2:0, 8 bits/channel) | 1920 × 1080 | N / A | |
| 3840 × 2160 | |||
| 7680 × 4320 | |||
| AV1 (YUV 4:2:0, 8 bits/channel) | 1920 × 1080 | Main Profile, L4.0 | |
| 3840 × 2160 | Main Profile, L5.0 | ||
| 7680 × 4320 | Main Profile, L6.0 | ||
| Video encoding (ffmpeg 5.x) | |||||||
|---|---|---|---|---|---|---|---|
| Format | Resolution | Coding parameters | API | ||||
| AMD | Intel | NVIDIA | AMD | Intel | NVIDIA | ||
| H.264 (YUV 4:2:0, 8 bits/channel) | 1920 × 1080 | -c:v h264_amf -quality speed -coder cabac -refs 1 -b:v 3M | -c:v h264_qsv -preset veryfast -profile:v main -level 4.1 -b:v 3M | -c:v h264_nvenc -preset fast -coder cabac -refs 1 -b:v 3M | AMF | oneVPL | NVENC |
| 3840 × 2160 | -c:v h264_amf -quality speed -coder cabac -refs 1 -b:v 7.5M | -c:v h264_qsv -preset veryfast -profile:v main -level 5.1 -b:v 7.5M | -c:v h264_nvenc -preset fast -coder cabac -refs 1 -b:v 7.5M | ||||
| HEVC (YUV 4:2:0, 8 bits/channel) | 1920 × 1080 | -c:v hevc_amf -quality speed -b:v 3M | -c:v hevc_qsv -preset veryfast -tier main -b:v 3M | -c:v hevc_nvenc -preset fast -b:v 3M | |||
| 3840 × 2160 | -c:v hevc_amf -quality speed -b:v 7.5M | -c:v hevc_qsv -preset veryfast -tier main -b:v 7.5M | -c:v hevc_nvenc -preset fast -b:v 7.5M | ||||
| 7680 × 4320 | -c:v hevc_amf -quality speed -b:v 20M | -c:v hevc_qsv -preset veryfast -tier main -b:v 20M | -c:v hevc_nvenc -preset fast -b:v 20M | ||||
| AV1 (YUV 4:2:0, 8 bits/channel) | 1920 × 1080 | -c:v hevc_amf -quality speed -b:v 3M | -c:v av1_qsv -preset veryfast -profile main -b:v 3M | -c:v hevc_nvenc -preset fast -b:v 3M | |||
| 3840 × 2160 | -c:v hevc_amf -quality speed -b:v 7.5M | -c:v av1_qsv -preset veryfast -profile main -b:v 7.5M | -c:v hevc_nvenc -preset fast -b:v 7.5M | ||||
| 7680 × 4320 | -c:v hevc_amf -quality speed -b:v 20M | -c:v av1_qsv -preset veryfast -profile main -b:v 20M | -c:v hevc_nvenc -preset fast -b:v 20M | ||||
The power of the graphics cards is recorded separately from the CPU and other PC components using the NVIDIA PCAT device. The load for the power and noise level tests is the Cyberpunk 2077 game at a resolution of 3840 × 2160 and maximum graphics quality settings (without ray tracing), as well as the FurMark stress test with the most aggressive settings (resolution 3840 × 2160, MSAA 8x). All parameters are measured after the graphics card has warmed up, when the GPU temperature and clock frequencies have stabilized.
Test participants
The following video cards took part in performance testing:
- AMD Radeon RX 9070 XT (1820/3060 MHz, 20 Gbps, 16 GB);
- NVIDIA GeForce RTX 5070 Ti (2295/2452 MHz, 28 Gbps, 16 GB);
- ;
- ;
- ;
- ;
- ;
- .
Note. The base and boost frequencies of the GPU are shown in brackets.
Clock speeds, power consumption, temperature, noise level and overclocking
The Navi 48 GPU loses the split clock domains into the frontend and CU array that AMD introduced in the previous generation of Navi chips. However, no matter which of the two clock rates of the outgoing line's flagship GPU (Navi 31) you take as a reference, Navi 48 has made a major step up to 2,9 GHz. The GeForce RTX 5070 Ti, on the other hand, is content with 2,8 GHz clock rate under gaming load.
| Performance under load (Cyberpunk 2077) | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| Video card | Setting | GPU clock speed, MHz (shader domain) | GPU clock speed, MHz (front-end) | GPU supply voltage, V | Fan speed, rpm (% of max.) | Fan speed 2, rpm (% of max.) | |||
| Avg. | Max. | Avg. | Max. | Avg. | Max. | Avg. | Avg. | ||
| GIGABYTE Radeon RX 9070 XT GAMING OC (1820/3060 MHz, 20 Gbps, 16 GB) | Silent BIOS | 2900 | 2956 | N / A | N / A | 1,01 | 1,01 | 1479 (30%) | N / A |
| GIGABYTE Radeon RX 9070 XT GAMING OC (1820/3060 MHz, 20 Gbps, 16 GB) | Performance BIOS | 2915 | 2941 | N / A | N / A | 1,01 | 1,01 | 1696 (34%) | N / A |
| GIGABYTE Radeon RX 9070 XT GAMING OC (+300 MHz, 22 Gbps, 16 GB) | Performance BIOS, +10% TBP | 2980 | 3001 | N / A | N / A | 1,03 | 1,04 | 1880 (38%) | N / A |
| SAPPHIRE NITRO+ Radeon RX 7900 XTX (1720/2499 MHz, 20 Gbps, 24 GB) | Secondary BIOS | 2545 | 2585 | 2753 | 2785 | 0,91 | 0,93 | 1412 (34%) | N / A |
| Palit GeForce RTX 5070 Ti GamingPro (2295/2452 MHz, 28 Gbps, 16 GB) | Silent BIOS | 2816 | 2820 | N / A | N / A | 1,06 | 1,06 | 1357 (36%) | 1357 (36%) |
| Palit GeForce RTX 5070 Ti GamingPro (2295/2452 MHz, 28 Gbps, 16 GB) | Performance BIOS | 2827 | 2835 | N / A | N / A | 1,06 | 1,06 | 1859 (50%) | 1859 (50%) |
| Palit GeForce RTX 5070 Ti GamingPro (+400 MHz, 32 Gbps, 16 GB) | Performance BIOS | 3202 | 3210 | N / A | N / A | 1,05 | 1,05 | 2010 (55%) | 2010 (55%) |
| Palit GeForce RTX 4070 Ti GameRock OC Classic (2310/2610 MHz, 21 Gbps, 12 GB) | Silent BIOS | 2805 | 2805 | N / A | N / A | 1,10 | 1,10 | 1516 (39%) | 1516 (39%) |
| Palit GeForce RTX 4070 Ti SUPER JetStream OC (2340/2640 MHz, 21 Gbps, 16 GB) | 2673 | 2700 | N / A | N / A | 1,04 | 1,05 | 1417 (38%) | 1417 (38%) | |
| NVIDIA GeForce RTX 4080 FE (2205/2505 MHz, 22,4 Gbps, 16 GB) | 2775 | 2775 | N / A | N / A | 1,08 | 1,08 | 1383 (43%) | 1299 (39%) | |
| Palit GeForce RTX 4080 SUPER JetStream OC (2295/2580 MHz, 23 Gbps, 16 GB) | 2722 | 2745 | N / A | N / A | 1,04 | 1,07 | 1473 (39%) | 1473 (39%) | |
| Palit GeForce RTX 5080 GameRock (2295/2617 MHz, 30 Gbps, 16 GB) | Silent BIOS | 2790 | 2790 | N / A | N / A | 1,04 | 1,04 | 1490 (40%) | 1490 (40%) |
Although the Radeon RX 9070 XT is not the flagship model of its generation (which it will not be this time), the power consumption of the new product in the GIGABYTE version reaches 374 W - almost like the Radeon RX 7900 XTX. This contrasts with the surprisingly low power of the "red" video card in idle mode - only 7 W. The GeForce RTX 5070 Ti from Palit consumes no more than 318 W - at the level of the 80 models of the 40th series.

The choice of firmware — “quiet” or “productive” — has a noticeable impact on the operation of the GIGABYTE Radeon RX 9070 XT GAMING OC cooling system. In the “productive” mode, the GPU temperature does not exceed 62 °C (and 84 at the hot spot), but the memory chips heat up to 94 °C. In the “quiet” mode, the temperature indicators change to 67, 90 and 98 °C, respectively.
In turn, the cooling system of the Palit GeForce RTX 5070 Ti GamingPro keeps the temperature of the graphics processor within 68 or 76 °C (again, depending on the active BIOS version), and the temperature of the VRAM chips stopped at a much more acceptable level of 70 or 78 °C. The GeForce 50 video card driver does not report the GPU hotspot temperature to monitoring utilities.

The noise level of the cooling system of both new products changes dramatically, obeying the BIOS switch. In the "quiet" mode, the Palit GeForce RTX 5070 Ti GamingPro turned out to be the quietest of the compared devices, but the GIGABYTE Radeon RX 9070 XT GAMING also distinguished itself with a moderate sound pressure adjusted for the significant power consumption. On the contrary, under the control of the "productive" BIOS, the GIGABYTE video card maintains an acceptable noise level, but the Palit GamingPro reaches 45 dBA - and this is without overclocking.

AMD's new GPUs are overclocked in the same way as NVIDIA chips: now, instead of the upper limit of the clock frequency, the user specifies the desired increase. The GPU supply voltage is regulated only downwards. Our video card sample remained stable when the target clock frequency was increased by 300 MHz, but due to the fact that the power reserve can only be increased by 10%, the actual core frequency increased by a symbolic 65 MHz. In turn, the video memory reached a bandwidth of 22 instead of the original 20 Gbps. The power consumption of the GIGABYTE Radeon RX 9070 XT GAMING in overclocking exceeded the 400 W mark, due to which the component temperature increased by a couple of degrees, and the noise level reached 41 dBA at a distance of 30 cm from the fans.
In contrast, the Palit GeForce RTX 5070 Ti, like the GeForce RTX 5080, overclocks superbly. Although the non-OC version of the card does not allow the factory TBP to be exceeded at all, we were able to add 375 MHz to the actual GPU clock in the gaming test and increase the GDDR7 memory bandwidth from 28 to 32 Gbps. Overclocking did not have a major impact on power consumption and component temperatures, and the noise level remained the same only because it was already quite high with the “performance” firmware.
Gaming tests (1920×1080)
At 1080p, both the GeForce RTX 5070 Ti and Radeon RX 9070 XT deliver frame rates in excess of 60 FPS in most of our test titles, and at least XNUMX FPS in Black Myth: Wukong.
| 1920 × 1080 | ||||||||
|---|---|---|---|---|---|---|---|---|
| AMD Radeon RX 9070XT | AMD Radeon RX 7900 XTX | NVIDIA GeForce RTX 4070 Ti | NVIDIA GeForce RTX 4070 Ti SUPER | NVIDIA GeForce RTX 4080 | NVIDIA GeForce RTX 4080 SUPER | NVIDIA GeForce RTX 5070 Ti | NVIDIA GeForce RTX 5080 | |
| Alan wake 2 | 135 / 142 | 143 / 148 | 101 / 106 | 106 / 112 | 122 / 128 | 123 / 129 | 119 / 126 | 135 / 146 |
| Black MythWukong | 58 / 66 | 53 / 64 | 49 / 56 | 51 / 59 | 58 / 68 | 60 / 69 | 59 / 68 | 67 / 76 |
| Cyberpunk 2077 | 124 / 149 | 138 / 166 | 93 / 108 | 99 / 117 | 118 / 139 | 115 / 138 | 122 / 146 | 128 / 167 |
| F1 24 | 182 / 267 | 167 / 263 | 154 / 223 | 156 / 229 | 171 / 251 | 175 / 253 | 173 / 244 | 187 / 275 |
| Hogwarts legacy | 188 / 213 | 196 / 216 | 145 / 160 | 154 / 170 | 173 / 193 | 181 / 197 | 185 / 202 | 193 / 218 |
| Horizon Zero Dawn Remastered | 144 / 184 | 146 / 187 | 132 / 164 | 130 / 167 | 142 / 182 | 145 / 185 | 137 / 176 | 141 / 186 |
| Metro Exodus | 79 / 130 | 81 / 140 | 68 / 118 | 73 / 127 | 79 / 146 | 81 / 148 | 80 / 144 | 88 / 167 |
| Red Dead Redemption 2 | 128 / 139 | 122 / 128 | 96 / 103 | 104 / 109 | 119 / 126 | 119 / 127 | 119 / 128 | 125 / 132 |
| return | 97 / 177 | 134 / 211 | 75 / 150 | 101 / 155 | 100 / 179 | 91 / 178 | 112 / 176 | 105 / 199 |
| Total War: WARHAMMER III | 85 / 105 | 85 / 107 | 76 / 94 | 81 / 97 | 86 / 103 | 88 / 105 | 85 / 103 | 83 / 105 |
| Max. | + 19 % | −9% | −2% | + 12 % | + 14 % | + 11 % | + 28 % | |
| Avg. | + 4 % | −18% | −14% | −3% | −2% | −3% | + 7 % | |
| Min. | −8% | −28% | −22% | −10% | −9% | −11% | −5% | |
Although some games prefer the AMD or NVIDIA architecture, from a practical point of view, the new products have equal performance: the GeForce RTX 5070 Ti is only 9070% slower than the Radeon RX 3 XT in FPS. Compared to the previous generation devices, the Radeon RX 9070 XT is 3% slower than the Radeon RX 7900 XTX. The GeForce RTX 5070 Ti is equivalent to the GeForce RTX 4080 and RTX 4080 SUPER, and the difference between the RTX 4070 Ti SUPER and the RTX 5070 Ti is 13% in frame rate.

Gaming tests (2560×1440)
In most benchmarks at 1440p, the new AMD and NVIDIA models achieve frame rates of at least 80 FPS, but the average frame rate in Black Myth: Wukong dropped to 51-52 FPS.
| 2560 × 1440 | ||||||||
|---|---|---|---|---|---|---|---|---|
| AMD Radeon RX 9070XT | AMD Radeon RX 7900 XTX | NVIDIA GeForce RTX 4070 Ti | NVIDIA GeForce RTX 4070 Ti SUPER | NVIDIA GeForce RTX 4080 | NVIDIA GeForce RTX 4080 SUPER | NVIDIA GeForce RTX 5070 Ti | NVIDIA GeForce RTX 5080 | |
| Alan wake 2 | 99 / 104 | 103 / 107 | 73 / 77 | 77 / 82 | 90 / 95 | 92 / 97 | 88 / 93 | 104 / 109 |
| Black MythWukong | 45 / 51 | 43 / 50 | 38 / 42 | 39 / 45 | 46 / 52 | 47 / 54 | 46 / 52 | 54 / 61 |
| Cyberpunk 2077 | 80 / 92 | 90 / 103 | 53 / 63 | 54 / 65 | 67 / 80 | 67 / 80 | 75 / 87 | 89 / 102 |
| F1 24 | 159 / 219 | 162 / 228 | 125 / 176 | 139 / 194 | 148 / 211 | 154 / 218 | 155 / 216 | 161 / 231 |
| Hogwarts legacy | 134 / 154 | 141 / 160 | 102 / 114 | 115 / 128 | 124 / 139 | 125 / 142 | 128 / 144 | 141 / 165 |
| Horizon Zero Dawn Remastered | 133 / 161 | 130 / 160 | 111 / 132 | 114 / 140 | 128 / 156 | 127 / 158 | 122 / 148 | 124 / 163 |
| Metro Exodus | 69 / 112 | 72 / 120 | 60 / 98 | 64 / 106 | 69 / 124 | 75 / 126 | 70 / 122 | 85 / 145 |
| Red Dead Redemption 2 | 114 / 119 | 106 / 111 | 83 / 88 | 89 / 94 | 104 / 109 | 106 / 111 | 106 / 111 | 109 / 115 |
| return | 89 / 140 | 110 / 162 | 75 / 115 | 81 / 121 | 92 / 138 | 85 / 139 | 92 / 137 | 82 / 154 |
| Total War: WARHAMMER III | 80 / 97 | 79 / 97 | 53 / 69 | 59 / 76 | 70 / 88 | 70 / 90 | 72 / 89 | 83 / 97 |
| Max. | + 16 % | −13% | −5% | + 11 % | + 13 % | + 9 % | + 29 % | |
| Avg. | + 4 % | −22% | −17% | −4% | −3% | −4% | + 9 % | |
| Min. | −7% | −32% | −29% | −13% | −13% | −11% | −3% | |
The Radeon RX 9070 XT and GeForce RTX 5070 Ti are again separated by a virtually insignificant 4% frame rate. The “red” new-generation graphics card is also only 4% behind the former flagship, the Radeon RX 7900 XTX, while the GeForce RTX 5070 Ti remains a replacement for the 80s of the previous generation. Compared to the GeForce RTX 4070 Ti SUPER, the new NVIDIA model has become 15% faster.

Gaming tests (3840×2160)
The Radeon RX 9070 XT and GeForce RTX 5070 Ti are generally suitable for gaming on a 4K screen at maximum graphics quality. Only in resource-intensive titles such as Black Myth: Wukong and Cyberpunk 2077 will the user have to put up with a frame rate significantly below 60 FPS or use upscaling.
| 3840 × 2160 | ||||||||
|---|---|---|---|---|---|---|---|---|
| AMD Radeon RX 9070XT | AMD Radeon RX 7900 XTX | NVIDIA GeForce RTX 4070 Ti | NVIDIA GeForce RTX 4070 Ti SUPER | NVIDIA GeForce RTX 4080 | NVIDIA GeForce RTX 4080 SUPER | NVIDIA GeForce RTX 5070 Ti | NVIDIA GeForce RTX 5080 | |
| Alan wake 2 | 55 / 58 | 57 / 59 | 40 / 43 | 43 / 46 | 51 / 54 | 50 / 54 | 49 / 52 | 58 / 62 |
| Black MythWukong | 27 / 31 | 27 / 31 | 22 / 25 | 24 / 27 | 28 / 31 | 28 / 32 | 28 / 31 | 33 / 36 |
| Cyberpunk 2077 | 34 / 41 | 38 / 44 | 22 / 27 | 23 / 28 | 29 / 35 | 28 / 36 | 32 / 38 | 39 / 46 |
| F1 24 | 110 / 140 | 118 / 155 | 91 / 116 | 99 / 126 | 111 / 145 | 114 / 146 | 112 / 148 | 127 / 167 |
| Hogwarts legacy | 79 / 91 | 81 / 95 | 58 / 65 | 61 / 68 | 70 / 81 | 75 / 83 | 73 / 84 | 90 / 102 |
| Horizon Zero Dawn Remastered | 89 / 103 | 87 / 102 | 65 / 77 | 74 / 87 | 83 / 98 | 85 / 100 | 75 / 89 | 86 / 103 |
| Metro Exodus | 52 / 80 | 52 / 86 | 44 / 66 | 47 / 72 | 55 / 85 | 56 / 86 | 54 / 86 | 67 / 103 |
| Red Dead Redemption 2 | 79 / 83 | 78 / 81 | 54 / 59 | 60 / 64 | 70 / 76 | 68 / 76 | 69 / 77 | 78 / 83 |
| return | 55 / 84 | 68 / 98 | 49 / 69 | 44 / 72 | 62 / 86 | 57 / 86 | 57 / 86 | 68 / 99 |
| Total War: WARHAMMER III | 40 / 57 | 42 / 59 | 29 / 39 | 33 / 44 | 39 / 52 | 40 / 53 | 39 / 53 | 47 / 63 |
| Max. | + 17 % | −17% | −10% | + 6 % | + 8 % | + 8 % | + 29 % | |
| Avg. | + 5 % | −25% | −19% | −4% | −3% | −4% | + 12 % | |
| Min. | −2% | −34% | −32% | −15% | −12% | −14% | 0% | |
High resolution hasn't changed the positions of the main competitors: Radeon RX 9070 XT still outperforms GeForce RTX 5070 Ti by 4% in average frame rate, but at the same time is 5% behind Radeon RX 7900 XTX. GeForce RTX 5070 Ti itself still replaces GeForce RTX 4080 and RTX 4080 SUPER, but the advantage over RTX 4070 Ti SUPER has grown to 18% FPS.

Ray Tracing Gaming Tests
Ray tracing games are the most important test that AMD's new accelerators will have to pass after years of lagging behind NVIDIA's solutions in this area. Fortunately, there are big changes for the better. Although in rasterized games the Radeon RX 9070 XT is an analogue of the Radeon RX 7900 XTX and on average even lags behind the former flagship, in ray tracing the Radeon RX 9070 XT is already ahead with a gap of 20-21% FPS.
True, there are games where the difference is small (Metro Exodus and Returnal), and in Indiana Jones and the Great Circle the frame rate on the Radeon RX 9070 XT dropped to 1 FPS even with FSR, which is apparently due to a lack of VRAM. However, this game only opened path tracing to owners of "red" GPUs in the latest add-on. The necessary driver optimizations are probably missing, so we did not take into account the results of Indiana Jones and the Great Circle when calculating the percentage ratios between the test participants. By the way, among all the test participants, only the Radeon RX 7900 XTX has the necessary amount of video memory to run Indiana Jones and the Great Circle on a 4K screen with maximum graphics quality. Other devices do not have this even with the most aggressive frame scaling settings.
| 1920 × 1080 | ||||||||
|---|---|---|---|---|---|---|---|---|
| AMD Radeon RX 9070XT | AMD Radeon RX 7900 XTX | NVIDIA GeForce RTX 4070 Ti | NVIDIA GeForce RTX 4070 Ti SUPER | NVIDIA GeForce RTX 4080 | NVIDIA GeForce RTX 4080 SUPER | NVIDIA GeForce RTX 5070 Ti | NVIDIA GeForce RTX 5080 | |
| Alan wake 2 | 59 / 62 | 47 / 50 | 52 / 56 | 58 / 61 | 67 / 71 | 66 / 70 | 67 / 71 | 76 / 82 |
| Black MythWukong | 19 / 24 | 11 / 14 | 31 / 38 | 34 / 42 | 40 / 48 | 41 / 50 | 41 / 48 | 49 / 57 |
| Cyberpunk 2077 | 34 / 40 | 28 / 33 | 36 / 46 | 38 / 49 | 47 / 59 | 47 / 59 | 42 / 56 | 50 / 65 |
| F1 24 | 121 / 157 | 70 / 130 | 97 / 128 | 104 / 137 | 104 / 154 | 110 / 157 | 102 / 152 | 110 / 169 |
| Hogwarts legacy | 110 / 128 | 94 / 109 | 95 / 115 | 104 / 123 | 117 / 140 | 120 / 142 | 119 / 142 | 134 / 160 |
| Indiana Jones and the Great Circle | N / A | 16 / 17 | N / A | 41 / 43 | 46 / 48 | 46 / 49 | 45 / 47 | 51 / 54 |
| Metro Exodus Enchanted Edition | 73 / 105 | 68 / 104 | 60 / 94 | 62 / 99 | 71 / 114 | 69 / 117 | 70 / 109 | 71 / 125 |
| return | 75 / 137 | 89 / 132 | 91 / 130 | 92 / 134 | 91 / 154 | 92 / 151 | 90 / 151 | 101 / 169 |
| Max. | −1% | + 58 % | + 75 % | + 100 % | + 108 % | + 100 % | + 138 % | |
| Avg. | −16% | + 3 % | + 10 % | + 27 % | + 29 % | + 25 % | + 44 % | |
| Min. | −42% | −18% | −13% | −2% | 0% | −3% | + 8 % | |
| 2560 × 1440 | ||||||||
|---|---|---|---|---|---|---|---|---|
| AMD Radeon RX 9070XT | AMD Radeon RX 7900 XTX | NVIDIA GeForce RTX 4070 Ti | NVIDIA GeForce RTX 4070 Ti SUPER | NVIDIA GeForce RTX 4080 | NVIDIA GeForce RTX 4080 SUPER | NVIDIA GeForce RTX 5070 Ti | NVIDIA GeForce RTX 5080 | |
| Alan wake 2 | 38 / 41 | 30 / 33 | 34 / 37 | 38 / 40 | 46 / 48 | 45 / 48 | 45 / 48 | 54 / 56 |
| Black MythWukong | 12 / 16 | 6 / 9 | 20 / 24 | 22 / 27 | 26 / 32 | 27 / 33 | 26 / 31 | 32 / 37 |
| Cyberpunk 2077 | 21 / 24 | 17 / 20 | 23 / 27 | 24 / 29 | 30 / 35 | 31 / 36 | 28 / 34 | 35 / 42 |
| F1 24 | 92 / 111 | 61 / 91 | 77 / 90 | 78 / 93 | 90 / 112 | 92 / 114 | 89 / 111 | 99 / 127 |
| Hogwarts legacy | 74 / 90 | 63 / 78 | 67 / 83 | 72 / 87 | 82 / 100 | 83 / 101 | 82 / 100 | 97 / 116 |
| Indiana Jones and the Great Circle | N / A | 10 / 11 | N / A | 30 / 32 | 30 / 33 | 34 / 36 | 31 / 32 | 38 / 40 |
| Metro Exodus Enchanted Edition | 58 / 83 | 57 / 80 | 50 / 70 | 54 / 76 | 61 / 89 | 62 / 91 | 56 / 83 | 69 / 99 |
| return | 61 / 103 | 74 / 100 | 66 / 94 | 67 / 98 | 78 / 116 | 67 / 115 | 78 / 113 | 87 / 131 |
| Max. | −3% | + 50 % | + 69 % | + 100 % | + 106 % | + 94 % | + 131 % | |
| Avg. | −17% | 0% | + 8 % | + 28 % | + 30 % | + 25 % | + 48 % | |
| Min. | −44% | −19% | −16% | + 1 % | + 3 % | 0% | + 14 % | |
| 3840 × 2160 | ||||||||
|---|---|---|---|---|---|---|---|---|
| AMD Radeon RX 9070XT | AMD Radeon RX 7900 XTX | NVIDIA GeForce RTX 4070 Ti | NVIDIA GeForce RTX 4070 Ti SUPER | NVIDIA GeForce RTX 4080 | NVIDIA GeForce RTX 4080 SUPER | NVIDIA GeForce RTX 5070 Ti | NVIDIA GeForce RTX 5080 | |
| Alan wake 2 | 17 / 20 | 14 / 16 | 4 / 5 | 19 / 20 | 22 / 24 | 22 / 24 | 22 / 24 | 27 / 29 |
| Black MythWukong | 5 / 7 | 3 / 4 | 5 / 10 | 11 / 13 | 13 / 16 | 13 / 16 | 13 / 16 | 16 / 19 |
| Cyberpunk 2077 | 10 / 11 | 8 / 9 | 9 / 12 | 11 / 13 | 14 / 16 | 14 / 17 | 13 / 16 | 16 / 20 |
| F1 24 | 51 / 59 | 35 / 48 | 43 / 48 | 46 / 52 | 55 / 61 | 56 / 62 | 55 / 61 | 63 / 72 |
| Hogwarts legacy | 40 / 51 | 34 / 44 | 34 / 45 | 39 / 50 | 45 / 58 | 46 / 59 | 44 / 58 | 53 / 67 |
| Indiana Jones and the Great Circle | N / A | 5 / 6 | N / A | N / A | N / A | N / A | N / A | N / A |
| Metro Exodus Enchanted Edition | 38 / 50 | 35 / 46 | 32 / 41 | 35 / 45 | 41 / 53 | 34 / 55 | 38 / 50 | 45 / 60 |
| return | 38 / 59 | 44 / 59 | 38 / 52 | 39 / 54 | 48 / 66 | 47 / 67 | 47 / 65 | 51 / 76 |
| Max. | 0% | + 43 % | + 86 % | + 129 % | + 129 % | + 129 % | + 171 % | |
| Avg. | −17% | −12% | + 10 % | + 33 % | + 35 % | + 32 % | + 57 % | |
| Min. | −43% | −75% | −12% | + 3 % | + 5 % | 0% | + 20 % | |
In absolute frame rate terms, the Radeon RX 9070 XT has the performance needed for hybrid rendering titles at resolutions up to 1440p and, at a stretch, 4K.
The same goes for the GeForce RTX 5070 Ti, and yet the average score for NVIDIA's device is 25-32% ahead, mostly in fully traced games. In its own camp, the GeForce RTX 5070 Ti confirms its status as the successor to the GeForce RTX 4080 and RTX 4080 SUPER: the latter's advantage lies in the range of 1 to 5% of frame rate. Compared to the GeForce RTX 4070 Ti SUPER, performance has increased by 13 to 19%.

Game tests with ray tracing and frame scaling
Thanks to Balanced upscaling, the new generation of AMD and NVIDIA devices guarantee comfortable frame rates in “hybrid” games at resolutions up to 4K. However, due to the 20-26% advantage, the GeForce RTX 5070 Ti copes much better with fully traced titles.
| 1920 × 1080 | ||||||||
|---|---|---|---|---|---|---|---|---|
| AMD Radeon RX 9070XT | AMD Radeon RX 7900 XTX | NVIDIA GeForce RTX 4070 Ti | NVIDIA GeForce RTX 4070 Ti SUPER | NVIDIA GeForce RTX 4080 | NVIDIA GeForce RTX 4080 SUPER | NVIDIA GeForce RTX 5070 Ti | NVIDIA GeForce RTX 5080 | |
| Alan wake 2 | 99 / 105 | 81 / 85 | 97 / 104 | 101 / 109 | 114 / 122 | 114 / 123 | 116 / 123 | 134 / 141 |
| Black MythWukong | 40 / 49 | 24 / 31 | 61 / 73 | 64 / 79 | 73 / 90 | 74 / 90 | 73 / 87 | 83 / 98 |
| Cyberpunk 2077 | 73 / 84 | 61 / 70 | 82 / 96 | 87 / 100 | 104 / 116 | 105 / 117 | 94 / 110 | 110 / 126 |
| F1 24 | 165 / 220 | 92 / 190 | 113 / 174 | 115 / 177 | 114 / 200 | 116 / 203 | 116 / 201 | 121 / 218 |
| Hogwarts legacy | 177 / 203 | 161 / 189 | 140 / 160 | 158 / 176 | 171 / 183 | 171 / 182 | 172 / 183 | 173 / 183 |
| Indiana Jones and the Great Circle | N / A | 27 / 29 | N / A | 55 / 59 | 62 / 67 | 65 / 68 | 62 / 65 | 73 / 75 |
| Metro Exodus Enchanted Edition | N / A | N / A | 69 / 121 | 75 / 125 | 79 / 139 | 81 / 142 | 75 / 134 | 80 / 148 |
| return | 42 / 170 | 123 / 175 | 96 / 165 | 96 / 173 | 110 / 197 | 108 / 194 | 104 / 193 | 97 / 205 |
| Max. | + 3 % | + 49 % | + 61 % | + 84 % | + 84 % | + 78 % | + 100 % | |
| Avg. | −15% | + 3 % | + 9 % | + 22 % | + 23 % | + 20 % | + 32 % | |
| Min. | −37% | −21% | −20% | −10% | −10% | −10% | −10% | |
| 2560 × 1440 | ||||||||
|---|---|---|---|---|---|---|---|---|
| AMD Radeon RX 9070XT | AMD Radeon RX 7900 XTX | NVIDIA GeForce RTX 4070 Ti | NVIDIA GeForce RTX 4070 Ti SUPER | NVIDIA GeForce RTX 4080 | NVIDIA GeForce RTX 4080 SUPER | NVIDIA GeForce RTX 5070 Ti | NVIDIA GeForce RTX 5080 | |
| Alan wake 2 | 75 / 79 | 61 / 65 | 74 / 79 | 78 / 83 | 89 / 95 | 90 / 96 | 90 / 95 | 104 / 109 |
| Black MythWukong | 28 / 34 | 17 / 22 | 43 / 53 | 47 / 58 | 56 / 67 | 56 / 68 | 55 / 65 | 63 / 75 |
| Cyberpunk 2077 | 49 / 56 | 40 / 47 | 56 / 64 | 60 / 68 | 70 / 79 | 71 / 80 | 66 / 76 | 76 / 87 |
| F1 24 | 139 / 178 | 82 / 151 | 102 / 145 | 110 / 155 | 100 / 162 | 105 / 171 | 111 / 167 | 111 / 177 |
| Hogwarts legacy | 154 / 180 | 116 / 142 | 98 / 115 | 107 / 125 | 125 / 144 | 126 / 146 | 122 / 137 | 144 / 163 |
| Indiana Jones and the Great Circle | N / A | 19 / 22 | N / A | 45 / 48 | 54 / 56 | 53 / 56 | 49 / 51 | 58 / 60 |
| Metro Exodus Enchanted Edition | N / A | N / A | 65 / 104 | 70 / 110 | 77 / 124 | 79 / 128 | 77 / 119 | 75 / 134 |
| return | 29 / 142 | 93 / 151 | 93 / 140 | 74 / 145 | 110 / 165 | 109 / 164 | 102 / 164 | 96 / 182 |
| Max. | + 6 % | + 56 % | + 71 % | + 97 % | + 100 % | + 91 % | + 121 % | |
| Avg. | −17% | + 2 % | + 9 % | + 24 % | + 26 % | + 22 % | + 39 % | |
| Min. | −35% | −36% | −31% | −20% | −19% | −24% | −9% | |
| 3840 × 2160 | ||||||||
|---|---|---|---|---|---|---|---|---|
| AMD Radeon RX 9070XT | AMD Radeon RX 7900 XTX | NVIDIA GeForce RTX 4070 Ti | NVIDIA GeForce RTX 4070 Ti SUPER | NVIDIA GeForce RTX 4080 | NVIDIA GeForce RTX 4080 SUPER | NVIDIA GeForce RTX 5070 Ti | NVIDIA GeForce RTX 5080 | |
| Alan wake 2 | 42 / 46 | 34 / 37 | 43 / 46 | 47 / 50 | 54 / 58 | 54 / 59 | 54 / 57 | 62 / 66 |
| Black MythWukong | 15 / 19 | 8 / 11 | 25 / 30 | 28 / 33 | 33 / 39 | 33 / 40 | 32 / 38 | 39 / 45 |
| Cyberpunk 2077 | 25 / 28 | 20 / 24 | 29 / 33 | 31 / 35 | 37 / 43 | 37 / 43 | 35 / 41 | 41 / 48 |
| F1 24 | 91 / 112 | 63 / 95 | 76 / 91 | 80 / 98 | 90 / 113 | 91 / 114 | 90 / 112 | 99 / 128 |
| Hogwarts legacy | 85 / 101 | 72 / 88 | 56 / 67 | 60 / 70 | 70 / 82 | 71 / 83 | 67 / 78 | 82 / 93 |
| Indiana Jones and the Great Circle | N / A | 12 / 13 | N / A | N / A | N / A | N / A | N / A | N / A |
| Metro Exodus Enchanted Edition | N / A | N / A | 51 / 72 | 55 / 78 | 62 / 91 | 63 / 92 | 61 / 87 | 72 / 102 |
| return | 20 / 102 | 81 / 108 | 67 / 91 | 64 / 96 | 78 / 112 | 76 / 112 | 83 / 113 | 93 / 127 |
| Max. | + 6 % | + 58 % | + 74 % | + 105 % | + 111 % | + 100 % | + 137 % | |
| Avg. | −16% | + 2 % | + 10 % | + 29 % | + 31 % | + 26 % | + 47 % | |
| Min. | −42% | −34% | −31% | −19% | −18% | −23% | −8% | |
Compared to the Radeon RX 7900 XTX, the upgraded AMD chip architecture provided the Radeon RX 9070 XT with an 18-20% FPS performance boost. The GeForce RTX 5070 Ti, in turn, lags behind the GeForce RTX 4080 and RTX 4080 SUPER by up to 4% FPS, but outperforms the GeForce RTX 4070 Ti SUPER by 10-14%.

Gaming tests in overclocking
As you might expect from the meager GPU clock speed increase, overclocking the Radeon RX 9070 TX (at least in the GIGABYTE GAMING OC version) is a near-futile exercise, only allowing for a 4% frame rate gain. The Palit GeForce RTX 5070 Ti GamingPro, on the other hand, is 9% faster after overclocking.

Tests in production applications
In the Blender benchmark, the new AMD graphics card was able to extract a significantly greater advantage from hardware acceleration of ray tracing than the Radeon RX 7900 XT. However, the old flagship completed rendering faster due to its advantage in raw performance. As a result, there can be no talk of equal competition between the Radeon RX 9070 XT and the GeForce RTX 5070 Ti. The GeForce RTX 5070 Ti itself took an intermediate position between the RTX 4070 Ti SUPER and RTX 4080.

In transcoding tasks using Premiere Pro, the "red" accelerator of the new generation is on par with the Radeon RX 7900 XTX, but lags behind the "green" competitors in the overall assessment due to the relatively low speed of working with RAW files. GeForce RTX 5070 Ti, like the RTX 5080, uses an updated H.264 and HEVC decoder, so it confidently outperforms the 40-series video cards.

The Radeon RX 9070 XT takes the last place in the GPU effects rendering speed chart, while the GeForce RTX 5070 Ti follows the RTX 5080 and two versions of the RTX 4080 in this test.

DaVinci Resolve transcoding tests put the Radeon RX 9070 XT on par with the Radeon RX 7900 XTX, just behind the GeForce RTX 5080. The GeForce RTX 5070 Ti outperforms all older NVIDIA solutions and follows the Radeon RX 9070 XT.

In the GPU effects rendering benchmark, the Radeon RX 9070 XT again falls to the bottom of the chart. The GeForce RTX 5070 Ti, on the other hand, is ahead of even the Radeon RX 7900 XTX and is second only to the GeForce RTX 5080.

The GeForce RTX 5070 Ti is poorly suited for a number of CAD applications due to the lack of necessary driver optimizations. Here, the Radeon RX 9070 XT takes revenge.

Video encoding/decoding
The hardware decoder on the Navi 48 chip received a non-trivial increase in performance when working with AV1, but the “green” GPUs still cope with other video formats noticeably faster.

In turn, the video encoder has also gained speed and can now compete with Intel's QuickSync, although it lags behind the "green" NVENC.

Performance per Watt
Since our Radeon RX 9070 XT and Radeon RX 7900 XTX samples have almost the same power reserves, the difference in power efficiency between them is entirely determined by gaming performance. The old flagship leads with a 5% FPS advantage in specific frame rate in rasterization, but in traced games the Radeon RX 9070 XT made a 20% jump.
| Manufacturer | AMD | NVIDIA | ||||||
|---|---|---|---|---|---|---|---|---|
| Model | Radeon RX 9070 XT | Radeon RX 7900 XTX | GeForce RTX 4070 Ti | GeForce RTX 4070 Ti SUPER | GeForce RTX 4080 | RTX GeForce 4080 SUPER | GeForce RTX 5070 Ti | GeForce RTX 5080 |
| Graphics Processor | Ships 48 XTX | Ships 31 XTX | AD104 | AD103 | AD103 | AD103 | GB203 | GB203 |
| Microarchitecture | RDNA4 | RDNA3 | Ada Lovelace | Ada Lovelace | Ada Lovelace | Ada Lovelace | Blackwell | Blackwell |
| Process technology, nm | TSMC N4C | TSMC N5/N6 | TSMC4N | TSMC4N | TSMC4N | TSMC4N | TSMC 4NP | TSMC 4NP |
| Average power consumption (FurMark), W | 373 | 372 | 285 | 285 | 332 | 317 | 318 | 397 |
| Performance/W (without ray tracing) | 100% | + 5 % | −2% | + 6 % | + 8 % | + 14 % | + 13 % | + 5 % |
| Performance/W (with ray tracing) | 100% | −17% | + 15 % | + 44 % | + 50 % | + 59 % | + 55 % | + 48 % |
In turn, the GeForce RTX 5070 Ti here also turned out to be an analogue of the GeForce RTX 4080 SUPER, but this video card has a lower TBP compared to the Radeon RX 9070 XT and outperforms the "red" competitor in ray tracing. As a result, the energy efficiency of the GeForce RTX 5070 Ti is higher by 13 or 55%, depending on the rendering method.
Summary of gaming benchmark results without ray tracing

Ray Tracing Gaming Benchmarks Summary

Summary of gaming benchmarks with ray tracing and frame scaling

Conclusions
Of the two competing devices, the Radeon RX 9070 XT and the GeForce RTX 5070 Ti, the latter is the easiest to describe. The graphics card is a variant of the GeForce RTX 5080, cut down just enough to make it an analogue of the GeForce RTX 4080 or RTX 4080 SUPER, only with a lower MSRP. This in itself would be forgivable if it weren’t for the fact that the GeForce RTX 5070 Ti outperforms the RTX 4070 Ti SUPER by 19% FPS at best under the most favorable conditions, and officially costs only $50 less, not to mention the actual prices.
The Radeon RX 9070 XT takes the place of the Radeon RX 7900 XTX in rasterized games, despite a large gap in raw computing power, and most importantly, due to the improved architecture, it has pulled ahead by 21% FPS in ray tracing. It is nice to see that at least AMD's mid-range GPU has caught up and surpassed the old top-tier GPU. But alas, even this increase was not enough to put an end to the multi-year gap with NVIDIA. The Radeon RX 9070 XT has a symbolic victory over the GeForce RTX 5070 Ti in rasterization, but in games with ray tracing, the average advantage of the GeForce RTX 5070 Ti reaches 32%, and in fully traced games - even more. The recommended price of the Radeon RX 9070 XT is proportionally lower than that of the RTX 5070 Ti, which balances out the weaknesses of the RDNA 4 logic. However, in Russia, the minimum prices of the competitors are almost equal, so despite NVIDIA's stagnation and AMD's progress, the choice between the Radeon RX 9070 XT and GeForce RTX 5070 Ti is simple - at least for now.
Among the gaming functions of the new GPUs, upscaling takes first place — DLSS or FSR version 4. NVIDIA chips received the ability to generate multiple frames and transformable models, and AMD, in principle, began to use neural networks for this purpose. But the FSR XNUMX technology, like DLSS, is tied to the "native" hardware, so the frame scaling tools available to owners of "green" and "red" accelerators are divided, and it is no longer possible to say that GeForce video cards support both DLSS and FSR, and Radeon - only FSR. We will consider the topic of upscaling quality and related latency separately in the upcoming study.
The Radeon RX 9070 XT, like the GeForce RTX 5070 Ti, has 16 GB of video memory. This means that within the current generation, only the GeForce RTX 5090 offers more, while other models risk getting into a situation where the GPU could handle the maximum graphics quality in a game (even with the help of upscaling), but is limited by the amount of VRAM. So far, there is only one such game - Indiana Jones and the Great Circle, but this is just the first bell.
The Radeon RX 9070 XT is a pure gaming product, poorly suited to most of the workloads we tested (excluding CAD). The GeForce RTX 5070 Ti, on the other hand, performed well in video editing tasks thanks to its updated hardware codec, but again is unlikely to be of interest to professionals due to the lack of local memory.
Finally, let's say a few words about the video cards that represent the Radeon RX 9070 XT and GeForce RTX 5070 Ti in the review - and now these are the most affordable versions of both models on the Russian market. The "red" GIGABYTE accelerator and the "green" Palit have a higher power consumption than prescribed by the reference specifications (in the "Radeon" it has exceeded 360 W) and work quite quietly if the corresponding BIOS is active. At the same time, the Palit GeForce RTX 5070 Ti GamingPro overclocks much better, although it does not even allow you to increase the TBP, but the GIGABYTE Radeon RX 9070 XT GAMING OC unpleasantly surprised with a VRAM temperature above 90 ° C and liquid thermal pads, which make it difficult to service the video card.
Source: 3dnews.ru


