The Nvidia RTX A6000, one of the most eagerly anticipated workstation GPUs in recent years, is finally here. And it’s certainly been worth the wait, writes Greg Corke
Nvidia first announced its new Nvidia RTX A6000 workstation GPU back in October 2020. But like many IT products in the year of Covid-19, it was delayed and we’ve been eagerly awaiting its arrival.
And now it’s finally here – or rather, it should be, by the time you read this. UK firm Scan should have availability of the PNY Nvidia RTX A6000 from 3 March and sent us a sample to review inside its new 3XS GWP-ME N1-32T AMD Ryzen Threadripper workstation, which we take a closer look at here.
Tuned for visualisation workflows, the Nvidia RTX A6000 (A for Ampere) is Nvidia’s second-generation RTX GPU. With an emphasis on hardware-based ray tracing, it promises to deliver more than double the GPU rendering performance of its predecessor, the Turing-based Nvidia Quadro RTX 6000. And, of course, it also offers a significant boost for 3D graphics, VR and AI workflows. And all for £3,730 + VAT.
The Nvidia RTX A6000 is the first workstation GPU to be built on the Nvidia Ampere architecture, and also the first to support PCIe Gen 4. With double the bandwidth of PCIe Gen 3, it should mean data can move in and out of the GPU quicker, but it won’t benefit all workflows. PCIe Gen 4 is currently only available in workstations with the latest AMD CPUs.
What? No Quadro?
The observant among you will have noticed that the Nvidia RTX A6000 is missing something. Yes, after 17 years, it appears that Nvidia is retiring its Quadro brand.
For someone who has written about Nvidia professional GPUs since the early 2000s, it’s hard to get used to this change. And, it would seem, Nvidia is struggling too. For the October launch, product photos were sent out with ‘Nvidia Quadro RTX A6000’ in the file name, which suggests a last-minute rebranding.
But the Nvidia RTX A6000 is certainly a Quadro card in everything but name. It has certified drivers for pro applications, renowned reliability, ECC memory and plenty of niche features for pro visualisation, such as stereo and Frame Lock for viz clusters. There’s also support for Nvidia virtual GPU (vGPU) software, which allows a workstation to be repurposed into multiple high performance virtual workstation instances.
With 48 GB of VRAM, the Nvidia RTX A6000 has the most memory of any professional GPU – on par with the Quadro RTX 8000, and double that of the Quadro RTX 6000.
This is particularly relevant for those working with high poly count models and very high-resolution textures in GPU renderers or real-time visualisation/VR. But more on this later.
What’s more, two Nvidia RTX A6000s can be bridged together with an NVLink adapter to create a memory pool of 96 GB. However, sharing geometry between two GPUs can come with a significant performance hit.
Beauty and the beast
Nvidia’s product design team has really gone to town on the Nvidia RTX A6000 and the card is a thing of beauty, with a minimal angular design and black mirror finish. NVLink, Stereo and Sync connectors are all hidden away behind discrete panels.
The board is rated at 300W, so you’ll need a fairly hefty PSU in your workstation. It gets its power from a special 8-pin connector, which is connected to two standard 8-pin PSU connectors via an adapter. It is cooled by two fans, top and bottom. There are four DisplayPort 1.4a connectors nestled below a heat sink.
Testing the RTX A6000
We put the PNY Nvidia RTX A6000 through a series of real-world application benchmarks, both for GPU rendering and real-time visualisation.
The GPU is simply overkill for current generation CAD and BIM software, so we didn’t do any testing in that regard. However, it’s important to note that it will still be certified for the likes of Solidworks, PTC Creo and Siemens NX, which is useful if you plan to use those kinds of applications alongside more viz-focused tools like Chaos V-Ray, Luxion KeyShot and Solidworks Visualize.
The full spec of our Scan 3XS GWP-ME N1-32T test machine can be seen below.
You can read a full review here.
Scan 3XS GWP-ME N1-32T
Processor: AMD Ryzen Threadripper 3970X (32C/64T) (3.7GHz – 4.5GHz)
Graphics card: Nvidia RTX A6000 (48 GB) (461.09 driver)
Memory: 128 GB Corsair Vengeance LPX 3600MHz DDR4
System drive: 2TB WD Black SN850 PCIe 4.0 NVMe M.2 SSD
Storage drive: 4TB Samsung 860 EVO SSD
Motherboard: ASUS ROG STRIX TRX40-E
Case: Fractal Design Define 7
CPU cooler: Cooler Master MasterLiquid ML360
Power supply: Corsair RM1000i – 80PLUS Gold
Operating system: Microsoft Windows 10 Pro 64-bit
Price £7,500 + VAT
For comparison, we used a Quadro RTX 4000 and historic data from the Quadro RTX 6000 GPU (457.09 driver), tested in a Lenovo ThinkStation P620 workstation with AMD Ryzen Threadripper Pro 3995WX CPU and 128 GB RAM.
Rendering with RTX A6000
GPU rendering has now reached a stage where it is becoming ubiquitous. While renderers built into CAD applications still tend to rely on the CPU, most of the major design viz focused renderers now offer a GPU rendering capability. They can also take full advantage of Nvidia RTX technology, including the dedicated Ray Tracing cores and Tensor cores for AI denoising.
As a side note, it’s important to check that your chosen GPU rendering tool supports the new Nvidia RTX A6000. Users of Solidworks Visualize 2021, for example, will have to wait until Service Pack 3 in April 2021 before they can take advantage of any ‘Ampere’ GPU.
Chaos Group V-Ray
V-Ray is one of the most popular physically based rendering tools, especially in architectural visualisation. We put the Nvidia RTX A6000 through its paces using the new, freely downloadable V-Ray 5 benchmark, which has dedicated tests for Nvidia CUDA GPUs, Nvidia RTX GPUs, as well as CPUs.
The results were extremely compelling, with the Nvidia RTX A6000 showing itself to be 2.32 faster than the Nvidia Quadro RTX 6000 in the CUDA test and 1.85 times faster in the RTX test. Considering this is just a generation-ongeneration comparison, it’s a phenomenal leap in performance.
Luxion Keyshot
KeyShot, a CPU rendering stalwart, is a relative newcomer to the world of GPU rendering. But it’s one of the slickest implementations we’ve seen, allowing users to switch between CPU and GPU rendering at the click of a button.
In the Keyshot 10 benchmark, part of the free KeyShot Viewer, we saw similar results to V-Ray. The Nvidia RTX A6000 outperformed the Quadro RTX 6000 by a factor of 1.97.
GPU memory – why 48GB?
One aspect of technology that has long held back GPU rendering is the limited amount of GPU memory available on each card. When a scene, including geometry, materials and lighting, doesn’t fit entirely into GPU memory, the render can fail, slow down (if it runs ‘out of core’ using system memory), or simply fall back to the CPU.
There are many established workarounds to help users ensure a dataset fits entirely within GPU memory — such as resizing or optimising textures, simplifying or stripping out geometry, or rendering in separate passes — but all of this takes time to prep. What the user really wants is not to have to worry about GPU memory at all.
With 48 GB, the Nvidia RTX A6000 is arguably the GPU that will enable this. And while it’s not the first to offer that much memory (the Quadro RTX 8000 also came with 48 GB) it’s the first to do so at a sub £4,000 price point.
Of course, for some workflows 48 GB will be overkill. However, for those pushing the boundaries of realism, using very hi-fidelity textures (such as those captured from reallife scans) or colossal engineering accurate datasets, it should open up a wealth of opportunities.
Luxion, for example, has in the past reported ray tracing a KeyShot scene with 1.37 billion unique triangles using two Quadro RTX 5000 cards and NVLink for a combined 32 GB of memory.
Indeed, despite its designer-friendly workflows, KeyShot is doing much to break new ground. A new feature in KeyShot 10 called RealCloth 2.0, for example, allows users to generate visually accurate woven materials using thread geometry. However, this can be incredibly memory-hungry when using the option to represent each individual thread as a geometric entity, even with relatively small scenes.
Previously, we’ve explored this feature in our KeyShot 10 review. Using a 16 GB Nvidia Quadro RTX 5000, however, we quickly ran out of memory, with the scene falling back to the CPU to render.
However, we can report that things were entirely different with the Nvidia RTX A6000.
With 48 GB to play with, we were able to much better match the scale of the weave in KeyShot to that of the physical fabric.
As you will see from the images below, the difference in accuracy, in terms of how the material is represented, is quite incredible.
Of course, RealCloth 2.0 is quite a niche example. However, there will certainly be some architectural or automotive visualisers out there that would benefit from 48 GB (or even 96 GB with two RTX A6000s over NVlink) now or in the future. You’ll know who you are.
Real-time 3D
While GPU rendering is a major play for the Nvidia RTX A6000, real-time 3D using OpenGL and DirectX continues to be a very important part of visualisation, with applications including Autodesk VRED, Lumiscaphe, Unreal Engine, Unity and others. And, of course, the boundaries between real-time 3D and ray tracing continue to blur.
To test frame rates, we used FRAPS in combination with a 3DConnexion SpaceMouse to ensure the models moved in a consistent way every time. We only tested at 4K (3,840 x 2,160) resolution. At FHD (1,920 x 1,080) resolution, this level of GPU simply isn’t stressed enough.
Enscape
Enscape is a real-time viz and VR tool for architects that uses OpenGL and delivers very high quality graphics in the viewport.
Enscape has used elements of ray tracing in its software for some time. Newer versions of the software are RTX-enabled, so full ray tracing can be toggled on and off.
For our tests, Enscape provided a large architectural scene of a museum and its surrounding area. At 7.5GB, the GPU memory requirements of this model are relatively high, but Enscape models can take up much more and this is a drop in the ocean for the Quadro RTX A6000.
In terms of performance, the Nvidia RTX A6000 delivered a phenomenal 53 frames per second (FPS) for an incredibly smooth experience. This is around 1.39 times faster than the Nvidia Quadro RTX 6000 and 2.79 times faster than the Nvidia Quadro RTX 4000.
Autodesk VRED Professional
Autodesk VRED Professional is an automotive-focused 3D visualisation, virtual prototyping and VR tool. It uses OpenGL and delivers very high-quality visuals in the viewport. It offers several levels of real-time anti-aliasing (AA), which is important for automotive styling, as it smooths the edges of body panels.
However, AA calculations use a lot of GPU resources, both in terms of processing and memory. We tested our automotive model with AA set variously to ‘off’, ‘medium’ and ‘ultra-high’.
The Nvidia RTX A6000 was significantly faster than the Quadro RTX 6000 delivering between 1.46 and 1.50 more frames per second in all of our tests. Most notably, the card delivered a very smooth 25.15 FPS when AA set to ‘ultra high’, which is unheard for a model of this complexity. This is 2.97 times faster than a Quadro RTX 4000.
Conclusion
Nvidia has put a huge effort into developing its hardware-based ray tracing technology and it’s now really starting to see the fruits.
It’s quite incredible to think it has managed to double the GPU rendering performance, generation on generation (Turing to Ampere). And, in real-time 3D, with a 1.4 to 1.5 times boost, it’s hardly dragging its heels.
Rendering has become the big battleground for hardware manufacturers and, with the 64-core AMD Ryzen Threadripper CPU, competition between GPU and CPU has never been so fierce.
CPUs have always had the lead when it comes to addressable memory and there’s an ongoing debate about rendering performance versus accuracy. But with 48 GB per GPU, and the ability to scale up cards as and when required (the new Supermicro SuperWorkstation 5014A-TT can support up to four Nvidia RTX A6000s), the argument for GPU rendering has never been more compelling.
In many ways, the biggest competition for the Nvidia RTX A6000 is from Nvidia itself.
The consumer-focused Nvidia GeForce RTX 3090 should offer roughly the same performance for one third of the price. It does have half the memory (24 GB), but with support for NVLink, two can be linked together to create a virtual 48 GB render resource. The downside is that it’s nigh-on impossible to get hold of a board right now and prices are inflated accordingly.
Of course, for some firms, it will always need to be Quadro all the way. Or should we say Quadro in features, warranty, reliability and certification, but not in name. It’s going to take some time to get used to the rebranding.