Anyone running Resolve using RTX 2080ti already?

Get answers to your questions about color grading, editing and finishing with DaVinci Resolve.
  • Author
  • Message
Offline
User avatar

Jed Mitchell

  • Posts: 99
  • Joined: Tue Nov 03, 2015 11:04 pm
  • Location: New York, NY

Re: Anyone running Resolve using RTX 2080ti already?

PostWed Jan 02, 2019 5:53 pm

Piotr Wozniacki wrote:As long as RT cores and Tensor cores of Turing cards don't do anything special in Resolve, I guess that with 3x Titan Xp I'd get similar performance to 2x 2080 ti cards - or am I too optimistic?


After doing some more performance testing on the 2080 Ti and comparing it with some other numbers out there (and against my 2x 1080 Ti system) I'd actually expect equal or better performance with even 2x Titan Xp than 2x 2080 Ti, and 3x Titan Xp should be much faster.

I was starting a build from scratch so getting the 2080 Ti seemed like an acceptable gamble, but if I was upgrading from 2x 1080 Ti or (especially) 2x Titan Xp I'd wait until there was some use for the RT cores to make a qualitative difference. The VRAM size on the 2080 Ti is a major limiting factor too, I'd go with a single RTX Titan if I was looking at the price range of 2x 2080 Ti.
"It's amazing what you can do when you don't know you can't do it."

Systems:
DR15.2.1 | Win10 | i9 7940X | 128GB RAM | 2x 1080 Ti | 960Pro cache disk
DR15.2.2 | Win10 | i9 7940X | 128GB RAM | 1x 2080 Ti | 660p cache disk
Offline

MishaEngel

  • Posts: 385
  • Joined: Wed Aug 29, 2018 12:18 am
  • Real Name: Misha Engel

Re: Anyone running Resolve using RTX 2080ti already?

PostThu Jan 03, 2019 2:17 am

Piotr Wozniacki wrote:Hmm...for thermal reasons, I have also been considering buying 3x 2080 ti of the blower type, but only after having succeeded selling my current 2x Titan Xp. I start wondering whether - instead of selling my Titans cheap, and buying the 2080 tis expensive - I should buy a used Titan Xp as my 3rd GPU... As long as RT cores and Tensor cores of Turing cards don't do anything special in Resolve, I guess that with 3x Titan Xp I'd get similar performance to 2x 2080 ti cards - or am I too optimistic?

Piotr


A third GPU which is equal or faster than your current Titan Xp's will make your system as fast as with 3 Titan Xp's.

To speed things up a little more(in windows) look here:



here https://level1techs.com/article/unlocking-2990wx-less-numa-aware-apps

and here https://bitsum.com/portfolio/coreprio/
Offline
User avatar

Cary Knoop

  • Posts: 807
  • Joined: Sun Mar 12, 2017 6:35 pm
  • Location: Newark, CA USA

Re: Anyone running Resolve using RTX 2080ti already?

PostThu Jan 03, 2019 2:45 am

MishaEngel wrote:To speed things up a little more(in windows) look here:



here https://level1techs.com/article/unlocking-2990wx-less-numa-aware-apps

and here https://bitsum.com/portfolio/coreprio/

The cat is slowly coming out of the bag on this.
The eyes should now be on Microsoft!
Offline
User avatar

Piotr Wozniacki

  • Posts: 960
  • Joined: Tue Aug 02, 2016 12:17 pm
  • Location: Poland

Re: Anyone running Resolve using RTX 2080ti already?

PostThu Jan 03, 2019 6:26 am

MishaEngel wrote:
A third GPU which is equal or faster than your current Titan Xp's will make your system as fast as with 3 Titan Xp's.


That's what I said, Mischa. I'm just considering adding a used 3rd Titan Xp to my current system because selling my Titans only to get 3x 2080ti instead would be too high a waste of money with the prices people offer me for them.

Oh - and what you might have misunderstood in my previous post could be the question rather than statement, that hopefully my system with 3x Titan Xp would not be much slower than with 2x 280ti. Do you agree with that?

Piotr
AMD TR 2990WX CPU | MSI X399 CARBON AC | 64GB RAM@XMP3200 | 2x Titan Xp GPU | 4x WD Black 3TB RAID0 media drive | 3x 1TB NVMe RAID0 cache drive | SSD SATA system drive | AX1600i PSU | Decklink 12G Extreme | Samsung UHD reference monitor (calibrated)
Offline
User avatar

Piotr Wozniacki

  • Posts: 960
  • Joined: Tue Aug 02, 2016 12:17 pm
  • Location: Poland

Re: Anyone running Resolve using RTX 2080ti already?

PostThu Jan 03, 2019 7:50 am

Cary Knoop wrote:
MishaEngel wrote:To speed things up a little more(in windows) look here:



here https://level1techs.com/article/unlocking-2990wx-less-numa-aware-apps

and here https://bitsum.com/portfolio/coreprio/

The cat is slowly coming out of the bag on this.
The eyes should now be on Microsoft!


Running Coreprio, I somehow do not see much of a difference in my 2990WX performance; not in Resolve at least...

Piotr
AMD TR 2990WX CPU | MSI X399 CARBON AC | 64GB RAM@XMP3200 | 2x Titan Xp GPU | 4x WD Black 3TB RAID0 media drive | 3x 1TB NVMe RAID0 cache drive | SSD SATA system drive | AX1600i PSU | Decklink 12G Extreme | Samsung UHD reference monitor (calibrated)
Offline
User avatar

Piotr Wozniacki

  • Posts: 960
  • Joined: Tue Aug 02, 2016 12:17 pm
  • Location: Poland

Re: Anyone running Resolve using RTX 2080ti already?

PostThu Jan 03, 2019 10:19 am

Here are some results of the Prime 95 benchmark (all benchmark parameters default; vanilla hardware settings as well as those of Ryzen Master; the 2304K FFT length portion selected arbitrarily):

1. W/o CorePrio running:
Code: Select all
Timings for 2304K FFT length (32 cores, 1 worker):  2.15 ms.  Throughput: 464.06 iter/sec.
Timings for 2304K FFT length (32 cores, 8 workers):  8.51,  7.95,  8.52,  7.12, 10.42, 10.09, 10.71, 10.49 ms.  Throughput: 884.85 iter/sec.
Timings for 2304K FFT length (32 cores, 32 workers): 32.13, 35.52, 32.25, 31.60, 31.73, 31.71, 35.28, 34.44, 37.09, 36.85, 31.59, 29.70, 30.16, 30.64, 35.07, 31.57, 50.69, 50.93, 40.55, 46.59, 60.59, 49.36, 61.24, 60.89, 54.16, 56.31, 53.50, 55.36, 52.07, 52.04, 59.91, 50.88 ms.  Throughput: 790.51 iter/sec.
Timings for 2304K FFT length (32 cores hyperthreaded, 1 worker):  2.95 ms.  Throughput: 338.88 iter/sec.
Timings for 2304K FFT length (32 cores hyperthreaded, 8 workers):  9.72, 10.17,  9.83,  9.79, 11.08, 10.45, 10.60, 11.91 ms.  Throughput: 769.36 iter/sec.
Timings for 2304K FFT length (32 cores hyperthreaded, 32 workers): 41.13, 40.28, 38.02, 38.43, 40.35, 37.71, 36.50, 36.31, 39.26, 39.44, 37.71, 38.38, 38.64, 37.50, 37.43, 36.64, 46.14, 46.46, 46.10, 46.60, 46.98, 46.77, 44.73, 45.17, 46.72, 46.11, 47.54, 45.27, 48.15, 47.94, 48.96, 48.39 ms.  Throughput: 760.10 iter/sec.


2. Results with CorePrio running with default settings:
Code: Select all
Timings for 2304K FFT length (32 cores, 1 worker):  1.94 ms.  Throughput: 516.16 iter/sec.
Timings for 2304K FFT length (32 cores, 8 workers): 10.05,  9.98, 10.18,  9.93, 10.35, 10.64, 10.73, 10.34 ms.  Throughput: 779.15 iter/sec.
Timings for 2304K FFT length (32 cores, 32 workers): 37.51, 38.43, 37.78, 38.66, 38.22, 39.83, 37.26, 37.36, 42.17, 43.10, 42.55, 43.67, 41.21, 42.69, 40.33, 42.28, 44.09, 33.77, 45.05, 44.81, 43.30, 36.39, 38.55, 38.30, 46.28, 39.95, 45.99, 45.07, 36.47, 39.15, 40.15, 44.16 ms.  Throughput: 789.88 iter/sec.
Timings for 2304K FFT length (32 cores hyperthreaded, 1 worker):  2.99 ms.  Throughput: 333.93 iter/sec.
Timings for 2304K FFT length (32 cores hyperthreaded, 8 workers): 11.27, 11.45, 11.17, 11.13,  9.46,  9.12,  9.19,  9.45 ms.  Throughput: 785.50 iter/sec.
Timings for 2304K FFT length (32 cores hyperthreaded, 32 workers): 45.90, 43.68, 46.03, 43.71, 46.34, 42.61, 45.92, 45.70, 49.89, 49.09, 50.66, 39.03, 47.35, 38.33, 43.59, 39.14, 40.14, 37.70, 36.41, 44.99, 41.03, 42.38, 40.64, 42.90, 48.51, 40.21, 40.36, 40.24, 45.27, 49.75, 32.87, 41.65 ms.  Throughput: 748.21 iter/sec.


So, a very slight throughput increases are indeed visible, but maily for a single worker. I need to look at it closer - especially repeat with Ryzen Master set to Creator Mode (at least), and perhaps some CorePrio's parameter change.

Also - although HWInfo which I'm solely using as my system monitor can't say whether or not any given core throttled (like with Intel CPUs) - this has also been a conservative test from this point of view, as I never let my CPU cool down completely after I stopped the first benchmark (without Core Prio enabled), so a slight possibility exists my 2990WX was at the edge of throttling in the second benchmark (the one with CorePrio enabled)...

And last but not least: to draw any conclusions, some Excel work on larger outputs in both scenarios is required; this has just been a quick and dirty sneak peak :)

Piotr

PS. I forgot to state that one thing this little application changes for sure: instead of showing all 32 cores at 100% load and dark blue in Task Manager most of the time, with CorePrio it does it only in the "32 cores, 32 workers" test!

And here is the same benchmark with Dynamic Local Mode disabled, as it should for CorePrio's own Dynamic Local Mode disabled as it should:
Code: Select all
Timings for 2048K FFT length (32 cores, 1 worker):  2.74 ms.  Throughput: 365.15 iter/sec.
Timings for 2048K FFT length (32 cores, 8 workers):  6.16,  6.17,  5.96,  5.84, 10.02, 10.07,  9.71,  9.75 ms.  Throughput: 1068.13 iter/sec.
Timings for 2048K FFT length (32 cores, 32 workers): 27.94, 26.90, 28.50, 28.14, 28.80, 27.51, 28.03, 28.15, 26.45, 26.64, 26.88, 27.51, 26.56, 26.12, 26.49, 27.48, 50.98, 50.56, 52.79, 48.59, 50.50, 47.73, 51.07, 53.98, 49.38, 50.70, 49.49, 49.87, 50.18, 49.48, 50.05, 50.50 ms.  Throughput: 902.78 iter/sec.
Timings for 2048K FFT length (32 cores hyperthreaded, 1 worker):  2.89 ms.  Throughput: 346.00 iter/sec.
Timings for 2048K FFT length (32 cores hyperthreaded, 8 workers):  6.57,  6.47,  6.37,  6.30, 10.62, 11.16, 10.68, 10.70 ms.  Throughput: 993.29 iter/sec.
Timings for 2048K FFT length (32 cores hyperthreaded, 32 workers): 27.79, 24.90, 33.37, 25.01, 29.72, 25.72, 28.25, 25.54, 27.64, 25.94, 26.46, 25.68, 26.73, 25.60, 26.79, 26.10, 50.09, 48.69, 57.51, 50.57, 50.16, 47.68, 60.40, 49.63, 50.09, 49.49, 50.69, 50.50, 49.45, 49.23, 51.98, 50.22 ms.  Throughput: 911.44 iter/sec.


So now, for 32 cores hyperthreaded, 32 workers we have throughputs like this:
760.10 iter/sec
748.21 iter/sec
911.44 iter/sec - meaning also 32 workers get faster with CorePrio!
Last edited by Piotr Wozniacki on Thu Jan 03, 2019 1:25 pm, edited 5 times in total.
AMD TR 2990WX CPU | MSI X399 CARBON AC | 64GB RAM@XMP3200 | 2x Titan Xp GPU | 4x WD Black 3TB RAID0 media drive | 3x 1TB NVMe RAID0 cache drive | SSD SATA system drive | AX1600i PSU | Decklink 12G Extreme | Samsung UHD reference monitor (calibrated)
Offline
User avatar

Piotr Wozniacki

  • Posts: 960
  • Joined: Tue Aug 02, 2016 12:17 pm
  • Location: Poland

Re: Anyone running Resolve using RTX 2080ti already?

PostThu Jan 03, 2019 10:45 am

Piotr Wozniacki wrote:
MishaEngel wrote:
A third GPU which is equal or faster than your current Titan Xp's will make your system as fast as with 3 Titan Xp's.


That's what I said, Mischa. I'm just considering adding a used 3rd Titan Xp to my current system because selling my Titans only to get 3x 2080ti instead would be too high a waste of money with the prices people offer me for them.

Oh - and what you might have misunderstood in my previous post could be the question rather than statement, that hopefully my system with 3x Titan Xp would not be much slower than with 2x RTX 2080ti. Do you agree with that?

Piotr

Since the theory behind CorePrio utility might actually have something to it, my hopes for having a really fast system one day have increased again. In this context - please share your opinion on the rhetoric question in my own quote above (I bolded it out now). I wouldn't like my GPUs to become real bottleneck if some beautiful a patch from M$ (or more probably - an utility from some enthusiast gurus) unleashes the true potential of my TR :) TIA

Piotr
AMD TR 2990WX CPU | MSI X399 CARBON AC | 64GB RAM@XMP3200 | 2x Titan Xp GPU | 4x WD Black 3TB RAID0 media drive | 3x 1TB NVMe RAID0 cache drive | SSD SATA system drive | AX1600i PSU | Decklink 12G Extreme | Samsung UHD reference monitor (calibrated)
Offline
User avatar

Piotr Wozniacki

  • Posts: 960
  • Joined: Tue Aug 02, 2016 12:17 pm
  • Location: Poland

Re: Anyone running Resolve using RTX 2080ti already?

PostThu Jan 03, 2019 12:09 pm

And speaking of Microsoft in the context of the 2990WX NUMA, here's and interesting bit from tom's Hardware:

"AMD continues working with Microsoft to route threads to the die with direct-attached memory first, and then spill remaining threads over to the compute dies. Unfortunately, the scheduler currently treats all dies as equal, operating in Round Robin mode. As a result, even moderately-threaded applications can suffer at the hands of high memory latency and low throughput. This is further complicated by thread migration. According to AMD, Microsoft has not committed to a timeline for updating its scheduler."

Piotr
AMD TR 2990WX CPU | MSI X399 CARBON AC | 64GB RAM@XMP3200 | 2x Titan Xp GPU | 4x WD Black 3TB RAID0 media drive | 3x 1TB NVMe RAID0 cache drive | SSD SATA system drive | AX1600i PSU | Decklink 12G Extreme | Samsung UHD reference monitor (calibrated)
Offline

Carsten Sellberg

  • Posts: 497
  • Joined: Fri Jun 16, 2017 9:13 am

Re: Anyone running Resolve using RTX 2080ti already?

PostThu Jan 03, 2019 12:56 pm

Cary Knoop wrote: The cat is slowly coming out of the bag on this.
The eyes should now be on Microsoft!


Hi.

3 months ago, I wrote this about the AMD 2990wx:

QUOTE 'My personal feeling was, that there is a problem in Windows, as some few Linux programs scaled much better. But only time can tell.'

From this link:

viewtopic.php?f=21&t=80200

Intel promised to deliver the 28 cores 56 threads Xeon W-3175X in 2018. But Intel is again delayed.
When we see the 28 cores Xeon W-3175X, will it be nice to know, how IT will handle programs with max numbers of threads?
Will it have the same problems as the AMD 2990wx?

Regards Carsten.
URSA Mini 4.6K
Offline

MishaEngel

  • Posts: 385
  • Joined: Wed Aug 29, 2018 12:18 am
  • Real Name: Misha Engel

Re: Anyone running Resolve using RTX 2080ti already?

PostThu Jan 03, 2019 1:44 pm

Piotr Wozniacki wrote:
MishaEngel wrote:
A third GPU which is equal or faster than your current Titan Xp's will make your system as fast as with 3 Titan Xp's.


That's what I said, Mischa. I'm just considering adding a used 3rd Titan Xp to my current system because selling my Titans only to get 3x 2080ti instead would be too high a waste of money with the prices people offer me for them.

Oh - and what you might have misunderstood in my previous post could be the question rather than statement, that hopefully my system with 3x Titan Xp would not be much slower than with 2x 280ti. Do you agree with that?

Piotr


3 Titan Xp's are faster than 2 RTX2080ti's for sure. On the otherhand you already have a super fast system. I would rather spend my money on lenses, light's, etc..

What kind of playback rates do you get with RED 8k.R3D files?
Offline
User avatar

Piotr Wozniacki

  • Posts: 960
  • Joined: Tue Aug 02, 2016 12:17 pm
  • Location: Poland

Re: Anyone running Resolve using RTX 2080ti already?

PostFri Jan 04, 2019 7:54 am

MishaEngel wrote:3 Titan Xp's are faster than 2 RTX2080ti's for sure. On the otherhand you already have a super fast system. I would rather spend my money on lenses, light's, etc..

Misha - this is important to me - could you elaborate on the above statement, please? Even assuming nothing very important is going to be implemented in Resolve which will heavily depend on the Turing new types of cores, I thought there was a consensus that the RTX 2080ti is indeed some 25% faster than my Titan Xp. So If I do sell my 2x Titans Xp now before they lose even more value, I'd be able to buy 2x RTX 2080tis for starters - and once BMD does introduce new RT core-based functionalities, I'd buy a 3rd one. While - if I keep my current cards, and even add a 3rd one (Titan Xp, used of course) - I will have this path closed for ever, as I expect my income to drastically drop starting next year. So as you can see, this is a real dilemma!

As to my shooting equipment - well, as a one man band I think I'm set for a long time to come (FS7, Inferno, A7Rii and GH5 with Metabones adapter and many Canon EF lenses as well as several native ones for both E and MFT mounts). I have only some basic lighting set - but shooting alone I'm not able to handle more anyway (for my classical music multi-camera projects, I team up with a larger production group anyway - with sound guys, camera assistants and lighting crew).

So could you please share the source you claims are based on? TIA,
Piotr
Last edited by Piotr Wozniacki on Fri Jan 04, 2019 4:29 pm, edited 1 time in total.
AMD TR 2990WX CPU | MSI X399 CARBON AC | 64GB RAM@XMP3200 | 2x Titan Xp GPU | 4x WD Black 3TB RAID0 media drive | 3x 1TB NVMe RAID0 cache drive | SSD SATA system drive | AX1600i PSU | Decklink 12G Extreme | Samsung UHD reference monitor (calibrated)
Offline

MishaEngel

  • Posts: 385
  • Joined: Wed Aug 29, 2018 12:18 am
  • Real Name: Misha Engel

Re: Anyone running Resolve using RTX 2080ti already?

PostFri Jan 04, 2019 3:48 pm

Don't buy on promises from manufatures.
Never pre-order.

At the moment RTX brings nothing for Resolve, be happy with your system.
Offline
User avatar

Piotr Wozniacki

  • Posts: 960
  • Joined: Tue Aug 02, 2016 12:17 pm
  • Location: Poland

Re: Anyone running Resolve using RTX 2080ti already?

PostFri Jan 04, 2019 4:33 pm

MishaEngel wrote:Don't buy on promises from manufatures.
Never pre-order.
At the moment RTX brings nothing for Resolve, be happy with your system.


While I agree with those rules in general, I thought Puget concluded their Resolve-specific tests stating that the Turing potential advantages aside, the RTX 2080ti's raw speed is some 25% higher than that of the Titan Xp...

Piotr
AMD TR 2990WX CPU | MSI X399 CARBON AC | 64GB RAM@XMP3200 | 2x Titan Xp GPU | 4x WD Black 3TB RAID0 media drive | 3x 1TB NVMe RAID0 cache drive | SSD SATA system drive | AX1600i PSU | Decklink 12G Extreme | Samsung UHD reference monitor (calibrated)
Offline

MishaEngel

  • Posts: 385
  • Joined: Wed Aug 29, 2018 12:18 am
  • Real Name: Misha Engel

Re: Anyone running Resolve using RTX 2080ti already?

PostFri Jan 04, 2019 5:06 pm

Piotr Wozniacki wrote:
MishaEngel wrote:Don't buy on promises from manufatures.
Never pre-order.
At the moment RTX brings nothing for Resolve, be happy with your system.


While I agree with those rules in general, I thought Puget concluded their Resolve-specific tests stating that the Turing potential advantages aside, the RTX 2080ti's raw speed is some 25% higher than that of the Titan Xp...

Piotr


16,49 = 16 and 19,51 = 20 in Puget language.

20/16=1.25 hence 25% faster in Puget language
19,51/16,49=1,18 hence 18% faster my language

Also puget tests with short files, 1 minute or shorter (so only at turbo speeds).
RTX2080ti turbo 1635MHz x 4352cores x 0.002=14.231 Tflops fp32
TitanXp turbo 1582MHz x 3840cores x 0.002=12.150 Tflops fp32

Real turbo difference is ~ 17.13%

Sustained speed(with base clock and long renders, NoiseReduction, effects, etc).
RTX2080ti base 1350MHZ x 4352cores x 0.002=11.750 Tflops fp32
TitanXp base 1480MHz x 3840cores x 0.002=11.366 Tflops fp32

Real base(sustained speed with real footage) difference is 3.38%

And the titanXp has 12 GB of VRAM and the RTX only 11 GB of VRAM.

Puget wants to sell(they are a commercial company) and are not an independent review site (hence colored information without strictly lying).
Offline
User avatar

Piotr Wozniacki

  • Posts: 960
  • Joined: Tue Aug 02, 2016 12:17 pm
  • Location: Poland

Re: Anyone running Resolve using RTX 2080ti already?

PostFri Jan 04, 2019 6:34 pm

Thanks for this enlightening analysis and your time, Misha :)

Piotr

PS. The RTX 2080ti models which I'm thinking of is either the Aorus from Gigabyte, or Blower from Zotac. Both are factory-overclocked, but have the "blower" cooling design (just like the Titans, and unlike the FE nVidia or any other of those many other models from different manufacturers which all have 2-3 fans and suffer from the "higher-shelf" card(s) throttling problem (both Gigabyte Aorus and Zotac blower cards expels the hot air through the back of the PC case). Also - even though I do not 6 or 8K with my own gear (my highest resolutions being 4k or UHD at 50p in Sony's XAVC-I flavor of H.264) - my classical music projects are usually up to 10 cameras, and when I do it with ad-hoc assembled teams one of more cameras may happen to be capable of up to 8K, and RAW (my own RAW possibilities are limited to the 4K@25p CDNG on my Shogun Inferno). And with so many cameras, often recording different formats (and I will always use one or two with the highest resolution if available, as this gives me great possibilities of zooming in. panning etc. for creative purposes) - editing material with so many different angles and close-ups of say a classic guitar player's fingers puts extreme requirements on the entire editing system speed and "snappiness" to maintain the A-V sync with millisecond precision (just one of many examples)... I've been doing that for many years, so believe me - I know what I'm talking about; the usual "1 frame precision" of the sync is not enough... And this is my main reason for the fastest possible system.
AMD TR 2990WX CPU | MSI X399 CARBON AC | 64GB RAM@XMP3200 | 2x Titan Xp GPU | 4x WD Black 3TB RAID0 media drive | 3x 1TB NVMe RAID0 cache drive | SSD SATA system drive | AX1600i PSU | Decklink 12G Extreme | Samsung UHD reference monitor (calibrated)
Offline
User avatar

Piotr Wozniacki

  • Posts: 960
  • Joined: Tue Aug 02, 2016 12:17 pm
  • Location: Poland

Re: Anyone running Resolve using RTX 2080ti already?

PostSun Jan 06, 2019 2:32 pm

Piotr Wozniacki wrote:And here is the same benchmark with Dynamic Local Mode disabled, as it should for CorePrio's own Dynamic Local Mode disabled as it should:
Code: Select all
Timings for 2048K FFT length (32 cores, 1 worker):  2.74 ms.  Throughput: 365.15 iter/sec.
Timings for 2048K FFT length (32 cores, 8 workers):  6.16,  6.17,  5.96,  5.84, 10.02, 10.07,  9.71,  9.75 ms.  Throughput: 1068.13 iter/sec.
Timings for 2048K FFT length (32 cores, 32 workers): 27.94, 26.90, 28.50, 28.14, 28.80, 27.51, 28.03, 28.15, 26.45, 26.64, 26.88, 27.51, 26.56, 26.12, 26.49, 27.48, 50.98, 50.56, 52.79, 48.59, 50.50, 47.73, 51.07, 53.98, 49.38, 50.70, 49.49, 49.87, 50.18, 49.48, 50.05, 50.50 ms.  Throughput: 902.78 iter/sec.
Timings for 2048K FFT length (32 cores hyperthreaded, 1 worker):  2.89 ms.  Throughput: 346.00 iter/sec.
Timings for 2048K FFT length (32 cores hyperthreaded, 8 workers):  6.57,  6.47,  6.37,  6.30, 10.62, 11.16, 10.68, 10.70 ms.  Throughput: 993.29 iter/sec.
Timings for 2048K FFT length (32 cores hyperthreaded, 32 workers): 27.79, 24.90, 33.37, 25.01, 29.72, 25.72, 28.25, 25.54, 27.64, 25.94, 26.46, 25.68, 26.73, 25.60, 26.79, 26.10, 50.09, 48.69, 57.51, 50.57, 50.16, 47.68, 60.40, 49.63, 50.09, 49.49, 50.69, 50.50, 49.45, 49.23, 51.98, 50.22 ms.  Throughput: 911.44 iter/sec.


So now, for 32 cores hyperthreaded, 32 workers we have throughputs like this:
760.10 iter/sec (no CorePrio; DLM running as implemented by AMD in Ryzen Pro)
748.21 iter/sec (CorePrio running but without disabling DLM first - which is not recommenced according to the Author; it may explain the little regression)
911.44 iter/sec - (CorePrio running with DLM off; quite obviously higher throughputalso noticeable with all 32 workers getting faster than with AMD implementation of DLM by almost 20% )


Of course, nothing conclusive in the Resolve context - but at least a small change in the right direction. I contused my quick and dirty tests a bit longer, but unfortunately no more good news for now:

1. The MSI version of CPU-Z bench(whatever it actually does):

CorePrio on Ryzen Master in Creator Mode - AMD DLM ON.PNG
CorePrio on Ryzen Master in Creator Mode - AMD DLM ON.PNG (29.37 KiB) Viewed 427 times

CorePrio after reboot.PNG
CorePrio after reboot.PNG (30.03 KiB) Viewed 420 times


Interestingly, 178%-187% over i9-7980XE has absolutely no pattern to it; I've seen what looked like random numbers (in this range) absolutely regardless of CorePrio running or not, or of Ryzen Master's current mode. What's more, while yesterday I tended to be getting the 182% or 183% numbers most of the time, today I didn't seem to be able get past 178% - until I... rebooted my machine! Bang - and 187% over the reference i9 again. So really, nothing to even speculate about.

2. But the most frustrating to me have been trying to find the sweet-spot for my scenario of Moldflow background analyses / foreground (interactive) Resolve sessions' in dependency on the combined Ryzen Master / CorePrio settings. As the general idea, this scenario of getting the most from the huge amount of threads on the 2990WX has more or less been working for me since day one in that I can run even a heavy analysis in the background, and edit or grade in Resolve at the same time without any noticeable impact on my fps - so yeah, the 2990WX makes my workstation work as two separate ones (and fast ones, too). However - no matter how I try - I cannot find any pattern to the influence of the CorePrio (or DLM in general) setting on the actual Moldflow analysis speed. And considering that a single analysis can require anything from several minutes to several days to complete - it is of paramount importance for me to accelerate them as much as only possible... Adding to it is the fact that - before actually launching any kind of a Moldflow analysis - a user can choose from among the following scaling (thread affinity) scenarios:

- automatic (where Moldflow decides how many cores to use and when)
- maximum thread number (32 in the case of 2990WX CPU)
- single thread (never used)
- a certain predefined number of threads (anything between 1 and 64 in the case of my CPU)

It then becomes obvious that- together with even a little help from tools like CorePrio - it would be great to find the optimum number of threads and stick to it, so that the analysis runs quickly while not stealing too much cycles from Resolve. It goes without saying that while I can wait for any analysis to complete a little longer, I absolutely require fluent editing/grading experience in Resolve....

And this is yet another reason why even slight differences in GPU subsystem speed is of such a great importance to me!


Piotr
AMD TR 2990WX CPU | MSI X399 CARBON AC | 64GB RAM@XMP3200 | 2x Titan Xp GPU | 4x WD Black 3TB RAID0 media drive | 3x 1TB NVMe RAID0 cache drive | SSD SATA system drive | AX1600i PSU | Decklink 12G Extreme | Samsung UHD reference monitor (calibrated)
Offline

Carsten Sellberg

  • Posts: 497
  • Joined: Fri Jun 16, 2017 9:13 am

Re: Anyone running Resolve using RTX 2080ti already?

PostSun Jan 06, 2019 4:48 pm

Hi Piotr.

Thank for confirming my understanding of the 2990WX.

You write
QUOTE: 'can run even a heavy analysis in the background, and edit or grade in Resolve at the same time without any noticeable impact on my fps'

I remember an old post, where you come with a suggesten, from Thu Aug 23, 2018 12:32 pm: 'I wonder if BMD could make possible several export render jobs to be run simultaneously in future version?'

viewtopic.php?f=21&t=77989

So I wonder, if it is possible to run two Resolves simultaneously?
May be as two separate users?
And try out what kind of Resolve jobs you can run simultaneously. A export render can may be one of them.

I have never tried Resolve with two plug in Graphics Cards. So I don't know the possibilities.
But do any expect it will be possible to run two different users and set it up to use two different Graphics Cards?
Is it a good idea. And what is needed?

And do you expect any gain in total performance compared to only one user with a DUAL Graphics Cards setup.

Regards Carsten.
URSA Mini 4.6K
Offline
User avatar

Piotr Wozniacki

  • Posts: 960
  • Joined: Tue Aug 02, 2016 12:17 pm
  • Location: Poland

Re: Anyone running Resolve using RTX 2080ti already?

PostSun Jan 06, 2019 5:25 pm

Hi Carsten,

I never tried it (perhaps will) - but even if there is a way to run more than one instance of Resolve, the heavy dependence of the efficiency of each instance on the GPU subsystem makes simultaneous export of more than a single file a rather sub-optimal way to take advantage of the 2990WX'es plethora of cores/threads, I'm afraid...

On the other hand, the beauty of my usage scenario is that one of the number-crunching applications (Moldflow CAE system in this case) doesn't use the graphics subsystem at all, so it can be owned and managed by Resolve in the usual way - without any extra overhead or dividing the graphical subsystem in two parts - one for Resolve, the other for Moldflow... I've been working like this ever since I finished building my Threadripper workstation, and had zero problems with it! Yes - it's true that in the earlier years, Moldfllow's solvers not only scaled across the CPU's threads, but could also use CUDA cores of an nVidia, or OpenCL in the case of an AMD GPU card. But this has never been properly implemented, and so the nVidia Fermi architecture was the last one used by Moldflow for this type of CPU/GPU parallelization. BTW it never worked really well, so I'm glad Moldflow developers dropped GPU acceleration altogether.

In fact, what I do is setting up my Moldflow projects on an average laptop (the same one I would take with me to my Moldflow clients); I am accessing the laptop and the Moldflow GUI (pre- and post-processor for the FEM model and analysis results) from my main 2990WX workstation using Remote Desktop. After my FEM model is ready, I'll simply launch the solver on the workstation rather than the slowish laptop; for as long as the analysis lasts I can then minimize my Remote Desktop and with the Moldflow analysis running in the background, I start my usual interactive Resolve session... Works like a charm - but of course, I'd like to squeeze every little bit of computing power out of such a setup :)

That said, of course I do not always work like this; facing a complicated Moldflow problem with the part and mold models several million 3D elements each, I will never share my WS resources (and my own cramped CPU I have in my head :)) with anything else, not even mention Resolve. And vice-versa: when working on sub-frame A/V sync of a multicamera music video, I'll need each and every CPU and GPU core to do it properly in Resolve (not to mention RAM, of which only having 64GB I couldn't possibly accommodate two big projects in memory without disk trashing. Another example of my Resolve pursuits which I'd never even try to conduct with a Moldflow analysis running in the background is any project involving Fusion....But with all light-weight projects of both CAE and NLE nature, this is the way I'm tackling the 2990WX architectural quirks and sometimes unforeseeable behavior... Sometimes I'd even run more than one Moldflow analyses in the background, if for instance all I need to do in Resolve is some media management, timeline housekeeping and introductory cutting!

Piotr
AMD TR 2990WX CPU | MSI X399 CARBON AC | 64GB RAM@XMP3200 | 2x Titan Xp GPU | 4x WD Black 3TB RAID0 media drive | 3x 1TB NVMe RAID0 cache drive | SSD SATA system drive | AX1600i PSU | Decklink 12G Extreme | Samsung UHD reference monitor (calibrated)
Offline
User avatar

Christopher Dobey

  • Posts: 184
  • Joined: Fri Jul 18, 2014 5:58 pm
  • Location: San Francisco Bay Area, California

Re: Anyone running Resolve using RTX 2080ti already?

PostMon Jan 21, 2019 5:39 am

my findings:

RTX 2080 Ti renders ~33% faster than the WX9100.

Timeline: 4.6K RAW 4:1 60p
Only RAW adjustments and light Temporal Noise Reduction.

I only benchmarked the above settings as its my personal workflow most of the time.
See sig for computer specs. The NVIDIA was in bootcamp and the AMD in macOS as I could not get the AMD working correctly in Windows. Both in the Sonnet Tech eGPU 650.
URSA Mini 4.6K PL
Resolve Studio 15.2.3 | macOS 10.14.4
Intel i9 8950HK 6-core | 32GB 2.4GHz DDR4
AMD WX9100 16GB eGPU | AMD 560X 4GB discrete
Decklink Mini Monitor 4K | LG OLED B7A 65" | Micro Panel
Offline

MishaEngel

  • Posts: 385
  • Joined: Wed Aug 29, 2018 12:18 am
  • Real Name: Misha Engel

Re: Anyone running Resolve using RTX 2080ti already?

PostMon Jan 21, 2019 2:49 pm

Christopher Dobey wrote:my findings:

RTX 2080 Ti renders ~33% faster than the WX9100.

Timeline: 4.6K RAW 4:1 60p
Only RAW adjustments and light Temporal Noise Reduction.

I only benchmarked the above settings as its my personal workflow most of the time.
See sig for computer specs. The NVIDIA was in bootcamp and the AMD in macOS as I could not get the AMD working correctly in Windows. Both in the Sonnet Tech eGPU 650.


That is good news for the one looking for a new and faster GPU. The Radeon7/VEGA2, it is about 27% faster than the WX9100 (so around 6% behind the RTX2080ti) has 16 GB of VRAM and only costs $699 where the RTX2080ti is $1199 and only has 11 GB(out of memory problems with effects and NR). And the VEGA2 works in mojave for the apple users.
Previous

Return to DaVinci Resolve

Who is online

Users browsing this forum: MontgomerySutton and 34 guests