15b2 WHEA Errors with Alpha channel source

Get answers to your questions about color grading, editing and finishing with DaVinci Resolve.
  • Author
  • Message
Offline

ChristopherSeguine

  • Posts: 89
  • Joined: Thu May 02, 2013 5:00 pm
  • Location: California

15b2 WHEA Errors with Alpha channel source

PostThu May 10, 2018 6:09 am

15b2, Win 17134, Nivida 397.55

Random WHEA/nvidia crashes, seems related to having 4 tracks, 2 R3d, 2 vfx with Alphas.

No WHEAs with 14 or other GPU apps

Logs:

https://1drv.ms/u/s!AlJNJy9Y2vGakB-asJtlwqGxUzKZ

Errors:
The computer has rebooted from a bugcheck. The bugcheck was: 0x00000124 (0x0000000000000005, 0xffffa187f99d9028, 0x0000000000000000, 0x0000000000000000). A dump was saved in: C:\WINDOWS\MEMORY.DMP. Report Id: 6a96a1c4-de17-44a3-abc0-fdfbf86229f5.

Log Name: System
Source: Microsoft-Windows-WHEA-Logger
Date: 5/9/2018 10:04:55 PM
Event ID: 16
Task Category: None
Level: Error
Keywords:
User: LOCAL SERVICE
Computer: DESKTOP-986EBBK
Description:
A fatal hardware error has occurred.

Component: PCI Express Root Port
Error Source: Generic

Bus:Device:Function: 0x0:0x0:0x1
Vendor ID:Device ID: 0x10DE:0x10EF
Class Code: 0x403

The details view of this entry contains further information.
Event Xml:
<Event xmlns="http://schemas.microsoft.com/win/2004/08/events/event">
<System>
<Provider Name="Microsoft-Windows-WHEA-Logger" Guid="{C26C4F3C-3F66-4E99-8F8A-39405CFED220}" />
<EventID>16</EventID>
<Version>0</Version>
<Level>2</Level>
<Task>0</Task>
<Opcode>0</Opcode>
<Keywords>0x8000000000000000</Keywords>
<TimeCreated SystemTime="2018-05-10T05:04:55.213028600Z" />
<EventRecordID>3426</EventRecordID>
<Correlation ActivityID="{5DC065F3-8BAC-4B41-BA14-617A49A23CAF}" />
<Execution ProcessID="4092" ThreadID="4976" />
<Channel>System</Channel>
<Computer>DESKTOP-986EBBK</Computer>
<Security UserID="S-1-5-19" />
</System>
<EventData>
<Data Name="ErrorSource">8</Data>
<Data Name="FRUId">{00000000-0000-0000-0000-000000000000}</Data>
<Data Name="FRUText">
</Data>
<Data Name="ValidBits">0xcf</Data>
<Data Name="PortType">4</Data>
<Data Name="Version">0x110</Data>
<Data Name="Command">0x4010</Data>
<Data Name="Status">0x146</Data>
<Data Name="Bus">0x0</Data>
<Data Name="Device">0x0</Data>
<Data Name="Function">0x1</Data>
<Data Name="Segment">0x0</Data>
<Data Name="SecondaryBus">0x0</Data>
<Data Name="Slot">0x0</Data>
<Data Name="VendorID">0x10de</Data>
<Data Name="DeviceID">0x10ef</Data>
<Data Name="ClassCode">0x403</Data>
<Data Name="DeviceSerialNumber">0x0</Data>
<Data Name="BridgeControl">0x0</Data>
<Data Name="BridgeStatus">0x0</Data>
<Data Name="UncorrectableErrorStatus">0x4000</Data>
<Data Name="CorrectableErrorStatus">0x2000</Data>
<Data Name="HeaderLog">00000000000000000000000000000000</Data>
<Data Name="Length">408</Data>
<Data Name="RawData">435045521002FFFFFFFF01000100000002000000980100000B3A04000A0512140000000000000000000000000000000000000000000000000000000000000000BDC407CF89B7184EB3C41F732CB5713167A4623E40AB9A40A698F362D464B38FBFB6EFCFD9E7D301000000004552000000000001000000000000000000000000C8000000D0000000000300000100000054E995D9C1BB0F43AD91B44DCB3C6F3500000000000000000000000000000000010000000000000000000000000000000000000000000000CF0000000000000004000000100100001040460100000000DE10EF10030400010000000000000000000000000000000000000000010510000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000010002000040000000000000201040000020000000A000000E0000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000</Data>
</EventData>
</Event>

Log Name: System
Source: nvlddmkm
Date: 5/9/2018 10:04:51 PM
Event ID: 14
Task Category: None
Level: Error
Keywords: Classic
User: N/A
Computer: DESKTOP-986EBBK
Description:
The description for Event ID 14 from source nvlddmkm cannot be found. Either the component that raises this event is not installed on your local computer or the installation is corrupted. You can install or repair the component on the local computer.

If the event originated on another computer, the display information had to be saved with the event.

The following information was included with the event:

\Device\Video8
CMDre 00000000 00000080 00000000 00000005 00000010

The message resource is present but the message was not found in the message table

Event Xml:
<Event xmlns="http://schemas.microsoft.com/win/2004/08/events/event">
<System>
<Provider Name="nvlddmkm" />
<EventID Qualifiers="49322">14</EventID>
<Level>2</Level>
<Task>0</Task>
<Keywords>0x80000000000000</Keywords>
<TimeCreated SystemTime="2018-05-10T05:04:51.482686100Z" />
<EventRecordID>3413</EventRecordID>
<Channel>System</Channel>
<Computer>DESKTOP-986EBBK</Computer>
<Security />
</System>
<EventData>
<Data>\Device\Video8</Data>
<Data>CMDre 00000000 00000080 00000000 00000005 00000010</Data>
<Binary>0000000002003000000000000E00AAC0000000000000000000000000000000000000000000000000</Binary>
</EventData>
</Event>
Primary: TRX40 Xtreme (3970x), Decklink Extreme 12G, 2x RTX3090, Areca Raid
Primary source: Red 8k .r3d, Phantom .cine, secondary footage: ArriRaw, DJI X5 CDNG
Offline

ChristopherSeguine

  • Posts: 89
  • Joined: Thu May 02, 2013 5:00 pm
  • Location: California

Re: 15b2 WHEA Errors with Alpha channel source

PostWed May 16, 2018 12:44 am

nvidia 397.64

Now Resolve 15B2 gives a GPU error 77, then WHEA hard crash.

Cuda 77 = CudaErrorIllegalAccess
Primary: TRX40 Xtreme (3970x), Decklink Extreme 12G, 2x RTX3090, Areca Raid
Primary source: Red 8k .r3d, Phantom .cine, secondary footage: ArriRaw, DJI X5 CDNG
Offline

Rohit Singhal

Blackmagic Design

  • Posts: 822
  • Joined: Wed Aug 22, 2012 11:07 am

Re: 15b2 WHEA Errors with Alpha channel source

PostWed May 16, 2018 9:09 am

Could you please try beta3 and post new logs if the issue still exists?
Rohit Singhal
DaVinci Resolve Software Development
Blackmagic Design
Offline

ChristopherSeguine

  • Posts: 89
  • Joined: Thu May 02, 2013 5:00 pm
  • Location: California

Re: 15b2 WHEA Errors with Alpha channel source

PostWed May 23, 2018 8:53 pm

15B3 - same problems.

gpu error and whea hard crash, seems to be only when using alpha channels on multiple 4k+ clips.

logs:
https://1drv.ms/u/s!AlJNJy9Y2vGakDHkgfPd5EAv0wf3
Primary: TRX40 Xtreme (3970x), Decklink Extreme 12G, 2x RTX3090, Areca Raid
Primary source: Red 8k .r3d, Phantom .cine, secondary footage: ArriRaw, DJI X5 CDNG
Offline

ChristopherSeguine

  • Posts: 89
  • Joined: Thu May 02, 2013 5:00 pm
  • Location: California

Re: 15b3 WHEA Errors with Alpha channel source

PostSat May 26, 2018 8:18 pm

I've tried switching from EXR sequences with alphas to Quicktime/Cineform with Alphas - still get WHEA crashes, always same cuda/nvidia.

I think the problem may have to do with background processing.
I turned of TimeLine thumbnails in Clip view options, and turned off Project Setting/Enable Background caching after XX.

Two days, same workflow, 4+ tracks, (2) 4k to 8k VFX tracks with alphas, no whea errors.
Primary: TRX40 Xtreme (3970x), Decklink Extreme 12G, 2x RTX3090, Areca Raid
Primary source: Red 8k .r3d, Phantom .cine, secondary footage: ArriRaw, DJI X5 CDNG
Offline

ChristopherSeguine

  • Posts: 89
  • Joined: Thu May 02, 2013 5:00 pm
  • Location: California

Re: 15b3 WHEA Errors with Alpha channel source

PostSun May 27, 2018 8:13 am

Definitely related to background caching.

Was not having crashes with auto cache turned off, until I tried to render the 4k sequence.
WHEA error again, reduced render frame rate to 10 fps = drop frames/crash.

When I render I turn monitor scaling off, which deletes all the caches.
During delivery render I notice file write activity by resolve for other files outside the delivery file - Resolve was rendering background auto user cache transitions/composite/fusion - DURING delivery render output.

Disabling Clips view so it does not generate thumbnails and disabling All auto User Cache = no WHEA or crash on 4k sequence delivery render

with multiple 8k r3ds layers on my timeline, and resolve trying to render a similar multi layer 8k r3d background cache simultaneously - seems like the problem - your running out of heap space or vram or something and causing the nvidia whea
Primary: TRX40 Xtreme (3970x), Decklink Extreme 12G, 2x RTX3090, Areca Raid
Primary source: Red 8k .r3d, Phantom .cine, secondary footage: ArriRaw, DJI X5 CDNG
Offline

ChristopherSeguine

  • Posts: 89
  • Joined: Thu May 02, 2013 5:00 pm
  • Location: California

Re: 15b2 WHEA Errors with Alpha channel source

PostSun May 27, 2018 7:52 pm

Primary: TRX40 Xtreme (3970x), Decklink Extreme 12G, 2x RTX3090, Areca Raid
Primary source: Red 8k .r3d, Phantom .cine, secondary footage: ArriRaw, DJI X5 CDNG

Return to DaVinci Resolve

Who is online

Users browsing this forum: 4EvrYng, Gary Hango, Google [Bot], Homey75, panos_mts, Rickett and 353 guests