Trensharo wrote:
But it might also force GPU upgrades for some PC users (if they want to use it regularly).
That's what would be interesting to find out ,what is the minimum Vram requirement for not crashing and also not taking a ridiculous amount of time to generate the model. I"m unsure if Resolve scaled to the 24GB Vram of my GPU resulting in nearly 18GB VRAM use or if it needs that much. What happens with a 16/12/8 GB VRAM card, does the data fit within the VRAM or it doesn't causing slow processing and possibly crashing.
Also when people download and install the Voice Trainer consider restarting Resolve, the new engine will then be optimised. Optimisation I've read uses 'best tactics' for each GPU, it most likely relates to Tensor cores or lack of tensor cores but possibly considers the VRAM of the GPU (but probably not)
I tried again after Engine optimisation, not sure if the difference was related to optimisation or there was another reason for different figures but this time, using 9m20s of sample audio training took 8 minutes, and maximum VRAM was 12.2GB, Voice convert was mainly just under 8GB VRAM but with peaks over 10GB. The result of the new AI model was not very good, I had better results training the previous 5minute audio but different samples. The source's were of similar quality, not perfect with minor background hiss.