Thanks for the feedback! I really appreciate it!
There's a bunch of info to unpack, and I need to look over all your comments with the tool in front of me. It would be great if you guys could use the issues tab on the Github page since we can deal with them in order and benefit from the community over there. Some, have been debated there already I think.
Word based animation etc.Regarding the text+ feature, I see it more like a VFX upgrade / nice to have, rather than pure editing, so my instinct tells me that we should focus on editing features first, unless we find help. A big next step would be to push the AI search features as much as possible, since we're also directly benefiting from them on our current projects - I'd love to explain that more some time, btw.
About losing syncThe large model is better than the medium model (especially on non-english languages), but also has its biases here and there.
Another thing to mention is that the 23.976fps timelines are problematic for the Resolve API since we're getting either a 23fps or 24fps rounded integer from Resolve, instead of the correct float - see known issues on the Github page. I think I've reported this months ago on the forum here too and others have confirmed it.
But, this might also just be the AI getting lazy every now and then and just acting like a child
Currently, you can re-align phrases using shortcuts directly from the app and I think we could automate the re-alignment with another AI model soon - again, for feedback if this would be useful, it would be amazing to debate it on Github.
Another thing that helps in our editing room, is to select the segments that are off with V, and simply re-transcribe using key T - I would avoid however re-transcribing segments that are less than 20-30 sec long with anything less than the large model because the AI will be missing a lot of context to get to better results than in the first pass (I might be wrong though)