I have several tapes (yes actual cassette tapes) of my grandfather reading a novel.
Unfortunately a few of the tapes have degraded to the point that I cannot play them back.
I would love to recreate his voice, to “rerecord” the missing bits.
The recordings are in Danish.
Is this possible?
If it is, how can I go about it?
I’ve been able to generate very good results with this open source project. You need a pretty good nVidia GPU, and it takes some time and tedious work to get it working they way you want it to:
https://github.com/neonbjb/tortoise-tts
Some voices sound exactly right. Other sound like a broken robot. The main reason I like it is that I can run it local without having to sign up for some stupid cloud service.
Looks very cool. I was unable to see anything regarding languages. Is it completely language independent somehow, or is it English only?
I have only used it with American English. Oddly, it will sometimes slip into a British accent. I believe it is possible to retrain it on other languages, but I have not done the deep dive required to do so.