Originally Posted by Bass Thumper
So what if someone painstakingly copied a Sinatra sound, reproduced his voice and changed the lyrics for a single song.
In 2005, Paul Anka released on the album "Rock Swings". The album is covers of rock songs done in the big-band style.

The arrangement you're hearing in the video was created Brad Dechter, and played by studio musicians.

All that was done without AI.

The only thing that AI did was:

  • Split out the vocal from the backing track; and
  • Replace the timbre of Anka's voice with Sinatra's.


While that's technologically impressive, it's orders of magnitude less than the sort of thing you are describing.

Originally Posted by Bass Thumper
...and my prediction is that some company will provide exactly this within 5 years.
You've got expertise in the field to base this prediction on?

Because that's not the sort of predictions I hear from people who are actually trying to accomplish this. They're saying there's a world of difference between working with temporally stable pixels and time-changing audio.

Predicting that audio will follow the same timeline as AI generative graphics ignores the huge difference between them.


-- David Cuny
My virtual singer development blog

Vocal control, you say. Never heard of it. Is that some kind of ProTools thing?