Previous Thread
Index
Next Thread
Print Thread
Go To
Recording, Mixing, Performance and Production
Joined: Jan 2002
Posts: 7,913
R
Veteran
OP Offline
Veteran
R
Joined: Jan 2002
Posts: 7,913
Fellow home recordists:

I found out about this project through a TED talk. It's an initiative to vastly expand the access to and variety of recorded human speech for use for those that require speech synthesis/assistance.

The main project page: https://vocalid.co/

The page where we can donate our voices: https://vocalid.co/voicebank

So many of us have better than average recording setups at home - let's band together to provide well-recorded speech to those that need it.

-Scott

Recording, Mixing, Performance and Production
Joined: Dec 2003
Posts: 8,987
Veteran
Offline
Veteran
Joined: Dec 2003
Posts: 8,987
Thanks, Scott!

Folks marvel at the capabilities of my Dynavox Maestro(r) Speech Synthesis Device but even including the recent advancements in DSP (Digital Speech Processing), we both know that the technology is in an infantile state of development and progressing exponentially. What is possible today will be outdone in a few months. Already, my Maestro can be completely controlled by eye movements without connections.

I am privileged to see some R&D projects currently under development and they are astonishing.

I demonstrate the device at VA Medical Centers for veterans who have become speech deprived. To think that we will soon SING is something so wonderful.

Recording, Mixing, Performance and Production
Joined: Dec 2003
Posts: 8,987
Veteran
Offline
Veteran
Joined: Dec 2003
Posts: 8,987
I'm envisioning the impact that Scott, Dr Gannon, Matt Finley, David Cuny, and others would have one this technology. Wow! I might sing again!

Dr Donald K. Reynolds, Dean of the College of Engineering at UW, said that a primary purpose of higher education is to prevent us from "...trying to reinvent the wheel!" So it behooves us to know and understand the current state-of-the-art. Perhaps Scott, as an Audio Engineer, would be best qualified for that role.

Recording, Mixing, Performance and Production
Joined: May 2015
Posts: 10
Newbie
Offline
Newbie
Joined: May 2015
Posts: 10
Thanks for posting this. I put it on my Facebook page with a shout-out to the singers, actors and voiceover folks I know to get involved, which some have apparently done.

Cheers,

Ed


***********************
"We know what we are, but not what we may be"

- Noted bluesman Blind Will Shakespeare
***********************
Recording, Mixing, Performance and Production
Joined: Sep 2010
Posts: 8,187
Veteran
Offline
Veteran
Joined: Sep 2010
Posts: 8,187
In this project, it looks like they are creating collections of phonetic units, and classifying the donor parameters to build collections. It seems like it would be pretty standard stuff, but there might be some "secret sauce" that I'm unaware of.

Along those lines, I've read a paper where the researcher replaced key phonemes in otherwise generic synthesized output, and was able to overlay the "personality" of the voice. So they might be taking advantage of that as well.

Vocaloid is still the "gold standard" of singing resynthesis, as far as I know, and the process takes months to create a good voice donor. It also helps if you've got native english speakers on the team.

If you're interested, Avanna is currently on sale for $49, and she's probably the best english vocaloid currently on the market. It only comes with the "tiny" editor, which is limited to creating 18 bars at a time, so you'd have to stitch the output together to make a full song. Personally, that's not much of a limitation, because I prefer to build melodies one section at a time.

I've been working for the last couple of weeks trying to get synSinger to automatically extract parameters from audio samples, but have been getting mixed results. If there's anyone with some audio/programming background, I'd love to hear from them!

As far as BiaB goes for vocal synthesis, the MusicXML it creates is pretty broken. I've reported and re-reported bugs (wrong pitches, rests between syllables) over a year ago, and seen no fixes.

It's very frustrating, especially since I'm trying to use BiaB to generate output for synSinger. frown


-- David Cuny

My virtual singer development blog
Vocal control, you say. Never heard of it. Is that some kind of ProTools thing?

BiaB 2025 | Windows 11 | Reaper | Way too many VSTis.
Recording, Mixing, Performance and Production
Joined: Jun 2012
Posts: 3,891
J
Veteran
Offline
Veteran
J
Joined: Jun 2012
Posts: 3,891
David, thanks for that info...Avanna is very interesting! Do you know if it is limited unless you buy Vocaloid 3 separately? Also, have you ever checked out this product, http://realitone.com/blue/ ? I was thinking of buying it during their sale.

Recording, Mixing, Performance and Production
Joined: Sep 2010
Posts: 8,187
Veteran
Offline
Veteran
Joined: Sep 2010
Posts: 8,187
Originally Posted By: JohnJohnJohn
David, thanks for that info...Avanna is very interesting! Do you know if it is limited unless you buy Vocaloid 3 separately? Also, have you ever checked out this product, http://realitone.com/blue/ ? I was thinking of buying it during their sale.

Avanna comes with the free "tiny" editor by default. There are two limitations with the "tiny" editor:
  • You can only work with one vocal track per track, and
  • You only get 18 bars per session

Since I work in a DAW, it's not really a problem for me to assemble the parts from pieces, and layer the tracks. You can also import a single audio track, so you can play the backing tracks as you're fiddling with the vocals.

I've not heard of Realivox. Props to the video demo - they do a good job explaining what the product can't do. Vocaloid doesn't have those limitations - it automatically anticipates the consonants. Plus, they state: "Finally, your search for a library that says "oosht" is over!" laugh

The legato from pitch to pitch isn't also as nice as Vocaloid. On the other hand, Realivox looks pretty fun to play.

So if you're looking for a nice background vocal, Realivox is probably a better choice than Vocaloid, as long as you're aware of the tool's limitation.


-- David Cuny

My virtual singer development blog
Vocal control, you say. Never heard of it. Is that some kind of ProTools thing?

BiaB 2025 | Windows 11 | Reaper | Way too many VSTis.
Recording, Mixing, Performance and Production
Joined: Jan 2002
Posts: 7,913
R
Veteran
OP Offline
Veteran
R
Joined: Jan 2002
Posts: 7,913
I'd like to steer the conversation back to the voicebank project.

Recording, Mixing, Performance and Production
Joined: Dec 2003
Posts: 8,987
Veteran
Offline
Veteran
Joined: Dec 2003
Posts: 8,987
I posted it among my HS Classmates and have been getting a lot of interest. I posted VocalID's promo today.

Sorry if I hijacked your thread, it was not intentional.

Recording, Mixing, Performance and Production
Joined: Jan 2002
Posts: 7,913
R
Veteran
OP Offline
Veteran
R
Joined: Jan 2002
Posts: 7,913
Don, no hi-jack.

If you, the reader are just arriving at this thread at this post, please hit the first post links and consider donating your voice to the voice-bank.

If nothing else, watch the TED talk video: http://www.ted.com/talks/rupal_patel_synthetic_voices_as_unique_as_fingerprints?language=en

See if that doesn't inspire you...

Recording, Mixing, Performance and Production
Joined: Jan 2002
Posts: 7,913
R
Veteran
OP Offline
Veteran
R
Joined: Jan 2002
Posts: 7,913
I recorded my first 500 sentences tonight. Note, make yourself comfortable so that you can read the sentences and read-aloud while recording without much head movement. However, don't put the mic right in front of your monitor. If you do, there will be some notch filtering that can occur. The recording 'studio' that is part of the voicebank is quite simple to use. Note to the interested, it's no small task to record 500 sentences - it took me about an hour and a half to get it done. The overall amount is roughly 3500 sentences! So, I'm looking at probably 8 hours of time that I'll be putting in to this effort. I will say that the first 150 sentences or so took longer because of the way I was oriented and had to turn away from the monitor for each sentence. The next 350 went by much faster.

Previous Thread
Next Thread
Go To

Link Copied to Clipboard
ChatPG

Ask sales and support questions about Band-in-a-Box using natural language.

ChatPG's knowledge base includes the full Band-in-a-Box User Manual and sales information from the website.

PG Music News
Update to Build 10 of RealBand® 2026 for Windows®!

If you're already using RealBand 2026 for Windows, download build 10 to get all the latest additions and enhancements.

Band-in-a-Box® 2025 for Mac® users: Build 904 now available!

If you're already using Band-in-a-Box® 2025 for Mac®, make sure to grab the latest update! Build 904 is now available for download and includes the newest additions and enhancements from our team.

Band-in-a-Box® 2026 for Windows® users: Build 1237 is now available!

Already a Band-in-a-Box 2026 for Windows user? Stay up to date and download the build 1237 to get all the latest additions and enhancements.

PowerTracks Pro 2026 for Windows is Here!

PowerTracks 2026 is here—bringing powerful new enhancements designed to make your production workflow faster, smoother, and more intuitive than ever.

The enhanced Mixer now shows Track Type and Instrument icons for instant track recognition, while a new grid option simplifies editing views. Non-floating windows adopt a modern title bar style, replacing the legacy blue bar.

The Master Volume is now applied at the end of the audio chain for consistent levels and full-signal master effects.

Tablature now includes a “Save bends when saving XML” option for improved compatibility with PG Music tools. Plus, you can instantly match all track heights with a simple Ctrl-release after resizing, and Add2 chords from MGU/SGU files are now fully supported... and more!

Get started today—first-time packages start at just $49.

Already using PowerTracks Pro Audio? Upgrade for as little as $29 and enjoy the latest improvements!

Order now!

Band-in-a-Box 2026 for Windows Special Offers End Tomorrow (January 15th, 2026) at 11:59 PM PST!

Time really is running out! Save up to 50% on Band-in-a-Box® 2026 for Windows® upgrades and receive a FREE Bonus PAK—only when you order by 11:59 PM PST on Thursday, January 15, 2026!

We've added many major new features and new content in a redesigned Band-in-a-Box® 2026 for Windows®!

Version 2026 introduces a modernized GUI redesign across the program, with updated toolbars, refreshed windows, smoother workflows, and a new Dark Mode option. There’s also a new side toolbar for quicker access to commonly used windows, and the new Multi-View feature lets you arrange multiple windows as layered panels without overlap, making it easier to customize your workspace.

Another exciting new addition is the new AI-Notes feature, which can transcribe polyphonic audio into MIDI. You can view the results in notation or play them back as MIDI, and choose whether to process an entire track or focus on specific parts like drums, bass, guitars/piano, or vocals. There's over 100 new features in Band-in-a-Box® 2026 for Windows®.

There's an amazing collection of new content too, including 202 RealTracks, new RealStyles, MIDI SuperTracks, Instrumental Studies, “Songs with Vocals” Artist Performance Sets, Playable RealTracks Set 5, two RealDrums Stems sets, XPro Styles PAK 10, Xtra Styles PAK 21, and much more!

Upgrade your Band-in-a-Box for Windows to save up to 50% on most Band-in-a-Box® 2026 upgrade packages!

Plus, when you order your Band-in-a-Box® 2026 upgrade during our special, you'll receive a Free Bonus PAK of exciting new add-ons.

If you need any help deciding which package is the best option for you, just let us know. We are here to help!

Band-in-a-Box® 2026 for Windows® Special Offers Extended Until January 15, 2026!

Good news! You still have time to upgrade to the latest version of Band-in-a-Box® for Windows® and save. Our Band-in-a-Box® 2026 for Windows® special now runs through January 15, 2025!

We've packed Band-in-a-Box® 2026 with major new features, enhancements, and an incredible lineup of new content! The program now sports a sleek, modern GUI redesign across the entire interface, including updated toolbars, refreshed windows, smoother workflows, a new dark mode option, and more. The brand-new side toolbar provides quicker access to key windows, while the new Multi-View feature lets you arrange multiple windows as layered panels without overlap, creating a flexible, clutter-free workspace. We have an amazing new “AI-Notes” feature. This transcribes polyphonic audio into MIDI so you can view it in notation or play it back as MIDI. You can process an entire track (all pitched instruments and drums) or focus on individual parts like drums, bass, guitars/piano, or vocals. There's an amazing collection of new content too, including 202 RealTracks, new RealStyles, MIDI SuperTracks, Instrumental Studies, “Songs with Vocals” Artist Performance Sets, Playable RealTracks Set 5, two RealDrums Stems sets, XPro Styles PAK 10, Xtra Styles PAK 21, and much more!

There are over 100 new features in Band-in-a-Box® 2026 for Windows®.

When you order purchase Band-in-a-Box® 2026 before 11:59 PM PST on January 15th, you'll also receive a Free Bonus PAK packed with exciting new add-ons.

Upgrade to Band-in-a-Box® 2026 for Windows® today! Check out the Band-in-a-Box® packages page for all the purchase options available.

Happy New Year!

Thank you for being part of the Band-in-a-Box® community.

Wishing you and yours a very happy 2026—Happy New Year from all of us at PG Music!

Forum Statistics
Forums57
Topics86,076
Posts799,791
Members40,028
Most Online44,367
Mar 4th, 2026
Newest Members
AurealiusB, sam31985, jyotish karan, bububz, Ramon C.
40,028 Registered Users
Top Posters(30 Days)
MarioD 142
DC Ron 101
rsdean 95
WaoBand 76
DrDan 69
Today's Birthdays
CloClo, Richcloud
Powered by UBB.threads™ PHP Forum Software 7.7.5