Previous Thread
Index
Next Thread
Print Thread
Go To
#374778 11/17/16 10:28 PM
Off-Topic
Joined: Sep 2010
Posts: 8,240
dcuny Offline OP
Veteran
OP Offline
Veteran
Joined: Sep 2010
Posts: 8,240
For those following the progress of this project, I'd hoped that I'd have a final version of my synthetic singer program completed back in March. However, it had a bad case of the mumbles, and I headed back to the drawing board to rewrite it. It's been eating my spare time since then.

I spent far too much time trying to get synthesized plosives and fricatives working. Then I listened to earlier versions of the program, and realized how much better they sounded using samples instead. So all that work got tossed out, and I ended up writing the code from scratch... again. cry

Just the other day, I got the code to where it is again reading MusicXML files and generating .wav files. There's still a lot of work to do, but the end is (hopefully) in sight.

I've been using Twinkle, Twinkle, Little Star" as my demo song, and here's the most current version, warts and all. For example, I haven't yet figured out why it can't say "world" correctly.

Twinkle, Twinkle, Little Star #15

As a change of pace, I decided to do my own version of Daisy Bell, one of the first examples of computer generated singing. It's still missing some phonemes, so I've cheated in spots. For example, the /G/ is actually /D SH/. The lyrics were automatically converted to phonemes, but I did some replacement by hand because the allophonic replacement code isn't working yet.

I also added some compression and reverb because everything sounds better with reverb. Just to let you know, it doesn't sound quite this good out of the box... But it sounds exactly as bad wink.

Anyway, here's synSinger singing "Daisy", as well as the 1961 version by Max Mathews, John Kelly, and Carol Lochbaum, which I found on Perry Cook's website. To make comparison easier, they're in the same key and tempo:

Daisy Bell (IBM 7094)
Daisy (synSinger)



I've always been curious about what software was used to generate the computer performance 55 years ago. It turns out that the data was hand-coded into the computer. That explains why I was never able to find any references to the text-to-singing program... It never existed! blush

As always, comments (positive or negative) are always appreciated.


-- David Cuny

My virtual singer development blog
Vocal control, you say. Never heard of it. Is that some kind of ProTools thing?

BiaB 2025 | Windows 11 | Reaper | Way too many VSTis.
dcuny #374803 11/18/16 02:57 AM
Off-Topic
Joined: Apr 2009
Posts: 10,883
Veteran
Offline
Veteran
Joined: Apr 2009
Posts: 10,883
It needs more Cowbell.


You can find my music at:
www.herbhartley.com
Add nothing that adds nothing to the music.
You can make excuses or you can make progress but not both.

The magic you are looking for is in the work you are avoiding.
Off-Topic
Joined: Sep 2010
Posts: 8,240
dcuny Offline OP
Veteran
OP Offline
Veteran
Joined: Sep 2010
Posts: 8,240
Originally Posted By: Guitarhacker
It needs more Cowbell.

I'll confess - that's one thing I hadn't considered. shocked

But that reminds me that I do need to add another feature - using another sound source instead of a glottal pitch. Remember the cellos voice on the MacInTalk?


-- David Cuny

My virtual singer development blog
Vocal control, you say. Never heard of it. Is that some kind of ProTools thing?

BiaB 2025 | Windows 11 | Reaper | Way too many VSTis.
dcuny #374861 11/18/16 07:42 AM
Off-Topic
Joined: Sep 2016
Posts: 408
W
PG Music Staff
Offline
PG Music Staff
W
Joined: Sep 2016
Posts: 408
I've never heard that original recording, absolutely insane that that was 1961... I also can't believe that you coded this yourself, that's crazy!! Amazing work. Your version is sounding very good. Keep us updated on how you progress with "world"!


Will - PG Music
dcuny #376488 11/24/16 02:43 PM
Off-Topic
Joined: Mar 2007
Posts: 1,342
Expert
Offline
Expert
Joined: Mar 2007
Posts: 1,342
Wow. Amazing you coded that yourself. I can't even begin grasping how you did that. Like how do you pick the right sample for a specific part of a word, or did you sample sounds and complete words? Individual letters can sound different when used in combination with other letters...

Can you please show some insight in that?

dcuny #376565 11/24/16 07:35 PM
Off-Topic
Joined: Sep 2010
Posts: 8,240
dcuny Offline OP
Veteran
OP Offline
Veteran
Joined: Sep 2010
Posts: 8,240
There are a couple different ways that vocal synthesis can be approached. The method that I'm using is called "formant synthesis", and is one of the oldest techniques that's been used for computer synthesis.

In English, there are approximately 40 distinct "sounds" that make up the language, are referred to as "phonemes".

There are different phonetic systems, but one of the simplest for American English is the "Arpabet", which uses plain text characters to represent phonemes. For example, the word "dictionary" would be written:

/D IH K SH AH N EH R IY/

You can play around with the the online CMU Pronouncing Dictionary to see how this works.

I use the CMU Dictionary to convert English into phonemes. If a word isn't found in the dictionary, I fall back to a public domain program called "Reciter" which guesses how to pronounce the word.

Phonemes are turned into sound by simulating the human vocal tract electronically. Before explaining that, let give give a (very simplified) explanation of how we create vocal sounds.

As air passes through the glottal folds, the folds vibrate and create sound. By controlling the tension (which in turn controls the length of the folds' opening), we can raise and lower the pitch we create. This pitch is called the fundamental frequency (F0), which we hear at the pitch of the voice.

This pitched glottal pulse (which resembled a kazoo sound) passes through our mouth. We use our tongue to create one or more resonating chambers that reinforce specific frequencies in the glottal pitch. These reinforced frequencies are called "resonances", and are what distinguishes one phonemes from another.

For example, (borrowing from the SoftVoice website), here are a number of vowel sounds, and the frequencies of resonance for the "average" male speaker in Hz:

/IY/ (beet): 270, 2300, 3000
/IH/ (bit): 400, 2000, 2550
/EH/ (bet): 530, 1850, 2500
/AE/ (bat): 660, 1700, 2400
/AH/ (but): 640, 1200, 2400
/UW/ (boot): 300, 870, 2250

In the phoneme /IY/ (as in beet), the first formant (F1) is at 270Hz, the second (F2) is at 2300Hz, and the third (F3) is at 3000Hz. Again, these formants don't alter the fundamental pitch, and remain fixed no matter what pitch you're singing.

Some phonemes are obviously more complex than that. For example, the phonemes /IY/ and /UW/ are diphthongs, and consist of two distinct targets. But I'm digressing...

To do this electronically, I generate a waveform that approximates a glottal pulse at the desired pitch, and pass it through a series of bandpass filters - one for each formant frequeny - to resonate at the desired frequencies. The output is a rough approximation of the sound.

Changing the pitch of the glottal pulse changes the pitch that's being sung. Changing the resonating filters to new values changes the phoneme that's being sung.

Some sounds (like the frication in the /F/ or the plosive in the /T/) are created by means different than described above. I used to synthesize them, but I now use digital samples because they give better results.

Very little work that I've done are my own ideas. I've borrowed heavily from the published work of Dennis Klatt, who wrote one of the first text-to-speech computer programs.

If you're curious, I'd highly recommend downloading this Formant Synthesis Demo. Click and drag in the area marked F1/F2 (formants one and two) and you'll get a good idea how this works.

Did that clear up some of the mystery?


-- David Cuny

My virtual singer development blog
Vocal control, you say. Never heard of it. Is that some kind of ProTools thing?

BiaB 2025 | Windows 11 | Reaper | Way too many VSTis.
dcuny #376694 11/25/16 07:19 AM
Off-Topic
Joined: Mar 2007
Posts: 1,342
Expert
Offline
Expert
Joined: Mar 2007
Posts: 1,342
Thanks for you answer, David. It did bring me more understanding of the subject of vocal synthesis. I will visit the sites you mentioned. Please keep us informed about your progress with this project. I find it very interesting.

dcuny #378406 11/29/16 10:11 PM
Off-Topic
Joined: Sep 2010
Posts: 8,240
dcuny Offline OP
Veteran
OP Offline
Veteran
Joined: Sep 2010
Posts: 8,240
Another update - the main change has been adding vibrato back into the code.

Twinkle, Twinkle, Little Star #16 (synSinger)

The various parameters of vibrato - depth, speed, minimum note length and delay before start - can be specified.

I've also worked on clearing up a number of phonemes, including the /ERL/ in "world", although the main problem with that word is the trailing /D/.

This version of synSinger is written in Lua, and isn't particularly fast - it renders audio about about half the speed of the song. I still haven't been able to spend time to figure out how to create a stand-alone executable. I need to spend some time with the squish documentation.

There are still instances where it will "squelch" when parameters change to quickly, and some of the phonemes still need more attention. But for the most part, the output seems to be fairly acceptable, although not always intelligible. Truth be told, it's only incrementally better than prior versions.

I've also created a "female" voice for synSinger by mapping phonemes formants from average male phoneme space to average female phoneme space. It also modifies some other parameters, such as raising the pitch up an octave (so "she" doesn't sound like a chain smoker), adding more breath noise, altering the glottal pulse, and modifying the formant bandwidths based on a shorter larynx. But it still sounds a bit cheesy, because female voices aren't something formant synthesis does that well:

Twinkle, Twinkle, Little Star (synSinger female voice)

As always, feedback is welcome. smile


-- David Cuny

My virtual singer development blog
Vocal control, you say. Never heard of it. Is that some kind of ProTools thing?

BiaB 2025 | Windows 11 | Reaper | Way too many VSTis.
Previous Thread
Next Thread
Go To

Link Copied to Clipboard
ChatPG

Ask sales and support questions about Band-in-a-Box using natural language.

ChatPG's knowledge base includes the full Band-in-a-Box User Manual and sales information from the website.

PG Music News
Holiday Weekend Hours

It's Victoria Day Long Weekend in Canada. Our Customer Service hours are:

Saturday, May 16: Closed
Sunday, May 17: Closed
Monday, May 18: 8:00am - 4:00pm

Regular hours
resume Tuesday, May 19th!

Today's the Last Day of the Band-in-a-Box 2026® for Mac Special!

Order before 11:59pm PDT today (May 15, 2026) to save up to 50% off your Band-in-a-Box® 2026 for Mac® upgrade and receive a FREE Bonus PAK loaded with great new Add-ons to use with this new version!

Don't wait - order today!

Check out all the new features in the redesigned Band-in-a-Box® 2026 for Mac®!

Band-in-a-Box® 2026 for Mac - Special Offers End at 11:59pm PDT on Friday, May 15th, 2026!

Order before 11:59pm PDT on Friday, May 15th and SAVE up to 50% on most Band-in-a-Box® version 2026 for Mac Upgrade packages... and that's not all! With your version 2026 for Mac purchase, we'll include a Bonus PAK full of great new Add-ons FREE! Upgrade to the 2026 49-PAK to receive even more NEW Add-ons including 20 additional RealTracks... that's 222 NEW RealTracks available with version Band-in-a-Box® 2026 for Mac!

Upgrade to Band-in-a-Box® 2026 for Mac® today for as little as $49! Check out the Band-in-a-Box® packages page for all available purchase options.

Learn more about the Free Bonus PAK and 49-PAK here.

If you have any questions about which package is the best option for you, just let us know. We're here to help!

202 New RealTracks Released with Band-in-a-Box 2026!

With Band-in-a-Box® 2026, we've released 202 incredible new RealTracks (in sets 468-488) in a variety of genres—featuring your most requested styles!

Jazz, Funk & World (Sets 468-475):
Our new jazz, funk & blues RealTracks include a groovin’ collection of RealTracks and RealDrums! These include more requested “soul jazz” RealTracks featuring artists Neil Swainson (bass), Charles Treadway (organ), Brent Mason (guitar), and Wes Little (drums). There are new “smooth jazz” styles (4), which include a RealTracks first: muted trumpet, as well as slick new smooth jazz brushes options for drums. Blues lovers will be thrilled—there are more “classic acoustic blues” styles, including guitar (5), bass (4), and drums (10) with blues master Colin Linden, featuring understated and tasty background acoustic soloing, plus brushes drums and acoustic bass. There are also new electric blues RealTracks, including electric blues with PG favorite Johnny Hiland (3) and soulful electric slide guitar from Colin Linden (4). If you love funk & gospel, there are great new options this year, including gospel organ (3) from Charles Treadway, as well as new funk, tango, and rock ’n’ roll drums (3) and bass (1). And for big, bold arrangements, we have uptempo soul horns (4) featuring a three-part hip horn section with options for a full mix or stems of each individual horn — plus an accompanying rhythm section (4) of drums, bass, guitar, and electric piano!

Rock & Pop (Sets 476–482):
Our new rock & pop RealTracks bring a powerful mix of requested favorites, fresh genres, and modern chart-inspired styles! We have more of our popular “Producer Layered Acoustic Guitars (15)” featuring Band-in-a-Box favorite Brent Mason. We’ve continued our much-requested disco styles (10), and added new Celtic guitar (5) with a more basic, accessible approach than our previous Drop-D or DADGAD offerings. There are also highly requested yacht rock styles (17), inspired by the smooth, polished soft-rock sound of the late ’70s and early ’80s — laid-back grooves, silky electric pianos, warm textures, elegant harmonic movement, and pristine production aesthetics. Fans of heavier styles will love our new glam metal (13), capturing the flashy, high-energy sound of ’80s arena-ready guitar rock. We also have a set of rootsy modern-folk rock (18), with a warm, organic sound combining contemporary folk textures and driving acoustic strumming. And we’ve added lots of new modern pop styles (16) — the kinds of sounds you’re hearing on the radio today, featuring exciting new drums, synths, and cutting-edge RealTracks arrangements.

Country, & Americana (Sets 483–488):
Our new country & Americana RealTracks deliver a rich collection of acoustic, electric, and roots-inspired styles! We have new country pop (9) with legendary guitarist Brent Mason. There is also a potpourri (14) of bouzouki, guitars, banjo, and more, perfect for adding texture and character to contemporary acoustic arrangements. We’ve added funky country guitar (5) with PG favorite Brent Mason, along with classic pedal steel styles (5) featuring steel great Doug Jernigan. There are more country songwriter styles (8) that provide intimate, rootsy foundations for storytelling and modern Americana writing. Finally, we have “background soloing” acoustic guitar (12) with Brent Mason — simpler, but still very tasty acoustic lines designed to sit beautifully behind vocals or act as a subtle standalone solo part.

Check out all the 202 new RealTracks (in sets 468-488)!

And, if you are looking for more, the 2026 49-PAK (for $49) includes an impressive collection of 20 bonus RealTracks, featuring exciting and inspiring additions to add to your RealTracks library. You'll get new country-rhythm guitar styles from PG Music favorites Johnny Hiland and Brent Mason, along with modern-pop grooves that capture today’s radio-ready sound! There are also new indie-folk styles with guitar, bass, 6-string bass used as a high-chording instrument, acoustic guitar, and banjo. Plus, dedicated "cymbal fills" RealDrums provide an added layer that work very well with low-key folky styles with other percussion.

The 2026 49-PAK is loaded with other great new add-ons as well. Learn more about the 2026 49-PAK!

2026 Free Bonus PAK & 49-PAK for Band-in-a-Box® 2026 for Mac®!

With your version 2026 for Mac Pro, MegaPAK, UltraPAK, UltraPAK+, Audiophile Edition or PlusPAK purchase, we'll include a Bonus PAK full of great new Add-ons for FREE! Or upgrade to the 2026 49-PAK for only $49 to receive even more NEW Add-ons including 20 additional RealTracks!

These PAKs are loaded with additional add-ons to supercharge your Band-in-a-Box®!

This Free Bonus PAK includes:

  • The 2026 RealCombos Booster PAK: -For Pro customers, this includes 27 new RealTracks and 23 new RealStyles. -For MegaPAK customers, this includes 25 new RealTracks and 23 new RealStyles. -For UltraPAK customers, this includes 12 new RealStyles.
  • MIDI Styles Set 92: Look Ma! More MIDI 15: Latin Jazz
  • MIDI SuperTracks Set 46: Piano & Organ
  • Instrumental Studies Set 24: Groovin' Blues Soloing
  • Artist Performance Set 19: Songs with Vocals 9
  • Playable RealTracks Set 5
  • RealDrums Stems Set 9: Cool Brushes
  • SynthMaster Sounds Set 1 (with audio demos)
  • iOS Android Band-in-a-Box® App
Looking for more great add-ons, then upgrade to the 2026 49-PAK for just $49 and you'll get:
  • 20 Bonus Unreleased RealTracks and RealDrums with 20 RealStyle.
  • FLAC Files (lossless audio files) for the 20 Bonus Unreleased RealTracks and RealDrums
  • MIDI Styles Set 93: Look Ma! More MIDI 16: SynthMaster
  • MIDI SuperTracks Set 47: More SynthMaster
  • Instrumental Studies 25 - Soul Jazz Guitar Soloing
  • Artist Performance Set 20: Songs with Vocals 10
  • RealDrums Stems Set 10: Groovin' Sticks
  • SynthMaster Sounds & Styles Set 2 (sounds & styles with audio demos)

Learn more about the Bonus PAK and 49-PAK for Band-in-a-Box® 2026 for Mac®!

XPro & Xtra Styles PAK Sets On Sale Now - Until May 15, 2026!

All of our XPro Styles PAKs and Xtra Styles PAKs are on sale until May 15th, 2026!

It's the perfect time to expand your Band-in-a-Box® style library with XPro and Xtra Styles PAKs. These additional styles for Band-in-a-Box® offer a wide range of genres designed to fit seamlessly into your projects. Each style is professionally arranged and mixed, helping enhance your songs while saving you time.

What are XPro Styles and Xtra Styles PAKs?

XPro Styles PAKs are styles that work with any version (Pro, MegaPAK, UltraPAK, UltraPAK+, or Audiophile Edition) of Band-in-a-Box® 2025 (or higher). XPro Styles PAKS 1-10 includes 1,000 styles!

Xtra Styles PAKs are styles that work with the UltraPAK, UltraPAK+, or Audiophile Edition of Band-in-a-Box® 2025 (or higher). Xtra Styles PAKs 1-21 includes 3,700 styles (and 35 MIDI styles)!

The XPro & Xtra Styles PAKs are not included in any Band-in-a-Box® package.

The XPro Styles PAKs 1-10 are available for only $29 ea (reg. $49 ea), or get them all in the XPro Styles PAK Bundle for only $149 (reg. $299)! Listen to demos and order now! For Mac or for Windows.

The Xtra Styles PAKs 1-21 are available for only $29 ea (reg. $49 ea), or get them all in the Xtra Styles PAK Bundle for only $199 (reg. $349)! Listen to demos and order now! For Mac or for Windows.

Note: XPro Styles PAKs require Band-in-a-Box® 2025 or higher and are compatible with ANY package, including the Pro, MegaPAK, UltraPAK, UltraPAK+, and Audiophile Edition.

The Xtra Styles require the UltraPAK, UltraPAK+, or Audiophile Edition of Band-in-a-Box®. (Xtra Styles PAK 19 requires the 2025 or higher UltraPAK, UltraPAK+, or Audiophile Edition. They will not work with the Pro or MegaPAK version as they require the RealTracks included in the UltraPAK, UltraPAK+, or Audiophile Edition.

Supercharge your Band-in-a-Box today with XPro Styles PAKs and Xtra Styles PAK Sets!

Band-in-a-Box 2026 for Mac Videos

With the release of Band-in-a-Box® 2026 for Mac, we’re rolling out a collection of brand-new videos on our YouTube channel. We’ll keep this forum post updated so you can easily find all the latest videos in one convenient spot.

Whether you're exploring new features, checking out the latest RealTracks or Style PAKs, this is your go-to guide for Band-in-a-Box® 2026.

Check out this forum post for "One Stop Shopping" of our Band-in-a-Box® 2026 Mac Videos!

Forum Statistics
Forums57
Topics86,306
Posts803,036
Members40,087
Most Online64,515
Apr 8th, 2026
Newest Members
Thekesslerboy, DC Cougar, luli, windsax, GregNYR
40,086 Registered Users
Top Posters(30 Days)
MarioD 132
rsdean 93
DC Ron 78
vicarn 77
zedd 60
Noel96 58
Today's Birthdays
Bob Rogers, knurd75, Tommycat
Powered by UBB.threads™ PHP Forum Software 7.7.5