Log in to post
|
Print Thread |
|
|
|
|
User Showcase
|
Joined: Sep 2010
Posts: 7,612
Veteran
|
OP
Veteran
Joined: Sep 2010
Posts: 7,612 |
When I posted a song a while back recorded using Sinsy, Don Gaynor had expressed hope that with this sort of technology he'd be able to sing again. While the fine folk at DynaVox haven't gotten back to him yet, I thought I might have a go at solving the problem. You may have noticed I've been a bit absent from the board since then. Here's the result: Loch Lomond (with harmony)It's "state of the art" 1980's formant synthesis. It's not the technology I'd intended to use, but that's a rather long story. I'm still working out the bugs in the code. It's not ready for end users, but it's finally "singing." There's a bit of post-processing on the vocals. I used the PG Vinyl plugin to reduce some of the popping on consonants, and a low-pass filter to kill some of the high-end noise. I've also run things through EZ Mix to add some reverb and compression. Most of the phonemes are fairly acceptable, but there's an /H/ and /L/ that seem to have gone missing. I've only tweaked one of the phonemes. At 0:15 there's what sounds like a breath noise. It's actually a bug, so I lowered the volume there. (I intend to add support for breaths at some point). The accompaniment is PJONPBA.STY (PopBalladPiano & Ac.Guitar[85RS]). I guessed at the chords from the sheet music, so there may be some clinkers in there. The harmony is just thirds above the melody, so there may be some clinkers there, too.
|
|
|
|
|
|
|
|
|
|
|
|
User Showcase
|
Joined: Dec 2003
Posts: 8,987
Veteran
|
Veteran
Joined: Dec 2003
Posts: 8,987 |
I love it David! Thanki, mate!
Now you may understand the difficulty I ran into trying to overcome inertia and reluctance to change on the part of Corporations, especially from ootsiders. Very frustrating and disheartening. It's extremely difficult to keep on slugging when we see no progress. I have several dead-end projects that I am stymied on including "The iPod Project" (qv).
Last edited by Don Gaynor; 05/29/13 03:48 PM. Reason: To make a semblance of cents to Canadian readers and moderators.
|
|
|
|
|
|
|
|
|
|
|
|
User Showcase
|
Joined: Sep 2010
Posts: 7,612
Veteran
|
OP
Veteran
Joined: Sep 2010
Posts: 7,612 |
Hopefully, I can shape it into something useful. I've taken a couple of days off work in an effort to get this into shape, and I've just found a major portion of the code that needs to be rewritten. Once this is stable, I still need to set up some sort of front end for it. The current plan is to have it read MusicXML files. I've got some code I've written for a different project I should be able to re-purpose. I also need to write some code to do dictionary lookup. I've got a nice hyphenated phonetic dictionary, so hopefully the majority of that work is already done. But... I still need to stabilize this code. If you're really lucky, this will give incentive to DynaVox to finish up their project, and you'll get some real voice synthesis. 
|
|
|
|
|
|
|
|
|
|
|
|
User Showcase
|
Joined: Dec 2005
Posts: 4,047
Veteran
|
Veteran
Joined: Dec 2005
Posts: 4,047 |
David,
This effort you have made into this cutting edge technology is outstanding. I've spent thousands of hours since the 70's working on coding projects that had never been done before. Can really appreciate the efforts you have put into this.
Hang in there and continuing good efforts on your "projects". Note didn't say good luck. Patience and stamina, be with you.
|
|
|
|
|
|
|
|
|
|
|
|
User Showcase
|
Joined: Sep 2010
Posts: 7,612
Veteran
|
OP
Veteran
Joined: Sep 2010
Posts: 7,612 |
This effort you have made into this cutting edge technology is outstanding. Yes, cutting edge 1980's technology at it's best.  This is about four generations removed from current voice synthesis technology. You might recall S.A.M. (Software Automatic Mouth), which was the basis of MacInTalk. If you play with the demo on that page (decompiled from assembly into C, and then converted into JavaScript!), you can hear the familial resemblance. Interestingly, the company that put out S.A.M. is still in business as SoftVoice, Inc., and it was their demo of Twinkle, Twinkle Little Star that convinced me that while formant synthesis might not create realistic results, it might be "good enough" for my purposes. Since they've been doing this for the last 30 years, I think their example is probably as good as this technology gets.
|
|
|
|
|
|
|
|
|
|
|
|
User Showcase
|
Joined: Jun 2012
Posts: 2,888
Veteran
|
Veteran
Joined: Jun 2012
Posts: 2,888 |
Hi David,
a milestone in your efforts. Following your blog I know what a lot of work this was and still is.
Guenter
|
|
|
|
|
|
|
|
|
|
|
|
User Showcase
|
Joined: Sep 2010
Posts: 7,612
Veteran
|
OP
Veteran
Joined: Sep 2010
Posts: 7,612 |
The next step it to make this usable. I'd like to write a UI that displays the music on a staff, and integrates with the CMU phonetic dictionary. Nothing terribly complex - just good enough to "get the job done." It'll output a .wav file, which BiaB can load. The rest is up to Don. 
|
|
|
|
|
|
|
|
|
|
|
|
User Showcase
|
Joined: Aug 2012
Posts: 12,469
Veteran
|
Veteran
Joined: Aug 2012
Posts: 12,469 |
David,
I commend you on the effort that this must require. Standing ovation for that, alone. Bravo.
floyd
|
|
|
|
|
|
|
|
|
|
|
|
User Showcase
|
Joined: May 2008
Posts: 5,085
Veteran
|
Veteran
Joined: May 2008
Posts: 5,085 |
Kudos David. What a thoughtful thing to do - not to mention all the time and effort involved. I'm sure it will bring Don and others lots of fun in making their music when you're done.
|
|
|
|
|
|
|
|
|
|
|
|
User Showcase
|
Joined: Sep 2010
Posts: 7,612
Veteran
|
OP
Veteran
Joined: Sep 2010
Posts: 7,612 |
Thanks! 
|
|
|
|
|
|
|
|
|
|
|
|
User Showcase
|
Joined: Oct 2008
Posts: 19,839
Veteran
|
Veteran
Joined: Oct 2008
Posts: 19,839 |
Hi David, This is amazing! I can't even begin to comprehend the amount of effort that must have been involved. You realize that you'll have to change your signature's saying now! "Loch Lomond" is all about 'voice control' at it's absolute cleverest  Regards, Noel
MY SONGS...Audiophile BIAB 2025
|
|
|
|
|
|
|
|
|
|
|
|
User Showcase
|
Joined: Oct 2008
Posts: 8,075
Veteran
|
Veteran
Joined: Oct 2008
Posts: 8,075 |
David,
I've got to be honest and say that my (very) first impression of this was that the timing of your presentation couldn't have been worse, considering the fact that Guenter has recently posted some excellent projects using a more advance synthetic voice. Comparisons are inevitable, and you are at a disadvantage being a single developer working for free without any kind of subsidy.
But the more I listen and compare your synth voice to the one Guenter is using, the more impressed I am with yours.
The biggest difference to my ear is that yours has a pronounced artifact where the syllables are joined together, whereas the other engine has figured out a way to blend the syllables more smoothly... which, in a song environment is definitely more musical.
Having dealt with artifacts when blending snippets in a DAW, I have to wonder if it would help to fade each of your separate syllables on both ends so the sharp edge that causes the artifact is less pronounced when they are joined together.
In the final analysis, all comparisons aside, what you have accomplished here is PHENOMEnal! (pun intended)
I wish you much luck with further development and I look forward to hearing more examples as time goes on.
Last edited by Pat Marr; 06/01/13 12:24 PM.
|
|
|
|
|
|
|
|
|
|
|
|
User Showcase
|
Joined: Dec 2011
Posts: 15,415
Veteran
|
Veteran
Joined: Dec 2011
Posts: 15,415 |
The patience of Job must have gone into that. I can't imagine the code (and I used to do a little programming). Gonna be fascinating to see how it plays out. Thanks.
Our albums and singles are on Spotify, Apple Music, Amazon Music, YouTube Music, Pandora and more. If interested search on Janice Merritt. Thanks! Our Videos are here on our website.
|
|
|
|
|
|
|
|
|
|
|
|
User Showcase
|
Joined: Sep 2010
Posts: 7,612
Veteran
|
OP
Veteran
Joined: Sep 2010
Posts: 7,612 |
Hi, Pat. Thanks for the your response. Before getting into details, I should clarify: this project is just re-implementing what others have already done, and much better than me. From text to speech: The MITalk system (Allen) was a great resource for me. The technology is essentially abandoned for other, better approaches. I'll give an explanation why I was unable to go down that route. Also, my immediate goal is to get Don something usable. If DynaVox finally gets him a better synthesis program, then, Hurrah!. No need for this program. I'm aware of the Vocaloid software - I've got the Avanna as well, because it's probably got the best English accent of all the current Vocaloids. I've also spent a lot of time looking at UTAU, a free synthetic singer written along the lines of Vocaloid. In fact, my initial approach was exactly what you suggested: record various phonemes (using Vowel/Consonant/Vowel patterns), cross-fade them together, and use pitch shifting. I've actually written a number of tools to do this. The stumbling block was the pitch shifting. The pitch shifting needs to shift some frequencies (the glottal pulse) and keep others fixed (the formants) or you get the "Mickey Mouse" effect. BiaB uses the astonishingly good elastiq algorithm. I couldn't find any free libraries that gave decent results - even the RubberBand library, which has formant preservation, didn't do an acceptable job. I tried FFT-based pitch shifting, but didn't have much luck. I got better results with PSOLA (Pitch Synchronous Overlap and Add), but there were significant artifacts: Here's an example. The examples I'd heard of formant-based synthesis convinced me that while it lacked realism, it was capable of creating intelligible and musical synthesis. I think you'll agree that, with some tuning, this synthesizer may not create realistic voices, but they can be understandable. And to be honest, I've been focused on just getting the code to work. I've spent very little time on fine-tuning the phonemes. This is alpha-software, and there's lots of room for improvement. That said, in Text-to-Speech Synthesis, Paul Taylor argues that formant-based synthesis is intrinsically un-natural because it can't capture the details of real speech, so I don't hold high hopes for it. I've considered mixing pre-recorded audio with synthesized sounds like eSpeak, but that raises plenty of issues. And there's still the issue of handling sounds like /B/, /D/ and /G/, which are voiced and consonants. So for the moment, I'm sticking with "pure" synthesis. I hope that somewhat explains that approach I've taken. Despite the many flaws, I figured it was time to move ahead with the project. For the moment, I'll be focusing on creating a UI.
|
|
|
|
|
|
|
|
|
|
|
|
User Showcase
|
Joined: Mar 2013
Posts: 4,494
Veteran
|
Veteran
Joined: Mar 2013
Posts: 4,494 |
Hi, David !
I am no expert on these matters which maybe makes me better judging the end result from a listeners point of view ? I am referring to the tune. Maybe I am partial as this tune has a special meaning for me. You see, once I sang this song to my dear wife Beni at the shore of this Loch and little did I know then, that the words held a prophecy as "me and my true love would never be allowed to meet again on the bonnie, bonnie banks of Loch Lomond" ! She died of cancer on October the 26 th 2011 !
I think you have done a marvellous job David ! Keep on the good work !
Cheers Dani
|
|
|
|
|
|
|
|
|
|
|
|
Ask sales and support questions about Band-in-a-Box using natural language.
ChatPG's knowledge base includes the full Band-in-a-Box User Manual and sales information from the website.
|
|
|
|
|
|
|
|
|
|
|
New RealTracks Released with Band-in-a-Box 2025!
We’ve expanded the Band-in-a-Box® RealTracks library with 202 incredible new RealTracks (in sets 449-467) across Jazz, Blues, Funk, World, Pop, Rock, Country, Americana, and Praise & Worship—featuring your most requested styles!
Jazz, Blues & World (Sets 449–455):
These RealTracks includes “Soul Jazz” with Neil Swainson (bass), Mike Clark (drums), Charles Treadway (organ), Miles Black (piano), and Brent Mason (guitar). Enjoy “Requested ’60s” jazz, classic acoustic blues with Colin Linden, and more of our popular 2-handed piano soloing. Plus, a RealTracks first—Tango with bandoneon, recorded in Argentina!
Rock & Pop (Sets 456–461):
This collection includes Disco, slap bass ‘70s/‘80s pop, modern and ‘80s metal with Andy Wood, and a unique “Songwriter Potpourri” featuring Chinese folk instruments, piano, banjo, and more. You’ll also find a muted electric guitar style (a RealTracks first!) and “Producer Layered Guitar” styles for slick "produced" sound.
Country, Americana & Praise (Sets 462–467):
We’ve added new RealTracks across bro country, Americana, praise & worship, vintage country, and songwriter piano. Highlights include Brent Mason (electric guitar), Eddie Bayers (drums), Doug Jernigan (pedal steel), John Jarvis (piano), Glen Duncan (banjo, mandolin & fiddle), Mike Harrison (electric bass) and more—offering everything from modern sounds to heartfelt Americana styles
Check out all the 202 New RealTracks (in sets 456-467)
And, if you are looking for more, the 2025 49-PAK (for $49) includes an additional 20 RealTracks with exciting new sounds and genre-spanning styles. Enjoy RealTracks firsts like Chinese instruments (guzheng & dizi), the bandoneon in an authentic Argentine tango trio, and the classic “tic-tac” baritone guitar for vintage country.
You’ll also get slick ’80s metal guitar from Andy Wood, modern metal with guitarist Nico Santora, bass player Nick Schendzielos, and drummer Aaron Stechauner, more praise & worship, indie-folk, modern/bro country with Brent Mason, and “Songwriter Americana” with Johnny Hiland.
Plus, enjoy user-requested styles like Soul Jazz RealDrums, fast Celtic Strathspey guitar, and Chill Hop piano & drums!
The 2025 49-PAK is loaded with other great new add-ons as well. Learn more about the 2025 49-PAK!
Bonus PAKs for Band-in-a-Box 2025 for Mac!
With your version 2025 for Mac Pro, MegaPAK, UltraPAK, UltraPAK+, Audiophile Edition or PlusPAK purchase, we'll include a Bonus PAK full of great new Add-ons FREE! Or upgrade to the 2025 49-PAK for only $49 to receive even more NEW Add-ons including 20 additional RealTracks!
These PAKs are loaded with additional add-ons to supercharge your Band-in-a-Box®!
This Free Bonus PAK includes:
- The 2025 RealCombos Booster PAK:
-For Pro customers, this includes 33 new RealTracks and 65+ new RealStyles.
-For MegaPAK customers, this includes 29 new RealTracks and 45+ new RealStyles.
-For UltraPAK customers, this includes 20 new RealStyles.
- Look Ma! More MIDI 13: Country & Americana
- Instrumental Studies Set 22: 2-Hand Piano Soloing - Rhythm Changes
- MIDI SuperTracks Set 44: Jazz Piano
- Artist Performance Set 17: Songs with Vocals 7
- Playable RealTracks Set 4
- RealDrums Stems Set 7: Jazz with Mike Clark
- SynthMaster Sounds and Styles (with audio demos)
- 128 GM MIDI Patch Audio Demos.
Looking for more great add-ons, then upgrade to the 2025 49-PAK for just $49 and you'll get:
- 20 Bonus Unreleased RealTracks and RealDrums with 20 RealStyles,
- FLAC Files (lossless audio files) for the 20 Bonus Unreleased RealTracks and RealDrums
- Look Ma! More MIDI 14: SynthMaster,
- Instrumental Studies Set 23: More '80s Hard Rock Soloing,
- MIDI SuperTracks Set 45: More SynthMaster
- Artist Performance Set 18: Songs with Vocals 8
- RealDrums Stems Set 8: Pop, Funk & More with Jerry Roe
Learn more about the Bonus PAKs for Band-in-a-Box® 2025 for Mac®!
New! Xtra Styles PAK 20 for Band-in-a-Box 2025 and Higher for Mac!
Xtra Styles PAK 20 for Mac & Windows Band-in-a-Box version 2025 (and higher) is here with 200 brand new RealStyles!
We're excited to bring you our latest and greatest in the all new Xtra Styles PAK 20 for Band-in-a-Box! This fresh installment is packed with 200 all-new styles spanning the rock & pop, jazz, and country genres you've come to expect, as well as the exciting inclusion of electronic styles!
In this PAK you’ll discover: Minimalist Modern Funk, New Wave Synth Pop, Hard Bop Latin Groove, Gospel Country Shuffle, Cinematic Synthwave, '60s Motown, Funky Lo-Fi Bossa, Heavy 1980s Metal, Soft Muted 12-8 Folk, J-Pop Jazz Fusion, and many more!
All the Xtra Styles PAKs 1 - 20 are on special for only $29 each (reg $49), or get all 209 PAKs for $199 (reg $399)! Order now!
Learn more and listen to demos of the Xtra Styles PAK 20.
Video: Xtra Styles PAK 20 Overview & Styles Demos: Watch now!
Note: The Xtra Styles require the UltraPAK, UltraPAK+, or Audiophile Edition of Band-in-a-Box®. (Xtra Styles PAK 20 requires the 2025 or higher UltraPAK, UltraPAK+, or Audiophile Edition. They will not work with the Pro or MegaPAK version because they need the RealTracks from the UltraPAK, UltraPAK+, or Audiophile Edition.
New! XPro Styles PAK 9 for Band-in-a-Box 2025 and higher for Mac!
We've just released XPro Styles PAK 9 for Mac & Windows Band-in-a-Box version 2025 (and higher) with 100 brand new RealStyles, plus 29 RealTracks/RealDrums!
We've been hard at it to bring you the latest and greatest in this 9th installment of our popular XPro Styles PAK series! Included are 75 styles spanning the rock & pop, jazz, and country genres (25 styles each) that fans have come to expect, as well as 25 styles in this volume's wildcard genre: funk & R&B!
If you're itching to get a sneak peek at what's included in XPro Styles PAK 9, here is a small helping of what you can look forward to: Funky R&B Horns, Upbeat Celtic Rock, Jazz Fusion Salsa, Gentle Indie Folk, Cool '60s Soul, Funky '70s R&B, Smooth Jazz Hip Hop, Acoustic Rockabilly Swing, Funky Reggae Dub, Dreamy Retro Latin Jazz, Retro Soul-Rock Fusion, and much more!
Special Pricing! Until July 31, 2024, all the XPro Styles PAKs 1 - 9 are on sale for only $29 ea (Reg. $49 ea), or get them all in the XPro Styles PAK Bundle for only $149 (reg. $299)! Order now!
Learn more and listen to demos of XPro Styles PAKs.
Video: XPro Styles PAK 9 Overview & Styles Demos: Watch now!
XPro Styles PAKs require Band-in-a-Box® 2025 or higher and are compatible with ANY package, including the Pro, MegaPAK, UltraPAK, UltraPAK+, and Audiophile Edition.
New! Xtra Styles PAK 20 for Band-in-a-Box 2025 and Higher for Windows!
Xtra Styles PAK 20 for Windows & Mac Band-in-a-Box version 2025 (and higher) is here with 200 brand new RealStyles!
We're excited to bring you our latest and greatest in the all new Xtra Styles PAK 20 for Band-in-a-Box! This fresh installment is packed with 200 all-new styles spanning the rock & pop, jazz, and country genres you've come to expect, as well as the exciting inclusion of electronic styles!
In this PAK you’ll discover: Minimalist Modern Funk, New Wave Synth Pop, Hard Bop Latin Groove, Gospel Country Shuffle, Cinematic Synthwave, '60s Motown, Funky Lo-Fi Bossa, Heavy 1980s Metal, Soft Muted 12-8 Folk, J-Pop Jazz Fusion, and many more!
All the Xtra Styles PAKs 1 - 20 are on special for only $29 each (reg $49), or get all 209 PAKs for $199 (reg $399)! Order now!
Learn more and listen to demos of the Xtra Styles PAK 20.
Video: Xtra Styles PAK 20 Overview & Styles Demos: Watch now!
Note: The Xtra Styles require the UltraPAK, UltraPAK+, or Audiophile Edition of Band-in-a-Box®. (Xtra Styles PAK 20 requires the 2025 or higher UltraPAK, UltraPAK+, or Audiophile Edition. They will not work with the Pro or MegaPAK version because they need the RealTracks from the UltraPAK, UltraPAK+, or Audiophile Edition.
New! XPro Styles PAK 9 for Band-in-a-Box 2025 and higher for Windows!
We've just released XPro Styles PAK 9 for Windows & Mac Band-in-a-Box version 2025 (and higher) with 100 brand new RealStyles, plus 29 RealTracks/RealDrums!
We've been hard at it to bring you the latest and greatest in this 9th installment of our popular XPro Styles PAK series! Included are 75 styles spanning the rock & pop, jazz, and country genres (25 styles each) that fans have come to expect, as well as 25 styles in this volume's wildcard genre: funk & R&B!
If you're itching to get a sneak peek at what's included in XPro Styles PAK 9, here is a small helping of what you can look forward to: Funky R&B Horns, Upbeat Celtic Rock, Jazz Fusion Salsa, Gentle Indie Folk, Cool '60s Soul, Funky '70s R&B, Smooth Jazz Hip Hop, Acoustic Rockabilly Swing, Funky Reggae Dub, Dreamy Retro Latin Jazz, Retro Soul-Rock Fusion, and much more!
Special Pricing! Until July 31, 2024, all the XPro Styles PAKs 1 - 9 are on sale for only $29 ea (Reg. $49 ea), or get them all in the XPro Styles PAK Bundle for only $149 (reg. $299)! Order now!
Learn more and listen to demos of XPro Styles PAKs.
Video: XPro Styles PAK 9 Overview & Styles Demos: Watch now!
XPro Styles PAKs require Band-in-a-Box® 2025 or higher and are compatible with ANY package, including the Pro, MegaPAK, UltraPAK, UltraPAK+, and Audiophile Edition.
Video: Band-in-a-Box® 2025 for Mac®: VST3 Plugin Support
Band-in-a-Box® 2025 for Mac® now includes support for VST3 plugins, alongside VST and AU. Use them with MIDI or audio tracks for even more creative possibilities in your music production.
Band-in-a-Box® 2025 for Macs®: VST3 Plugin Support
Video: Band-in-a-Box® 2025 for Mac®: Using VST3 Plugins
Join the conversation on our forum.
|
|
|
|
|
|
|
|
|
|
|
Forums58
Topics84,380
Posts778,470
Members39,636
|
Most Online25,754 Jan 24th, 2025
|
|
|
|
|
|
|
|
|