Results 1 to 15 of 15

Thread: Text to Speech Chip/Module

  1. #1
    Senior Member
    trash's Avatar
    Join Date
    Jan 2008
    Location
    Tamworth
    Posts
    4,089
    Thanks
    148
    Thanked 3,229 Times in 1,451 Posts
    Rep Power
    1288
    Reputation
    47674

    Default Text to Speech Chip/Module

    I'm still on my quest to find a text to speech module. And not having much luck.
    Needs to be TTL/RS232/I2C/SPI input, any audio output.

    What I want is a natural speech module that is stand alone. Robot voices and Stephen Hawking don't cut it.
    I want the sexy female voice that talks to you via your GPS etc.

    So chips like the speakjet. Definitely no good.

    TTS-EM-IC-HD2 is a good example of a true voice module, but it's +$300

    Raspberry Pi with espeak ... it's close, but it's still a bit too robotic.

    I can't even find any information of what kind of chipset dashboard GPS's use.
    There has to be something out there cheaper than the ~$150 price tag of a GPS.
    I'd rip apart an old GPS if I could get hold of one cheap enough or I thought there was a chance of success.

    I've even tried a dirt cheap chinese TTS module that had a good demo voice, I was going to teach it Engrish. But the module has no support from the supplier.
    "We take your money, you go away!" is the support I've got so far. $5 was worth the risk.

    So I'm open to suggestions.
    There are some older modules out there, but there is almost no information about them on the interweb. It's like they never existed.
    I'd be happy to get my hands on one of those old modules with just a little bit of technical data.
    Yes I am an agent of Satan, but my duties are largely ceremonial.



Look Here ->
  • #2
    Senior Member
    best4less's Avatar
    Join Date
    Jan 2008
    Location
    Australia
    Posts
    7,684
    Thanks
    3,487
    Thanked 2,207 Times in 1,132 Posts
    Rep Power
    758
    Reputation
    15165

    Default

    I watched this vid a while ago, not sure if it is the same thing you are looking at

    When you do things right, people won't be sure that you have done anything at all

  • #3
    Senior Member
    Downunder35m's Avatar
    Join Date
    May 2009
    Posts
    4,063
    Thanks
    101
    Thanked 1,797 Times in 862 Posts
    Rep Power
    631
    Reputation
    15724

    Default

    The GPS does not use a TTS chip, it uses a TTS engine, usually Loquendo, Nuance or the older Vocalizer voices.
    Similar story on mobile phones, so maybe it would be easier to find/progam a software for Android that accepts a serial input or file transfer and speaks out the result using the installed voice engine.
    Tomtom GO730 ,Navcore 9.004 ,Bootloader 5.5256 ,Map :Australia 845.2661
    ttmaps and Tomplayer on 16GB SDHC class6
    Password for all my files: downunder

  • The Following User Says Thank You to Downunder35m For This Useful Post:

    tristen (10-07-14)

  • #4
    Senior Member
    trash's Avatar
    Join Date
    Jan 2008
    Location
    Tamworth
    Posts
    4,089
    Thanks
    148
    Thanked 3,229 Times in 1,451 Posts
    Rep Power
    1288
    Reputation
    47674

    Default

    I'm trying to avoid that robot voice B4L.
    While the emic is ok for simple TTS aplications, it's really crap for the application I need TTS.

    Thanks Downunder, I don't care if it is a chip, module or engine.
    What I really need is a black box which will accept serial RS232 and spits out a sexy female voice. Pretty much that simple.
    Though it needs to run on 12VDC and has to be stand alone. If I could make a zombie GPS and just utilise it's voice function, then I'll do it.

    If I have to load phonomes onto something like a PIC chip, I'll do that can construct my own words from syllables etc. (obviously this isn't as easy as it sounds or I would have done it already)
    Or if I can find software for something like the raspberry pi then I'll try that. (I'm currently running espeak which slightly better than the emic, but still a little robotic).

    So re-purposing a GPS platform might work, if I knew what that platform was or even it's OS.
    Yes I am an agent of Satan, but my duties are largely ceremonial.

  • #5
    Premium Member

    Join Date
    Jan 2008
    Posts
    4,311
    Thanks
    5,982
    Thanked 4,171 Times in 1,771 Posts
    Rep Power
    1349
    Reputation
    50392

    Default

    Something like perhaps?

  • #6
    Senior Member
    trash's Avatar
    Join Date
    Jan 2008
    Location
    Tamworth
    Posts
    4,089
    Thanks
    148
    Thanked 3,229 Times in 1,451 Posts
    Rep Power
    1288
    Reputation
    47674

    Default

    Emic 2
    While the emic is ok for simple TTS aplications, it's really crap for the application I need TTS.
    If it sounds like robotic, then it's not appropriate for the job I intend to use it for.

    Lets just say I'm going to use it for a sperm extraction vending machine.
    Maybe something like this...
    So I need a pleasant sexy sounding female TTS.
    Getting your gnob gobbled by Stephen Hawking just isn't going to work.
    Yes I am an agent of Satan, but my duties are largely ceremonial.

  • The Following User Says Thank You to trash For This Useful Post:

    Tiny (14-07-14)

  • #7
    Senior Member
    Downunder35m's Avatar
    Join Date
    May 2009
    Posts
    4,063
    Thanks
    101
    Thanked 1,797 Times in 862 Posts
    Rep Power
    631
    Reputation
    15724

    Default

    How about , or , like the Nuance (former LoquendoTTS) products all these can be used on embedded systems.
    Of course a bit of work if needed to implement it into your program, but it might be easier to do it that way as you can use the same platform for the other tasks as well, not just the text output.
    Tomtom GO730 ,Navcore 9.004 ,Bootloader 5.5256 ,Map :Australia 845.2661
    ttmaps and Tomplayer on 16GB SDHC class6
    Password for all my files: downunder

  • The Following User Says Thank You to Downunder35m For This Useful Post:

    Tiny (14-07-14)

  • #8
    Senior Member
    weirdo's Avatar
    Join Date
    Jan 2008
    Posts
    5,458
    Thanks
    4,638
    Thanked 3,135 Times in 1,633 Posts
    Rep Power
    0
    Reputation
    29602

    Default

    PM me your postal address trash
    pretty sure I have an obsolete GPS unit in the shed you can have if you want to go that way

  • #9
    Senior Member
    best4less's Avatar
    Join Date
    Jan 2008
    Location
    Australia
    Posts
    7,684
    Thanks
    3,487
    Thanked 2,207 Times in 1,132 Posts
    Rep Power
    758
    Reputation
    15165

    Default



    Works for me Trash LOL
    When you do things right, people won't be sure that you have done anything at all

  • The Following 3 Users Say Thank You to best4less For This Useful Post:

    Tiny (14-07-14),trash (14-07-14),weirdo (13-07-14)

  • #10
    Crazy Diamond
    Tiny's Avatar
    Join Date
    Dec 2010
    Location
    Tasmania
    Age
    64
    Posts
    6,393
    Thanks
    11,000
    Thanked 5,437 Times in 2,652 Posts
    Rep Power
    2156
    Reputation
    89077

    Default

    If you do go down the GPS route; Garmin have free Voice Studio Software for creating personalised voices with a microphone.
    Not sure if you can customise it enough for your needs though.
    Also will require a compatible GPS to operate.

    Cheers, Tiny
    "You can lead a person to knowledge, but you can't make them think? If you're not part of the solution, you're part of the problem.
    The information is out there; you just have to let it in."

  • #11
    Senior Member
    trash's Avatar
    Join Date
    Jan 2008
    Location
    Tamworth
    Posts
    4,089
    Thanks
    148
    Thanked 3,229 Times in 1,451 Posts
    Rep Power
    1288
    Reputation
    47674

    Default

    Thanks again downunder.
    Cereproc supports raspberry pi, so there's a good chance I can use that provided they're not going to cost an arm and a leg.
    The other two don't appear to be of any use. It's a major pain in the arse to have to pick up another platform unless that platform is a black TTS box. (12V in, V.24 in, Audio out)

    I'm reluctant to work with the rasberry pi when all I wanted was an affordable (non robotic) TTS module.
    But the pi is cheap enough to make it worth a little bit of effort. I struggle with linux and python (not sure why) but I can manage and maybe even move my PIC control functions onto the pi.

    I'll see how much the Cereproc ends up costing, because ripping the voice box out of a GPS could end up being cheaper.

    I heard a female robotic voice via a UHF repeater today. Even though it still had a noticeable robotic quantisation noise, it was quite acceptable to listen to even over a radio channel.
    Yes I am an agent of Satan, but my duties are largely ceremonial.

  • #12
    Senior Member
    best4less's Avatar
    Join Date
    Jan 2008
    Location
    Australia
    Posts
    7,684
    Thanks
    3,487
    Thanked 2,207 Times in 1,132 Posts
    Rep Power
    758
    Reputation
    15165

    Default



    I tried this on my Samsung S3 but not to sexy LOL
    When you do things right, people won't be sure that you have done anything at all

  • #13
    Senior Member
    Downunder35m's Avatar
    Join Date
    May 2009
    Posts
    4,063
    Thanks
    101
    Thanked 1,797 Times in 862 Posts
    Rep Power
    631
    Reputation
    15724

    Default

    I don't know how good your coding skills are but there is a way for Python to make it speak:
    Depending on your skills it should also be possible to write a little Android app that accepts text over the USB or WiFi connection and and output using one of the avialable TTS engines.
    Basically like an Ebook reader with text input on demand.
    Did not do much searching on this but there is a chance projects for this already exists.
    Tomtom GO730 ,Navcore 9.004 ,Bootloader 5.5256 ,Map :Australia 845.2661
    ttmaps and Tomplayer on 16GB SDHC class6
    Password for all my files: downunder

  • #14
    Senior Member
    trash's Avatar
    Join Date
    Jan 2008
    Location
    Tamworth
    Posts
    4,089
    Thanks
    148
    Thanked 3,229 Times in 1,451 Posts
    Rep Power
    1288
    Reputation
    47674

    Default

    My coding skills are pretty ####ing awesome. (pat myself on the back) but I find because I have not exercised at lot of them I have lost of lot of skill in the higher level languages
    and my 32bit program has never been any good. The bus stopped at 16 bits and I got off.
    My main programming is with assembler and I can shit it in if the operating system doesn't get in the way.
    I'm a bottom feeder when it comes to low level and risc programming.

    So ideally, what I wanted to do was run a PIC chip which I can program in assembler and talk to a TTS piece of hardware. It >should< have been that simple.
    But without any TTS with a good sexy voice I have to compromise. The raspberry pi was that compromise and the thought was just to use the pi as a TTS module since it's about 10 times cheaper.
    But TTS on the pi is turning out to be a crock of shit. It doesn't exist and it's just as bad as a $5 speakjet.

    So, back to a risc mentality. While TTS would have been ideal, I can live with much less. What I need the TTS to say can be broken down into about 50 blocks of wav files and
    they can be strung together sequentially using a simple script like perl.

    Anyhow.... I have had another look at python. In days gone by I would have picked up this language in about 30 minutes. I must be getting old because it's kind of confusing.
    I struggle for the time, but I expect this will change. I've also had to go back and look at Perl.
    So at the moment I'm flipping between python and perl trying to work out which I can program with the least amount of fuss.

    What I'm currently running is MPG321 on the pi and it is playing the TTS generated and captured files as MP3 files. It works really well.
    The problem is now I have to make it work in perl or python. Which is where I'm at.

    Then I can port up some of the functions I would have had the PIC do. So in python it might check a GPIO for data input and then assemble the TTS phrase based upon that data.
    The actually programming of how to assemble the speech is a piece of piss. It's getting the data from the port and making perl or python run MPG321 and pass variables to it.

    I'm working on it now and I'm always open to tips and shortcuts if anybody would like to suggest some.
    Yes I am an agent of Satan, but my duties are largely ceremonial.

  • #15
    Senior Member
    Bibliophile's Avatar
    Join Date
    May 2009
    Location
    Sydney
    Age
    75
    Posts
    1,204
    Thanks
    674
    Thanked 1,166 Times in 389 Posts
    Rep Power
    573
    Reputation
    17596

    Default

    Many years ago, when the commonwealth games were in Brisbane, the ABC made a number of digitized speech devices for line identification.
    If you only require a limited number of responses you could get a female of your choice to say them and record / digitize them in WAV or MP3 and then play them back as required.
    For most of my life, I lived a delusion

  • Bookmarks

    Posting Permissions

    • You may not post new threads
    • You may not post replies
    • You may not post attachments
    • You may not edit your posts
    •