Re: Voice Recognition
- From: "Caroline" <Caroline@xxxxxxxxxx>
- Date: Sat, 24 Sep 2005 18:01:03 -0400
Thank you Chris. That sounds like a good idea, a little more work but
interesting and challenging. I have never done anything with spectrums, so I
have to do some research on that as well. I've seem Casey Chestnut's "from
scratch" article, really admire his work, too bad he won't share his code,
but I understand his point of view. The phonix solution seems like it would
work, but they won't even give out the pricing on their website so I imagine
it could be quite expensive.
So far I think your idea is the best. Thanks for sharing, and I will post if
I find anything interesting.
"<ctacke/>" <ctacke_AT_OpenNETCF_com> wrote in message
news:%23NO518UwFHA.3860@xxxxxxxxxxxxxxxxxxxxxxx
> It's a tall order and I've not seen anyone implement it and share the
> code. Conceptually I know how you'd attack it, so I can at least point you
> where I'd go if I had to do it.
>
> 1. Use the OpenNETCF.Multimedia.Audio.Recorder to collect the sound. This
> will give you easy access to the audio data itself.
> 2. Run the audio data through one of the OpenNETCF.MathEx.FFT classes to
> get a spectrum on it. You might have to do an FFT over time to get a
> "fingerprint" on the sound (i.e. don't FFT the entire sound, but maybe a
> running FFT of a 128 or 256 data point window).
> 3. Once you have this you can generate a spectrum for a given word. Have
> several people say the word several times. Try to pick a wide array of
> speakers.
> 4. Now the fun - you need to grab a spectrum from a speaker and try to
> match it to a known spectrum. If you have several spectra for a given
> word you might try a closest match or a bounds fit. Who knows, maybe
> simple peak analysis would be enough - looking at the data is the only way
> to know.
>
> -Chris
>
>
> "Caroline" <Caroline@xxxxxxxxxx> wrote in message
> news:ObEi9nUwFHA.2232@xxxxxxxxxxxxxxxxxxxxxxx
>> Just need to expand on the requirements:
>> - This is not a web services app, so I need recognition running on the
>> PPC.
>> - It needs to understand different kinds of voices.
>>
>> I just started doing some research on this and so far I can't find any
>> sample code for that.
>>
>> Your advice would be appreciated.
>> Thank you,
>> Caroline
>>
>
>
.
- References:
- Voice Recognition
- From: Caroline
- Re: Voice Recognition
- From: Caroline
- Re: Voice Recognition
- From: <ctacke/>
- Voice Recognition
- Prev by Date: Re: Voice Recognition
- Next by Date: Is SSL Enough?
- Previous by thread: Re: Voice Recognition
- Next by thread: Re: Voice Recognition
- Index(es):
Relevant Pages
|