![]() |
KnowBrainer Speech Recognition | ![]() |
Topic Title: How to improve recognition accuracy for a user whose speech is not 'standard' Topic Summary: how to improve recognition accuracy for a user whose speech is affected by cerebral palsy Created On: 05/18/2023 06:12 PM Status: Post and Reply |
|
![]() |
![]() |
- SueW | - 05/18/2023 06:12 PM |
![]() |
![]() |
- Alan Cantor | - 05/18/2023 08:01 PM |
![]() |
![]() |
- Lunis Orcutt | - 05/18/2023 08:23 PM |
![]() |
![]() |
- wheels496 | - 05/19/2023 04:00 AM |
![]() |
![]() |
- SueW | - 05/20/2023 05:49 PM |
![]() |
![]() |
- SueW | - 05/20/2023 06:07 PM |
![]() |
![]() |
- Alan Cantor | - 07/22/2023 05:28 PM |
![]() |
![]() |
- R. Wilke | - 07/23/2023 01:12 PM |
![]() |
![]() |
- ax | - 07/23/2023 02:27 PM |
![]() |
![]() |
- Alan Cantor | - 07/23/2023 04:43 PM |
![]() |
|
I am assisting a client whose speech is affected by cerebral palsy. He has DPI V 15.61.200.010, using a high quality noise-cancelling microphone, Australian accent model. Computer is high spec, running Windows 11. Recognition accuracy is very poor. I have tested Dragon myself on his computer, using the same microphone (set up my own user profile) and I get good results, so it seems clear that the reason for the poor recognition is my client’s speech. We have tried using the Vocabulary tools such as adding words and phrases to the vocabulary, training words and phrases, and Learn from specific documents, which help a little but not enough. Client is adamant that he achieved much better recognition accuracy on an older version of Dragon (V 11). He did lots of the readings, and after each reading Dragon’s recognition would improve. Client is really keen to use Dragon on his computer as he has lots of things to do, he is CEO of an online business. He doesn’t have many other assistive technology options that would work for him. My question is… Is there a way to create and import your own readings for Dragon 15 or 16? I have found some posts on this subject but from many years ago, relating to earlier versions of Dragon. Other thoughts: The level of recognition accuracy with Dragon 15, 16 is very good but is it possible that removing the additional readings has made it more difficult for users with ‘non-standard’ speech to improve recognition accuracy? It is also interesting that my client is getting better results when using SIRI and Voice Control on iPhone. This is useful for speech to text however client really needs to be able to use Dragon on his computer. I’m wondering why he would be getting significantly better recognition accuracy on his iPhone, compared to using Dragon on his computer. It’s usually the other way round. Could it be that SIRI and Voice Control on iPhone make more use of the probability model than the acoustic model? This is the only way I can explain that someone with non-standard speech is getting better recognition accuracy on his iPhone compared to using Dragon on his computer. Sue
------------------------- Cheers SueW |
|
|
|
![]() |
|
Hi Sue,
I'm experimenting with a technique to improve Dragon accuracy for someone with a non-standard accent and manner of speaking. The technique is labour-intensive and persnickety. I've tested it only once, and I could see the need to make the system work a little better. Accuracy was initially about 50%, and jumped to 70% in about 90 minutes. I'll be testing again in a few weeks. Not sure the technique is ready for prime-time. |
|
|
|
![]() |
|
It's not unusual for someone with a verbal disability to experience better accuracy in Siri then Dragon. By design, Dragon can be a bit picky. Rather than delving deeper, let's jump into it:
1. Open the DragonBar Settings / Microphone / Choose Microphone menu 2. Remove the checkmark from Automatically adjust microphone level as I speak 3. DPI 15 will not prompt you to rerun the Microphone Check but you will need to do so. v16 is smarter 4. Use the KnowBrainer Train Dragon Command (courtesy of Monkey8). Those MIA training scripts were never removed. Nuance only removed the menu. We have heard that this training can help end-users with abnormal voices. Because of your client's disability, your client has the option of receiving a free or discounted copyof KnowBrainer 2022. There is also a 30 day trial in our signature tag. We recommend downloading and installing a 30 day trial of KnowBrainer 2022. Then say Train Dragon. ------------------------- Change "No" to "Know" w/KnowBrainer 2022 |
|
|
|
![]() |
|
Hello A dragon trainer pointed me to the "rainbow passage", which seemingly includes every syllable in the UK language. I had about three sessions, with a work colleague, who helped me with the training. Initially, I just dictated the passage, with my colleague helping me to correct and train every misrecognised utterance. I also trained up the alphabet (alpha, Bravo et cetera). ------------------------- DP 16 |
|
|
|
![]() |
|
Thanks for your replies.
Alan, I look forward to hearing more. Thanks for your tips Wheels496. Lunis, I will send you an email. Sue W ------------------------- Cheers SueW |
|
|
|
![]() |
|
I followed Lunis' suggestions to download free trial of Knowbrainer. I then used the Train Dragon command and Voila the readings appeared.
Thanks Lunis! and thanks to others for your suggestions. Alan, I would be interested to hear more about your method when it is ready. ------------------------- Cheers SueW |
|
|
|
![]() |
|
Yesterday I tested my new training protocol. The process took a little over an hour. Accuracy climbed from around 50%, to around 80%. |
|
|
|
![]() |
|
Alan,
This is really marvellous testing indeed. It might be worthwhile automating via the API, although that would take longer than just 20 hours, not even accounting for debugging. Nonetheless, great job, specifically as it clearly demonstrates some of the underlying concepts. ------------------------- The New Game in Town: DragonConnect |
|
|
|
![]() |
|
Your approach is methodical and the validation of a principle impressive, Alan!
On DMO, recognition improvement from going through the corrections menu always seems marginal at best. It's perceptible, if one zooms in on a single correction that's repeatedly made. But I can't really be sure how long after the correction(s) any improvement would kick in or how durable it is.
Our of curiosity, on desktop Dragon at least, does any tangible improvement from your exercise get encapsulated/captured in a discrete file (or files) that can be exported and preserved? Or does an individual have to go through the "curated corrections" exercise all over if they are forced into a new profile? |
|
|
|
![]() |
|
The only way I can think of to preserve the results is to make a backup copy of the entire user profile. It's probably easier than backing up just the acoustic model; but who, if anybody, knows which file (or files) hold the acoustic model! |
|
|
FuseTalk Standard Edition v4.0 - © 1999-2023 FuseTalk™ Inc. All rights reserved.