-
Notifications
You must be signed in to change notification settings - Fork 600
[EN] Match articles "a" and "an" for <the> #3014
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
28ea338
to
1d3718a
Compare
First of all, I don't think it's a good idea to prefix Second of all, if we're doing this, I don't see why there wouldn't be the Third, I don't see why this would apply strictly to vacuums and not every other entity. Fourth, to counter the the issues above and create new ones, why not add Finally, like I said numerous times before, I don't think it's wise to add incorrect sentences just to please Whisper or any other STT. The proper solution here would be to fix Whisper. I'd like to hear the other language leaders' comments on this. |
Oh I agree on most of this, and I can change the PR to have a[n] instead of just for the vacuum, I just thought it would an issue to have a change that would affect other commands. I would be more than happy to add the change directly to if that's what would be preferred. |
Also, as much as I would love to fix Whisper to follow grammar, but there is so much you can do with it to influence the output. OpenAI probably trained it on mega powerful datacenters with all the speech data they sucked off the internet, it's not really realistic to be able to somehow fix all the edge cases like this without the expertise and resources they had. |
1d3718a
to
c64ffb8
Compare
I went ahead and added "a[n]" directly to , since adding it to the intents themselves leads to wonky matching. |
Whisper (for me) seems to always put article
a
after vacuum commandsstart
orreturn
.Someone might also actually say it this way so let's handle that.