Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Please add an extended description for some sections #7

Open
mitrokun opened this issue Mar 22, 2024 · 3 comments
Open

Please add an extended description for some sections #7

mitrokun opened this issue Mar 22, 2024 · 3 comments

Comments

@mitrokun
Copy link

mitrokun commented Mar 22, 2024

From the position of an ordinary user🙄

After reading the description, a few questions remain.

What is the range of values for the "correct-sentences" parameter?

And the main misunderstanding is caused by the "No Correct Patterns" section. The meaning of the parameter is well explained, but it is completely unclear how to use it. It would be great to add an example of the actual use of this option.
I tried to search on Google, looked at the available examples and problems related to vosk, вut I didn't find an answer anywhere.

//Looks like I finally figured it out.
The first parameter is linked to error estimation.

And in the second case, it is enough to specify something like this if we want to have a request to Wikipedia
no_correct_patterns:

  • what is [the] [a] [an]
@roschi02
Copy link

roschi02 commented Oct 1, 2024

What is the range of values for the "correct-sentences" parameter?

I have the exact same question. Did you ever figure it out?

@synesthesiam
Copy link
Contributor

I've added some more to the README, but I'll reproduce here:

See description the score_cutoff parameter in the rapidfuzz docs for more details (weights=(1, 1, 3)).

So the range of the "correct-sentences" parameter depends on the size of the strings that are being compared. It's effectively the maximum number of "changes" that can be made try and turn the text that vosk gives back into one of the pre-defined sentences. I set the weights to (1, 1, 3) so that adding/removing a character is less expensive than changing an existing one.

@roschi02
Copy link

roschi02 commented Oct 2, 2024

Do I really need to run the script, or can I just change the values in the Vosk configuration?
And what exactly am I changing there? I can only enter one number, not a series of numbers.
image

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants