hello there! I’ ve already asked in the NLP subreddit, and a user suggested me to ask here.
So, here i am. I’ ve developed a little PC App, where user has to enter several inputs in several windows. The App read a dataset (excel file), processes the user inputs and shows the results. Finally, the dataset is updated.
Now, the App kinda works as intended (already implemented in a GUI via PySimpleGUI. Next task i’m trying to develop, is the vocal command. So, if the input is to “enter number”, i’m trying to achieve an automatic way to extract those informations from the vocal and automatically entering these inputs into the right input box (there are input boxes, listboxes, checkboxes etc…).
I’ve already worked a little bit to the starting point: from vocal to text. Has to be refined, but it kinda works.
I’m a bit more focused on the text part. I’m currently using SpaCy library. What i’m trying to understand are a couple of thing:
1)What is the best approach? Working with Regex for a fixed extraction of values, or trying to go deeper with a machine learning model or neural network? Pros and cos?
2)I’m figuring the pipeline like this: i need a NLP apporach in order to extract the right values, so, point is, trying to design a script who extracts the right informations. If, for example, the user is speaking, saying things don’t related to the inputs to enter, the model needs to understand that those words are not inputs. Then, the model has to extract the right informations. Final part is to create a dictionary in order to use the values stored, as inputs (that normally, would have been entered by user via keyboard).
3)So, my problem is actually a 4 parts problem: extracting text from voice; understand the sentences spoken by user; understanding correlations in order to extract only the “right” informations; create an object to store those informations in order to use them as input (problem for another day).
Now, could you help me figuring out the best approach and how to achieve it? It’s a bit overwheling, i must admit
submitted by /u/D3vil_Dant3
[link] [comments]