You can use the Google Cloud Speech-to-Text api if you are just trying to get a transcription of what the user says and run some natural-language analysis on the text:
For a recent project we used the windows built-in speech recognition(speech to text, no sentiment analysis or similar). For simple stuff it worked ok and was easy to set up.