Speech recognition – Page 3

Picovoice Puts Smarts Offline in 512K of Memory

Posted on January 2, 2019 by Al Williams

We live in the future. You can ask your personal assistant to turn on the lights, plan your commute, or set your thermostat. If they ever give Alexa sudo, she might be able to make a sandwich. However, you almost always see these devices sending data to some remote server in the sky to do the analysis and processing. There are some advantages to that, but it isn’t great for privacy as several recent news stories have pointed out. It also doesn’t work well when the network or those remote servers crash — another recent news story. But what’s the …read more

Continue reading Picovoice Puts Smarts Offline in 512K of Memory→

Speech Recognition Without A Voice

Posted on September 14, 2018 by Brian Benchoff

The biggest change in Human Computer Interaction over the past few years is the rise of voice assistants. The Siris and Alexas are our HAL 9000s, and soon we’ll be using these assistants to open the garage door. They might just do it this time.

What would happen if you could talk to these voice assistants without saying a word? Would that be telepathy? That’s exactly what [Annie Ho] is doing with Cerebro Voice, a project in this year’s Hackaday Prize.

At its core, the idea behind Cerebro Voice is based on subvocal recognition, a technique that detects electrical signals …read more

Continue reading Speech Recognition Without A Voice→

Talk To The Faucet

Posted on September 1, 2018 by Steven Dufresne

Your hands are filthy from working on your latest project and you need to run the water to wash them. But you don’t want to get the taps filthy too. Wouldn’t it be nice if you could just tell them to turn on hot, or cold? Or if the water’s too cold, you could tell them to make it warmer. [Vije Miller] did just that, he added servo motors to his kitchen tap and enlisted an AI to interpret his voice commands.

Look closely at the photo and you can guess that he started with a single-lever type of tap, …read more

Continue reading Talk To The Faucet→

Make A Natural Language Phone Bot Like Google’s Duplex AI

Posted on June 21, 2018 by Steven Dufresne

After seeing how Google’s Duplex AI was able to book a table at a restaurant by fooling a human maître d’ into thinking it was human, I wondered if it might be possible for us mere hackers to pull off the same feat. What could you or I do without Google’s legions of ace AI programmers and racks of neural network training hardware? Let’s look at the ways we can make a natural language bot of our own. As you’ll see, it’s entirely doable.

Breaking Down The Solution

One of the first steps in engineering a solution is to break …read more

Continue reading Make A Natural Language Phone Bot Like Google’s Duplex AI→

Speech Recognition For Linux Gets A Little Closer

Posted on January 18, 2018 by Al Williams

It has become commonplace to yell out commands to a little box and have it answer you. However, voice input for the desktop has never really gone mainstream. This is particularly slow for Linux users whose options are shockingly limited, although decent speech support is baked into recent versions of Windows and OS X Yosemite and beyond.

There are four well-known open speech recognition engines: CMU Sphinx, Julius, Kaldi, and the recent release of Mozilla’s DeepSpeech (part of their Common Voice initiative). The trick for Linux users is successfully setting them up and using them in applications. [Michael Sheldon] aims …read more

Continue reading Speech Recognition For Linux Gets A Little Closer→

Fooling Speech Recognition With Hidden Voice Commands

Posted on January 16, 2018 by Lewin Day

It’s 2018, and while true hoverboards still elude humanity, some future predictions have come true. It’s now possible to talk to computers, and most of the time they might even understand you. Speech recognition is usually achieved through the use of neural networks to process audio, in a way that some suggest mimics the operation of the human brain. However, as it turns out, they can be easily fooled.

The attack begins with an audio sample, generally of a simple spoken phrase, though music can also be used. The desired text that the computer should hear instead is then fed …read more

Continue reading Fooling Speech Recognition With Hidden Voice Commands→

Gong, an AI-based language tool to help sales and customer service reps, nabs $20M

Posted on July 12, 2017 by Ingrid Lunden

As artificial intelligence continues its spread into all aspects of computing, many believe that it will be the next big frontier in CRM. Today a startup called Gong.io underscores that trend: the Israeli startup, which has built a tool that uses… Continue reading Gong, an AI-based language tool to help sales and customer service reps, nabs $20M→

Ten Minute TensorFlow Speech Recognition

Posted on March 25, 2017 by Al Williams

Like a lot of people, we’ve been pretty interested in TensorFlow, the Google neural network software. If you want to experiment with using it for speech recognition, you’ll want to check out [Silicon Valley Data Science’s] GitHub repository which promises you a fast setup for a speech recognition demo. It even covers which items you need to install if you are using a CUDA GPU to accelerate processing or if you aren’t.

Another interesting thing is the use of TensorBoard to visualize the resulting neural network. This tool offers up a page in your browser that lets you visualize what’s …read more

Continue reading Ten Minute TensorFlow Speech Recognition→

Ten Minute TensorFlow Speech Recognition

Posted on March 25, 2017 by Al Williams

Another interesting thing is the use of TensorBoard to visualize the resulting neural network. This tool offers up a page in your browser that lets you visualize what’s …read more

Continue reading Ten Minute TensorFlow Speech Recognition→

Arduino Clock Is HAL 1000

Posted on December 11, 2016 by Al Williams

In the movie 2001: A Space Odyssey, HAL 9000 — the neurotic computer — had a birthday in 1992 (for some reason, in the book it is 1997). In the late 1960s, that date sounded impossibly far away, but now it seems like a distant memory. The only thing is, we are only now starting to get computers with voice I/O that are practical and even they are a far cry from HAL.

[GeraldF6] built an Arduino-based clock. That’s nothing new but thanks to a MOVI board (ok, shield), this clock has voice input and output as you can …read more

Continue reading Arduino Clock Is HAL 1000→