golgu-speechtotext released. raspberrypi speech to text recognition


here it is

http://cli-apps.org/content/show.php/golgu-speechtotext?content=159751

Well i was joking arount a bit with youtube and i have seen a video where raspberry recognized voice … so i did a little script😀  using google voice recognition service … and …

🙂

 

WHAT IS IT ?

it’s a script that records your voice with ffmpeg or avconv and it sends to google server to be converted into text.It can even performe some operation like open a browser show your webcam search for stuff on internet for now this function can be used only in english and italian🙂

TROUBLES WITH MIC :

if you have some problems with the default mic or if you want use the webcam’s mic you can set it in this way

-m hw:x,x , the default one is hw:0,0

where x,x must a set of numbers , you can understand them reading your configuration

#########################################################
00-00: ALC269VC Analog : ALC269VC Analog : playback 1 : capture 1
02-00: USB Audio : USB Audio : capture 1
#########################################################

if you get some problems recording audio just see this file /home/sem/.config/golgu-speechtotext/golgu-speechtotext.log, you could get some problem with ffmpeg \ avcon because some mic require 2 channels some other require 1 channel mono…

INSTALLATION

you need of :

ffmpeg or avconv awk

for debian systems :

sudo apt-get install awk ffmpeg #avconv
LANGUAGES

it supports these languages some code could be wrong … :

af sq ar az eu bn be bg ca zh-CN zh-TW hr cs da nl en eo et tl fi fr gl ka de el gu Creole iw hi hu is id ga it ja kn ko la lv lt mk ms mt no fa pl pt ro ru sr sk sl es sw sv ta te th tr uk ur vi cy yi

USAGE:

golgu-speechtotext <options>

-a : function run applications and commands activated , for keys look down …
-c : set channels , some mic require 2 channels some others require 1 channel to work properly with ffmpeg
-e : open golgu-speechtotext config file
-h : print this help
-l : print languages and exit
-i : set input language
-v : print version and exit
-w : set webcam device to use , example : -w /dev/video0
-ws: write settings on config file located here /home/sem/.config/golgu-speechtotext/golgu-speechtotext.cfg

########
EXAMPLES
########

————————
golgu-speechtotext -i en-us
————————

————————
golgu-speechtotext -i en-us -c
————————

#######################################
FUNCTIONS RUN APPLICATIONS AND COMMANDS
#######################################

this function if activated can run applications and do some stuff … like open the browser or show your webcam , if you have more than one it will ask you to pick one ..

————————
golgu-speechtotext -i en-us -a
————————

key words :

open browser
search for
show webcam
open text editor
what time is it
suspend computer

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s

%d bloggers like this: