Skip to content

Latest commit

 

History

History
100 lines (54 loc) · 3.83 KB

integrations.md

File metadata and controls

100 lines (54 loc) · 3.83 KB
layout title permalink
default
VOSK integration with Asterisk, Freeswitch and Jigasi
/integrations

Asterisk

See https://github.com/alphacep/vosk-asterisk

Freeswitch

See https://github.com/alphacep/freeswitch/tree/master/src/mod/asr_tts/mod_vosk

Jigasi

See jitsi/jigasi#294

Unimrcp

See the plugin for unimrcp server https://github.com/alphacep/unimrcp-vosk-plugin/tree/vosk-plugin

Call analytics

https://github.com/bogdal1993/voice_perception

ROS Robot operating system

Some draft of the plugin is here, not yet fully functional https://github.com/alphacep/ros-vosk. See also ROS wiki page http://wiki.ros.org/vosk. We need someone help to fully implement and test the integration.

Another alternative implmentaiton https://gitlab.com/bob-ros2/voskros

Other projects that use Vosk

https://github.com/ryohajika/ofxVosk - ofxVosk is an openFrameworks addon of vosk-api to use automatic speech recognition (ASR) functionality based on Kaldi project.

https://github.com/ideasman42/nerd-dictation - Offline Speech to Text for Desktop Linux. See demo video.

https://github.com/Stypox/dicio-android - assistant

https://github.com/Tadashi-Hikari/Athena - assistant

https://kdenlive.org - subtitle generation

https://github.com/openaudiosearch/openaudiosearch - audio library search

https://github.com/bablokb/pi-webradio - Internet radio with voice control for RaspberryPi

https://github.com/o-oconnell/mp4grep - mp4grep is a tool that transcribes and searches audio and video files for a regex pattern.

https://github.com/swentel/solfidola - solfège musical training

https://github.com/mosave/LVTerminal - Lite Voice Terminal, an "offline smart speaker" solution powered by on-premise ASR server

https://github.com/iamsrp/dexter - Dexter, A Voice Controlled Assistant for RPi

https://github.com/SubtitleEdit/subtitleedit - Subtitle Editor in C#

https://numenvoice.org - Voice control for handsfree computing

https://github.com/omlins/JustSayIt.jl - Offline, low latency, highly accurate and secure speech to command translation software

https://github.com/OpenVoiceOS/ovos-stt-plugin-vosk - vosk STT plugin for mycroft

https://github.com/rafaelcaricio/gst-plugin-vosk - Vosk plugin for Gstreamer

https://github.com/janvarev/Irene-Voice-Assistant/ - Voice assistant in Russian

https://github.com/antiboredom/videogrep - Search in Videos

https://play.google.com/store/apps/details?id=com.africadevtalents.koumabiboro - Kouma Bi Boro is a speech recognition application in Mandinka language mainly based on the one spoken in Ivory Coast

https://github.com/opencast/opencast - The free and open source solution for automated video capture and distribution at scale.

https://github.com/bsaleh03/CaptionIt - Vosk-powered HUD

https://github.com/PhilippeRo/IBus-Speech-To-Text - This IBus engine uses VOSK (https://github.com/alphacep/vosk-api) for voice recognition and allows to dictate text in several languages in any application through IBus. It supports Wayland and likely Xorg, though it has not been tested with the latter.

https://github.com/PhilippeRo/gst-vosk/ - Gstreamer plugin for Vosk

https://github.com/ahmad081177/control-thymio-via-voice - Thymio II robot (educational robot) via voice commands using VOSK

https://github.com/mixmesh/vosk - Erlang bindings for Vosk

https://github.com/audapolis/audapolis - audio editor

https://github.com/TUM-Dev/TUM-Live-Voice-Service - Microservice that generates subtitles for TUM-Live

https://git.sr.ht/~geb/sprec - Speech recognition command for scripting

Bindings

https://github.com/igor725/lua-vosk - Lua bindings

https://github.com/diegolijo/speech-to-text - Cordova plugin

https://github.com/unusualprojects/GodotSpeechRecognition - Vosk in Godot