Self hosting voice assistant

Scrubbles@poptalk.scrubbles.tech · 11 months ago

Self hosting voice assistant

logos@sh.itjust.works · 11 months ago

There was Mycroft but it got killed by a patent troll although afaik you can still run it. OpenvoiceOS and Neon AI have picked up the ball.

motsu@lemmy.world · 11 months ago

Rhasspy. Idk if rhasspy3 is out fully, but I would wait for that and then set it up. (I have began to see the home assistant side being released - its supposed to tie in a lot better than rhasspy2, and even brought the dev on to the HA project)

Yowasa@lemmy.world · 11 months ago

Didn’t home assistant hire the guy who made rhasppy so he could work on their voice assistant?

cwagner@lemmy.cwagner.me · 11 months ago

Yeah, he switched from Mycroft to Nabu Casa. Not a lot of things happening with Rhasspy directly from what I can see (been following it for a while and had a semi-working setup before the world ran out of Pi Zero 2 W’s and I stopped), but HA has been getting more and more features. I think satellites are still missing, though.

Scrubbles@poptalk.scrubbles.tech · 11 months ago

So wait until there’s more with home assistant?

cwagner@lemmy.cwagner.me · 11 months ago

That‘s my current plan ;) I absolutely need the satellite feature and the option to use voice commands for playing music.

Scrubbles@poptalk.scrubbles.tech · 11 months ago

Awesome I’ll look more into this, do you know if they’ll let us use our own voice models? Will it be natural like chatgpt style or more scripted like Alexa? And the satellites, I assume that’s like what I was talking about where I (hopefully someday) can flash my google minis and put HA on them instead?

cwagner@lemmy.cwagner.me · 11 months ago

do you know if they’ll let us use our own voice models?

Probably? I don’t know what the tech in that area looks like.

Will it be natural like chatgpt style or more scripted like Alexa

Everything is about scripted commands, but you can use templates and variables. It requires more setup but is more reliable.

And the satellites, I assume that’s like what I was talking about where I (hopefully someday) can flash my google minis and put HA on them instead?

I’d guess the chances for that (or me flashing my Alexas) is close to zero, those are far too locked down.

Satellites simply means that you can put a lower power device somewhere, and it will let your central server do the heavy processing. So with Rhasspy, you’d have one powerful device that would do Speech-To-Text (like an rPI 4), and smaller devices (like those Pi Zero 2 W’s I was never able to get back then) that only do wakeword recognition on-device (which needs to happen too fast for you to wait for the network), and upon waking, simply send the audio to the central server for processing.

Scrubbles@poptalk.scrubbles.tech · 11 months ago

Great, thanks, I think that’s all I needed. I’ll start playing with it but I’ll hold off on a major implementation until that’s all finished

Midnitte@kbin.social · 11 months ago

As others alluded to, theres Home Assistant Assist

ErwinLottemann@feddit.de · 11 months ago

It’s also Home Assistant year of the voice!

christiannils@sh.itjust.works · 11 months ago

Did you check https://github.com/toverainc/willow-inference-server/ ? I tried it with an ESP-BOX (running https://github.com/toverainc/willow) and the first results are really promising.