Archives 2005 - 2019    Search

Cheap large diaphragm mics I liked, listening to YouTube vids through a tinny laptop speaker

published Feb 15, 2019 06:55   by admin ( last modified Feb 16, 2019 11:29 )

Audio-technica 4047 Oops, this isn't cheap! I started listening to samples at YouTube and thought "Wow this is great for a €70 mic". Turns out it's more like €700.

AKG P120 This is a €70 mic. I like what I hear on Youtube. Not as much as the 4047 above, but…

Samson C01U Pro USB

 

 


New insights into audio recording for video, from recent sessions

published Feb 15, 2019 04:40   by admin ( last modified Feb 16, 2019 01:19 )

So for the situations I tend to end up in, one-on-one interviews, conversations in front of a white board with 2 or more people and sometimes a group chiming in with opinions. here are some ideas with regards to wired, wireless, digital, number of inputs, where to record and so forth.

Insight number 1 — tracks

A video camera usually only has two sound tracks. If you want more mics than two, you will have to record those on a different recorder, being it a camera, a dedicated recorder or a sound card connected to a computer. This means that you need to do synchronization. Unless you are fancy with SMPTE or something, that will be done by matching audio automatically. If you already need to do that, it does not take much more time to match many tracks algorithmically.

Insight number 2 — synchronization

If you already accept synchronizing multiple sources, you might as well have the recording device on the subject. The exception being that two mics should be wireless and their receivers should be mounted on the main camera to get sound and images in lockstep.

Insight number 3 — lapels don't scale

It's a lot of work to mic people up. Well, not a lot of work but it does not scale to when you have 12 or 50 people. A handheld wireless mic which can be passed around can be a good option in those cases. Also, everyone understands how a normal handheld mic works, so don't go for one where you talk at the side of it or something. You could also use a portable recorder for this with built-in mics. Actually that may be the best option.

Insight number 4 — no digital

No need to go digital. It makes more and more sense in our day and age to go digital from the mic to the recorder, but since most cameras don't have digital sound inputs afaik, you'll have to wait for that for a while. And for your other devices, if one believes this blog post's insights to be true, you'll use portable recorders on the person, which come to think of it is kind of all digital, just a very short chain.

A possible setup from these insights

  • Two wireless lavalier/lapel mics with the receivers mounted on-camera. You could go with just one and use the other channel for picking up ambient sound for synchronization
  • Any other cameras used, can take up sound from any fixed mics, including boom mics, indeed they should use this to get some synchronization signal.
  • Two people can be be mic'ed up with the wireless mics,
  • any others can be mic'ed up with lavalier mics that go to a recorder directly on the person's body.

 

Being a bit more concrete

A. One wireless lavalier mic for the main camera,

for example Røde rødelink newscaster/filmmaker (untested by me).

B. More lavalier mics

All other lavalier mics used, would have their own recording device on the person's body. Here you would need to find good but affordable such recorders, or use mobile phones.

  • Sony has an apparently great little mobile recorder under €100, the Sony ICD-UX560, but ideally you'd want to go a bit cheaper than that. Also, it does not come with a clip, but it does have a mic input with bias (3-5V for electret condenser mics).
  • A mobile phone with e.g. the aputure a.lav mic. I have that one and I don't like it, though.

    I suspect some mobile phones apply compression, maybe there is a way to avoid that. Record that note App works well on Android as a recorder in my experience.

There are some novelties in the field, such as the

  • Sennheiser Memory Mic, but I wish it came in black and not white. It's also at ~€200 a bit on the expensive side.
  • Same price range for the Tascam DR-10L Digital Audio Recorder with Lavalier Mic .
  • There is also the Zoom F1-LP, with lavalier mic, roughly the same price.

 

C. One handheld microphone

One handheld microphone that you can use for non-mic'ed up people to speak through. Here is a recorder that is of interest for this:

  • Tascam DR-10X Plug-On Micro Linear PCM Recorder (XLR). Aroung €100.
  • Here is one that even supplies 48V phantom power: Saramonic SR-VRM1 Plug-On Linear PCM Recorder for XLR Microphones, around €125.
  • You could also use a handheld voice recorder such as the Sony ICD-UX560 (again) as a handheld microphone.
  • Or even use a mobile phone, possibly with a mic sticking out to make it obvious where to talk. Make sure to have a good screen lock, but not so good that you can't unlock it because the finger print holder went to lunch.

Risks with this approach

  • Uneven sound quality with so many different recording devices.
  • Synchronization not working due to sound images being too different between different mics.
  • It's hard to know with confidence if a local device is indeed recording. With a central mixing desk you could check. Maybe if you use phones you could have them report over the Internet their recording status.
  • People wandering away with the devices.
    If a person wanders away with a wireless mic, you lose the mic. If he wanders away with a recorder however, you also lose the recording from that person! And you will have to edit them out of the video. A fitting punishment maybe :).

Go full insight— No wireless mics?

Actually for when only two subjects need to be recorded, two wireless camera mounted systems might be the best choice. Maybe one should never go go full insight, but you could probably get away with only portable recorders on the subjects, and sync up via the cameras' built-in mics. Actually for when only two subjects need to be recorded, two wireless camera mounted systems might be the best. But, you could probably get away with only portable recorders on the subjects, and sync up via the cameras' built-in mics.

 


Align videos with audio analysis & stack them in a multi sound file

published Feb 14, 2019 01:25   by admin ( last modified Feb 17, 2019 11:57 )

With align-videos-by-sound, which is a python script/application you can align videos by their sound tracks.

After install, you can for example do this:

./bin/simple_stack_videos_by_sound_track --mode=direct --audio_mode=multi_streams tests/testfiles/3-secs-in.mp4 tests/testfiles/7-secs-in.mp4 tests/testfiles/full.mp4


It will present the 3 videos stacked in a "merged.mkv" file (one of the videos will be duplicated for symmetry), and the sound tracks from all 3 videos will be synced, for editing and mixing in e.g. Audacity (tested by me).

aligned tracks
Click to view full-size image…
Size: 80.3 kB

This is what it looks like in Audacity when you open the "merged.mkv" file. You can then export to multiple files and they will get the appropriate length of silence in the beginning.

Align-videos-by-sound is alpha software! And I am the maintainer of it, although all the heavy coding was done by Allison Deal (where i lifted the code from) and later Hiroaki Itoh.

 


Narrow a stereo track where things are panned hard left/right, with sox

published Feb 13, 2019 03:31   by admin ( last modified Feb 13, 2019 03:31 )
sox stereo.wav narrowstereo.wav remix 1v0.6,2v0.4  2v0.6,1v0.4

Credit to here: https://superuser.com/questions/348756/creating-stereo-file-from-two-mono-files-with-sox


An interesting guide on how to get an ecosystem going in your garden

published Feb 13, 2019 01:37   by admin ( last modified Feb 13, 2019 01:37 )

https://np.reddit.com/r/IWantToLearn/comments/apgslg/iwtl_how_to_create_a_better_environment_for/egaxmw5/

A proof to me that anything can be interesting once somebody explains it in a way that makes your brain grok the particular field.


Is bluetooth microphone audio always bad?

published Feb 12, 2019 12:50   by admin ( last modified Feb 17, 2019 01:15 )

Update: Here it is straight from the horse's mouth, or rather Antlion's mouth: All Bluetooth mic standards are bad, max they could get out was 16KHz sampling frequency (vid forwarded to where he talks about this):
https://youtu.be/UIp3Dh6rx7w?t=315

I put the sound track down for a screen cast with my Sony bluetooth headset. The result was not good, I had to redo it later with a wired mic. I then saw a video review of a higher quality bluetooth headset, but alas again the sound is no good from the mic:

https://youtu.be/xbxyqSVc_Io?t=265

This answer on Quora seems to indicate that with current standards bluetooth microophone audio will always suck, unless you hack the mic stream into some data protocol of bluetooth.

Why is the quality of my microphone so bad when I use my Bluetooth headset with my windows desktop computer? - Quora

 


How to get a stereo wav file from a Canon C300 camera with MXF format using ffmpeg

published Feb 12, 2019 03:45   by admin ( last modified Feb 12, 2019 04:12 )

This presumes you have been recording into the Canon C300 Mark I with two sound sources, split into separate channels. And you want the channels to be panned hard left and right, suitable for editing and comparison in e.g. Audacity (Although you can of course also export to different files and import).

This does the trick:

ffmpeg -i from_canon.MXF -filter_complex "[0:1] [0:2] amerge" -c:a pcm_s16le stereo.wav

Credit goes to FFMPEG: Need to mix dow multiple audio stream to single stereo - Stack Overflow

Software such as ffmpeg, Videolan Client (VLC) and audacity see Canon's MXF format as two separate sound channels inside a container, not as a left and right channel. In VLC you will only get one audio track for example, and you can switch to the other one if you want. So using ffmpeg, this won't work:

ffmpeg -i from_canon.MXF pcm_s16le stereo.wav

It will say that it will only include one audio stream in the wav file, althought there are two:

Stream mapping:
  Stream #0:1 -> #0:0 (pcm_s16le (native) -> pcm_s16le (native))

Mapping the channels into wav also won't work:

ffmpeg -i AA340601.MXF -map 0:1 -map 0:2 -c:a pcm_s16le all.wav

This will give the error:

WAVE files have exactly one stream
Could not write header for output file #0 (incorrect codec parameters ?): Invalid argument
Error initializing output stream 0:1 --

Ideas for using 12-48V phantom power to drive a 3v electret condenser microphone

published Feb 11, 2019 12:15   by admin ( last modified Feb 11, 2019 12:14 )

Not many sound cards I have found have multiple inputs for 3V electret condenser microphones. Maybe that makes sense, since such small mics are often used with headsets and as lapel/lavalier microphones, and those tend to be wireless if you're handling more than one.

Still, it would be fun to make a thingy that converts 48V phantom power to work with an elecret. Actually you can buy such an adapter off the shelf from Røde for about €20. That is probably the sane thing to do:

Rode VXLR+

However many people have thought about and designed their own solutions for this.

links
audio - Phantom Power supply for Electret mic - Electrical Engineering Stack Exchange
Convert 48V Phantom Power to 9V
This one is more about powering electrets in general, good read! Especially intrigued about just using two batteries in series and that's it:
Powering microphones
One of TI's examples is downregulating from 48V for the below circuit:
LM317HV 60V Input, 1.5A Adjustable Output Linear Regulator | TI.com

What is the best recording setup for recording discussions and interviews?

published Feb 09, 2019 03:40   by admin ( last modified Feb 12, 2019 01:21 )

Towards a setup with a focus on redundancy and forgiveness — so that botching a recording will hopefully be a thing of the past.

What we ideally would like to have, is a wireless system with high mic capacity, where all mics are sent digitally into a sound card and presented as separate channels to the audio workstation of your choice, for processing and mixing. Also, it would be nice to have multiple systems recording in parallel.

We can't get all the way to that on a small or medium budget. But we can get somewhat close.

Last week I rescued a conversation/interview track from a good videographer where unfortunately different sound levels and noise levels had been mixed together into a mono mix. Since people took turns in talking it wasn't too hard this time to make it really good by applying noise reduction and normalization per-region.

And I guess we've all have experienced a recording, by us or by someone else, where one participant just isn't audible or where the sound quality varies to the point that the listener must wonder if the people talking are all in the same room, or even at the same event.

If you have an experienced team that interviews you, they know how to set up things and they will have a dedicated sound technician. I was interviewed by such a team (from German 3sat)  a few months ago. Flawless.

But if you don't have access to that and I don't most of the time, it can be interesting to explore what technology can do to help to make sure you do not botch the recording. Do note that the equipment listed below is untested by me, with a couple of exceptions.

B&H Photo Video Audio has an article that touches on some of these topics, although they specifically talk about field recordings: How to Use a Portable Audio Recorder in Field Production

Many microphones are better than few microphones

If you have many microphones, errors in individual setups will have less grave consequences

Wireless is king

Many interesting conversations take place that aren't solely for the benefit of the recording. We can't have cables all over the place!

Separate digital tracks into the computer

It's better to be able to process separate channels afterwards and compensate for different levels of noise, sound characteristics and the like. Also, you can get rid of tracks that were no good. This is often referred to as multi channel recording.

The audio should be converted to digital as soon as possible.

And preferrably not converted back and forth a couple of times.

Setups that try to meet these requirements

The solution that meets all requirements (almost)

I'm adding this here: Samson — Go Mic Mobile®

With the lapel/lavalier option, this is really cool since it is all digital. It's a wireless system with a receiver that can send two independent channels, from two wireless devices, directly to USB.

I'm tempted to "drop my mic" here. However read on for systems with higher channel counts and many more options.

Wireless microphone system

AKG DMS Tetrad ~€700

A 4 channel wireless microphone system that is digital, and has 4 separate outputs. Separate outputs is kind of rare under €1000. The digital wireless part should make it more robust against interference (all wireless mic systems are turning digital nowadays). The Tetrad also has separate outputs for each microphone! But alas, it has no digital out.

Microphone arrays

As a backup, an advance conference microphone system may be good. This one has 4 microphone arrays and automatically listens in the direction of the current speaker. It has good reviews but not sure if everything is also available over USB and not just Bluetooth.

Bluetooth Speakerphone - eMeet M2 Black Wireless Conference Speakerphone ~€200

A cheaper version with no arrays is this (tested by me, ok) and it works well over USB:

Jabra Speak 510 UC USB/Bluetooth ~€100

Multi-channel sound cards

Some sound cards only allow stereo digital output, others offer 4,8 or 96 outputs for that matter. Sometimes it depends on the drivers and operating system in the computer how many channels you get. Behringer UMC1820 and Behringer UMC404HD  work well with Linux according to what I could read at LinuxMusicians— Linux support Behringer UMC404HD?.

My guess is that all cards below work well on Linux. OSX also does a good job with multi-channel while Windows struggles and needs bespoke drivers.

What is a channel

With "channel" below is meant the whole chain: An XLR/phone input with a mic preamp on the card that turns up as a separate channel in the computer's audio workstation.

Compact studio cards

Behringer UMC404HD  Four 24-bit channels. ~€100

Cheap, compact.

Focusrite Scarlett 18i8 2nd Gen ~€300 An eight channel 24-bit that is not a 19 incher for rack mount, but much more compact

19" studio cards

Behringer UMC1820~ €180 Eight 24-bit channels + ten virtual Adac ones.
Cheap, 19" for rack mount.

Behringer ADA8200 Ultragain ~€150 Eight 24-bit channels.
Cheap, 19" for rack mount.

Portable with extra built-in mics and built-in recording

These recorders function just as the above sound cards, but add the ability to store multi channel recordings directly and in parallel on the device, and are eminently portable and have a couple of replaceable microphones built-in.

Zoom H5 Handy Recorder | Zoom ~€250 Two 24-bit channels + 2 more with a separate clip-on adapter for ~€60

Zoom H6 Handy Recorder | Zoom ~€300 Four 24-bit channels + 2 more with a separate clip-on adapter for ~€60

Built-in recording and audio work stations

These work stations can function just as the above sound cards, but add the ability to store multi channel recordings directly and in parallel on the device, and are also mixing desks with a lof of hardware knobs and faders. It's really cool, but more to lug around and may look a bit intimidating to the participants.

Zoom LiveTrak L-12 Fourteeen 24-bit channels ~€500
Here is an inspiring video. Zoom LiveTrak L‑12 | Mixing pop/rock song - YouTube The multi channel source files are from a reference library, free for people to try their mixing chops on. Gorgeous song and singing voice, if you ask me! Some people report bleeding between higher track numbers for this device. Some of the channels on this one are not pre-amped.

Zoom R16 Eight 24-bit channels. ~€330
An oldie that still seems to give the best bang for the buck. Some of the channels on this one are not pre-amped.

Shoestring option

Some Linux distributions such as Ubuntu Studio with Ardour and Jack should be able to record from several sound cards at once. In fact I will try this a bit later with some old sound cards.

Here is a wireless microphone that records straight to USB: Fifine.

Not sure it's digital in the wireless part though.

Synchronization with video

One option is to record through the video mic and then use software that synchronizes your multi channel stuff with the video. One piece of software that does this is the python script align videos by sound. I maintain that software, but other people are the ones that have coded it, I just smacked a command line interface on it. I have used align videos by sound to synchronize a mobile phone voice recording with the video camera's internal mic recording, and it worked fine, reporting the correct offset.

General sound into the video cameras

In order to sync with video. the cameras should record sound even if we will not be using that sound, since it will be used for syncing. Conversely, one of the mics going into the multi channel card should be close to the camera mics to pick up the same pattern.

Redundancy

One option here is to mic up each person with two mics, and simply let the second mic record to the person's phone (or other phone). I have an Aputur A.lav lapel mic that can do that, but not happy with it. Worst case, just use the internal microphone of the phone!

 

 

 

 


Render times and hangs with Linux Flowblade editor

published Feb 08, 2019 06:40   by admin ( last modified Feb 08, 2019 06:46 )

Verdict: "Mp4 lossless"on Flowblade 2.0 is the winner, neglible RAM use and 330MB output file on a scale from 47MB (h264) to 6.9GB (ffyuv).

Material: an 11 minute screencast with sound.

Render times and behavior
Lossless output Flowblade version Result RAM use Render time Render size
ffyuv 1 series fine < 1GB 2m 34s 6.9GB
h264 1 series hangs computer but finishes 16GB 8m 30s 46.6MB
ffv 1 series hangs computer, no finish 14GB 3m 58s 1.9GB
MPEG-4 2 series fine < 1GB 2m 28s 328.5MB

 

You can then use ffmpeg to make a smaller lossless video. For example:

ffmpeg -i input.mp4 -c:v libx264 -crf 0 -c:a copy output.mkv

brings down the MPEG-4 file to 144MB with neglible RAM use.


Testing out deepspeech for speech to text recognition

published Feb 05, 2019 02:05   by admin ( last modified Feb 05, 2019 04:01 )

Contents: 16 minutes of heavily compressed "mushy" speech in British-ish English. Text file needed to create a script to redo the audio. I tried deepspeech 0.4.1 with its supplied model.

./bin/deepspeech --model models/output_graph.pb \
--alphabet models/alphabet.txt \
--trie models/trie --lm models/lm.binary \
--audio resampled-mono.wav > 16monoklr.txt

For comparison here's what Google transcribes from the same file:

this is the Swedish Land Registry logjam products contract blockchain system a Croma way running on s flex so I'd open a number of tabs here for each participant into the smart contract seller buyer sellers and buyers Bank the Land Registry itself and another server which in this case could be the demo…

First try on a 48KHz stereo wav file. The result is close to garbage and way too short:

his is the way should under his dream look him potomac changes and i conteanyng on spikes so i oftener of catechism in to the smart contract celeritate sang biting the understreak so i don't know server wichitas be the democrats not stage she told some pecorino is…

Second try 16KHz stereo version of the same file. Result, this is the entire contents of the file:

ohhhhhhhhh ohhhhhhhhh hhhhhhhhow hhhhhhhhow hhhhhhhhow here

Third try 16KHz mono version of the same file. The result is close to garbage and way too short:

this is these we should under destrem look him or he come look changed and a craney running on specs so i don't get a number of caterers to the smart contract selerier said or sankirtan the understreak self and an observer which he is caste democrat…


 

 


What's the best microphone for a screencast?

published Feb 02, 2019 01:55   by admin ( last modified Feb 02, 2019 02:34 )

I want super sound quality that doesn't preoccupy my hands, or is a table microphone, it needs to be close to my mouth.

My best microphone for this purpose that I own, is a T-bone TWS One (~€60) wireless headset that runs on 865 MHz to a wireless receiver. It's actually really good. I do not need any better. But it is a bit of work to set up:

  • the headset itself
  • the transmitter with batteries
  • the receiver, with power adapter
  • A 6.3mm output from the receiver that needs to go somewhere

So is there more convenient stuff out there?

What is not better

I have a lavalier, an Aputure A.lav. It is emphatically worse in sound quality than the T-bone. Its thin microphone cable is prone to picking up hum, and it makes a lot of scratching noises when cables move around. Maybe my copy shouldn't have passed quality control.

Also worse is a Sony MDR-XB950B1 headset over Bluetooth. If you post process with normalization, equalization and compression you can get up to the level of the sound of a depressed airline pilot over the plane's PA.

Here is the microphones I am looking at right now

Plantronics Voyager 4220 UC bluetooth headset €200

Has good reviews. A bit expensive though, but might be a good replacement for my Sony MDR-XB950B1. About €200.

Samson Go Mic USB, USB microphone €40

USB means there is very little analog cable length to pic up hum. It is also reasonably priced, very small with built-in clip and mic mount. About €50.

ModMic 4/5, clip-on analog microphone for headphones

Good reputation but does have a long analog cable. But maybe it is well shielded. Analog means it can be plugged into an analog mixer. However if you run a Linux distribution with Jack, such as MediBuntu, you ought to be able to do simultaneous recording from multiple sound cards, including USB microphones.

ModMic Wireless, clip-on wireless microphone for headphones €100

ModMic Wireless – Antlion Audio A bit expensive.

Overviews

 


Flowblade— a so far so good video editor on Linux

published Jan 31, 2019 11:50   by admin ( last modified Feb 01, 2019 12:11 )

This morning I opened my KDEnlive project that I spent 4 hours on yesterday, and found it had scrambled all my clips over three video timelines, changed durations and lost some clips.

So today I switched. I have tested many video editors but never before Flowblade. So far, so good! It took me an hour to recreate yesterday's work (since I now knew the source material better) and lo and behold, it hasn't corrupted my work yet. Thruth to be hold, neither had KDEnlive before yesterday.

I remember when I had to build a watchdog for the Linux Abiword word processor, a watchdog that reacted to every file change Abiword made and checked that into a local git repository. That version of Abiword had the habit of corrupting its document files. I thought I'd now need to do the same with KDEnlive. But I think I will stick with Flowblade now. It has crashed on me 5 times today, but so far not corrupted anything.

I found Flowblade to have a logical UI and you can name clips! But, you cannot cut soundtracks. That could be a dealbreaker. Maybe you can if you merge them with video tracks?

Ok, I have found a way to cut an audio clip.

  • Make sure all tracks are inactive
  • That includes the track the audio clip is on, if that track is active, it won't work
  • Select the audio you want to make a cut in
  • Press x, or select Edit→Cut Clip

Annotate screenshots in Ubuntu when Shutter is gone—Use Krita

published Jan 30, 2019 03:50   by admin ( last modified Jan 30, 2019 03:49 )

You can also use Flameshot if you use that tool to screenshot. Krita is better than Gimp, because you can draw ellipses directly without using selection tools and such. No arrows though as far as I have found. There might be Gimp plugins that do more stuff though.

 

Obviously use can use a vector editor such as InkScape, but then you have to think about preserving resolutions.


Link to comparison of lots of programming fonts — and my two favorites

published Jan 22, 2019 12:55   by admin ( last modified Jan 22, 2019 12:59 )

http://app.programmingfonts.org

 

My two favorites:

Bitstream Vera Sans Mono and Hack.


On the French yellow vest movement & similar movements

published Jan 22, 2019 11:55   by admin ( last modified Jan 22, 2019 11:52 )

This interesting article goes into some causes, but admits to not understand the movement.

https://www.the-american-interest.com/2019/01/21/the-problem-with-no-name/

I think there's a reason for yellow vests & such: Disorientation. Unclear road ahead, imaginations run on fumes. Society hasn't integrated the power of the Internet; it helped wipe out the old French parties. Monetary policy adds to our present confusion. Old extreme left/right baggage floats up.

The road ahead must bring clarity to a computerized & networked world. Now its effect is centrifugal & ppl trip on fear or hubris. I hope blockchains can help, bring clarity to what money is, & coordinate people to act politically. For the latter digital identities is important.


Impressions of libre MarkDown note-taking apps: Joplin & Boostnote.

published Jan 22, 2019 11:46   by admin ( last modified Jan 22, 2019 11:46 )

Impressions of libre MarkDown note-taking apps: Joplin & Boostnote. Joplin less cluttered, obvious how to make notes, has a menu bar. Boostnote UI more like VS Code editor put together at random 🙂 Actual usage may shift opinion…


Merge tools for git (untested)

published Jan 11, 2019 09:55   by admin ( last modified Jan 11, 2019 10:09 )

Getting OpenVPN to work when ipv6 support is missing

published Jan 07, 2019 02:39   by admin ( last modified Jan 07, 2019 02:39 )

You may get error messages such as:

"GDG6: NLMSG_ERROR: error Operation not supported"

"GDG6: remote_host_ipv6=n/a"

 

If it cannot get ipv6 connection, add these to your config file:

 

pull-filter ignore "route-ipv6"
pull-filter ignore "ifconfig-ipv6"

Hints to software to build your own Google Home

published Dec 17, 2018 03:05   by admin ( last modified Dec 17, 2018 03:03 )

Some guys are trying, however they are held back by production and fitting problems of PCB and LCD. Would've been easier if they had just used standard hardware I guess. But it's harder to charge for that.

https://www.kickstarter.com/projects/aiforeveryone/mycroft-mark-ii-the-open-voice-assistant

They do on the page mention a number of text to speech and command softwares, those could be worth to take a look at.

 

  • Pocket-sphinx
  • precise
  • Mozilla DeepSpeech
  • Adapt
  • Padatious
  • Mimic
  • CMU flite

 

So there's probably where you hould start looking!