I have the Coqui STT yesno model from GitHub, which is an ultra-compact speech recognition model that only recognises two words: yes and no.
I have the yesno.pbmm
and yesno.scorer
files.
I also have a tarball coqui-yesno-checkpoints.tar.gz
containing the following files:
ls -l coqui-yesno-checkpoints/
total 7356
-rw-r--r-- 1 jeremiah jeremiah 12 Jul 27 00:00 alphabet.txt
-rw-r--r-- 1 jeremiah jeremiah 1066076 Jul 26 23:59 best_dev-1909.data-00000-of-00001
-rw-r--r-- 1 jeremiah jeremiah 1377 Jul 26 23:59 best_dev-1909.index
-rw-r--r-- 1 jeremiah jeremiah 4476795 Jul 26 23:59 best_dev-1909.meta
-rw-r--r-- 1 jeremiah jeremiah 83 Jul 27 00:00 best_dev_checkpoint
-rw-r--r-- 1 jeremiah jeremiah 3861 Jul 27 00:02 flags.txt
-rw-r--r-- 1 jeremiah jeremiah 1368905 Jul 27 00:05 yesno-64dims.logs
-rw-r--r-- 1 jeremiah jeremiah 594622 Jul 27 00:05 yesno-64dims.logs.lm-optimizer
How do I convert this model into a .tflite
file for use with Coqui on embedded devices?
Looking for a command-line solution that can be easily scripted.