This is awesome man. We attempted to build something similar and wound up giving up and pivoting to transcripts w/ a punctuation model to enhance them.
If this was around at the time, we likely would have been able to make audio work.
Kudos for your work on this. Seems truly well architected and thought out. The spacy integration is especially awesome.
Thanks! I spent a decent amount of time messing around with regex nonsense before I realized Spacy could work for this. I decided to leave the regex approach in as an option anyway since it still works reasonably well and is lighter weight.
If this was around at the time, we likely would have been able to make audio work.
Kudos for your work on this. Seems truly well architected and thought out. The spacy integration is especially awesome.