Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

You don't need ML to trim out silence


Silence is often problem dependent... You may want ML to differentiate between noisy audio with speech and noisy audio without speech.


"Silence" is a problematic term. For me, that word encompasses: squeaky chairs, typing on a loud keyboard, moving objects around on my table, etc. In a perfect world, Whisper —like a human— can easily distinguish a human voice from the din of my office, and only try and transcribe my voice.

Does anyone have solutions for clearing out "silence" from an audio file that works off something a bit more accurate than just "<= decibel x"?

Edited for grammar.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: