You don't need ML to trim out silence

sdenton4 · on Dec 7, 2022

Silence is often problem dependent... You may want ML to differentiate between noisy audio with speech and noisy audio without speech.

darkpicnic · on Dec 7, 2022

"Silence" is a problematic term. For me, that word encompasses: squeaky chairs, typing on a loud keyboard, moving objects around on my table, etc. In a perfect world, Whisper —like a human— can easily distinguish a human voice from the din of my office, and only try and transcribe my voice.

Does anyone have solutions for clearing out "silence" from an audio file that works off something a bit more accurate than just "<= decibel x"?

Edited for grammar.