Using machine learning for audio and speech tasks is becoming more popular, as it opens up a new surface for human-computer interaction. NatML provides infrastructure for making predictions on audio models--specifically those that work with raw audio waveforms. Let's walk through the typical workflow: