Enable JavaScript to interact with content and submit forms on Wolfram websites. Learn how

Audio Processing

Topic
Overview »

Digital audio is widely available from speech, music, and natural sounds, most of which can also be algorithmically synthesized. Digital audio can be manipulated in a variety of ways, including editing (trim, split, join, ...), enhancing (amplify, denoise, ...), analyzing (visualize, classify, ...), and creating effects (pitch shift, adding reverb, ...). The Wolfram Language provides fully integrated support for audio, including fast in-memory data and large out-of-core files. The built-in audio supports a range of uses, from immediate playing and scrubbing to advanced programmatic processing and analysis.

Creating & Importing Audio »

Audio — represent audio using an array of data, a file or a URL

Import ▪ WebAudioSearch ▪ AudioCapture ▪ SpeechSynthesize ▪ ...

Visualization

AudioPlot — waveform plot of audio

Spectrogram ▪ Periodogram ▪ Cepstrogram

Audio Editing »

AudioAmplify — amplify an audio signal

AudioTrim ▪ AudioPitchShift ▪ AudioJoin ▪ ...

Filtering & Noise Removal »

LowpassFilter — lowpass filtering of audio

WienerFilter ▪ MeanFilter ▪ TotalVariationFilter ▪ ...

Audio Analysis »

AudioIntervals — return intervals of silence, audio, etc.

AudioMeasurements ▪ AudioLocalMeasurements ▪ ShortTimeFourier ▪ ...

Mean ▪ Variance ▪ Median ▪ Quantile ▪ ...

Machine Learning for Audio

AudioIdentify — attempt to identify what an audio signal is a recording of

PitchRecognize ▪ AudioInstanceQ ▪ FeatureSpacePlot ▪ Nearest ▪ ...

NetEncoder ▪ NetChain ▪ NetGraph ▪ ...

Understanding Speech »

SpeechRecognize — convert a spoken audio signal to text

SpeechCases ▪ SpeechInterpreter ▪ ...

Top