Audio Processing

Topic
Overview  »

Digital audio is widely available from speech, music, and natural sounds, most of which can also be algorithmically synthesized. Digital audio can be manipulated in a variety of ways, including editing (trim, split, join, ...), enhancing (amplify, denoise, ...), analyzing (visualize, classify, ...), and creating effects (pitch shift, adding reverb, ...).  The Wolfram Language provides fully integrated support for audio, including fast in-memory data and large out-of-core files. The built-in audio supports a range of uses, from immediate playing and scrubbing to advanced programmatic processing and analysis.

Creating & Importing Audio »

Audio represent audio using an array of data, a file or a URL

Import  ▪  WebAudioSearch  ▪  AudioCapture  ▪  SpeechSynthesize  ▪  ...

Visualization

AudioPlot waveform plot of audio

Spectrogram  ▪  Periodogram  ▪  Cepstrogram

Audio Editing »

AudioAmplify amplify an audio signal

AudioTrim  ▪  AudioPitchShift  ▪  AudioJoin  ▪  ...

Filtering & Noise Removal »

LowpassFilter lowpass filtering of audio

WienerFilter  ▪  MeanFilter  ▪  TotalVariationFilter  ▪  ...

Audio Analysis »

AudioIntervals return intervals of silence, audio, etc.

AudioMeasurements  ▪  AudioLocalMeasurements  ▪  ShortTimeFourier  ▪  ...

Mean  ▪  Variance  ▪  Median  ▪  Quantile  ▪  ...

Machine Learning for Audio

AudioIdentify attempt to identify what an audio signal is a recording of

PitchRecognize  ▪  AudioInstanceQ  ▪  FeatureSpacePlot  ▪  Nearest  ▪  ...

NetEncoder  ▪  NetChain  ▪  NetGraph  ▪  ...

Understanding Speech »

SpeechRecognize convert a spoken audio signal to text

SpeechCases  ▪  SpeechInterpreter  ▪  ...