Audio Processing

Captured digital audio is widely available from speech, music, and natural sounds, most of which can also be algorithmically synthesized. Digital audio can be manipulated in a variety of ways, including editing (trim, split, join, ...), enhancing (amplify, denoise, ...), analyzing (visualize, classify, ...), and creating effects (pitch shift, adding reverb, ...).  The Wolfram Language provides fully integrated support for audio, including fast in-memory data and large out-of-core files. The built-in audio supports a range of uses, from immediate playing and scrubbing to advanced programmatic processing and analysis.

Creating & Importing Audio

Audio represent audio using an array of data, a file, or a URL

AudioCapture capture an audio signal from an input device

SpeechSynthesize synthesize a speech signal from text

WebAudioSearch  ▪  AudioGenerator  ▪  Import  ▪  ExampleData

Programmatic Capture & Playback »

AudioPlay create an audio stream and start playing

AudioPause  ▪  AudioStop  ▪  AudioStream  ▪  ...

Audio Properties »

AudioData extract data stored in audio

Duration  ▪  AudioSampleRate  ▪  AudioChannels  ▪  ...

Arithmetic & Statistical Operations

Plus  ▪  Times  ▪  Power  ▪  Abs  ▪  Log  ▪  ...

Mean  ▪  Variance  ▪  Median  ▪  Quantile  ▪  ...

Audio Visualization

AudioPlot waveform plot of audio

Spectrogram  ▪  Periodogram  ▪  Cepstrogram  ▪  ListPlot

Basic Audio Manipulation

AudioResample resample the audio to a different sample rate

AudioTrim  ▪  AudioPad  ▪  AudioDelete  ▪  AudioInsert  ▪  AudioReplace  ▪  AudioPartition  ▪  AudioSplit

AudioPan  ▪  AudioChannelMix  ▪  AudioNormalize  ▪  AudioAmplify

Audio Composition

ConformAudio return a list of audio objects with conforming properties

AudioJoin  ▪  AudioOverlay  ▪  AudioChannelSeparate  ▪  AudioChannelCombine

Audio Effects

AudioReverb create audio by adding reverberations to audio

AudioFade  ▪  AudioDelay  ▪  AudioTimeStretch  ▪  AudioPitchShift  ▪  AudioFrequencyShift  ▪  AudioSpectralMap  ▪  AudioSpectralTransformation

Audio Analysis »

AudioIntervals return intervals of silence, audio, etc.

AudioMeasurements  ▪  AudioLocalMeasurements  ▪  AudioAnnotate  ▪  ShortTimeFourier  ▪  CepstrogramArray  ▪  ...

Filtering & Noise Removal »

LowpassFilter lowpass filtering of audio

WienerFilter remove noise from audio using Wiener filtering

HighpassFilter  ▪  MeanFilter  ▪  TotalVariationFilter  ▪  ...

Understanding Audio Signals

AudioIdentify attempt to identify what an audio signal is a recording of

PitchRecognize recognize the pitch of an audio signal

SpeechRecognize perform speech-to-text

Machine Learning »

Classify, Predict create and apply classifiers or predictors to audio signals

Nearest  ▪  FeatureNearest  ▪  FeatureSpacePlot  ▪  FindClusters  ▪  ...

NetEncoder  ▪  NetChain  ▪  NetGraph  ▪  ...