VideoTranscribe

VideoTranscribe[video]

recognizes speech in an audio track and adds it to video as a subtitle track.

Details and Options

  • VideoTranscribe performs speech-to-text on audio tracks and adds it to video as a subtitle track. It is typically used to generate closed captions for videos.
  • The following options can be given:
  • AudioTrackSelection Automaticthe audio track to transcribe
    MaskingAllinterval of interest
    MethodAutomaticthe method to use
    PerformanceGoal$PerformanceGoalaspects of performance to try to optimize
    ProgressReporting$ProgressReportingwhether to report the progress of the computation
    TargetDevice"CPU"the device on which to perform recognition
  • By default, speech in the whole signal is recognized. Use Masking->{int1,int2,} to limit the recognition to intervals inti.
  • By default, if a method returns non-speech tokens (e.g. [applause]), they are returned in the result. Use Method{"NonSpeechReplacement"replacements} to specify different replacements. Use "NonSpeechReplacement""" to remove them.
  • VideoTranscribe uses machine learning. Its methods, training sets and biases included therein may change and yield varied results in different versions of the Wolfram Language.
  • VideoTranscribe may download resources that will be stored in your local object store at $LocalBase and can be listed using LocalObjects[] and removed using ResourceRemove.

Examples

open allclose all

Basic Examples  (1)

Transcribe second audio track of a video and add it as a subtitle:

Options  (1)

AudioTrackSelection  (1)

Select a different track to transcribe:

Wolfram Research (2024), VideoTranscribe, Wolfram Language function, https://reference.wolfram.com/language/ref/VideoTranscribe.html.

Text

Wolfram Research (2024), VideoTranscribe, Wolfram Language function, https://reference.wolfram.com/language/ref/VideoTranscribe.html.

CMS

Wolfram Language. 2024. "VideoTranscribe." Wolfram Language & System Documentation Center. Wolfram Research. https://reference.wolfram.com/language/ref/VideoTranscribe.html.

APA

Wolfram Language. (2024). VideoTranscribe. Wolfram Language & System Documentation Center. Retrieved from https://reference.wolfram.com/language/ref/VideoTranscribe.html

BibTeX

@misc{reference.wolfram_2024_videotranscribe, author="Wolfram Research", title="{VideoTranscribe}", year="2024", howpublished="\url{https://reference.wolfram.com/language/ref/VideoTranscribe.html}", note=[Accessed: 07-September-2024 ]}

BibLaTeX

@online{reference.wolfram_2024_videotranscribe, organization={Wolfram Research}, title={VideoTranscribe}, year={2024}, url={https://reference.wolfram.com/language/ref/VideoTranscribe.html}, note=[Accessed: 07-September-2024 ]}