TextWords

TextWords["string"]

gives a list of the runs of characters identified as words in string.

TextWords["string",n]

gives the first n words in string.

Details

Examples

open allclose all

Basic Examples  (3)

Segment a string into a list of words:

TextWords separates words by punctuation as well as whitespace:

Get the first 10 words in a block of text:

Scope  (3)

TextWords preserves hyphenation:

Titles, currencies and other syntactic units are segmented as separate words:

Get a list of words from a ContentObject:

Applications  (1)

Make a WordCloud of words from a poem:

Properties & Relations  (2)

TextWords is equivalent to TextCases[,"Word"]:

TextStructure splits texts into the same words:

Possible Issues  (1)

Words returned by TextWords are identified structurally, and may not be dictionary words:

Wolfram Research (2015), TextWords, Wolfram Language function, https://reference.wolfram.com/language/ref/TextWords.html (updated 2016).

Text

Wolfram Research (2015), TextWords, Wolfram Language function, https://reference.wolfram.com/language/ref/TextWords.html (updated 2016).

CMS

Wolfram Language. 2015. "TextWords." Wolfram Language & System Documentation Center. Wolfram Research. Last Modified 2016. https://reference.wolfram.com/language/ref/TextWords.html.

APA

Wolfram Language. (2015). TextWords. Wolfram Language & System Documentation Center. Retrieved from https://reference.wolfram.com/language/ref/TextWords.html

BibTeX

@misc{reference.wolfram_2023_textwords, author="Wolfram Research", title="{TextWords}", year="2016", howpublished="\url{https://reference.wolfram.com/language/ref/TextWords.html}", note=[Accessed: 19-March-2024 ]}

BibLaTeX

@online{reference.wolfram_2023_textwords, organization={Wolfram Research}, title={TextWords}, year={2016}, url={https://reference.wolfram.com/language/ref/TextWords.html}, note=[Accessed: 19-March-2024 ]}