TextWords
TextWords["string"]
gives a list of the runs of characters identified as words in string.
TextWords["string",n]
gives the first n words in string.
Details
- Characters in string that are not identified as being part of words are dropped by TextWords.
- TextWords[ContentObject[…]] gives words from the plain text contents of the ContentObject.
Examples
open allclose allBasic Examples (3)
Segment a string into a list of words:
TextWords separates words by punctuation as well as whitespace:
Scope (3)
TextWords preserves hyphenation:
Titles, currencies and other syntactic units are segmented as separate words:
Get a list of words from a ContentObject:
Applications (1)
Make a WordCloud of words from a poem:
Properties & Relations (2)
Possible Issues (1)
Words returned by TextWords are identified structurally, and may not be dictionary words:
Text
Wolfram Research (2015), TextWords, Wolfram Language function, https://reference.wolfram.com/language/ref/TextWords.html (updated 2016).
CMS
Wolfram Language. 2015. "TextWords." Wolfram Language & System Documentation Center. Wolfram Research. Last Modified 2016. https://reference.wolfram.com/language/ref/TextWords.html.
APA
Wolfram Language. (2015). TextWords. Wolfram Language & System Documentation Center. Retrieved from https://reference.wolfram.com/language/ref/TextWords.html