Linguistic Data

The Wolfram Language has not only convenient built-in multilingual dictionaries, but also built-in information on word meaning, structure, and usage, as well as the relationship between words. Together with the Wolfram Language's tightly integrated string manipulation functions, visualization, and data import and export, this provides a uniquely powerful platform for natural language computing.

WordList lists of words of various types in many languages

RandomWord random word of specified type

WordFrequencyData data on typical current and historical word frequencies

WordData properties of words and networks of relationships between them

WordDefinition definitions of words

Synonyms synonyms for a word

Antonyms antonyms for a word

PartOfSpeech possible parts of speech for a word

Languages & Translations

Language 6000+ recognized languages from around the world

WritingScript Alphabet Character GrammaticalUnit

WordTranslation translations of words in many languages

Alphabet alphabets for many languages

LanguageIdentify  ▪  Transliterate  ▪  AlphabeticOrder

TextTranslation translate text using an integrated external service

Textual Analysis »

Classify classify text based on language, topic, sentiment, or arbitrary training

StringSplit  ▪  StringCases  ▪  StringCount  ▪  Counts  ▪  Nearest  ▪  ...

TextStructure parse text into its grammatical structure

DictionaryLookup look up words in English and other dictionaries using string patterns

DictionaryWordQ test if a word is a correctly spelled dictionary word

SpellingCorrectionList list of spelling suggestions for misspelled words

Importing Data »

Import import or "scrape" text from all standard formats

"HTML"  ▪  "PDF"  ▪  "RTF"  ▪  "XML"  ▪  "TeX"  ▪  "Text"  ▪  "String"  ▪  ...

ResourceData access textual data in the Wolfram Data Repository

WikipediaData retrieve text from Wikipedia

WebSearch integrated web search in all languages, with snippets etc.

Number Words

IntegerName words for integers in many languages

Proper Names & Linguistic Entities »

GivenName Surname Person City Chemical Species  ▪  ...

Interpreter  ▪  SemanticInterpretation  ▪  SemanticImportString