Linguistic Data

The Wolfram Language has not only convenient built-in multilingual dictionaries, but also built-in information on word meaning, structure, and usage, as well as the relationship between words. Together with the Wolfram Language's tightly integrated string manipulation functions, visualization, and data import and export, this provides a uniquely powerful platform for natural language computing.


DictionaryLookup look up words in English and other dictionaries using string patterns

WordData properties of words and networks of relationships between them

"Definitions"  ▪  "Synonyms"  ▪  "BroaderTerms"  ▪  "PartOfSpeech"  ▪  "InflectedForms"  ▪  ...

Textual Analysis »

Classify classify text based on language, topic, sentiment, or arbitrary training

StringSplit  ▪  StringCases  ▪  StringCount  ▪  Counts  ▪  Nearest  ▪  ...

Importing Data »

Import import or "scrape" text from all standard formats

"HTML"  ▪  "PDF"  ▪  "RTF"  ▪  "XML"  ▪  "TeX"  ▪  "Text"  ▪  "String"  ▪  ...

LanguageData data on 6000+ languages

ExampleData access to standard sample texts, including complete books

Proper Names & Linguistic Entities

PersonData  ▪  CityData  ▪  ChemicalData  ▪  SpeciesData

Interpreter  ▪  SemanticInterpretation  ▪  SemanticImportString