Linguistic Data
The Wolfram Language has not only convenient built-in multilingual dictionaries, but also built-in information on word meaning, structure, and usage, as well as the relationship between words. Together with the Wolfram Language's tightly integrated string manipulation functions, visualization, and data import and export, this provides a uniquely powerful platform for natural language computing.
WordList — lists of words of various types in many languages
RandomWord — random word of specified type
WordFrequencyData — data on typical current and historical word frequencies
WordData — properties of words and networks of relationships between them
WordDefinition — definitions of words
Synonyms — synonyms for a word
Antonyms — antonyms for a word
PartOfSpeech — possible parts of speech for a word
Languages & Translations
Language — 6000+ recognized languages from around the world
WritingScript Alphabet Character GrammaticalUnit
WordTranslation — translations of words in many languages
Alphabet — alphabets for many languages
LanguageIdentify ▪ Transliterate ▪ AlphabeticOrder
TextTranslation — translate text using an integrated external service
Textual Analysis »
Classify — classify text based on language, topic, sentiment, or arbitrary training
StringSplit ▪ StringCases ▪ StringCount ▪ Counts ▪ Nearest ▪ ...
TextStructure — parse text into its grammatical structure
DictionaryLookup — look up words in English and other dictionaries using string patterns
DictionaryWordQ — test if a word is a correctly spelled dictionary word
SpellingCorrectionList — list of spelling suggestions for misspelled words
Importing Data »
Import — import or "scrape" text from all standard formats
"HTML" ▪ "PDF" ▪ "RTF" ▪ "XML" ▪ "TeX" ▪ "Text" ▪ "String" ▪ ...
ResourceData — access textual data in the Wolfram Data Repository
WikipediaData — retrieve text from Wikipedia
WebSearch — integrated web search in all languages, with snippets etc.
Number Words
IntegerName — words for integers in many languages
Proper Names & Linguistic Entities »
GivenName Surname Person City Chemical Species ...
Interpreter ▪ SemanticInterpretation ▪ SemanticImportString
LLM-Based Linguistics »
LLMResourceFunction — use functions from the Wolfram Prompt Repository