RemoveDiacritics
RemoveDiacritics[string]
replaces characters in string that have diacritics by their base ASCII characters, when possible.
Details and Options
- With the option setting Language->lang, RemoveDiacritics removes diacritics that do not occur in the basic alphabet of the language lang.
- RemoveDiacritics automatically threads over lists.
Examples
open allclose allScope (2)
RemoveDiacritics automatically threads over lists:
RemoveDiacritics works on strings of different scripts:
Options (2)
Language (2)
With Languagelang, RemoveDiacritics will not remove diacritics that are part of the basic alphabet of lang:
By default, Language option is set to $Language. Changing it has the same effect as setting the option:
Properties & Relations (2)
For a specified language, applying RemoveDiacritics to the result of Alphabet should return the same list of letters:
Using AlphabeticOrder with IgnoreDiacritics->True is equivalent to using AlphabeticOrder after applying RemoveDiacritics:
Possible Issues (1)
RemoveDiacritics breaks ligatures in basic ASCII characters when the ligature is not part of the language:
If it is part of the basic alphabet of the language, it only removes the diacritics:
Text
Wolfram Research (2015), RemoveDiacritics, Wolfram Language function, https://reference.wolfram.com/language/ref/RemoveDiacritics.html (updated 2015).
CMS
Wolfram Language. 2015. "RemoveDiacritics." Wolfram Language & System Documentation Center. Wolfram Research. Last Modified 2015. https://reference.wolfram.com/language/ref/RemoveDiacritics.html.
APA
Wolfram Language. (2015). RemoveDiacritics. Wolfram Language & System Documentation Center. Retrieved from https://reference.wolfram.com/language/ref/RemoveDiacritics.html