unidecode - stripping down text to ASCII in a sane way

published Feb 27, 2012 01:29   by admin ( last modified Feb 27, 2012 01:29 )


A module that makes "Jörgen" into "Jorgen" and "mañana" into "manana". And so on.

Note that this module generally produces better results than simply stripping accents from characters (which can be done in Python with built-in functions). It is based on hand-tuned character mappings that for example also contain ASCII approximations for symbols and non-Latin alphabets.



Read more: Unidecode 0.04.9 : Python Package Index