[ < ] [ > ]   [ << ] [ Up ] [ >> ]         [Top] [Contents] [Index] [ ? ]

4.14 recode.scm

Festival doesn't support different character sets directly. But it is 8-bit clean and you can use whatever character coding you like if you can process it in the form of 8-bit characters. The recode.scm module offers the following functions to convert between different character sets:

recode string from-coding to-coding

Return given string, originally encoded in from-coding, recoded to to-coding.

recode-utf8->current string

Return given string, originally encoded in UTF-8, recoded to the coding of the current voice.

Before applying normal recoding this function translates strings as specified in recode-special-utf8-translations variable. This allows you to convert some unicode characters in a special way, e.g. to translate empty space to space (thus separating words around it). recode-special-utf8-translations contains lists of two elements, the converted substring and its translation.

The recoding functions use the iconv program and temporary files to convert between character sets. There is no known better way to do the conversions.


[ < ] [ > ]   [ << ] [ Up ] [ >> ]

This document was generated by Milan Zamazal on August, 11 2009 using texi2html 1.78.