Big5 |
BIG5
|
Traditional Chinese |
|
EUC-JP |
EUCJP
|
Japanese |
|
EUC-KR |
EUCKR
|
Korean |
|
GB18030 |
GB18030
|
Chinese |
|
IBM420 |
IBM420
|
Arabic |
|
IBM424 |
IBM424
|
Hebrew |
|
IBM949 |
IBM949
|
Korean |
|
ISO-2022-CN |
ISO2022CN
|
Simplified Chinese |
|
ISO-2022-JP |
ISO2022JP
|
Japanese |
|
ISO-2022-KR |
ISO2022KR
|
Korean |
|
ISO-8859-1 |
ISO88591
|
Danish, Dutch, English, French, German, Italian, Norwegian, Portuguese, Swedish |
|
ISO-8859-2 |
ISO88592
|
Czech, Hungarian, Polish, Romanian |
|
ISO-8859-5 |
ISO88595
|
Russian |
|
ISO-8859-6 |
ISO88596
|
Arabic |
|
ISO-8859-7 |
ISO88597
|
Greek |
|
ISO-8859-8 |
ISO88598
|
Hebrew |
|
ISO-8859-9 |
ISO88599
|
Turkish |
|
ISO-8859-15 |
ISO885915
|
Danish, Dutch, English, French, German, Italian, Norwegian, Portuguese, Swedish |
Identical to ISO-8859-1 except for 8 characters, including the Euro currency symbol. |
KOI8-R |
KOI8R
|
Russian |
|
Shift_JIS |
SHIFTJIS
|
Japanese |
|
UTF-8 |
UTF8
|
All languages |
For loading data from delimited files (CSV, TSV, etc.), UTF-8 is the default. . . For loading data from all other supported file formats (JSON, Avro, etc.), as well as unloading data, UTF-8 is the only supported character set. |
UTF-16 |
UTF16
|
All languages |
|
UTF-16BE |
UTF16BE
|
All languages |
|
UTF-16LE |
UTF16LE
|
All languages |
|
UTF-32 |
UTF32
|
All languages |
|
UTF-32BE |
UTF32BE
|
All languages |
|
UTF-32LE |
UTF32LE
|
All languages |
|
windows-874 |
WINDOWS874
|
Thai |
|
windows-949 |
WINDOWS949
|
Korean |
|
windows-1250 |
WINDOWS1250
|
Czech, Hungarian, Polish, Romanian |
|
windows-1251 |
WINDOWS1251
|
Russian |
|
windows-1252 |
WINDOWS1252
|
Danish, Dutch, English, French, German, Italian, Norwegian, Portuguese, Swedish |
|
windows-1253 |
WINDOWS1253
|
Greek |
|
windows-1254 |
WINDOWS1254
|
Turkish |
|
windows-1255 |
WINDOWS1255
|
Hebrew |
|
windows-1256 |
WINDOWS1256
|
Arabic |
|