Encoding List
List of encoding methods available in DSG ruleset configuration
The rules that use the listed encoding are as follows:
- Binary
- HTML Form Media Type
- Text
- ProtegrityDataProtection
Standard encoding list
Method | Description |
---|---|
ascii | English (646, us-ascii) |
base64 | Base64 multiline MIME conversion (the result always includes a trailing ‘\n’) |
big5 | Traditional Chinese (big5-tw, csbig5) |
big5hkscs | Traditional Chinese (big5-hkscs, hkscs) |
bz2 | Compression using bz2 |
cp037 | English (IBM037, IBM039) |
cp424 | Hebrew (EBCDIC-CP-HE, IBM424) |
cp437 | English (437, IBM437) |
cp500 | Western Europe (EBCDIC-CP-BE, EBCDIC-CP-CH, IBM500) |
cp720 | Arabic (cp720) |
cp737 | Greek (cp737) |
cp775 | Baltic languages (IBM775) |
cp850 | Western Europe (850, IBM850) |
cp852 | Central and Eastern Europe (852, IBM852) |
cp855 | Bulgarian, Byelorussian, Macedonian, Russian, Serbian (855, IBM855) |
cp856 | Hebrew (cp856) |
cp857 | Turkish (857, IBM857) |
cp858 | Western Europe (858, IBM858) |
cp860 | Portuguese (860, IBM860) |
cp861 | Icelandic (861, CP-IS, IBM861) |
cp862 | Hebrew (862, IBM862) |
cp863 | Canadian (863, IBM863) |
cp864 | Arabic (IBM864) |
cp865 | Danish, Norwegian (865, IBM865) |
cp866 | Russian (866, IBM866) |
cp869 | Greek (869, CP-GR, IBM869) |
cp874 | Thai (cp874) |
cp875 | Greek (cp875) |
cp932 | Japanese (932, ms932, mskanji, ms-kanji) |
cp949 | Korean (949, ms949, uhc) |
cp950 | Traditional Chinese (950, ms950) |
cp1006 | Urdu (cp1006) |
cp1026 | Turkish (ibm1026) |
cp1140 | Western Europe (ibm1140) |
cp1250 | Central and Eastern Europe (windows-1250) |
cp1251 | Bulgarian, Byelorussian, Macedonian, Russian, Serbian (windows-1251) |
cp1252 | Western Europe (windows-1252) |
cp1253 | Greek (windows-1253) |
cp1254 | Turkish (windows-1254) |
cp1255 | Hebrew (windows-1255) |
cp1256 | Arabic (windows-1256) |
cp1257 | Baltic languages (windows-1257) |
cp1258 | Vietnamese (windows-1258) |
euc_jp | Japanese (eucjp, ujis, u-jis) |
euc_jis_2004 | Japanese (jisx0213, eucjis2004) |
euc_jisx0213 | Japanese (eucjisx0213) |
euc_kr | Korean (euckr, korean, ksc5601, ks_c-5601, ks_c-5601-1987, ksx1001, ks_x-1001) |
gb2312 | Simplified Chinese (chinese, csiso58gb231280, euc-cn, euccn, eucgb2312-cn, gb2312-1980, gb2312-80, iso-ir-58) |
gbk | Unified Chinese (936, cp936, ms936) |
gb18030 | Unified Chinese (gb18030-2000) |
hex | Hexadecimal representation conversion (two digits per byte) |
hz | Simplified Chinese (hzgb, hz-gb, hz-gb-2312) |
iso2022_jp | Japanese (csiso2022jp, iso2022jp, iso-2022-jp) |
iso2022_jp_1 | Japanese (iso2022jp-1, iso-2022-jp-1) |
iso2022_jp_2 | Japanese, Korean, Simplified Chinese, Western Europe, Greek (iso2022jp-2, iso-2022-jp-2) |
iso2022_jp_2004 | Japanese (iso2022jp-2004, iso-2022-jp-2004) |
iso2022_jp_3 | Japanese (iso2022jp-3, iso-2022-jp-3) |
iso2022_jp_ext | Japanese (iso2022jp-ext, iso-2022-jp-ext) |
iso2022_kr | Korean (csiso2022kr, iso2022kr, iso-2022-kr) |
latin_1 | West Europe (iso-8859-1, iso8859-1, 8859, cp819, latin, latin1, L1) |
iso8859_2 | Central and Eastern Europe (iso-8859-2, latin2, L2) |
iso8859_3 | Esperanto, Maltese (iso-8859-3, latin3, L3) |
iso8859_4 | Baltic languages (iso-8859-4, latin4, L4) |
iso8859_5 | Bulgarian, Byelorussian, Macedonian, Russian, Serbian (iso-8859-5, cyrillic) |
iso8859_6 | Arabic (iso-8859-6, arabic) |
iso8859_7 | Greek (iso-8859-7, greek, greek8) |
iso8859_8 | Hebrew (iso-8859-8, hebrew) |
iso8859_9 | Turkish (iso-8859-9, latin5, L5) |
iso8859_10 | Nordic languages (iso-8859-10, latin6, L6) |
iso8859_11 | Thai languages (iso-8859-11, thai) |
iso8859_13 | Baltic languages (iso-8859-13, latin7, L7) |
iso8859_14 | Celtic languages (iso-8859-14, latin8, L8) |
iso8859_15 | Western Europe (iso-8859-15, latin9, L9) |
iso8859_16 | South-Eastern Europe (iso-8859-16, latin10, L10) |
johab | Korean (cp1361, ms1361) |
koi8_r | Russian () |
koi8_u | Ukrainian () |
mac_cyrillic | Bulgarian, Byelorussian, Macedonian, Russian, Serbian (maccyrillic) |
mac_greek | Greek (macgreek) |
mac_iceland | Icelandic (maciceland) |
mac_latin2 | Central and Eastern Europe (maclatin2, maccentraleurope) |
mac_roman | Western Europe (macroman) |
mac_turkish | Turkish (macturkish) |
ptcp154 | Kazakh (csptcp154, pt154, cp154, cyrillic-asian) |
shift_jis | Japanese (csshiftjis, shiftjis, sjis, s_jis) |
shift_jis_2004 | Japanese (shiftjis2004, sjis_2004, sjis2004) |
shift_jisx0213 | Japanese (shiftjisx0213, sjisx0213, s_jisx0213) |
utf_32 | Unicode Transformation Format (U32, utf32) |
utf_32_be | Unicode Transformation Format (big endian) |
utf_32_le | Unicode Transformation Format (little endian) |
utf_16 | Unicode Transformation Format (U16, utf16) |
utf_16_be | Unicode Transformation Format (big endian BMP only) |
utf_16_le | Unicode Transformation Format (little endian BMP only) |
utf_7 | Unicode Transformation Format (U7, unicode-1-1-utf-7) |
utf_8 | Unicode Transformation Format (U8, UTF, utf8) |
utf_8_sig | Unicode Transformation Format (with BOM signature) |
zlib | Gzip compression (zip) |
External encoding list
- Base64
- HTML Encoding
- JSON Escape
- URI Encoding
- URI Encoding Plus
- XML Encoding
- Quoted Printable
- SQL Escape
Proprietary
- Base128
- Unicode
- CJK
- High ASCII