Character Encoding References

ASCII:

This is the character set used most often by C programmers and it is the basic character set implemented by the char data type.

https://www.cs.cmu.edu/~pattis/15-1XX/common/handouts/ascii.html

UNICODE:

A multi-byte character encoding method that incorporates the language/character set into the format. UNICODE was made popular by Web programming where compatibility between countries is important. C extensions make it possible to work with UNICODE in C however there is not a lot of use in the embedded domain.

http://www.tamasoft.co.jp/en/general-info/unicode.html

http://www.unicode.org/versions/Unicode7.0.0/

EBSIDIC:

The original 8 bit character set widely used are large scale computers popular before the PC and C.

https://www.pcmag.com/encyclopedia/term/ebcdic-chart

SIXBIT:

Used on 36 bit computers so 6 characters could cleanly be packed into one memory location. Not so popular now since 32 and 64 bit systems are what we have today.

https://wiki2.org/en/Six-bit_character_code

BAUDOT:

A 5 bit character (very limited character set) used by old teletypes and paper tape.

https://cs.stanford.edu/people/eroberts/courses/soco/projects/2008-09/colossus/baudot.html