encoding-samples
v0.0.3
Published
Markdown documents saved with different encodings that contain specific character tables and text examples
Readme
Encoding samples
Markdown documents saved with different encodings that contain specific character tables and text examples. Use them to test encoding/decoding algorithms. Each document is also saved in generally accepted UTF-8 encoding so you can test conversion by loading both versions and comparing resulting strings.
Encodings
- KOI8-R
- KOI8-RU
- KOI8-T
- KOI8-U
- Windows-1251
- Windows-1252
- more to add...
Additional binary file contains all possible bytes from 0 to 255 to test loading file as binary string.
Sources
- http://clagnut.com/blog/2380
- https://ru.wikipedia.org/wiki/%D0%9A%D0%9E%D0%98-8
- https://en.wikipedia.org/wiki/Tajik_alphabet
- https://be.wikipedia.org/wiki/%D0%9F%D0%B0%D0%BD%D0%B3%D1%80%D0%B0%D0%BC%D0%B0
- https://ru.wikipedia.org/wiki/Windows-1251
- https://scratchpad.fandom.com/wiki/Character_Encoding_Recommendation_for_Languages
- https://ru.wikipedia.org/wiki/ISO_8859-1
Contributing
Feel free to add samples for uncovered encodings and suggest improvements and fixes to existing samples.
