Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

GB2312 is not in any of its interchangable encodings #17

Open
Artoria2e5 opened this issue Dec 1, 2016 · 1 comment
Open

GB2312 is not in any of its interchangable encodings #17

Artoria2e5 opened this issue Dec 1, 2016 · 1 comment

Comments

@Artoria2e5
Copy link
Collaborator

GB2312, which is just a table without binary numbers, have various interchangable encodings like EUC-CN (the common "gb2312" encoding which looks like GBK) and HZ (which uses escapes). Bsdconv's current hex mappings should be all added with 0x8080 to generate the actual hex digits in EUC-CN.

@buganini
Copy link
Owner

buganini commented Dec 1, 2016

Current table for GB2312 came from http://glyph.iso10646hk.net/download/GB2312.TXT
And table for GBK came from http://icu-project.org/repos/icu/data/trunk/charset/source/gb18030/gbkuni30.txt
gbk decoder is probably problematic since it's transposed from encoder..
I forgot how I got them... I only used them to convert some short articles and id3 tags.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants