Base16384

A unicode-based encoding scheme that presents binary data (sequence of 8-bit bytes) in sequences of 14-bit printable Chinese characters. It saves 17% space compared to base64.

Inspired by fumiama/base16384.

Description

Base16384 uses 16384 (2¹⁴) Chinese characters (from \u4E00 to \u8DFF) to represent binary data.

If the length of the binary data is not a multiple of 7, we will add a \u3D0x (where x is the remainder modulo 7) after the output.

Comparison

	Base64	Base16384
Overhead	33%	14%
Charset	`[0-9a-zA-Z+/]`	`[\u4E00-\u8DFF]`
Example	`RXhhbXBsZQ==`	`彞吖菁穥㴀`

Usage

import { decode, encode } from 'base16384'

const buffer = encode('Example') // Uint16Array
new TextDecoder().decode(decode(buffer)) // 'Example'

API

encode(data)

data: string | Uint8Array original binary data
returns: Uint16Array base16384-encoded data

Encode binary data to base16384.

decode(data)

data: string | Uint16Array base16384-encoded data
returns: Uint8Array original binary data

Decode base16384 to binary data.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
.github/workflows		.github/workflows
src		src
tests		tests
.editorconfig		.editorconfig
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
package.json		package.json
readme.md		readme.md
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Base16384

Description

Comparison

Usage

API

encode(data)

decode(data)

About

Releases

Packages

Languages

License

shigma/base16384.js

Folders and files

Latest commit

History

Repository files navigation

Base16384

Description

Comparison

Usage

API

encode(data)

decode(data)

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages