Multibyte characters get corrupted when KEYBD trap is set #197

siteshwar · 2017-12-07T15:38:12Z

Steps to reproduce:

Add KEYBD trap in /etc/kshrc (if it does not exist) :

# key bindings - make Delete, Home, End,... work
keybd_trap () {
  case ${.sh.edchar} in
    $'\e[1~') .sh.edchar=$'\001';; # Home = beginning-of-line
    $'\e[F')  .sh.edchar=$'\005';; # End = end-of-line
    $'\e[5~') .sh.edchar=$'\e>';; # PgUp = history-previous
    $'\e[6~') .sh.edchar=$'\e<';; # PgDn = history-next
    $'\e[3~') .sh.edchar=$'\004';; # Delete = delete-char
  esac
}
trap keybd_trap KEYBD

Add a test user and switch to it:

useradd test -s /bin/ksh
su - test

Type あいうえお from keyboard

Actual results:

BDFHJ is displayed.

Expected results:

あいうえお should be displayed.

There is some discussion around it in the abandoned patch #83

Related bug: https://bugzilla.redhat.com/show_bug.cgi?id=1503922

The text was updated successfully, but these errors were encountered:

k-takahagi · 2017-12-12T05:48:40Z

Steps to reproduce:
rlogin localhost
Type あいうえお from keyboard
Actual results:
^A^Aいうえお is displayed.
Expected results:
あいうえお should be displayed.
Is this the same bug?

$ uname -s -v
SunOS 11.3
$

siteshwar · 2017-12-12T05:52:21Z

@k-takahagi Do you have KEYBD trap set ? It should not reproduce when KEYBD trap is not set.

The KEYBD trap should now be fully functional for UTF-8 and other multibyte locales. Thanks to Johnothan King for finding this fix! Analysis: The KEYBD trap code processes character code points stored in e_lbuf by ed_read(). But shell variables store bytes, not characters. So, in UTF-8 locales for example, the Unicode code points need to be converted to multibyte UTF-8 encoding. This is needed to calculate the length of each encoded character in bytes (which fixes the corruption issue) and for keytrap() to store its UTF-8 representation in ${.sh.edchar}. src/cmd/ksh93/edit/edit.c: ed_getchar(): - Remove the workaround from the referenced commit. - Use mbconv to convert innput codepoints to bytes before adding them to inbuff, a char array that is passed on to keytrap(). Related: https://bugzilla.redhat.com/show_bug.cgi?id=1503922 Related: att#197 Related: #307 Resolves: #460 Co-authored-by: Johnothan King <[email protected]>

siteshwar added the bug label Dec 7, 2017

JohnoKing mentioned this issue May 14, 2021

Multibyte characters get corrupted when KEYBD trap is set ksh93/ksh#307

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Multibyte characters get corrupted when KEYBD trap is set #197

Multibyte characters get corrupted when KEYBD trap is set #197

siteshwar commented Dec 7, 2017 •

edited

Loading

k-takahagi commented Dec 12, 2017

siteshwar commented Dec 12, 2017

Multibyte characters get corrupted when KEYBD trap is set #197

Multibyte characters get corrupted when KEYBD trap is set #197

Comments

siteshwar commented Dec 7, 2017 • edited Loading

k-takahagi commented Dec 12, 2017

siteshwar commented Dec 12, 2017

siteshwar commented Dec 7, 2017 •

edited

Loading