HP-UX Reference (11i v2 04/09) - 1 User Commands A-M (vol 1)

e
eucset(1) eucset(1)
NAME
eucset - sets and gets code widths for ldterm
SYNOPSIS
eucset [-p]
eucset [[-c HP15-codeset ]or[
-c UTF8]or[-c GB18030]or[cswidth]]
DESCRIPTION
The
eucset command sets or gets (reports) the encoding and display widths of the Extended UNIX Code
(EUC), and UCS Transformation Format (UTF8), and GB18030 characters processed by the current input
terminal. EUC is an encoding method for codesets composed of single or multiple bytes. It permits appli-
cations and the terminal hardware to use the 7-bit US ASCII code and up to three single byte or multi-
byte code sets simultaneously.
The
eucset command without any options, first tries to set the codeset to one of the four HP15 codesets.
If unsuccessful, 7-bit US ASCII is used as the default codeset. This command must be used to specify any
other EUC codesets, whether they are single byte or multibyte. See the WARNINGS section, for special
warnings on the values of the cswidth argument.
For GB18030 or UTF8 setting, use the
-c option.
Options
The
eucset command recognizes the following options and arguments:
-p Displays the current settings of the EUC character widths for the terminal.
-c Sets the width to one of the four HP15 codesets, UTF8,orGB18030. The HP15 codesets
supported are SJIS, CCDC, GB, and BIG5.
EUC Code Set Classes
EUC divides codesets into four classes. Each codeset has two characteristics: the number of bytes for
encoding the characters in the codeset, and the number of display columns to display the characters in
the codeset. All characters within a codeset possess the same characteristics.
• Codeset 0 consists of all 7-bit, single byte ASCII characters. The most significant bit of each of
these characters is 0 (zero). Characters in codeset 0 require one byte for encoding, and occupy
one display column. These values are fixed for codeset 0 (zero). The 7-bit US ASCII code is the
primary EUC codeset, which is available to users without direct specification.
• Codeset 1 is a supplementary EUC codeset. Codeset 1 characters have an initial byte whose
most significant bit is 1. Characters in codeset 1 may require more than one byte for encoding,
and may require more than one display column. The
eucset command must be used to set the
characteristics for codeset 1.
• Codesets 2 and 3 are supplementary EUC codesets. Characters in these codesets have an initial
byte of SS2 or SS3, respectively. They require more than one byte for encoding, and may require
more than one display column. The
eucset command must be used to set the characteristics
for codesets 2 and 3.
The cswidth argument in the
eucset command line is a character string that describes the character
widths for codesets 1 through 3. This command does not allow the user to modify the settings for codeset
0. The character string is of the following format:
X1 [:Y1 ],X2[:Y2 ],X3[:Y3 ]
The value X1 is the number of bytes required to encode a character in codeset class 1. Y1 is the number
of display columns needed to display characters in this class. X2 is the number of bytes required to
encode a character in codeset 2, not counting the SS2 byte, and Y2 is the number of display columns for
codeset 2 characters. X3 is the number of bytes needed to encode characters in codeset 3, not counting
the SS3 byte, and Y3 is the number of display columns required for these characters. The values for the
column widths may be omitted if they are equal to the number of encoding bytes. If the encoding value of
any of the EUC codesets is set to
0 (zero), this indicates that the codeset does not exist. See the WARN-
INGS section for special warnings on the values of the cswidth argument.
If no cswidth argument is supplied, the
eucset command uses the value of the CSWIDTH environment
variable. If this variable is not present, the following default string is substituted:
HP-UX 11i Version 2: September 2004 − 1 − Hewlett-Packard Company Section 1−−265