ETC N413

,62,(&-7&6&1
,62,(&-7&6&:*1
'DWH ,62,(&-7&6&:*
ELWDQGELWFRGHVDQGWKHLUH[WHQVLRQ
6(&5(7$5,$7(/27
'2& 7<3( )LQDO7H[W6XEPLWWHGIRU,6SXEOLFDWLRQ
7,7/( )LQDO7H[WRI',6,QIRUPDWLRQ7HFKQRORJ\ELW
VLQJOHE\WHFRGHGJUDSKLFFKDUDFWHUVHWV3DUW/DWLQ
DOSKDEHW1R
6285&( 0U.,/DUVVRQ3URMHFW(GLWRU
352-(&7
-7&
67$786 ,QDFFRUGDQFHZLWK5HVROXWLRQ0DGRSWHGDWWKHWK
3OHQDU\PHHWLQJRI6&KHOGLQ&UHWH*UHHFHWKLVGRFXPHQW
LVVXEPLWWHGWR,77)WRJHWKHUZLWK'LVSRVLWLRQRI&RPPHQWV
5HSRUWFRQWDLQHGLQ1IRUSXEOLFDWLRQSURFHVVLQJ
$&7,21 ,' )<,
'8( '$7( ',675,%87,21 32DQG/0HPEHUVRI,62,(&-7&6&
:*&RQYHQHUV6HFUHWDULDWV
:*0HPEHUV
,62,(&-7&6HFUHWDULDW
,62,(&,77)
0(',80 3
12 2) 3$*(6 &RQWDFW 6HFUHWDULDW ,62,(& -7& 6& :* (/27 0UV .9HOOL DFWLQJ
$FKDUQRQ .DWR 3DWLVVLD $7+(16 ± *5((&(
7HO )D[ (PDLO NNE#HORWJU
&RQWDFW &RQYHQRU ,62,(& -7& 6& :* 0U (0HODJUDNLV
$FKDUQRQ .DWR 3DWLVVLD $7+(16 ± *5((&(
7HO )D[ (PDLO HHP#HORWJU
ISO/IEC 8859-4:1997 (E)
TE
X
T
© ISO/IEC
TITLE PAGE
19
97
-
11
-1
1
FI
N
A
L
To be provided by ITTF
ISO/IEC 8859-4:1997 (E)
© ISO/IEC
Contents
Page
Foreword . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . iii
Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . iv
1
Scope . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
2
Conformance . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
3
Normative references
4
Definitions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2
5
Notation, code table and character names . . . . . . . . . . . 2
6
Specification of the coded character set . . . . . . . . . . . . . 3
7
Identification of the character set . . . . . . . . . . . . . . . . . . 6
TE
X
T
........................... 1
Annex A: Coverage of languages by parts 1 to 10 of
ISO/IEC 8859 . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
Annex B: Main differences between the First edition and
this Second edition of this part of ISO/IEC 8859 . 9
. . . . . . . . . . . . . . . . . . . . . . . . . . . 10
19
97
-
11
-1
1
FI
N
A
L
Annex C: Bibliography
© ISO/IEC 1997
All rights reserved. Unless otherwise specified, no part of this publication may be
reproduced or utilized in any form or by any means, electronic or mechanical,
including photocopying and microfilm, without permission in writing from the publisher.
ISO/IEC Copyright Office • Case Postale 56 • CH-1211 Genève 20 • Switzerland
ii
© ISO/IEC
ISO/IEC 8859-4:1997 (E)
Foreword
TE
X
T
ISO (the International Organization for Standardization) and IEC (the
International Electrotechnical Commission) form the specialized
system for worldwide standardization. National bodies that are
members of ISO or IEC participate in the development of
International Standards through technical committees established by
the respective organization to deal with particular fields of technical
activity. ISO and IEC technical committees collaborate in fields of
mutual interest. Other international organizations, governmental and
nongovernmental, in liaison with ISO and IEC, also take part in the
work.
In the field of information technology, ISO and IEC have established
a joint technical committee, ISO/IEC JTC1. Draft International
Standards adopted by the joint technical committee are circulated to
national bodies for voting. Publication as an International Standard
requires approval by at least 75% of the national bodies casting a
vote.
International Standard ISO/IEC 8859-4 was prepared by Joint
Technical Committee ISO/IEC JTC 1, Information technology,
Subcommittee SC 2, Character sets and information coding.
ISO/IEC 8859 consists of the following parts, under the general title
Information technology – 8-bit single-byte coded graphic character
sets:
Part 1:
Latin alphabet No. 1
–
Part 2:
Latin alphabet No. 2
–
Part 3:
Latin alphabet No. 3
–
Part 4:
Latin alphabet No. 4
–
Part 5:
Latin/Cyrillic alphabet
–
Part 6:
Latin/Arabic alphabet
–
Part 7:
Latin/Greek alphabet
–
Part 8:
Latin/Hebrew alphabet
–
Part 9:
Latin alphabet No. 5
–
Part 10: Latin alphabet No. 6
Annexes A to C of this part of ISO/IEC 8859 are for information only.
19
97
-
11
-1
1
FI
N
A
L
–
iii
ISO/IEC 8859-4:1997 (E)
© ISO/IEC
Introduction
19
97
-
11
-1
1
FI
N
A
L
TE
X
T
ISO/IEC 8859 consists of several parts. Each part specifies a set of
up to 191 graphic characters and the coded representation of these
characters by means of a single 8-bit byte. Each set is intended for
use for a particular group of languages.
iv
INTERNATIONAL STANDARD © ISO/IEC
ISO/IEC 8859-4:1997 (E)
Information technology –
8-bit single-byte coded graphic character sets –
Part 4: Latin alphabet No. 4
2.2 Conformance of devices
This part of ISO/IEC 8859 specifies a set of 191
coded graphic characters identified as Latin
alphabet No. 4.
A device is in conformance with this part of
ISO/IEC 8859 if it conforms to the requirements of
2.2.1, and either or both of 2.2.2 and 2.2.3. A claim
of conformance shall identify the document which
contains the description specified in 2.2.1.
TE
X
T
1 Scope
This set of coded graphic characters is intended for
use in data and text processing applications and
also for information interchange.
The set contains graphic characters used for
general purpose applications in typical office
environments in at least the following languages:
Danish, English, Estonian, Finnish, German,
Greenlandic, Latin, Latvian, Lithuanian, Norwegian,
Sámi (but see Annex A.1, Notes), Slovene and
Swedish.
L
This set of coded graphic characters may be
regarded as a version of an 8-bit code according to
ISO/IEC 2022 or ISO/IEC 4873 at level 1.
A
This part of ISO/IEC 8859 may not be used in
conjunction with any other parts of ISO/IEC 8859.
If coded characters from more than one part are to
be used together, by means of code extension
techniques, the equivalent coded character sets
from ISO/IEC 10367 should be used instead within
a version of ISO/IEC 4873 at level 2 or level 3.
FI
N
The coded characters in this set may be used in
conjunction with coded control functions selected
from ISO/IEC 6429. However, control functions are
not used to create composite graphic symbols from
two or more graphic characters (see clause 6).
19
97
-
11
-1
1
NOTE – ISO/IEC 8859 is not intended for use with
Telematic services defined by ITU-T. If information coded
according to ISO/IEC 8859 is to be transferred to such
services, it will have to conform to the requirements of
those services at the access-point.
2 Conformance
2.1 Conformance of information interchange
A coded-character-data-element (CC-data-element)
within coded information for interchange is in
conformance with this part of ISO/IEC 8859 if all the
coded representations of graphic characters within
that CC-data-element conform to the requirements
of clause 6.
2.2.1 Device description
A device that conforms to this part of ISO/IEC 8859
shall be the subject of a description that identifies
the means by which the user may supply characters
to the device, or may recognize them when they are
made available to him, as specified respectively in
2.2.2 and 2.2.3.
2.2.2 Originating devices
An originating device shall allow its user to supply
any sequence of characters from those specified in
clause 6, and shall be capable of transmitting their
coded representations within a CC-data-element.
2.2.3 Receiving devices
A receiving device shall be capable of receiving and
interpreting any coded representations of characters
that are within a CC-data-element, and that conform
to clause 6, and shall make the corresponding
characters available to its user in such a way that
the user can identify them from among those
specified there, and can distinguish them from each
other.
3 Normative references
The following standards contain provisions which,
through reference in this text, constitute provisions
of this part of ISO/IEC 8859. At the time of publication, the editions indicated were valid. All standards
are subject to revision, and parties to agreements
based on this part of ISO/IEC 8859 are encouraged
to investigate the possibility of applying the most
recent editions of the standards indicated below.
Members of IEC and ISO maintain registers of
currently valid International Standards.
1
ISO/IEC 8859-4:1997 (E)
© ISO/IEC
ISO/IEC 2022:1994,
Information technology –
Character code structure and extension techniques.
ISO/IEC 4873:1991,
Information technology –
ISO 8-bit code for information interchange –
Structure and rules for implementation.
ISO/IEC 8824-1:1995, Information technology –
Abstract Syntax Notation One (ASN.1): Specification of basic notation.
The bit combinations may be interpreted to represent
numbers in binary notation by attributing the
following weights to the individual bits:
Bit
Weight
b8
b7
b6
b5
b4
b3
b2
b1
128
64
32
16
8
4
2
1
For the purposes of this part of ISO/IEC 8859 the
following definitions apply:
Using these weights, the bit combinations are
identified by notations of the form xx/yy, where xx
and yy are numbers in the range 00 to 15. The
correspondence between the notations of the form
xx/yy and the bit combinations consisting of the bits
b 8 to b1 is as follows:
4.1 bit combination: An ordered set of bits used
for the representation of characters.
– xx is the number represented by b 8, b 7, b 6 and b 5
where these bits are given the weights 8, 4, 2, and
1 respectively.
TE
X
T
4 Definitions
4.2 byte: A bit string that is operated upon as a unit.
4.3 character: A member of a set of elements
used for the organization, control, or representation
of data.
4.4 code table: A table showing the characters
allocated to each bit combination in a code.
4.5 coded character set; code:
A set of
unambiguous rules that establishes a character set
and the one-to-one relationship between the
characters of the set and their bit combinations.
A
L
4.6 coded-character-data-element (CC-dataelement): An element of interchanged information
that is specified to consist of a sequence of coded
representations of characters, in accordance with
one or more identified standards for coded
character sets.
N
4.7 graphic character: A character, other than a
control function, that has a visual representation
normally handwritten, printed or displayed, and that
has a coded representation consisting of one or
more bit combinations.
NOTE – In ISO/IEC 8859 a single bit combination is used
to represent each character.
FI
4.8 graphic symbol: A visual representation of a
graphic character or of a control function.
11
-1
1
4.9 position: That part of a code table identified
by its column and row coordinates.
19
97
-
5 Notation, code table, and names
5.1 Notation
The bits of the bit combinations of the 8-bit code are
identified by b 8, b7, b 6, b 5, b 4, b 3, b2, and b1, where
b 8 is the highest-order, or most-significant bit and b 1
is the lowest-order, or least-significant bit.
2
– yy is the number represented by b 4, b 3, b 2 and b 1
where these bits are given the weights 8, 4, 2, and
1 respectively.
The bit combinations are also identified by notations
of the form hk, where h and k are numbers in the
range 0 to F in hexadecimal notation. The number
h is the same as the number xx described above,
and the number k the same as the number yy
described above.
5.2 Layout of the code table
An 8-bit code table consists of 256 positions
arranged in 16 columns and 16 rows. The columns
and the rows are numbered 00 to 15. In hexadecimal notation the columns and the rows are
numbered 0 to F.
The code table positions are identified by notations
of the form xx/yy, where xx is the column number
and yy is the row number. The column and row
numbers are shown at the top and left edges of the
table respectively. The code table positions are
also identified by notations of the form hk, where h
is the column number and k is the row number in
hexadecimal notation.
The column and row
numbers are shown at the bottom and right edges of
the table respectively.
The positions of the code table are in one-to-one
correspondence with the bit combinations of the
code. The notation of a code table position, of the
form xx/yy, or of the form hk, is the same as that of
the corresponding bit combination.
5.3 Names and meanings
This part of ISO/IEC 8859 assigns a unique name
and a unique identifier to each graphic character.
These names and identifiers have been taken from
© ISO/IEC
ISO/IEC 8859-4:1997 (E)
ISO/IEC 10646-1 (E). This part of ISO/IEC 8859
also specifies an acronym for each of the characters
SPACE, NO-BREAK SPACE and SOFT HYPHEN.
For acronyms only Latin capital letters A to Z are
used. It is intended that the acronyms be retained in
all translations of the text.
Except for SPACE (SP), NO-BREAK SPACE
(NBSP) and SOFT HYPHEN (SHY), this part of
ISO/IEC 8859 does not define and does not restrict
the meanings of graphic characters.
5.3.1 SPACE (SP)
A graphic character the visual representation of
which consists of the absence of a graphic symbol.
5.3.2 NO-BREAK SPACE (NBSP)
L
A graphic character the visual representation of
which consists of the absence of a graphic symbol,
for use when a line break is to be prevented in the
text as presented.
5.3.3 SOFT HYPHEN (SHY)
A
A graphic character that is imaged by a graphic
symbol identical with, or similar to, that representing
HYPHEN, for use when a line break has been
established within a word.
N
6 Specification of the coded character set
FI
This part of ISO/IEC 8859 specifies 191 characters
allocated to the bit combinations of the code table
(table 2). None of these characters are combining
characters.
NOTE – Combining characters are described in ISO/IEC
2022:1994 subclause 6.3.3.
19
97
-
11
-1
1
Control functions, such as BACKSPACE or
CARRIAGE RETURN, shall not be used to create
composite graphic symbols, which are made up
from the graphic representations of two or more
characters.
6.1 Characters of the set and their coded
representation
See table 1.
Bit
combi- Hex Identifier Name
nation
02/00
02/01
02/02
02/03
02/04
02/05
02/06
02/07
02/08
02/09
02/10
02/11
02/12
02/13
02/14
02/15
03/00
03/01
03/02
03/03
03/04
03/05
03/06
03/07
03/08
03/09
03/10
03/11
03/12
03/13
03/14
03/15
04/00
04/01
04/02
04/03
04/04
04/05
04/06
04/07
04/08
04/09
04/10
04/11
04/12
04/13
04/14
04/15
05/00
05/01
05/02
05/03
05/04
05/05
05/06
05/07
05/08
05/09
05/10
05/11
05/12
05/13
05/14
05/15
20
21
22
23
24
25
26
27
28
29
2A
2B
2C
2D
2E
2F
30
31
32
33
34
35
36
37
38
39
3A
3B
3C
3D
3E
3F
40
41
42
43
44
45
46
47
48
49
4A
4B
4C
4D
4E
4F
50
51
52
53
54
55
56
57
58
59
5A
5B
5C
5D
5E
5F
U+0020
U+0021
U+0022
U+0023
U+0024
U+0025
U+0026
U+0027
U+0028
U+0029
U+002A
U+002B
U+002C
U+002D
U+002E
U+002F
U+0030
U+0031
U+0032
U+0033
U+0034
U+0035
U+0036
U+0037
U+0038
U+0039
U+003A
U+003B
U+003C
U+003D
U+003E
U+003F
U+0040
U+0041
U+0042
U+0043
U+0044
U+0045
U+0046
U+0047
U+0048
U+0049
U+004A
U+004B
U+004C
U+004D
U+004E
U+004F
U+0050
U+0051
U+0052
U+0053
U+0054
U+0055
U+0056
U+0057
U+0058
U+0059
U+005A
U+005B
U+005C
U+005D
U+005E
U+005F
SPACE
EXCLAMATION MARK
QUOTATION MARK
NUMBER SIGN
DOLLAR SIGN
PERCENT SIGN
AMPERSAND
APOSTROPHE
LEFT PARENTHESIS
RIGHT PARENTHESIS
ASTERISK
PLUS SIGN
COMMA
HYPHEN-MINUS
FULL STOP
SOLIDUS
DIGIT ZERO
DIGIT ONE
DIGIT TWO
DIGIT THREE
DIGIT FOUR
DIGIT FIVE
DIGIT SIX
DIGIT SEVEN
DIGIT EIGHT
DIGIT NINE
COLON
SEMICOLON
LESS-THAN SIGN
EQUALS SIGN
GREATER-THAN SIGN
QUESTION MARK
COMMERCIAL AT
LATIN CAPITAL LETTER A
LATIN CAPITAL LETTER B
LATIN CAPITAL LETTER C
LATIN CAPITAL LETTER D
LATIN CAPITAL LETTER E
LATIN CAPITAL LETTER F
LATIN CAPITAL LETTER G
LATIN CAPITAL LETTER H
LATIN CAPITAL LETTER I
LATIN CAPITAL LETTER J
LATIN CAPITAL LETTER K
LATIN CAPITAL LETTER L
LATIN CAPITAL LETTER M
LATIN CAPITAL LETTER N
LATIN CAPITAL LETTER O
LATIN CAPITAL LETTER P
LATIN CAPITAL LETTER Q
LATIN CAPITAL LETTER R
LATIN CAPITAL LETTER S
LATIN CAPITAL LETTER T
LATIN CAPITAL LETTER U
LATIN CAPITAL LETTER V
LATIN CAPITAL LETTER W
LATIN CAPITAL LETTER X
LATIN CAPITAL LETTER Y
LATIN CAPITAL LETTER Z
LEFT SQUARE BRACKET
REVERSE SOLIDUS
RIGHT SQUARE BRACKET
CIRCUMFLEX ACCENT
LOW LINE
TE
X
T
This part of ISO/IEC 8859 specifies a graphic
symbol for each graphic character. This symbol is
shown in the corresponding position of the code
table. However, this part, or any other part, of
ISO/IEC 8859 does not specify a particular style or
font design for imaging graphic characters. Annex
B of ISO/IEC 10367 gives further information on this
subject.
Table 1 – Character set, coded representation
3
ISO/IEC 8859-4:1997 (E)
© ISO/IEC
Table 1 (continued)
Bit
combi- Hex Identifier
nation
Bit
combi- Hex Identifier
nation
Name
60
61
62
63
64
65
66
67
68
69
6A
6B
6C
6D
6E
6F
70
71
72
73
74
75
76
77
78
79
7A
7B
7C
7D
7E
U+0060
U+0061
U+0062
U+0063
U+0064
U+0065
U+0066
U+0067
U+0068
U+0069
U+006A
U+006B
U+006C
U+006D
U+006E
U+006F
U+0070
U+0071
U+0072
U+0073
U+0074
U+0075
U+0076
U+0077
U+0078
U+0079
U+007A
U+007B
U+007C
U+007D
U+007E
GRAVE ACCENT
LATIN SMALL LETTER A
LATIN SMALL LETTER B
LATIN SMALL LETTER C
LATIN SMALL LETTER D
LATIN SMALL LETTER E
LATIN SMALL LETTER F
LATIN SMALL LETTER G
LATIN SMALL LETTER H
LATIN SMALL LETTER I
LATIN SMALL LETTER J
LATIN SMALL LETTER K
LATIN SMALL LETTER L
LATIN SMALL LETTER M
LATIN SMALL LETTER N
LATIN SMALL LETTER O
LATIN SMALL LETTER P
LATIN SMALL LETTER Q
LATIN SMALL LETTER R
LATIN SMALL LETTER S
LATIN SMALL LETTER T
LATIN SMALL LETTER U
LATIN SMALL LETTER V
LATIN SMALL LETTER W
LATIN SMALL LETTER X
LATIN SMALL LETTER Y
LATIN SMALL LETTER Z
LEFT CURLY BRACKET
VERTICAL LINE
RIGHT CURLY BRACKET
TILDE
10/00
10/01
10/02
10/03
10/04
10/05
10/06
10/07
10/08
10/09
10/10
10/11
10/12
10/13
10/14
10/15
11/00
11/01
11/02
11/03
11/04
11/05
11/06
11/07
11/08
11/09
11/10
11/11
11/12
11/13
11/14
11/15
A0
A1
A2
A3
A4
A5
A6
A7
A8
A9
AA
AB
AC
AD
AE
AF
B0
B1
B2
B3
B4
B5
B6
B7
B8
B9
BA
BB
BC
BD
BE
BF
U+00A0
U+0104
U+0138
U+0156
U+00A4
U+0128
U+013B
U+00A7
U+00A8
U+0160
U+0112
U+0122
U+0166
U+00AD
U+017D
U+00AF
U+00B0
U+0105
U+02DB
U+0157
U+00B4
U+0129
U+013C
U+02C7
U+00B8
U+0161
U+0113
U+0123
U+0167
U+014A
U+017E
U+014B
NO-BREAK SPACE
LATIN CAPITAL LETTER A WITH OGONEK
LATIN SMALL LETTER KRA (Greenlandic)
LATIN CAPITAL LETTER R WITH CEDILLA
CURRENCY SIGN
LATIN CAPITAL LETTER I WITH TILDE
LATIN CAPITAL LETTER L WITH CEDILLA
SECTION SIGN
DIAERESIS
LATIN CAPITAL LETTER S WITH CARON
LATIN CAPITAL LETTER E WITH MACRON
LATIN CAPITAL LETTER G WITH CEDILLA
LATIN CAPITAL LETTER T WITH STROKE
SOFT HYPHEN
LATIN CAPITAL LETTER Z WITH CARON
MACRON
DEGREE SIGN
LATIN SMALL LETTER A WITH OGONEK
OGONEK
LATIN SMALL LETTER R WITH CEDILLA
ACUTE ACCENT
LATIN SMALL LETTER I WITH TILDE
LATIN SMALL LETTER L WITH CEDILLA
CARON
CEDILLA
LATIN SMALL LETTER S WITH CARON
LATIN SMALL LETTER E WITH MACRON
LATIN SMALL LETTER G WITH CEDILLA
LATIN SMALL LETTER T WITH STROKE
LATIN CAPITAL LETTER ENG (Sámi)
LATIN SMALL LETTER Z WITH CARON
LATIN SMALL LETTER ENG (Sámi)
L
A
N
1
-1
11
97
-
19
12/00
12/01
12/02
12/03
12/04
12/05
12/06
12/07
12/08
12/09
12/10
12/11
12/12
12/13
12/14
12/15
13/00
13/01
13/02
13/03
13/04
13/05
13/06
13/07
13/08
13/09
13/10
13/11
13/12
13/13
13/14
13/15
14/00
14/01
14/02
14/03
14/04
14/05
14/06
14/07
14/08
14/09
14/10
14/11
14/12
14/13
14/14
14/15
15/00
15/01
15/02
15/03
15/04
15/05
15/06
15/07
15/08
15/09
15/10
15/11
15/12
15/13
15/14
15/15
C0 U+0100
C1 U+00C1
C2 U+00C2
C3 U+00C3
C4 U+00C4
C5 U+00C5
C6 U+00C6
C7 U+012E
C8 U+010C
C9 U+00C9
CA U+0118
CB U+00CB
CC U+0116
CD U+00CD
CE U+00CE
CF U+012A
D0 U+0110
D1 U+0145
D2 U+014C
D3 U+0136
D4 U+00D4
D5 U+00D5
D6 U+00D6
D7 U+00D7
D8 U+00D8
D9 U+0172
DA U+00DA
DB U+00DB
DC U+00DC
DD U+0168
DE U+016A
DF U+00DF
E0 U+0101
E1 U+00E1
E2 U+00E2
E3 U+00E3
E4 U+00E4
E5 U+00E5
E6 U+00E6
E7 U+012F
E8 U+010D
E9 U+00E9
EA U+0119
EB U+00EB
EC U+0117
ED U+00ED
EE U+00EE
EF U+012B
F0 U+0111
F1 U+0146
F2 U+014D
F3 U+0137
F4 U+00F4
F5 U+00F5
F6 U+00F6
F7 U+00F7
F8 U+00F8
F9 U+0173
FA U+00FA
FB U+00FB
FC U+00FC
FD U+0169
FE U+016B
FF U+02D9
Name
LATIN CAPITAL LETTER A WITH MACRON
LATIN CAPITAL LETTER A WITH ACUTE
LATIN CAPITAL LETTER A WITH CIRCUMFLEX
LATIN CAPITAL LETTER A WITH TILDE
LATIN CAPITAL LETTER A WITH DIAERESIS
LATIN CAPITAL LETTER A WITH RING ABOVE
LATIN CAPITAL LETTER AE
LATIN CAPITAL LETTER I WITH OGONEK
LATIN CAPITAL LETTER C WITH CARON
LATIN CAPITAL LETTER E WITH ACUTE
LATIN CAPITAL LETTER E WITH OGONEK
LATIN CAPITAL LETTER E WITH DIAERESIS
LATIN CAPITAL LETTER E WITH DOT ABOVE
LATIN CAPITAL LETTER I WITH ACUTE
LATIN CAPITAL LETTER I WITH CIRCUMFLEX
LATIN CAPITAL LETTER I WITH MACRON
LATIN CAPITAL LETTER D WITH STROKE
LATIN CAPITAL LETTER N WITH CEDILLA
LATIN CAPITAL LETTER O WITH MACRON
LATIN CAPITAL LETTER K WITH CEDILLA
LATIN CAPITAL LETTER O WITH CIRCUMFLEX
LATIN CAPITAL LETTER O WITH TILDE
LATIN CAPITAL LETTER O WITH DIAERESIS
MULTIPLICATION SIGN
LATIN CAPITAL LETTER O WITH STROKE
LATIN CAPITAL LETTER U WITH OGONEK
LATIN CAPITAL LETTER U WITH ACUTE
LATIN CAPITAL LETTER U WITH CIRCUMFLEX
LATIN CAPITAL LETTER U WITH DIAERESIS
LATIN CAPITAL LETTER U WITH TILDE
LATIN CAPITAL LETTER U WITH MACRON
LATIN SMALL LETTER SHARP S (German)
LATIN SMALL LETTER A WITH MACRON
LATIN SMALL LETTER A WITH ACUTE
LATIN SMALL LETTER A WITH CIRCUMFLEX
LATIN SMALL LETTER A WITH TILDE
LATIN SMALL LETTER A WITH DIAERESIS
LATIN SMALL LETTER A WITH RING ABOVE
LATIN SMALL LETTER AE
LATIN SMALL LETTER I WITH OGONEK
LATIN SMALL LETTER C WITH CARON
LATIN SMALL LETTER E WITH ACUTE
LATIN SMALL LETTER E WITH OGONEK
LATIN SMALL LETTER E WITH DIAERESIS
LATIN SMALL LETTER E WITH DOT ABOVE
LATIN SMALL LETTER I WITH ACUTE
LATIN SMALL LETTER I WITH CIRCUMFLEX
LATIN SMALL LETTER I WITH MACRON
LATIN SMALL LETTER D WITH STROKE
LATIN SMALL LETTER N WITH CEDILLA
LATIN SMALL LETTER O WITH MACRON
LATIN SMALL LETTER K WITH CEDILLA
LATIN SMALL LETTER O WITH CIRCUMFLEX
LATIN SMALL LETTER O WITH TILDE
LATIN SMALL LETTER O WITH DIAERESIS
DIVISION SIGN
LATIN SMALL LETTER O WITH STROKE
LATIN SMALL LETTER U WITH OGONEK
LATIN SMALL LETTER U WITH ACUTE
LATIN SMALL LETTER U WITH CIRCUMFLEX
LATIN SMALL LETTER U WITH DIAERESIS
LATIN SMALL LETTER U WITH TILDE
LATIN SMALL LETTER U WITH MACRON
DOT ABOVE
TE
X
T
06/00
06/01
06/02
06/03
06/04
06/05
06/06
06/07
06/08
06/09
06/10
06/11
06/12
06/13
06/14
06/15
07/00
07/01
07/02
07/03
07/04
07/05
07/06
07/07
07/08
07/09
07/10
07/11
07/12
07/13
07/14
FI
4
Table 1 (concluded)
© ISO/IEC
ISO/IEC 8859-4:1997 (E)
6.2 Code table
For each character in the set the code table
(table 2) shows a graphic symbol at the position in
the code table corresponding to the bit combination
specified in table 1.
The shaded positions in the code table correspond
to bit combinations that do not represent graphic
characters. Their use is outside the scope of
ISO/IEC 8859; it is specified in other International
Standards, for example ISO/IEC 6429.
TE
X
T
Table 2 – Code table of Latin alphabet No. 4
NBSP
SHY
19
97
-
11
-1
1
FI
N
A
L
SP
x
he
5
ISO/IEC 8859-4:1997 (E)
© ISO/IEC
7 Identification of the character set
7.1 Identification according to ISO/IEC 2022
and ISO/IEC 4873
The graphic characters of this part of ISO/IEC 8859
constitute a single coded character set. However in
accordance with ISO/IEC 2022 and ISO/IEC 4873
the code table of this part of ISO/IEC 8859 may be
considered to consist of the following components:
–
a 94-character G0 graphic character set
represented by bit combinations 02/01 to 07/14;
–
a 96-character G1 graphic character set
represented by bit combinations 10/00 to 15/15.
When the identification methods of ISO/IEC 2022 or
ISO/IEC 4873 are used this part of ISO/IEC 8859
shall be identified by the following pair of
designation functions:
GZD4
04/02
(ESC 02/08 04/02)
G1D6
04/04
(ESC 02/13 04/04)
NOTE – The corresponding escape sequences are
shown in parentheses.
L
7.2 Identification according to ISO/IEC 8824-1
(ASN.1)
19
97
-
11
-1
1
FI
N
A
In the terminology of ISO/IEC 8824-1 the character
set of this part of ISO/IEC 8859 and the
corresponding coded representations are distinct,
and are known as the "character abstract syntax"
and the "character transfer syntax" respectively.
6
–
character set
{ iso standard 8859 4 abstract-syntax (1) }
–
coded representations
{ iso standard 8859 4 transfer-syntax (0) }
The corresponding object descriptors shall be:
–
character set
"ISO 8859 part 4 repertoire"
TE
X
T
–
The character SPACE represented by bit
combination 02/00;
When the identification methods of ISO/IEC 8824-1
are used this part of ISO/IEC 8859 shall be
identified by the following object identifiers:
–
coded representations "ISO 8859 part 4 code"
7.3 Identification using the ISO International
register of coded character sets to be used
with escape sequences
According to 7.1 above the character set of this part
of ISO/IEC 8859 may be considered to consist of
the character SPACE, a 94-character G0 graphic
character set, and a 96-character G1 graphic
character set. The G0 and G1 graphic character
sets may be identified by the use of the Registration
Numbers from the ISO International register of
coded character sets to be used with escape
sequences.
When these registration numbers are used this part
of ISO/IEC 8859 shall be identified by the following
pair of registration numbers:
– G0 graphic character set ISO-IR 6
– G1 graphic character set ISO-IR 110
© ISO/IEC
ISO/IEC 8859-4:1997 (E)
Annex A
(informative)
Coverage of languages by parts 1 to 10 of ISO/IEC 8859
A.1 Languages of European origin written in Latin script
ISO/IEC
ISO/IEC
ISO/IEC
ISO/IEC
ISO/IEC
ISO/IEC
8859-1
8859-2
8859-3
8859-4
8859-9
8859-10
Latin
Latin
Latin
Latin
Latin
Latin
alphabet
alphabet
alphabet
alphabet
alphabet
alphabet
No.
No.
No.
No.
No.
No.
1
2
3
4
5
6
The following official and regional languages written
in Europe are covered by the Latin alphabets 1–6 as
indicated by number in table A.1:
TE
X
T
The following parts of ISO/IEC 8859 specify coded
character sets which comprise various different
selections of characters based on the Latin
alphabet. These sets are identified by the numbers
1 to 6 as shown:
Table A.1 – Language coverage
Language
Covered by alphabet(s) Language
Albanian
Basque
Breton
Catalan
Croat
Czech
Danish
Dutch
English
Esperanto
Estonian
Faroese
Finnish
French
1
1
1
1
2
2
4
2
1
1
(1)
5
5
5
4
1 The list of languages in table A.1 is not exhaustive.
It shows the languages that are included in the Scope
clause of each part of ISO/IEC 8859.
FI
1
1
1
1
2
3
4
4
5
5
5
5
6
6
5
6
6
2
1
1
(new orthography)
6 Italian
Latin
4
6 Latvian
6 Lithuanian
4 5 6 Luxemburgish
Maltese
(3)
(5)
3
3
A
1
1
1
Frisian
Galician
German
Greenlandic
Hungarian
Icelandic
6 Irish Gaelic
L
5
5
5
5
N
NOTES
2
Covered by alphabet(s) Language
2 For writing French three characters (Œ, œ, Ÿ) not
specified in parts 1, 3 and 9, are also needed.
1
1
3
3
2
1
4
4
4
5
5
6
6
5
Norwegian
Polish
Portuguese
Rhaeto-Romanic
Romanian
Sámi
Scottish Gaelic
Slovak
Slovene
Sorbian
Spanish
Swedish
Turkish
Covered by alphabet(s)
1
4
5
6
2
1
1
3
5
5
2
4
1
6
5
2
2
2
4
1
1
4
(3)
6
5
5
5
6
3
4 There are several official written languages outside
Europe that are covered by Latin alphabet No. 1.
Examples are Indonesian/Malay, Tagalog (Philippines),
Swahili, Afrikaans.
5
Use of Latin alphabet No. 3 for Turkish is deprecated.
19
97
-
11
-1
1
3 The various Sámi languages use partly differing
orthographies. The character sets in parts 4 and 10 cover
the requirements of the Sámi languages most commonly
used in Finland, Norway and Sweden. For the Skolt Sámi
language used in Finland and Norway additional
characters are needed. These are included in ISO-IR 158
and 197.
7
ISO/IEC 8859-4:1997 (E)
© ISO/IEC
A.2 Languages written in non-Latin scripts
The following parts of ISO/IEC 8859 specify coded
character sets which include graphic characters
from alphabets other than the Latin alphabet:
8859-5
8859-6
8859-7
8859-8
Latin/Cyrillic alphabet
Latin/Arabic alphabet
Latin/Greek alphabet
Latin/Hebrew alphabet
The Cyrillic characters included in part 5 cover
Bulgarian, Byelorussian, (Slavic) Macedonian,
Russian, Serbian and Ukrainian (as written up to
1990, see also Scope of part 5).
The Arabic characters included in part 6 cover
Arabic. The Greek characters included in part 7
cover Greek (monotonikó orthography). The Hebrew
characters included in part 8 cover Hebrew.
19
97
-
11
-1
1
FI
N
A
L
TE
X
T
ISO/IEC
ISO/IEC
ISO/IEC
ISO/IEC
The following official and regional languages are
covered by these alphabets:
8
© ISO/IEC
ISO/IEC 8859-4:1997 (E)
Annex B
(informative)
Main differences between the First edition and this Second edition of
this part of ISO/IEC 8859
B.4 A new Annex A has been added that identifies
the coverage of languages by parts 1–10 of ISO/IEC
8859.
B.5 Various editorial adjustments and clarifications
have been made to the text of the standard. The
hexadecimal equivalents of the bit combinations
have been added to tables 1 and 2, and a revised
font has been used for the graphic symbols in
table 2.
TE
X
T
B.1 The names of the graphic characters have
been amended where necessary to align them with
the names of characters adopted for all standards
on coded character sets developed under the
responsibility of ISO/IEC JTC 1. For each character
the short identifiers specified in ISO/IEC 10646-1
Amendment 9 have been added to table 1.
B.2 The new style of conformance clause, adopted
for all standards on coded character sets, has been
introduced.
B.6 Annex C, Bibliography, has been added.
B.3 Object identifiers conforming to Abstract Syntax
Notation One (ASN.1, see ISO/IEC 8824-1) are
specified in 7.2 for the character set, and the
corresponding coded representations, of this part of
ISO/IEC 8859.
19
97
-
11
-1
1
FI
N
A
L
Registration numbers from the International register
of coded character sets to be used with escape
sequences, have been included as an additional
method of identifying the coded character set of this
part of ISO/IEC 8859.
9
ISO/IEC 8859-4:1997 (E)
© ISO/IEC
Annex C
(informative)
Bibliography
ISO/IEC 6429:1992,
ISO/IEC 10367:1991,
8-bit codes.
Information technology – Control functions for coded character sets.
Information technology – Standardized coded graphic character sets for use in
TE
X
T
ISO/IEC 10646-1:1993, Information technology – Universal Multiple-Octet Coded Character Set (UCS) –
Part 1: Architecture and Basic Multilingual Plane.
19
97
-
11
-1
1
FI
N
A
L
ISO International register of coded character sets to be used with escape sequences.
10