,62,(&-7&6&1 ,62,(&-7&6&:*1 'DWH ,62,(&-7&6&:* ELWDQGELWFRGHVDQGWKHLUH[WHQVLRQ 6(&5(7$5,$7(/27 '2& 7<3( )LQDO7H[W6XEPLWWHGIRU,6SXEOLFDWLRQ 7,7/( )LQDO7H[WRI',6,QIRUPDWLRQ7HFKQRORJ\ELW VLQJOHE\WHFRGHGJUDSKLFFKDUDFWHUVHWV3DUW/DWLQ DOSKDEHW1R 6285&( 0U.,/DUVVRQ3URMHFW(GLWRU 352-(&7 -7& 67$786 ,QDFFRUGDQFHZLWK5HVROXWLRQ0DGRSWHGDWWKHWK 3OHQDU\PHHWLQJRI6&KHOGLQ&UHWH*UHHFHWKLVGRFXPHQW LVVXEPLWWHGWR,77)WRJHWKHUZLWK'LVSRVLWLRQRI&RPPHQWV 5HSRUWFRQWDLQHGLQ1IRUSXEOLFDWLRQSURFHVVLQJ $&7,21 ,' )<, '8( '$7( ',675,%87,21 32DQG/0HPEHUVRI,62,(&-7&6& :*&RQYHQHUV6HFUHWDULDWV :*0HPEHUV ,62,(&-7&6HFUHWDULDW ,62,(&,77) 0(',80 3 12 2) 3$*(6 &RQWDFW 6HFUHWDULDW ,62,(& -7& 6& :* (/27 0UV .9HOOL DFWLQJ $FKDUQRQ .DWR 3DWLVVLD $7+(16 ± *5((&( 7HO )D[ (PDLO NNE#HORWJU &RQWDFW &RQYHQRU ,62,(& -7& 6& :* 0U (0HODJUDNLV $FKDUQRQ .DWR 3DWLVVLD $7+(16 ± *5((&( 7HO )D[ (PDLO HHP#HORWJU ISO/IEC 8859-4:1997 (E) TE X T © ISO/IEC TITLE PAGE 19 97 - 11 -1 1 FI N A L To be provided by ITTF ISO/IEC 8859-4:1997 (E) © ISO/IEC Contents Page Foreword . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . iii Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . iv 1 Scope . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 2 Conformance . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 3 Normative references 4 Definitions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2 5 Notation, code table and character names . . . . . . . . . . . 2 6 Specification of the coded character set . . . . . . . . . . . . . 3 7 Identification of the character set . . . . . . . . . . . . . . . . . . 6 TE X T ........................... 1 Annex A: Coverage of languages by parts 1 to 10 of ISO/IEC 8859 . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7 Annex B: Main differences between the First edition and this Second edition of this part of ISO/IEC 8859 . 9 . . . . . . . . . . . . . . . . . . . . . . . . . . . 10 19 97 - 11 -1 1 FI N A L Annex C: Bibliography © ISO/IEC 1997 All rights reserved. Unless otherwise specified, no part of this publication may be reproduced or utilized in any form or by any means, electronic or mechanical, including photocopying and microfilm, without permission in writing from the publisher. ISO/IEC Copyright Office • Case Postale 56 • CH-1211 Genève 20 • Switzerland ii © ISO/IEC ISO/IEC 8859-4:1997 (E) Foreword TE X T ISO (the International Organization for Standardization) and IEC (the International Electrotechnical Commission) form the specialized system for worldwide standardization. National bodies that are members of ISO or IEC participate in the development of International Standards through technical committees established by the respective organization to deal with particular fields of technical activity. ISO and IEC technical committees collaborate in fields of mutual interest. Other international organizations, governmental and nongovernmental, in liaison with ISO and IEC, also take part in the work. In the field of information technology, ISO and IEC have established a joint technical committee, ISO/IEC JTC1. Draft International Standards adopted by the joint technical committee are circulated to national bodies for voting. Publication as an International Standard requires approval by at least 75% of the national bodies casting a vote. International Standard ISO/IEC 8859-4 was prepared by Joint Technical Committee ISO/IEC JTC 1, Information technology, Subcommittee SC 2, Character sets and information coding. ISO/IEC 8859 consists of the following parts, under the general title Information technology – 8-bit single-byte coded graphic character sets: Part 1: Latin alphabet No. 1 – Part 2: Latin alphabet No. 2 – Part 3: Latin alphabet No. 3 – Part 4: Latin alphabet No. 4 – Part 5: Latin/Cyrillic alphabet – Part 6: Latin/Arabic alphabet – Part 7: Latin/Greek alphabet – Part 8: Latin/Hebrew alphabet – Part 9: Latin alphabet No. 5 – Part 10: Latin alphabet No. 6 Annexes A to C of this part of ISO/IEC 8859 are for information only. 19 97 - 11 -1 1 FI N A L – iii ISO/IEC 8859-4:1997 (E) © ISO/IEC Introduction 19 97 - 11 -1 1 FI N A L TE X T ISO/IEC 8859 consists of several parts. Each part specifies a set of up to 191 graphic characters and the coded representation of these characters by means of a single 8-bit byte. Each set is intended for use for a particular group of languages. iv INTERNATIONAL STANDARD © ISO/IEC ISO/IEC 8859-4:1997 (E) Information technology – 8-bit single-byte coded graphic character sets – Part 4: Latin alphabet No. 4 2.2 Conformance of devices This part of ISO/IEC 8859 specifies a set of 191 coded graphic characters identified as Latin alphabet No. 4. A device is in conformance with this part of ISO/IEC 8859 if it conforms to the requirements of 2.2.1, and either or both of 2.2.2 and 2.2.3. A claim of conformance shall identify the document which contains the description specified in 2.2.1. TE X T 1 Scope This set of coded graphic characters is intended for use in data and text processing applications and also for information interchange. The set contains graphic characters used for general purpose applications in typical office environments in at least the following languages: Danish, English, Estonian, Finnish, German, Greenlandic, Latin, Latvian, Lithuanian, Norwegian, Sámi (but see Annex A.1, Notes), Slovene and Swedish. L This set of coded graphic characters may be regarded as a version of an 8-bit code according to ISO/IEC 2022 or ISO/IEC 4873 at level 1. A This part of ISO/IEC 8859 may not be used in conjunction with any other parts of ISO/IEC 8859. If coded characters from more than one part are to be used together, by means of code extension techniques, the equivalent coded character sets from ISO/IEC 10367 should be used instead within a version of ISO/IEC 4873 at level 2 or level 3. FI N The coded characters in this set may be used in conjunction with coded control functions selected from ISO/IEC 6429. However, control functions are not used to create composite graphic symbols from two or more graphic characters (see clause 6). 19 97 - 11 -1 1 NOTE – ISO/IEC 8859 is not intended for use with Telematic services defined by ITU-T. If information coded according to ISO/IEC 8859 is to be transferred to such services, it will have to conform to the requirements of those services at the access-point. 2 Conformance 2.1 Conformance of information interchange A coded-character-data-element (CC-data-element) within coded information for interchange is in conformance with this part of ISO/IEC 8859 if all the coded representations of graphic characters within that CC-data-element conform to the requirements of clause 6. 2.2.1 Device description A device that conforms to this part of ISO/IEC 8859 shall be the subject of a description that identifies the means by which the user may supply characters to the device, or may recognize them when they are made available to him, as specified respectively in 2.2.2 and 2.2.3. 2.2.2 Originating devices An originating device shall allow its user to supply any sequence of characters from those specified in clause 6, and shall be capable of transmitting their coded representations within a CC-data-element. 2.2.3 Receiving devices A receiving device shall be capable of receiving and interpreting any coded representations of characters that are within a CC-data-element, and that conform to clause 6, and shall make the corresponding characters available to its user in such a way that the user can identify them from among those specified there, and can distinguish them from each other. 3 Normative references The following standards contain provisions which, through reference in this text, constitute provisions of this part of ISO/IEC 8859. At the time of publication, the editions indicated were valid. All standards are subject to revision, and parties to agreements based on this part of ISO/IEC 8859 are encouraged to investigate the possibility of applying the most recent editions of the standards indicated below. Members of IEC and ISO maintain registers of currently valid International Standards. 1 ISO/IEC 8859-4:1997 (E) © ISO/IEC ISO/IEC 2022:1994, Information technology – Character code structure and extension techniques. ISO/IEC 4873:1991, Information technology – ISO 8-bit code for information interchange – Structure and rules for implementation. ISO/IEC 8824-1:1995, Information technology – Abstract Syntax Notation One (ASN.1): Specification of basic notation. The bit combinations may be interpreted to represent numbers in binary notation by attributing the following weights to the individual bits: Bit Weight b8 b7 b6 b5 b4 b3 b2 b1 128 64 32 16 8 4 2 1 For the purposes of this part of ISO/IEC 8859 the following definitions apply: Using these weights, the bit combinations are identified by notations of the form xx/yy, where xx and yy are numbers in the range 00 to 15. The correspondence between the notations of the form xx/yy and the bit combinations consisting of the bits b 8 to b1 is as follows: 4.1 bit combination: An ordered set of bits used for the representation of characters. – xx is the number represented by b 8, b 7, b 6 and b 5 where these bits are given the weights 8, 4, 2, and 1 respectively. TE X T 4 Definitions 4.2 byte: A bit string that is operated upon as a unit. 4.3 character: A member of a set of elements used for the organization, control, or representation of data. 4.4 code table: A table showing the characters allocated to each bit combination in a code. 4.5 coded character set; code: A set of unambiguous rules that establishes a character set and the one-to-one relationship between the characters of the set and their bit combinations. A L 4.6 coded-character-data-element (CC-dataelement): An element of interchanged information that is specified to consist of a sequence of coded representations of characters, in accordance with one or more identified standards for coded character sets. N 4.7 graphic character: A character, other than a control function, that has a visual representation normally handwritten, printed or displayed, and that has a coded representation consisting of one or more bit combinations. NOTE – In ISO/IEC 8859 a single bit combination is used to represent each character. FI 4.8 graphic symbol: A visual representation of a graphic character or of a control function. 11 -1 1 4.9 position: That part of a code table identified by its column and row coordinates. 19 97 - 5 Notation, code table, and names 5.1 Notation The bits of the bit combinations of the 8-bit code are identified by b 8, b7, b 6, b 5, b 4, b 3, b2, and b1, where b 8 is the highest-order, or most-significant bit and b 1 is the lowest-order, or least-significant bit. 2 – yy is the number represented by b 4, b 3, b 2 and b 1 where these bits are given the weights 8, 4, 2, and 1 respectively. The bit combinations are also identified by notations of the form hk, where h and k are numbers in the range 0 to F in hexadecimal notation. The number h is the same as the number xx described above, and the number k the same as the number yy described above. 5.2 Layout of the code table An 8-bit code table consists of 256 positions arranged in 16 columns and 16 rows. The columns and the rows are numbered 00 to 15. In hexadecimal notation the columns and the rows are numbered 0 to F. The code table positions are identified by notations of the form xx/yy, where xx is the column number and yy is the row number. The column and row numbers are shown at the top and left edges of the table respectively. The code table positions are also identified by notations of the form hk, where h is the column number and k is the row number in hexadecimal notation. The column and row numbers are shown at the bottom and right edges of the table respectively. The positions of the code table are in one-to-one correspondence with the bit combinations of the code. The notation of a code table position, of the form xx/yy, or of the form hk, is the same as that of the corresponding bit combination. 5.3 Names and meanings This part of ISO/IEC 8859 assigns a unique name and a unique identifier to each graphic character. These names and identifiers have been taken from © ISO/IEC ISO/IEC 8859-4:1997 (E) ISO/IEC 10646-1 (E). This part of ISO/IEC 8859 also specifies an acronym for each of the characters SPACE, NO-BREAK SPACE and SOFT HYPHEN. For acronyms only Latin capital letters A to Z are used. It is intended that the acronyms be retained in all translations of the text. Except for SPACE (SP), NO-BREAK SPACE (NBSP) and SOFT HYPHEN (SHY), this part of ISO/IEC 8859 does not define and does not restrict the meanings of graphic characters. 5.3.1 SPACE (SP) A graphic character the visual representation of which consists of the absence of a graphic symbol. 5.3.2 NO-BREAK SPACE (NBSP) L A graphic character the visual representation of which consists of the absence of a graphic symbol, for use when a line break is to be prevented in the text as presented. 5.3.3 SOFT HYPHEN (SHY) A A graphic character that is imaged by a graphic symbol identical with, or similar to, that representing HYPHEN, for use when a line break has been established within a word. N 6 Specification of the coded character set FI This part of ISO/IEC 8859 specifies 191 characters allocated to the bit combinations of the code table (table 2). None of these characters are combining characters. NOTE – Combining characters are described in ISO/IEC 2022:1994 subclause 6.3.3. 19 97 - 11 -1 1 Control functions, such as BACKSPACE or CARRIAGE RETURN, shall not be used to create composite graphic symbols, which are made up from the graphic representations of two or more characters. 6.1 Characters of the set and their coded representation See table 1. Bit combi- Hex Identifier Name nation 02/00 02/01 02/02 02/03 02/04 02/05 02/06 02/07 02/08 02/09 02/10 02/11 02/12 02/13 02/14 02/15 03/00 03/01 03/02 03/03 03/04 03/05 03/06 03/07 03/08 03/09 03/10 03/11 03/12 03/13 03/14 03/15 04/00 04/01 04/02 04/03 04/04 04/05 04/06 04/07 04/08 04/09 04/10 04/11 04/12 04/13 04/14 04/15 05/00 05/01 05/02 05/03 05/04 05/05 05/06 05/07 05/08 05/09 05/10 05/11 05/12 05/13 05/14 05/15 20 21 22 23 24 25 26 27 28 29 2A 2B 2C 2D 2E 2F 30 31 32 33 34 35 36 37 38 39 3A 3B 3C 3D 3E 3F 40 41 42 43 44 45 46 47 48 49 4A 4B 4C 4D 4E 4F 50 51 52 53 54 55 56 57 58 59 5A 5B 5C 5D 5E 5F U+0020 U+0021 U+0022 U+0023 U+0024 U+0025 U+0026 U+0027 U+0028 U+0029 U+002A U+002B U+002C U+002D U+002E U+002F U+0030 U+0031 U+0032 U+0033 U+0034 U+0035 U+0036 U+0037 U+0038 U+0039 U+003A U+003B U+003C U+003D U+003E U+003F U+0040 U+0041 U+0042 U+0043 U+0044 U+0045 U+0046 U+0047 U+0048 U+0049 U+004A U+004B U+004C U+004D U+004E U+004F U+0050 U+0051 U+0052 U+0053 U+0054 U+0055 U+0056 U+0057 U+0058 U+0059 U+005A U+005B U+005C U+005D U+005E U+005F SPACE EXCLAMATION MARK QUOTATION MARK NUMBER SIGN DOLLAR SIGN PERCENT SIGN AMPERSAND APOSTROPHE LEFT PARENTHESIS RIGHT PARENTHESIS ASTERISK PLUS SIGN COMMA HYPHEN-MINUS FULL STOP SOLIDUS DIGIT ZERO DIGIT ONE DIGIT TWO DIGIT THREE DIGIT FOUR DIGIT FIVE DIGIT SIX DIGIT SEVEN DIGIT EIGHT DIGIT NINE COLON SEMICOLON LESS-THAN SIGN EQUALS SIGN GREATER-THAN SIGN QUESTION MARK COMMERCIAL AT LATIN CAPITAL LETTER A LATIN CAPITAL LETTER B LATIN CAPITAL LETTER C LATIN CAPITAL LETTER D LATIN CAPITAL LETTER E LATIN CAPITAL LETTER F LATIN CAPITAL LETTER G LATIN CAPITAL LETTER H LATIN CAPITAL LETTER I LATIN CAPITAL LETTER J LATIN CAPITAL LETTER K LATIN CAPITAL LETTER L LATIN CAPITAL LETTER M LATIN CAPITAL LETTER N LATIN CAPITAL LETTER O LATIN CAPITAL LETTER P LATIN CAPITAL LETTER Q LATIN CAPITAL LETTER R LATIN CAPITAL LETTER S LATIN CAPITAL LETTER T LATIN CAPITAL LETTER U LATIN CAPITAL LETTER V LATIN CAPITAL LETTER W LATIN CAPITAL LETTER X LATIN CAPITAL LETTER Y LATIN CAPITAL LETTER Z LEFT SQUARE BRACKET REVERSE SOLIDUS RIGHT SQUARE BRACKET CIRCUMFLEX ACCENT LOW LINE TE X T This part of ISO/IEC 8859 specifies a graphic symbol for each graphic character. This symbol is shown in the corresponding position of the code table. However, this part, or any other part, of ISO/IEC 8859 does not specify a particular style or font design for imaging graphic characters. Annex B of ISO/IEC 10367 gives further information on this subject. Table 1 – Character set, coded representation 3 ISO/IEC 8859-4:1997 (E) © ISO/IEC Table 1 (continued) Bit combi- Hex Identifier nation Bit combi- Hex Identifier nation Name 60 61 62 63 64 65 66 67 68 69 6A 6B 6C 6D 6E 6F 70 71 72 73 74 75 76 77 78 79 7A 7B 7C 7D 7E U+0060 U+0061 U+0062 U+0063 U+0064 U+0065 U+0066 U+0067 U+0068 U+0069 U+006A U+006B U+006C U+006D U+006E U+006F U+0070 U+0071 U+0072 U+0073 U+0074 U+0075 U+0076 U+0077 U+0078 U+0079 U+007A U+007B U+007C U+007D U+007E GRAVE ACCENT LATIN SMALL LETTER A LATIN SMALL LETTER B LATIN SMALL LETTER C LATIN SMALL LETTER D LATIN SMALL LETTER E LATIN SMALL LETTER F LATIN SMALL LETTER G LATIN SMALL LETTER H LATIN SMALL LETTER I LATIN SMALL LETTER J LATIN SMALL LETTER K LATIN SMALL LETTER L LATIN SMALL LETTER M LATIN SMALL LETTER N LATIN SMALL LETTER O LATIN SMALL LETTER P LATIN SMALL LETTER Q LATIN SMALL LETTER R LATIN SMALL LETTER S LATIN SMALL LETTER T LATIN SMALL LETTER U LATIN SMALL LETTER V LATIN SMALL LETTER W LATIN SMALL LETTER X LATIN SMALL LETTER Y LATIN SMALL LETTER Z LEFT CURLY BRACKET VERTICAL LINE RIGHT CURLY BRACKET TILDE 10/00 10/01 10/02 10/03 10/04 10/05 10/06 10/07 10/08 10/09 10/10 10/11 10/12 10/13 10/14 10/15 11/00 11/01 11/02 11/03 11/04 11/05 11/06 11/07 11/08 11/09 11/10 11/11 11/12 11/13 11/14 11/15 A0 A1 A2 A3 A4 A5 A6 A7 A8 A9 AA AB AC AD AE AF B0 B1 B2 B3 B4 B5 B6 B7 B8 B9 BA BB BC BD BE BF U+00A0 U+0104 U+0138 U+0156 U+00A4 U+0128 U+013B U+00A7 U+00A8 U+0160 U+0112 U+0122 U+0166 U+00AD U+017D U+00AF U+00B0 U+0105 U+02DB U+0157 U+00B4 U+0129 U+013C U+02C7 U+00B8 U+0161 U+0113 U+0123 U+0167 U+014A U+017E U+014B NO-BREAK SPACE LATIN CAPITAL LETTER A WITH OGONEK LATIN SMALL LETTER KRA (Greenlandic) LATIN CAPITAL LETTER R WITH CEDILLA CURRENCY SIGN LATIN CAPITAL LETTER I WITH TILDE LATIN CAPITAL LETTER L WITH CEDILLA SECTION SIGN DIAERESIS LATIN CAPITAL LETTER S WITH CARON LATIN CAPITAL LETTER E WITH MACRON LATIN CAPITAL LETTER G WITH CEDILLA LATIN CAPITAL LETTER T WITH STROKE SOFT HYPHEN LATIN CAPITAL LETTER Z WITH CARON MACRON DEGREE SIGN LATIN SMALL LETTER A WITH OGONEK OGONEK LATIN SMALL LETTER R WITH CEDILLA ACUTE ACCENT LATIN SMALL LETTER I WITH TILDE LATIN SMALL LETTER L WITH CEDILLA CARON CEDILLA LATIN SMALL LETTER S WITH CARON LATIN SMALL LETTER E WITH MACRON LATIN SMALL LETTER G WITH CEDILLA LATIN SMALL LETTER T WITH STROKE LATIN CAPITAL LETTER ENG (Sámi) LATIN SMALL LETTER Z WITH CARON LATIN SMALL LETTER ENG (Sámi) L A N 1 -1 11 97 - 19 12/00 12/01 12/02 12/03 12/04 12/05 12/06 12/07 12/08 12/09 12/10 12/11 12/12 12/13 12/14 12/15 13/00 13/01 13/02 13/03 13/04 13/05 13/06 13/07 13/08 13/09 13/10 13/11 13/12 13/13 13/14 13/15 14/00 14/01 14/02 14/03 14/04 14/05 14/06 14/07 14/08 14/09 14/10 14/11 14/12 14/13 14/14 14/15 15/00 15/01 15/02 15/03 15/04 15/05 15/06 15/07 15/08 15/09 15/10 15/11 15/12 15/13 15/14 15/15 C0 U+0100 C1 U+00C1 C2 U+00C2 C3 U+00C3 C4 U+00C4 C5 U+00C5 C6 U+00C6 C7 U+012E C8 U+010C C9 U+00C9 CA U+0118 CB U+00CB CC U+0116 CD U+00CD CE U+00CE CF U+012A D0 U+0110 D1 U+0145 D2 U+014C D3 U+0136 D4 U+00D4 D5 U+00D5 D6 U+00D6 D7 U+00D7 D8 U+00D8 D9 U+0172 DA U+00DA DB U+00DB DC U+00DC DD U+0168 DE U+016A DF U+00DF E0 U+0101 E1 U+00E1 E2 U+00E2 E3 U+00E3 E4 U+00E4 E5 U+00E5 E6 U+00E6 E7 U+012F E8 U+010D E9 U+00E9 EA U+0119 EB U+00EB EC U+0117 ED U+00ED EE U+00EE EF U+012B F0 U+0111 F1 U+0146 F2 U+014D F3 U+0137 F4 U+00F4 F5 U+00F5 F6 U+00F6 F7 U+00F7 F8 U+00F8 F9 U+0173 FA U+00FA FB U+00FB FC U+00FC FD U+0169 FE U+016B FF U+02D9 Name LATIN CAPITAL LETTER A WITH MACRON LATIN CAPITAL LETTER A WITH ACUTE LATIN CAPITAL LETTER A WITH CIRCUMFLEX LATIN CAPITAL LETTER A WITH TILDE LATIN CAPITAL LETTER A WITH DIAERESIS LATIN CAPITAL LETTER A WITH RING ABOVE LATIN CAPITAL LETTER AE LATIN CAPITAL LETTER I WITH OGONEK LATIN CAPITAL LETTER C WITH CARON LATIN CAPITAL LETTER E WITH ACUTE LATIN CAPITAL LETTER E WITH OGONEK LATIN CAPITAL LETTER E WITH DIAERESIS LATIN CAPITAL LETTER E WITH DOT ABOVE LATIN CAPITAL LETTER I WITH ACUTE LATIN CAPITAL LETTER I WITH CIRCUMFLEX LATIN CAPITAL LETTER I WITH MACRON LATIN CAPITAL LETTER D WITH STROKE LATIN CAPITAL LETTER N WITH CEDILLA LATIN CAPITAL LETTER O WITH MACRON LATIN CAPITAL LETTER K WITH CEDILLA LATIN CAPITAL LETTER O WITH CIRCUMFLEX LATIN CAPITAL LETTER O WITH TILDE LATIN CAPITAL LETTER O WITH DIAERESIS MULTIPLICATION SIGN LATIN CAPITAL LETTER O WITH STROKE LATIN CAPITAL LETTER U WITH OGONEK LATIN CAPITAL LETTER U WITH ACUTE LATIN CAPITAL LETTER U WITH CIRCUMFLEX LATIN CAPITAL LETTER U WITH DIAERESIS LATIN CAPITAL LETTER U WITH TILDE LATIN CAPITAL LETTER U WITH MACRON LATIN SMALL LETTER SHARP S (German) LATIN SMALL LETTER A WITH MACRON LATIN SMALL LETTER A WITH ACUTE LATIN SMALL LETTER A WITH CIRCUMFLEX LATIN SMALL LETTER A WITH TILDE LATIN SMALL LETTER A WITH DIAERESIS LATIN SMALL LETTER A WITH RING ABOVE LATIN SMALL LETTER AE LATIN SMALL LETTER I WITH OGONEK LATIN SMALL LETTER C WITH CARON LATIN SMALL LETTER E WITH ACUTE LATIN SMALL LETTER E WITH OGONEK LATIN SMALL LETTER E WITH DIAERESIS LATIN SMALL LETTER E WITH DOT ABOVE LATIN SMALL LETTER I WITH ACUTE LATIN SMALL LETTER I WITH CIRCUMFLEX LATIN SMALL LETTER I WITH MACRON LATIN SMALL LETTER D WITH STROKE LATIN SMALL LETTER N WITH CEDILLA LATIN SMALL LETTER O WITH MACRON LATIN SMALL LETTER K WITH CEDILLA LATIN SMALL LETTER O WITH CIRCUMFLEX LATIN SMALL LETTER O WITH TILDE LATIN SMALL LETTER O WITH DIAERESIS DIVISION SIGN LATIN SMALL LETTER O WITH STROKE LATIN SMALL LETTER U WITH OGONEK LATIN SMALL LETTER U WITH ACUTE LATIN SMALL LETTER U WITH CIRCUMFLEX LATIN SMALL LETTER U WITH DIAERESIS LATIN SMALL LETTER U WITH TILDE LATIN SMALL LETTER U WITH MACRON DOT ABOVE TE X T 06/00 06/01 06/02 06/03 06/04 06/05 06/06 06/07 06/08 06/09 06/10 06/11 06/12 06/13 06/14 06/15 07/00 07/01 07/02 07/03 07/04 07/05 07/06 07/07 07/08 07/09 07/10 07/11 07/12 07/13 07/14 FI 4 Table 1 (concluded) © ISO/IEC ISO/IEC 8859-4:1997 (E) 6.2 Code table For each character in the set the code table (table 2) shows a graphic symbol at the position in the code table corresponding to the bit combination specified in table 1. The shaded positions in the code table correspond to bit combinations that do not represent graphic characters. Their use is outside the scope of ISO/IEC 8859; it is specified in other International Standards, for example ISO/IEC 6429. TE X T Table 2 – Code table of Latin alphabet No. 4 NBSP SHY 19 97 - 11 -1 1 FI N A L SP x he 5 ISO/IEC 8859-4:1997 (E) © ISO/IEC 7 Identification of the character set 7.1 Identification according to ISO/IEC 2022 and ISO/IEC 4873 The graphic characters of this part of ISO/IEC 8859 constitute a single coded character set. However in accordance with ISO/IEC 2022 and ISO/IEC 4873 the code table of this part of ISO/IEC 8859 may be considered to consist of the following components: – a 94-character G0 graphic character set represented by bit combinations 02/01 to 07/14; – a 96-character G1 graphic character set represented by bit combinations 10/00 to 15/15. When the identification methods of ISO/IEC 2022 or ISO/IEC 4873 are used this part of ISO/IEC 8859 shall be identified by the following pair of designation functions: GZD4 04/02 (ESC 02/08 04/02) G1D6 04/04 (ESC 02/13 04/04) NOTE – The corresponding escape sequences are shown in parentheses. L 7.2 Identification according to ISO/IEC 8824-1 (ASN.1) 19 97 - 11 -1 1 FI N A In the terminology of ISO/IEC 8824-1 the character set of this part of ISO/IEC 8859 and the corresponding coded representations are distinct, and are known as the "character abstract syntax" and the "character transfer syntax" respectively. 6 – character set { iso standard 8859 4 abstract-syntax (1) } – coded representations { iso standard 8859 4 transfer-syntax (0) } The corresponding object descriptors shall be: – character set "ISO 8859 part 4 repertoire" TE X T – The character SPACE represented by bit combination 02/00; When the identification methods of ISO/IEC 8824-1 are used this part of ISO/IEC 8859 shall be identified by the following object identifiers: – coded representations "ISO 8859 part 4 code" 7.3 Identification using the ISO International register of coded character sets to be used with escape sequences According to 7.1 above the character set of this part of ISO/IEC 8859 may be considered to consist of the character SPACE, a 94-character G0 graphic character set, and a 96-character G1 graphic character set. The G0 and G1 graphic character sets may be identified by the use of the Registration Numbers from the ISO International register of coded character sets to be used with escape sequences. When these registration numbers are used this part of ISO/IEC 8859 shall be identified by the following pair of registration numbers: – G0 graphic character set ISO-IR 6 – G1 graphic character set ISO-IR 110 © ISO/IEC ISO/IEC 8859-4:1997 (E) Annex A (informative) Coverage of languages by parts 1 to 10 of ISO/IEC 8859 A.1 Languages of European origin written in Latin script ISO/IEC ISO/IEC ISO/IEC ISO/IEC ISO/IEC ISO/IEC 8859-1 8859-2 8859-3 8859-4 8859-9 8859-10 Latin Latin Latin Latin Latin Latin alphabet alphabet alphabet alphabet alphabet alphabet No. No. No. No. No. No. 1 2 3 4 5 6 The following official and regional languages written in Europe are covered by the Latin alphabets 1–6 as indicated by number in table A.1: TE X T The following parts of ISO/IEC 8859 specify coded character sets which comprise various different selections of characters based on the Latin alphabet. These sets are identified by the numbers 1 to 6 as shown: Table A.1 – Language coverage Language Covered by alphabet(s) Language Albanian Basque Breton Catalan Croat Czech Danish Dutch English Esperanto Estonian Faroese Finnish French 1 1 1 1 2 2 4 2 1 1 (1) 5 5 5 4 1 The list of languages in table A.1 is not exhaustive. It shows the languages that are included in the Scope clause of each part of ISO/IEC 8859. FI 1 1 1 1 2 3 4 4 5 5 5 5 6 6 5 6 6 2 1 1 (new orthography) 6 Italian Latin 4 6 Latvian 6 Lithuanian 4 5 6 Luxemburgish Maltese (3) (5) 3 3 A 1 1 1 Frisian Galician German Greenlandic Hungarian Icelandic 6 Irish Gaelic L 5 5 5 5 N NOTES 2 Covered by alphabet(s) Language 2 For writing French three characters (Œ, œ, Ÿ) not specified in parts 1, 3 and 9, are also needed. 1 1 3 3 2 1 4 4 4 5 5 6 6 5 Norwegian Polish Portuguese Rhaeto-Romanic Romanian Sámi Scottish Gaelic Slovak Slovene Sorbian Spanish Swedish Turkish Covered by alphabet(s) 1 4 5 6 2 1 1 3 5 5 2 4 1 6 5 2 2 2 4 1 1 4 (3) 6 5 5 5 6 3 4 There are several official written languages outside Europe that are covered by Latin alphabet No. 1. Examples are Indonesian/Malay, Tagalog (Philippines), Swahili, Afrikaans. 5 Use of Latin alphabet No. 3 for Turkish is deprecated. 19 97 - 11 -1 1 3 The various Sámi languages use partly differing orthographies. The character sets in parts 4 and 10 cover the requirements of the Sámi languages most commonly used in Finland, Norway and Sweden. For the Skolt Sámi language used in Finland and Norway additional characters are needed. These are included in ISO-IR 158 and 197. 7 ISO/IEC 8859-4:1997 (E) © ISO/IEC A.2 Languages written in non-Latin scripts The following parts of ISO/IEC 8859 specify coded character sets which include graphic characters from alphabets other than the Latin alphabet: 8859-5 8859-6 8859-7 8859-8 Latin/Cyrillic alphabet Latin/Arabic alphabet Latin/Greek alphabet Latin/Hebrew alphabet The Cyrillic characters included in part 5 cover Bulgarian, Byelorussian, (Slavic) Macedonian, Russian, Serbian and Ukrainian (as written up to 1990, see also Scope of part 5). The Arabic characters included in part 6 cover Arabic. The Greek characters included in part 7 cover Greek (monotonikó orthography). The Hebrew characters included in part 8 cover Hebrew. 19 97 - 11 -1 1 FI N A L TE X T ISO/IEC ISO/IEC ISO/IEC ISO/IEC The following official and regional languages are covered by these alphabets: 8 © ISO/IEC ISO/IEC 8859-4:1997 (E) Annex B (informative) Main differences between the First edition and this Second edition of this part of ISO/IEC 8859 B.4 A new Annex A has been added that identifies the coverage of languages by parts 1–10 of ISO/IEC 8859. B.5 Various editorial adjustments and clarifications have been made to the text of the standard. The hexadecimal equivalents of the bit combinations have been added to tables 1 and 2, and a revised font has been used for the graphic symbols in table 2. TE X T B.1 The names of the graphic characters have been amended where necessary to align them with the names of characters adopted for all standards on coded character sets developed under the responsibility of ISO/IEC JTC 1. For each character the short identifiers specified in ISO/IEC 10646-1 Amendment 9 have been added to table 1. B.2 The new style of conformance clause, adopted for all standards on coded character sets, has been introduced. B.6 Annex C, Bibliography, has been added. B.3 Object identifiers conforming to Abstract Syntax Notation One (ASN.1, see ISO/IEC 8824-1) are specified in 7.2 for the character set, and the corresponding coded representations, of this part of ISO/IEC 8859. 19 97 - 11 -1 1 FI N A L Registration numbers from the International register of coded character sets to be used with escape sequences, have been included as an additional method of identifying the coded character set of this part of ISO/IEC 8859. 9 ISO/IEC 8859-4:1997 (E) © ISO/IEC Annex C (informative) Bibliography ISO/IEC 6429:1992, ISO/IEC 10367:1991, 8-bit codes. Information technology – Control functions for coded character sets. Information technology – Standardized coded graphic character sets for use in TE X T ISO/IEC 10646-1:1993, Information technology – Universal Multiple-Octet Coded Character Set (UCS) – Part 1: Architecture and Basic Multilingual Plane. 19 97 - 11 -1 1 FI N A L ISO International register of coded character sets to be used with escape sequences. 10