1
2
3
4
5
6
7Internet Engineering Task Force (IETF) P. Faltstrom, Ed.
8Request for Comments: 5892 Cisco
9Category: Standards Track August 2010
10ISSN: 2070-1721
11
12
13 The Unicode Code Points and
14 Internationalized Domain Names for Applications (IDNA)
15
16Abstract
17
18 This document specifies rules for deciding whether a code point,
19 considered in isolation or in context, is a candidate for inclusion
20 in an Internationalized Domain Name (IDN).
21
22 It is part of the specification of Internationalizing Domain Names in
23 Applications 2008 (IDNA2008).
24
25Status of This Memo
26
27 This is an Internet Standards Track document.
28
29 This document is a product of the Internet Engineering Task Force
30 (IETF). It represents the consensus of the IETF community. It has
31 received public review and has been approved for publication by the
32 Internet Engineering Steering Group (IESG). Further information on
33 Internet Standards is available in Section 2 of RFC 5741.
34
35 Information about the current status of this document, any errata,
36 and how to provide feedback on it may be obtained at
37 http://www.rfc-editor.org/info/rfc5892.
38
39Copyright Notice
40
41 Copyright (c) 2010 IETF Trust and the persons identified as the
42 document authors. All rights reserved.
43
44 This document is subject to BCP 78 and the IETF Trust's Legal
45 Provisions Relating to IETF Documents
46 (http://trustee.ietf.org/license-info) in effect on the date of
47 publication of this document. Please review these documents
48 carefully, as they describe your rights and restrictions with respect
49 to this document. Code Components extracted from this document must
50 include Simplified BSD License text as described in Section 4.e of
51 the Trust Legal Provisions and are provided without warranty as
52 described in the Simplified BSD License.
53
54
55
56
57
58Faltstrom Standards Track [Page 1]
59
60RFC 5892 IDNA Code Points August 2010
61
62
63Table of Contents
64
65 1. Introduction . . . . . . . . . . . . . . . . . . . . . . . . . 3
66 2. Category Definitions Used to Calculate Derived Property
67 Value . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
68 2.1. LetterDigits (A) . . . . . . . . . . . . . . . . . . . . . 5
69 2.2. Unstable (B) . . . . . . . . . . . . . . . . . . . . . . . 6
70 2.3. IgnorableProperties (C) . . . . . . . . . . . . . . . . . 6
71 2.4. IgnorableBlocks (D) . . . . . . . . . . . . . . . . . . . 7
72 2.5. LDH (E) . . . . . . . . . . . . . . . . . . . . . . . . . 7
73 2.6. Exceptions (F) . . . . . . . . . . . . . . . . . . . . . . 7
74 2.7. BackwardCompatible (G) . . . . . . . . . . . . . . . . . . 9
75 2.8. JoinControl (H) . . . . . . . . . . . . . . . . . . . . . 9
76 2.9. OldHangulJamo (I) . . . . . . . . . . . . . . . . . . . . 9
77 2.10. Unassigned (J) . . . . . . . . . . . . . . . . . . . . . . 9
78 3. Calculation of the Derived Property . . . . . . . . . . . . . 10
79 4. Code Points . . . . . . . . . . . . . . . . . . . . . . . . . 10
80 5. IANA Considerations . . . . . . . . . . . . . . . . . . . . . 11
81 5.1. IDNA-Derived Property Value Registry . . . . . . . . . . . 11
82 5.2. IDNA Context Registry . . . . . . . . . . . . . . . . . . 11
83 5.2.1. Template for Context Registry . . . . . . . . . . . . 11
84 6. Security Considerations . . . . . . . . . . . . . . . . . . . 12
85 7. Acknowledgements . . . . . . . . . . . . . . . . . . . . . . . 12
86 Appendix A. Contextual Rules Registry . . . . . . . . . . . . . 13
87 Appendix A.1. ZERO WIDTH NON-JOINER . . . . . . . . . . . . . . . 15
88 Appendix A.2. ZERO WIDTH JOINER . . . . . . . . . . . . . . . . . 16
89 Appendix A.3. MIDDLE DOT . . . . . . . . . . . . . . . . . . . . . 16
90 Appendix A.4. GREEK LOWER NUMERAL SIGN (KERAIA) . . . . . . . . . 17
91 Appendix A.5. HEBREW PUNCTUATION GERESH . . . . . . . . . . . . . 17
92 Appendix A.6. HEBREW PUNCTUATION GERSHAYIM . . . . . . . . . . . . 18
93 Appendix A.7. KATAKANA MIDDLE DOT . . . . . . . . . . . . . . . . 18
94 Appendix A.8. ARABIC-INDIC DIGITS . . . . . . . . . . . . . . . . 19
95 Appendix A.9. EXTENDED ARABIC-INDIC DIGITS . . . . . . . . . . . . 19
96 Appendix B. Code Points 0x0000 - 0x10FFFF . . . . . . . . . . . 20
97 Appendix B.1. Code Points in Unicode Character Database (UCD)
98 Format . . . . . . . . . . . . . . . . . . . . . . . 20
99 8. References . . . . . . . . . . . . . . . . . . . . . . . . . . 69
100 8.1. Normative References . . . . . . . . . . . . . . . . . . . 69
101 8.2. Informative References . . . . . . . . . . . . . . . . . . 69
102
103
104
105
106
107
108
109
110
111
112
113
114Faltstrom Standards Track [Page 2]
115
116RFC 5892 IDNA Code Points August 2010
117
118
1191. Introduction
120
121 RFC 4690 [RFC4690] suggests an inclusion-based approach for selecting
122 the code points from The Unicode Standard [Unicode52] that should be
123 included in the list of code points that may be used in
124 Internationalized Domain Names.
125
126 Specifically, RFC 4690 [RFC4690] says the following:
127
128 The IAB has concluded that there is a consensus within the broader
129 community that lists of code points should be specified by the use
130 of an inclusion-based mechanism (i.e., identifying the characters
131 that are permitted), rather than by excluding a small number of
132 characters from the total Unicode set as Stringprep [RFC3454] and
133 Nameprep [RFC3491] do today. That conclusion should be reviewed
134 by the IETF community and action taken as appropriate.
135
136 This document reviews and classifies the collections of code points
137 in the Unicode character set by examining various properties of the
138 code points. It then defines an algorithm for determining a derived
139 property value. It specifies a procedure, and not a table, of code
140 points so that the algorithm can be used to determine code point sets
141 independent of the version of Unicode that is in use.
142
143 This document is not intended to specify precisely how these property
144 values are to be applied in IDN labels. That information appears in
145 the Protocol document [RFC5891], but it is important to understand
146 that the assignment of a value of this property to a particular
147 character is not sufficient to determine whether it can be used in a
148 given label. In particular, some combinations of allowed code points
149 are not advisable for use in IDNs due to rules specific to a script
150 or class of characters. The requirement for such rules is linked to
151 the operations in the Protocol document and especially to the
152 characters designated as requiring contextual rules.
153
154 The value of the property is to be interpreted as follows.
155
156 o PROTOCOL VALID: Those that are allowed to be used in IDNs. Code
157 points with this property value are permitted for general use in
158 IDNs. However, that a label consists only of code points that
159 have this property value does not imply that the label can be used
160 in DNS. See the Protocol document for algorithms to make
161 decisions about labels in domain names. The abbreviated term
162 PVALID is used to refer to this value in the rest of this
163 document.
164
165
166
167
168
169
170Faltstrom Standards Track [Page 3]
171
172RFC 5892 IDNA Code Points August 2010
173
174
175 o CONTEXTUAL RULE REQUIRED: Some characteristics of the character,
176 such as it being invisible in certain contexts or problematic in
177 others, require that it not be used in labels unless specific
178 other characters or properties are present. The abbreviated term
179 CONTEXT is used to refer to this value in the rest of this
180 document. There are two subdivisions of CONTEXTUAL RULE REQUIRED,
181 one for Join_controls (called CONTEXTJ) and for other characters
182 (called CONTEXTO). These are discussed in more detail below and
183 in the Protocol document.
184
185 o DISALLOWED: Those that should clearly not be included in IDNs.
186 Code points with this property value are not permitted in IDNs.
187
188 o UNASSIGNED: Those code points that are not designated (i.e., are
189 unassigned) in the Unicode Standard.
190
191 The mechanisms described here allow determination of the value of the
192 property for future versions of Unicode (including characters added
193 after Unicode 5.2). Changes in Unicode properties that do not affect
194 the outcome of this process do not affect IDN. For example, a
195 character can have its Unicode General_Category value (see
196 [Unicode52]) change from So to Sm or from Lo to Ll, without affecting
197 the algorithm results. Moreover, even if such changes were the
198 result, the BackwardCompatible list (Section 2.7) can be adjusted to
199 ensure the stability of the results.
200
201 Some code points need to be allowed in exceptional circumstances but
202 should be excluded in all other cases; these rules are also described
203 in other documents. The most notable of these are the Join Control
204 characters, U+200D ZERO WIDTH JOINER and U+200C ZERO WIDTH
205 NON-JOINER. Both of them have the derived property value CONTEXTJ.
206 A character with the derived property value CONTEXTJ or CONTEXTO
207 (CONTEXTUAL RULE REQUIRED) is not to be used unless an appropriate
208 rule has been established and the context of the character is
209 consistent with that rule. It is invalid to either register a string
210 containing these characters or even to look one up unless such a
211 contextual rule is found and satisfied. Please see Appendix A, "The
212 Contextual Rules Registry", for more information.
213
214 This document is part of a series that, together, constitute a
215 proposal for updating the IDNA standards to resolve issues uncovered
216 in recent years, cover a broader range of scripts, and provide for
217 migration to newer versions of Unicode. See the Rationale document
218 [RFC5894] for a broader discussion.
219
220 The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT",
221 "SHOULD", "SHOULD NOT", "RECOMMENDED", "MAY", and "OPTIONAL" in this
222 document are to be interpreted as described in RFC 2119 [RFC2119].
223
224
225
226Faltstrom Standards Track [Page 4]
227
228RFC 5892 IDNA Code Points August 2010
229
230
2312. Category Definitions Used to Calculate Derived Property Value
232
233 The derived property obtains its value based on a two-step procedure.
234 First, characters are placed in one or more character categories
235 based on either core properties defined by the Unicode Standard or by
236 treating the code point as an exception and addressing the code point
237 by its code point value. These categories are not mutually
238 exclusive.
239
240 In the second step, set operations are used with these categories to
241 determine the values for an IDN-specific property. Those operations
242 are specified in Section 3.
243
244 Unicode property names and property value names may have short
245 abbreviations, such as gc for the General_Category property, and Ll
246 for the Lowercase_Letter property value of the gc property.
247
248 In the following specification of categories, the operation that
249 returns the value of a particular Unicode character property for a
250 code point is designated by using the formal name of that property
251 (from PropertyAliases.txt) followed by '(cp)'. For example, the
252 value of the General_Category property for a code point is indicated
253 by General_Category(cp).
254
2552.1. LetterDigits (A)
256
257 A: General_Category(cp) is in {Ll, Lu, Lo, Nd, Lm, Mn, Mc}
258
259 These rules identify characters commonly used in mnemonics and often
260 informally described as "language characters". In general, only code
261 points assigned to this category are suitable for use in IDN.
262
263 For more information, see Section 4.5 of The Unicode Standard
264 [Unicode].
265
266 The categories used in this rule are:
267
268 o Ll - Lowercase_Letter
269
270 o Lu - Uppercase_Letter
271
272 o Lo - Other_Letter
273
274 o Nd - Decimal_Number
275
276 o Lm - Modifier_Letter
277
278
279
280
281
282Faltstrom Standards Track [Page 5]
283
284RFC 5892 IDNA Code Points August 2010
285
286
287 o Mn - Nonspacing_Mark
288
289 o Mc - Spacing_Mark
290
2912.2. Unstable (B)
292
293 B: toNFKC(toCaseFold(toNFKC(cp))) != cp
294
295 This category is used to group the characters that are not stable
296 under Normalization Form K (NFKC) and case folding. In general,
297 these code points are not suitable for use for IDN.
298
299 The toCaseFold() operation is defined in Section 3.13 of The Unicode
300 Standard [Unicode].
301
302 The toNFKC() operation returns the code point in normalization form
303 KC. For more information, see Section 5 of Unicode Standard Annex
304 #15 [TR15].
305
306 It should be noted that NFKC is used, although Normalization Form C
307 (NFC) is used in the "IDNA Protocol" document [RFC5891].
308
3092.3. IgnorableProperties (C)
310
311 C: Default_Ignorable_Code_Point(cp) = True or
312 White_Space(cp) = True or
313 Noncharacter_Code_Point(cp) = True
314
315 This category is used to group code points that are not recommended
316 for use in identifiers. In general, these code points are not
317 suitable for use in an IDN.
318
319 The definition for Default_Ignorable_Code_Point can be found in
320 DerivedCoreProperties.txt [DerivedCoreProperties] and is at the time
321 of Unicode 5.2:
322
323 Other_Default_Ignorable_Code_Point + Cf (Format characters)
324 + Variation_Selector - White_Space - FFF9..FFFB (Annotation
325 Characters) - 0600..0603, 06DD, 070F (exceptional Cf characters
326 that should be visible)
327
328
329
330
331
332
333
334
335
336
337
338Faltstrom Standards Track [Page 6]
339
340RFC 5892 IDNA Code Points August 2010
341
342
3432.4. IgnorableBlocks (D)
344
345 D: Block(cp) is in {Combining Diacritical Marks for Symbols,
346 Musical Symbols, Ancient Greek Musical Notation}
347
348 This category is used to identify code points that are not useful in
349 mnemonics or that are otherwise impractical for IDN use. In general,
350 these code points are not suitable for use for IDN.
351
352 The definition of blocks can be found in Blocks.txt [BlockNames].
353
3542.5. LDH (E)
355
356 E: cp is in {002D, 0030..0039, 0061..007A}
357
358 This category is used in the second step to preserve the traditional
359 "hostname" (LDH -- as described in the Definitions document
360 [RFC5890]) characters ('-', 0-9, and a-z). In general, these code
361 points are suitable for use for IDN. Note that there are other rules
362 regarding the code point U+002D HYPHEN-MINUS that are specified in
363 the IDNA Protocol Specification [RFC5891].
364
3652.6. Exceptions (F)
366
367 F: cp is in {00B7, 00DF, 0375, 03C2, 05F3, 05F4, 0640, 0660,
368 0661, 0662, 0663, 0664, 0665, 0666, 0667, 0668,
369 0669, 06F0, 06F1, 06F2, 06F3, 06F4, 06F5, 06F6,
370 06F7, 06F8, 06F9, 06FD, 06FE, 07FA, 0F0B, 3007,
371 302E, 302F, 3031, 3032, 3033, 3034, 3035, 303B,
372 30FB}
373
374 This category explicitly lists code points for which the category
375 cannot be assigned using only the core property values that exist in
376 the Unicode standard. The values are according to the table below:
377
378 PVALID -- Would otherwise have been DISALLOWED
379
380 00DF; PVALID # LATIN SMALL LETTER SHARP S
381 03C2; PVALID # GREEK SMALL LETTER FINAL SIGMA
382 06FD; PVALID # ARABIC SIGN SINDHI AMPERSAND
383 06FE; PVALID # ARABIC SIGN SINDHI POSTPOSITION MEN
384 0F0B; PVALID # TIBETAN MARK INTERSYLLABIC TSHEG
385 3007; PVALID # IDEOGRAPHIC NUMBER ZERO
386
387
388
389
390
391
392
393
394Faltstrom Standards Track [Page 7]
395
396RFC 5892 IDNA Code Points August 2010
397
398
399 CONTEXTO -- Would otherwise have been DISALLOWED
400
401 00B7; CONTEXTO # MIDDLE DOT
402 0375; CONTEXTO # GREEK LOWER NUMERAL SIGN (KERAIA)
403 05F3; CONTEXTO # HEBREW PUNCTUATION GERESH
404 05F4; CONTEXTO # HEBREW PUNCTUATION GERSHAYIM
405 30FB; CONTEXTO # KATAKANA MIDDLE DOT
406
407 CONTEXTO -- Would otherwise have been PVALID
408
409 0660; CONTEXTO # ARABIC-INDIC DIGIT ZERO
410 0661; CONTEXTO # ARABIC-INDIC DIGIT ONE
411 0662; CONTEXTO # ARABIC-INDIC DIGIT TWO
412 0663; CONTEXTO # ARABIC-INDIC DIGIT THREE
413 0664; CONTEXTO # ARABIC-INDIC DIGIT FOUR
414 0665; CONTEXTO # ARABIC-INDIC DIGIT FIVE
415 0666; CONTEXTO # ARABIC-INDIC DIGIT SIX
416 0667; CONTEXTO # ARABIC-INDIC DIGIT SEVEN
417 0668; CONTEXTO # ARABIC-INDIC DIGIT EIGHT
418 0669; CONTEXTO # ARABIC-INDIC DIGIT NINE
419 06F0; CONTEXTO # EXTENDED ARABIC-INDIC DIGIT ZERO
420 06F1; CONTEXTO # EXTENDED ARABIC-INDIC DIGIT ONE
421 06F2; CONTEXTO # EXTENDED ARABIC-INDIC DIGIT TWO
422 06F3; CONTEXTO # EXTENDED ARABIC-INDIC DIGIT THREE
423 06F4; CONTEXTO # EXTENDED ARABIC-INDIC DIGIT FOUR
424 06F5; CONTEXTO # EXTENDED ARABIC-INDIC DIGIT FIVE
425 06F6; CONTEXTO # EXTENDED ARABIC-INDIC DIGIT SIX
426 06F7; CONTEXTO # EXTENDED ARABIC-INDIC DIGIT SEVEN
427 06F8; CONTEXTO # EXTENDED ARABIC-INDIC DIGIT EIGHT
428 06F9; CONTEXTO # EXTENDED ARABIC-INDIC DIGIT NINE
429
430 DISALLOWED -- Would otherwise have been PVALID
431
432 0640; DISALLOWED # ARABIC TATWEEL
433 07FA; DISALLOWED # NKO LAJANYALAN
434 302E; DISALLOWED # HANGUL SINGLE DOT TONE MARK
435 302F; DISALLOWED # HANGUL DOUBLE DOT TONE MARK
436 3031; DISALLOWED # VERTICAL KANA REPEAT MARK
437 3032; DISALLOWED # VERTICAL KANA REPEAT WITH VOICED SOUND MARK
438 3033; DISALLOWED # VERTICAL KANA REPEAT MARK UPPER HALF
439 3034; DISALLOWED # VERTICAL KANA REPEAT WITH VOICED SOUND MARK UPPER HA
440 3035; DISALLOWED # VERTICAL KANA REPEAT MARK LOWER HALF
441 303B; DISALLOWED # VERTICAL IDEOGRAPHIC ITERATION MARK
442
443
444
445
446
447
448
449
450Faltstrom Standards Track [Page 8]
451
452RFC 5892 IDNA Code Points August 2010
453
454
4552.7. BackwardCompatible (G)
456
457 G: cp is in {}
458
459 This category includes the code points that property values in
460 versions of Unicode after 5.2 have changed in such a way that the
461 derived property value would no longer be PVALID or DISALLOWED. If
462 changes are made to future versions of Unicode so that code points
463 might change the property value from PVALID or DISALLOWED, then this
464 table can be updated and keep special exception values so that the
465 property values for code points stay stable.
466
4672.8. JoinControl (H)
468
469 H: Join_Control(cp) = True
470
471 This category consists of Join Control characters (i.e., they are not
472 in LetterDigits (Section 2.1) but are still required in IDN labels
473 under some circumstances).
474
4752.9. OldHangulJamo (I)
476
477 I: Hangul_Syllable_Type(cp) is in {L, V, T}
478
479 This category consists of all conjoining Hangul Jamo (Leading Jamo,
480 Vowel Jamo, and Trailing Jamo).
481
482 Elimination of conjoining Hangul Jamo from the set of PVALID
483 characters results in restricting the set of Korean PVALID characters
484 just to preformed, modern Hangul syllable characters. Old Hangul
485 syllables, which must be spelled with sequences of conjoining Hangul
486 Jamo, are not PVALID for IDNs.
487
4882.10. Unassigned (J)
489
490 J: General_Category(cp) is in {Cn} and
491 Noncharacter_Code_Point(cp) = False
492
493 This category consists of code points in the Unicode character set
494 that are not (yet) assigned. It should be noted that Unicode
495 distinguishes between "unassigned code points" and "unassigned
496 characters". The unassigned code points are all but (Cn -
497 Noncharacters), while the unassigned *characters* are all but (Cn +
498 Cs).
499
500
501
502
503
504
505
506Faltstrom Standards Track [Page 9]
507
508RFC 5892 IDNA Code Points August 2010
509
510
5113. Calculation of the Derived Property
512
513 As described above (Section 1) and in more detail in the IDNA
514 Protocol document [RFC5891], possible values of the IDN property are:
515
516 o PVALID
517
518 o CONTEXTJ
519
520 o CONTEXTO
521
522 o DISALLOWED
523
524 o UNASSIGNED
525
526 The algorithm to calculate the value of the derived property is as
527 follows. If the name of a rule (such as Exception) is used, that
528 implies the set of code points that the rule defines, while the same
529 name as a function call (such as Exception(cp)) implies the value cp
530 has in the Exceptions table.
531
532 If .cp. .in. Exceptions Then Exceptions(cp);
533 Else If .cp. .in. BackwardCompatible Then BackwardCompatible(cp);
534 Else If .cp. .in. Unassigned Then UNASSIGNED;
535 Else If .cp. .in. LDH Then PVALID;
536 Else If .cp. .in. JoinControl Then CONTEXTJ;
537 Else If .cp. .in. Unstable Then DISALLOWED;
538 Else If .cp. .in. IgnorableProperties Then DISALLOWED;
539 Else If .cp. .in. IgnorableBlocks Then DISALLOWED;
540 Else If .cp. .in. OldHangulJamo Then DISALLOWED;
541 Else If .cp. .in. LetterDigits Then PVALID;
542 Else DISALLOWED;
543
5444. Code Points
545
546 The categories and rules defined in Sections 2 and 3 apply to all
547 Unicode code points. The table in Appendix B shows, for illustrative
548 purposes, the consequences of the categories and classification
549 rules, and the resulting property values.
550
551 The list of code points that can be found in Appendix B is
552 non-normative. Sections 2 and 3 are normative.
553
554
555
556
557
558
559
560
561
562Faltstrom Standards Track [Page 10]
563
564RFC 5892 IDNA Code Points August 2010
565
566
5675. IANA Considerations
568
5695.1. IDNA-Derived Property Value Registry
570
571 IANA has created a registry with the derived properties for the
572 versions of Unicode released after (and including) version 5.2. The
573 derived property value is to be calculated in cooperation with a
574 designated expert [RFC5226] according to the specifications in
575 Sections 2 and 3 and not by copying the non-normative table found in
576 Appendix B.
577
578 If non-backward-compatible changes or other problems arise during the
579 creation or designated expert review of the table of derived property
580 values, they should be flagged for the IESG. Changes to the rules
581 (as specified in Sections 2 and 3), including BackwardCompatible
582 (Section 2.7) (a set that is at release of this document is empty)
583 require IETF Review, as described in RFC 5226 [RFC5226].
584
5855.2. IDNA Context Registry
586
587 For characters that are defined in the IDNA derived property value
588 registry (Section 5.1) as CONTEXTO or CONTEXTJ and that therefore
589 require a contextual rule, IANA has created and now maintains a list
590 of approved contextual rules. Additions or changes to these rules
591 require IETF Review, as described in [RFC5226].
592
593 Appendix A contains further discussion and a table from which that
594 registry can be initialized.
595
5965.2.1. Template for Context Registry
597
598 The following information is to be given when a new rule is created.
599
600 Name: Unique name of the rule
601
602 Code point: Rule that should be applied when this code point
603 exists in the label
604
605 Overview: Description in plain English on what the rule verifies
606
607 Lookup: Should the rule be applied at time of lookup?
608
609 Rule Set: The set of rules, with a reference to the defining
610 document.
611
612
613
614
615
616
617
618Faltstrom Standards Track [Page 11]
619
620RFC 5892 IDNA Code Points August 2010
621
622
6236. Security Considerations
624
625 Security Considerations for this version of IDNA, except for the
626 special issues associated with right-to-left scripts and characters,
627 are described in the Definitions document [RFC5890]. Specific issues
628 for labels containing characters associated with scripts written
629 right to left appear in the Bidi document [RFC5893].
630
6317. Acknowledgements
632
633 This document would not have been possible to produce without input
634 from many people. The main contributors are (in alphabetical order)
635 Harald Alvestrand, Vint Cerf, Tina Dam, Mark Davis, Gihan Dias,
636 Mouhammet Diop, Michael Everson, Asmus Freytag, Debbie Garside, Paul
637 Hoffman, Kent Karlsson, Cary Karp, Jaeyoun Kim, John Klensin, Olaf
638 Kolkman, Gervase Markham, Ram Mohan, Lisa Moore, Yngve Pettersen,
639 Erik van der Poel, Hualin Qian, Rick Reed, Pete Resnick, Lakmal
640 Silva, Michel Suignard, Andrew Sullivan, Wil Tan, Kenneth Whistler,
641 Chris Wright, and Yoshiro Yoneya.
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674Faltstrom Standards Track [Page 12]
675
676RFC 5892 IDNA Code Points August 2010
677
678
679Appendix A. Contextual Rules Registry
680
681 As discussed in Section 5.2 and in the IANA Considerations section of
682 the Rationale document [RFC5894], a registry of rules that define the
683 contexts in which particular PROTOCOL-VALID characters, characters
684 associated with a requirement for Contextual Information, are
685 permitted. These rules are expressed as tests on the label in which
686 the characters appear (all, or any part of, the label may be tested).
687
688 The grammatical rules are expressed in pseudo-code. The conventions
689 used for that pseudo-code are explained here.
690
691 Each rule is constructed as a Boolean expression that evaluates to
692 either True or False. A simple "True;" or "False;" rule sets the
693 default result value for the rule set. Subsequent conditional rules
694 that evaluate to True or False may re-set the result value.
695
696 A special value "Undefined" is used to deal with any error
697 conditions, such as an attempt to test a character before the start
698 of a label or after the end of a label. If any term of a rule
699 evaluates to Undefined, further evaluation of the rule immediately
700 terminates, as the result value of the rule will itself be Undefined.
701
702 cp represents the code point to be tested.
703
704 FirstChar is a special term that denotes the first code point in a
705 label.
706
707 LastChar is a special term that denotes the last code point in a
708 label.
709
710 .eq. represents the equality relation.
711
712 A .eq. B evaluates to True if A equals B.
713
714 .is. represents checking the position in a label.
715
716 A .is. B evaluates to True if A and B have same position in
717 the same label.
718
719 .ne. represents the non-equality relation.
720
721 A .ne. B evaluates to True if A is not equal to B.
722
723 .in. represents the set inclusion relation.
724
725 A .in. B evaluates to True if A is a member of the set B.
726
727
728
729
730Faltstrom Standards Track [Page 13]
731
732RFC 5892 IDNA Code Points August 2010
733
734
735 A functional notation, Function_Name(cp), is used to express either
736 string positions within a label, Boolean character property tests of
737 a code point, or a regular expression match. When such function
738 names refer to Boolean character property tests, the function names
739 use the exact Unicode character property name for the property in
740 question, and "cp" is evaluated as the Unicode value of the code
741 point to be tested, rather than as its position in the label. When
742 such function names refer to string positions within a label, "cp" is
743 evaluated as its position in the label.
744
745 RegExpMatch(X) takes as its parameter X a schematic regular
746 expression consisting of a mix of Unicode character property values
747 and literal Unicode code points.
748
749 Script(cp) returns the value of the Unicode Script property, as
750 defined in Scripts.txt in the Unicode Character Database.
751
752 Canonical_Combining_Class(cp) returns the value of the Unicode
753 Canonical_Combining_Class property, as defined in UnicodeData.txt in
754 the Unicode Character Database.
755
756 Before(cp) returns the code point of the character immediately
757 preceding cp in logical order in the string representing the label.
758 Before(FirstChar) evaluates to Undefined.
759
760 After(cp) returns the code point of the character immediately
761 following cp in logical order in the string representing the label.
762 After(LastChar) evaluates to Undefined.
763
764 Note that "Before" and "After" do not refer to the visual display
765 order of the character in a label, which may be reversed or otherwise
766 modified by the bidirectional algorithm for labels including
767 characters from scripts written right to left. Instead, "Before" and
768 "After" refer to the network order of the character in the label.
769
770 The clauses "Then True" and "Then False" imply exit from the
771 pseudo-code routine with the corresponding result.
772
773 Repeated evaluation for all characters in a label makes use of the
774 special construct:
775
776 For All Characters:
777
778 Expression;
779
780 End For;
781
782
783
784
785
786Faltstrom Standards Track [Page 14]
787
788RFC 5892 IDNA Code Points August 2010
789
790
791 This construct requires repeated evaluation of "Expression" for each
792 code point in the label, starting from FirstChar and proceeding to
793 LastChar.
794
795 The different fields in the rules are to be interpreted as follows:
796
797 Code point:
798 The code point, or code points, to which this rule is to be
799 applied. Normally, this implies that if any of the code points in
800 a label is as defined, then the rules should be applied. If
801 evaluated to True, the code point is OK as used; if evaluated to
802 False, it is not OK.
803
804 Overview:
805 A description of the goal with the rule, in plain English.
806
807 Lookup:
808 True if application of this rule is recommended at lookup time;
809 False otherwise.
810
811 Rule Set:
812 The rule set itself, as described above.
813
814Appendix A.1. ZERO WIDTH NON-JOINER
815
816 Code point:
817 U+200C
818
819 Overview:
820 This may occur in a formally cursive script (such as Arabic) in a
821 context where it breaks a cursive connection as required for
822 orthographic rules, as in the Persian language, for example. It
823 also may occur in Indic scripts in a consonant-conjunct context
824 (immediately following a virama), to control required display of
825 such conjuncts.
826
827 Lookup:
828 True
829
830 Rule Set:
831
832 False;
833
834 If Canonical_Combining_Class(Before(cp)) .eq. Virama Then True;
835
836 If RegExpMatch((Joining_Type:{L,D})(Joining_Type:T)*\u200C
837
838 (Joining_Type:T)*(Joining_Type:{R,D})) Then True;
839
840
841
842Faltstrom Standards Track [Page 15]
843
844RFC 5892 IDNA Code Points August 2010
845
846
847Appendix A.2. ZERO WIDTH JOINER
848
849 Code point:
850 U+200D
851
852 Overview:
853 This may occur in Indic scripts in a consonant-conjunct context
854 (immediately following a virama), to control required display of
855 such conjuncts.
856
857 Lookup:
858 True
859
860 Rule Set:
861
862 False;
863
864 If Canonical_Combining_Class(Before(cp)) .eq. Virama Then True;
865
866Appendix A.3. MIDDLE DOT
867
868 Code point:
869 U+00B7
870
871 Overview:
872 Between 'l' (U+006C) characters only, used to permit the Catalan
873 character ela geminada to be expressed.
874
875 Lookup:
876 False
877
878 Rule Set:
879
880 False;
881
882 If Before(cp) .eq. U+006C And
883
884 After(cp) .eq. U+006C Then True;
885
886
887
888
889
890
891
892
893
894
895
896
897
898Faltstrom Standards Track [Page 16]
899
900RFC 5892 IDNA Code Points August 2010
901
902
903Appendix A.4. GREEK LOWER NUMERAL SIGN (KERAIA)
904
905 Code point:
906 U+0375
907
908 Overview:
909 The script of the following character MUST be Greek.
910
911 Lookup:
912 False
913
914 Rule Set:
915
916 False;
917
918 If Script(After(cp)) .eq. Greek Then True;
919
920Appendix A.5. HEBREW PUNCTUATION GERESH
921
922 Code point:
923 U+05F3
924
925 Overview:
926 The script of the preceding character MUST be Hebrew.
927
928 Lookup:
929 False
930
931 Rule Set:
932
933 False;
934
935 If Script(Before(cp)) .eq. Hebrew Then True;
936
937
938
939
940
941
942
943
944
945
946
947
948
949
950
951
952
953
954Faltstrom Standards Track [Page 17]
955
956RFC 5892 IDNA Code Points August 2010
957
958
959Appendix A.6. HEBREW PUNCTUATION GERSHAYIM
960
961 Code point:
962 U+05F4
963
964 Overview:
965 The script of the preceding character MUST be Hebrew.
966
967 Lookup:
968 False
969
970 Rule Set:
971
972 False;
973
974 If Script(Before(cp)) .eq. Hebrew Then True;
975
976Appendix A.7. KATAKANA MIDDLE DOT
977
978 Code point:
979 U+30FB
980
981 Overview:
982 Note that the Script of Katakana Middle Dot is not any of
983 "Hiragana", "Katakana", or "Han". The effect of this rule is to
984 require at least one character in the label to be in one of those
985 scripts.
986
987 Lookup:
988 False
989
990 Rule Set:
991
992 False;
993
994 For All Characters:
995
996 If Script(cp) .in. {Hiragana, Katakana, Han} Then True;
997
998 End For;
999
1000
1001
1002
1003
1004
1005
1006
1007
1008
1009
1010Faltstrom Standards Track [Page 18]
1011
1012RFC 5892 IDNA Code Points August 2010
1013
1014
1015Appendix A.8. ARABIC-INDIC DIGITS
1016
1017 Code point:
1018 0660..0669
1019
1020 Overview:
1021 Can not be mixed with Extended Arabic-Indic Digits.
1022
1023 Lookup:
1024 False
1025
1026 Rule Set:
1027
1028 True;
1029
1030 For All Characters:
1031
1032 If cp .in. 06F0..06F9 Then False;
1033
1034 End For;
1035
1036Appendix A.9. EXTENDED ARABIC-INDIC DIGITS
1037
1038 Code point:
1039 06F0..06F9
1040
1041 Overview:
1042 Can not be mixed with Arabic-Indic Digits.
1043
1044 Lookup:
1045 False
1046
1047 Rule Set:
1048
1049 True;
1050
1051 For All Characters:
1052
1053 If cp .in. 0660..0669 Then False;
1054
1055 End For;
1056
1057
1058
1059
1060
1061
1062
1063
1064
1065
1066Faltstrom Standards Track [Page 19]
1067
1068RFC 5892 IDNA Code Points August 2010
1069
1070
1071Appendix B. Code Points 0x0000 - 0x10FFFF
1072
1073 If one applies the rules (Section 3) to the code points 0x0000 to
1074 0x10FFFF to Unicode 5.2, the result is as follows.
1075
1076 This list is non-normative, and only included for illustrative
1077 purposes. Specifically, what is displayed in the third column is not
1078 the formal name of the code point (as defined in Section 4.8 of The
1079 Unicode Standard [Unicode52]). The differences exist, for example,
1080 for the code points that have the code point value as part of the
1081 name (for example, CJK UNIFIED IDEOGRAPH-4E00) and the naming of
1082 Hangul syllables. For many code points, what you see is the official
1083 name.
1084
1085Appendix B.1. Code Points in Unicode Character Database (UCD) Format
1086
10870000..002C ; DISALLOWED # <control>..COMMA
1088002D ; PVALID # HYPHEN-MINUS
1089002E..002F ; DISALLOWED # FULL STOP..SOLIDUS
10900030..0039 ; PVALID # DIGIT ZERO..DIGIT NINE
1091003A..0060 ; DISALLOWED # COLON..GRAVE ACCENT
10920061..007A ; PVALID # LATIN SMALL LETTER A..LATIN SMALL LETTER Z
1093007B..00B6 ; DISALLOWED # LEFT CURLY BRACKET..PILCROW SIGN
109400B7 ; CONTEXTO # MIDDLE DOT
109500B8..00DE ; DISALLOWED # CEDILLA..LATIN CAPITAL LETTER THORN
109600DF..00F6 ; PVALID # LATIN SMALL LETTER SHARP S..LATIN SMALL LETT
109700F7 ; DISALLOWED # DIVISION SIGN
109800F8..00FF ; PVALID # LATIN SMALL LETTER O WITH STROKE..LATIN SMAL
10990100 ; DISALLOWED # LATIN CAPITAL LETTER A WITH MACRON
11000101 ; PVALID # LATIN SMALL LETTER A WITH MACRON
11010102 ; DISALLOWED # LATIN CAPITAL LETTER A WITH BREVE
11020103 ; PVALID # LATIN SMALL LETTER A WITH BREVE
11030104 ; DISALLOWED # LATIN CAPITAL LETTER A WITH OGONEK
11040105 ; PVALID # LATIN SMALL LETTER A WITH OGONEK
11050106 ; DISALLOWED # LATIN CAPITAL LETTER C WITH ACUTE
11060107 ; PVALID # LATIN SMALL LETTER C WITH ACUTE
11070108 ; DISALLOWED # LATIN CAPITAL LETTER C WITH CIRCUMFLEX
11080109 ; PVALID # LATIN SMALL LETTER C WITH CIRCUMFLEX
1109010A ; DISALLOWED # LATIN CAPITAL LETTER C WITH DOT ABOVE
1110010B ; PVALID # LATIN SMALL LETTER C WITH DOT ABOVE
1111010C ; DISALLOWED # LATIN CAPITAL LETTER C WITH CARON
1112010D ; PVALID # LATIN SMALL LETTER C WITH CARON
1113010E ; DISALLOWED # LATIN CAPITAL LETTER D WITH CARON
1114010F ; PVALID # LATIN SMALL LETTER D WITH CARON
11150110 ; DISALLOWED # LATIN CAPITAL LETTER D WITH STROKE
11160111 ; PVALID # LATIN SMALL LETTER D WITH STROKE
11170112 ; DISALLOWED # LATIN CAPITAL LETTER E WITH MACRON
11180113 ; PVALID # LATIN SMALL LETTER E WITH MACRON
1119
1120
1121
1122Faltstrom Standards Track [Page 20]
1123
1124RFC 5892 IDNA Code Points August 2010
1125
1126
11270114 ; DISALLOWED # LATIN CAPITAL LETTER E WITH BREVE
11280115 ; PVALID # LATIN SMALL LETTER E WITH BREVE
11290116 ; DISALLOWED # LATIN CAPITAL LETTER E WITH DOT ABOVE
11300117 ; PVALID # LATIN SMALL LETTER E WITH DOT ABOVE
11310118 ; DISALLOWED # LATIN CAPITAL LETTER E WITH OGONEK
11320119 ; PVALID # LATIN SMALL LETTER E WITH OGONEK
1133011A ; DISALLOWED # LATIN CAPITAL LETTER E WITH CARON
1134011B ; PVALID # LATIN SMALL LETTER E WITH CARON
1135011C ; DISALLOWED # LATIN CAPITAL LETTER G WITH CIRCUMFLEX
1136011D ; PVALID # LATIN SMALL LETTER G WITH CIRCUMFLEX
1137011E ; DISALLOWED # LATIN CAPITAL LETTER G WITH BREVE
1138011F ; PVALID # LATIN SMALL LETTER G WITH BREVE
11390120 ; DISALLOWED # LATIN CAPITAL LETTER G WITH DOT ABOVE
11400121 ; PVALID # LATIN SMALL LETTER G WITH DOT ABOVE
11410122 ; DISALLOWED # LATIN CAPITAL LETTER G WITH CEDILLA
11420123 ; PVALID # LATIN SMALL LETTER G WITH CEDILLA
11430124 ; DISALLOWED # LATIN CAPITAL LETTER H WITH CIRCUMFLEX
11440125 ; PVALID # LATIN SMALL LETTER H WITH CIRCUMFLEX
11450126 ; DISALLOWED # LATIN CAPITAL LETTER H WITH STROKE
11460127 ; PVALID # LATIN SMALL LETTER H WITH STROKE
11470128 ; DISALLOWED # LATIN CAPITAL LETTER I WITH TILDE
11480129 ; PVALID # LATIN SMALL LETTER I WITH TILDE
1149012A ; DISALLOWED # LATIN CAPITAL LETTER I WITH MACRON
1150012B ; PVALID # LATIN SMALL LETTER I WITH MACRON
1151012C ; DISALLOWED # LATIN CAPITAL LETTER I WITH BREVE
1152012D ; PVALID # LATIN SMALL LETTER I WITH BREVE
1153012E ; DISALLOWED # LATIN CAPITAL LETTER I WITH OGONEK
1154012F ; PVALID # LATIN SMALL LETTER I WITH OGONEK
11550130 ; DISALLOWED # LATIN CAPITAL LETTER I WITH DOT ABOVE
11560131 ; PVALID # LATIN SMALL LETTER DOTLESS I
11570132..0134 ; DISALLOWED # LATIN CAPITAL LIGATURE IJ..LATIN CAPITAL LET
11580135 ; PVALID # LATIN SMALL LETTER J WITH CIRCUMFLEX
11590136 ; DISALLOWED # LATIN CAPITAL LETTER K WITH CEDILLA
11600137..0138 ; PVALID # LATIN SMALL LETTER K WITH CEDILLA..LATIN SMA
11610139 ; DISALLOWED # LATIN CAPITAL LETTER L WITH ACUTE
1162013A ; PVALID # LATIN SMALL LETTER L WITH ACUTE
1163013B ; DISALLOWED # LATIN CAPITAL LETTER L WITH CEDILLA
1164013C ; PVALID # LATIN SMALL LETTER L WITH CEDILLA
1165013D ; DISALLOWED # LATIN CAPITAL LETTER L WITH CARON
1166013E ; PVALID # LATIN SMALL LETTER L WITH CARON
1167013F..0141 ; DISALLOWED # LATIN CAPITAL LETTER L WITH MIDDLE DOT..LATI
11680142 ; PVALID # LATIN SMALL LETTER L WITH STROKE
11690143 ; DISALLOWED # LATIN CAPITAL LETTER N WITH ACUTE
11700144 ; PVALID # LATIN SMALL LETTER N WITH ACUTE
11710145 ; DISALLOWED # LATIN CAPITAL LETTER N WITH CEDILLA
11720146 ; PVALID # LATIN SMALL LETTER N WITH CEDILLA
11730147 ; DISALLOWED # LATIN CAPITAL LETTER N WITH CARON
11740148 ; PVALID # LATIN SMALL LETTER N WITH CARON
1175
1176
1177
1178Faltstrom Standards Track [Page 21]
1179
1180RFC 5892 IDNA Code Points August 2010
1181
1182
11830149..014A ; DISALLOWED # LATIN SMALL LETTER N PRECEDED BY APOSTROPHE.
1184014B ; PVALID # LATIN SMALL LETTER ENG
1185014C ; DISALLOWED # LATIN CAPITAL LETTER O WITH MACRON
1186014D ; PVALID # LATIN SMALL LETTER O WITH MACRON
1187014E ; DISALLOWED # LATIN CAPITAL LETTER O WITH BREVE
1188014F ; PVALID # LATIN SMALL LETTER O WITH BREVE
11890150 ; DISALLOWED # LATIN CAPITAL LETTER O WITH DOUBLE ACUTE
11900151 ; PVALID # LATIN SMALL LETTER O WITH DOUBLE ACUTE
11910152 ; DISALLOWED # LATIN CAPITAL LIGATURE OE
11920153 ; PVALID # LATIN SMALL LIGATURE OE
11930154 ; DISALLOWED # LATIN CAPITAL LETTER R WITH ACUTE
11940155 ; PVALID # LATIN SMALL LETTER R WITH ACUTE
11950156 ; DISALLOWED # LATIN CAPITAL LETTER R WITH CEDILLA
11960157 ; PVALID # LATIN SMALL LETTER R WITH CEDILLA
11970158 ; DISALLOWED # LATIN CAPITAL LETTER R WITH CARON
11980159 ; PVALID # LATIN SMALL LETTER R WITH CARON
1199015A ; DISALLOWED # LATIN CAPITAL LETTER S WITH ACUTE
1200015B ; PVALID # LATIN SMALL LETTER S WITH ACUTE
1201015C ; DISALLOWED # LATIN CAPITAL LETTER S WITH CIRCUMFLEX
1202015D ; PVALID # LATIN SMALL LETTER S WITH CIRCUMFLEX
1203015E ; DISALLOWED # LATIN CAPITAL LETTER S WITH CEDILLA
1204015F ; PVALID # LATIN SMALL LETTER S WITH CEDILLA
12050160 ; DISALLOWED # LATIN CAPITAL LETTER S WITH CARON
12060161 ; PVALID # LATIN SMALL LETTER S WITH CARON
12070162 ; DISALLOWED # LATIN CAPITAL LETTER T WITH CEDILLA
12080163 ; PVALID # LATIN SMALL LETTER T WITH CEDILLA
12090164 ; DISALLOWED # LATIN CAPITAL LETTER T WITH CARON
12100165 ; PVALID # LATIN SMALL LETTER T WITH CARON
12110166 ; DISALLOWED # LATIN CAPITAL LETTER T WITH STROKE
12120167 ; PVALID # LATIN SMALL LETTER T WITH STROKE
12130168 ; DISALLOWED # LATIN CAPITAL LETTER U WITH TILDE
12140169 ; PVALID # LATIN SMALL LETTER U WITH TILDE
1215016A ; DISALLOWED # LATIN CAPITAL LETTER U WITH MACRON
1216016B ; PVALID # LATIN SMALL LETTER U WITH MACRON
1217016C ; DISALLOWED # LATIN CAPITAL LETTER U WITH BREVE
1218016D ; PVALID # LATIN SMALL LETTER U WITH BREVE
1219016E ; DISALLOWED # LATIN CAPITAL LETTER U WITH RING ABOVE
1220016F ; PVALID # LATIN SMALL LETTER U WITH RING ABOVE
12210170 ; DISALLOWED # LATIN CAPITAL LETTER U WITH DOUBLE ACUTE
12220171 ; PVALID # LATIN SMALL LETTER U WITH DOUBLE ACUTE
12230172 ; DISALLOWED # LATIN CAPITAL LETTER U WITH OGONEK
12240173 ; PVALID # LATIN SMALL LETTER U WITH OGONEK
12250174 ; DISALLOWED # LATIN CAPITAL LETTER W WITH CIRCUMFLEX
12260175 ; PVALID # LATIN SMALL LETTER W WITH CIRCUMFLEX
12270176 ; DISALLOWED # LATIN CAPITAL LETTER Y WITH CIRCUMFLEX
12280177 ; PVALID # LATIN SMALL LETTER Y WITH CIRCUMFLEX
12290178..0179 ; DISALLOWED # LATIN CAPITAL LETTER Y WITH DIAERESIS..LATIN
1230017A ; PVALID # LATIN SMALL LETTER Z WITH ACUTE
1231
1232
1233
1234Faltstrom Standards Track [Page 22]
1235
1236RFC 5892 IDNA Code Points August 2010
1237
1238
1239017B ; DISALLOWED # LATIN CAPITAL LETTER Z WITH DOT ABOVE
1240017C ; PVALID # LATIN SMALL LETTER Z WITH DOT ABOVE
1241017D ; DISALLOWED # LATIN CAPITAL LETTER Z WITH CARON
1242017E ; PVALID # LATIN SMALL LETTER Z WITH CARON
1243017F ; DISALLOWED # LATIN SMALL LETTER LONG S
12440180 ; PVALID # LATIN SMALL LETTER B WITH STROKE
12450181..0182 ; DISALLOWED # LATIN CAPITAL LETTER B WITH HOOK..LATIN CAPI
12460183 ; PVALID # LATIN SMALL LETTER B WITH TOPBAR
12470184 ; DISALLOWED # LATIN CAPITAL LETTER TONE SIX
12480185 ; PVALID # LATIN SMALL LETTER TONE SIX
12490186..0187 ; DISALLOWED # LATIN CAPITAL LETTER OPEN O..LATIN CAPITAL L
12500188 ; PVALID # LATIN SMALL LETTER C WITH HOOK
12510189..018B ; DISALLOWED # LATIN CAPITAL LETTER AFRICAN D..LATIN CAPITA
1252018C..018D ; PVALID # LATIN SMALL LETTER D WITH TOPBAR..LATIN SMAL
1253018E..0191 ; DISALLOWED # LATIN CAPITAL LETTER REVERSED E..LATIN CAPIT
12540192 ; PVALID # LATIN SMALL LETTER F WITH HOOK
12550193..0194 ; DISALLOWED # LATIN CAPITAL LETTER G WITH HOOK..LATIN CAPI
12560195 ; PVALID # LATIN SMALL LETTER HV
12570196..0198 ; DISALLOWED # LATIN CAPITAL LETTER IOTA..LATIN CAPITAL LET
12580199..019B ; PVALID # LATIN SMALL LETTER K WITH HOOK..LATIN SMALL
1259019C..019D ; DISALLOWED # LATIN CAPITAL LETTER TURNED M..LATIN CAPITAL
1260019E ; PVALID # LATIN SMALL LETTER N WITH LONG RIGHT LEG
1261019F..01A0 ; DISALLOWED # LATIN CAPITAL LETTER O WITH MIDDLE TILDE..LA
126201A1 ; PVALID # LATIN SMALL LETTER O WITH HORN
126301A2 ; DISALLOWED # LATIN CAPITAL LETTER OI
126401A3 ; PVALID # LATIN SMALL LETTER OI
126501A4 ; DISALLOWED # LATIN CAPITAL LETTER P WITH HOOK
126601A5 ; PVALID # LATIN SMALL LETTER P WITH HOOK
126701A6..01A7 ; DISALLOWED # LATIN LETTER YR..LATIN CAPITAL LETTER TONE T
126801A8 ; PVALID # LATIN SMALL LETTER TONE TWO
126901A9 ; DISALLOWED # LATIN CAPITAL LETTER ESH
127001AA..01AB ; PVALID # LATIN LETTER REVERSED ESH LOOP..LATIN SMALL
127101AC ; DISALLOWED # LATIN CAPITAL LETTER T WITH HOOK
127201AD ; PVALID # LATIN SMALL LETTER T WITH HOOK
127301AE..01AF ; DISALLOWED # LATIN CAPITAL LETTER T WITH RETROFLEX HOOK..
127401B0 ; PVALID # LATIN SMALL LETTER U WITH HORN
127501B1..01B3 ; DISALLOWED # LATIN CAPITAL LETTER UPSILON..LATIN CAPITAL
127601B4 ; PVALID # LATIN SMALL LETTER Y WITH HOOK
127701B5 ; DISALLOWED # LATIN CAPITAL LETTER Z WITH STROKE
127801B6 ; PVALID # LATIN SMALL LETTER Z WITH STROKE
127901B7..01B8 ; DISALLOWED # LATIN CAPITAL LETTER EZH..LATIN CAPITAL LETT
128001B9..01BB ; PVALID # LATIN SMALL LETTER EZH REVERSED..LATIN LETTE
128101BC ; DISALLOWED # LATIN CAPITAL LETTER TONE FIVE
128201BD..01C3 ; PVALID # LATIN SMALL LETTER TONE FIVE..LATIN LETTER R
128301C4..01CD ; DISALLOWED # LATIN CAPITAL LETTER DZ WITH CARON..LATIN CA
128401CE ; PVALID # LATIN SMALL LETTER A WITH CARON
128501CF ; DISALLOWED # LATIN CAPITAL LETTER I WITH CARON
128601D0 ; PVALID # LATIN SMALL LETTER I WITH CARON
1287
1288
1289
1290Faltstrom Standards Track [Page 23]
1291
1292RFC 5892 IDNA Code Points August 2010
1293
1294
129501D1 ; DISALLOWED # LATIN CAPITAL LETTER O WITH CARON
129601D2 ; PVALID # LATIN SMALL LETTER O WITH CARON
129701D3 ; DISALLOWED # LATIN CAPITAL LETTER U WITH CARON
129801D4 ; PVALID # LATIN SMALL LETTER U WITH CARON
129901D5 ; DISALLOWED # LATIN CAPITAL LETTER U WITH DIAERESIS AND MA
130001D6 ; PVALID # LATIN SMALL LETTER U WITH DIAERESIS AND MACR
130101D7 ; DISALLOWED # LATIN CAPITAL LETTER U WITH DIAERESIS AND AC
130201D8 ; PVALID # LATIN SMALL LETTER U WITH DIAERESIS AND ACUT
130301D9 ; DISALLOWED # LATIN CAPITAL LETTER U WITH DIAERESIS AND CA
130401DA ; PVALID # LATIN SMALL LETTER U WITH DIAERESIS AND CARO
130501DB ; DISALLOWED # LATIN CAPITAL LETTER U WITH DIAERESIS AND GR
130601DC..01DD ; PVALID # LATIN SMALL LETTER U WITH DIAERESIS AND GRAV
130701DE ; DISALLOWED # LATIN CAPITAL LETTER A WITH DIAERESIS AND MA
130801DF ; PVALID # LATIN SMALL LETTER A WITH DIAERESIS AND MACR
130901E0 ; DISALLOWED # LATIN CAPITAL LETTER A WITH DOT ABOVE AND MA
131001E1 ; PVALID # LATIN SMALL LETTER A WITH DOT ABOVE AND MACR
131101E2 ; DISALLOWED # LATIN CAPITAL LETTER AE WITH MACRON
131201E3 ; PVALID # LATIN SMALL LETTER AE WITH MACRON
131301E4 ; DISALLOWED # LATIN CAPITAL LETTER G WITH STROKE
131401E5 ; PVALID # LATIN SMALL LETTER G WITH STROKE
131501E6 ; DISALLOWED # LATIN CAPITAL LETTER G WITH CARON
131601E7 ; PVALID # LATIN SMALL LETTER G WITH CARON
131701E8 ; DISALLOWED # LATIN CAPITAL LETTER K WITH CARON
131801E9 ; PVALID # LATIN SMALL LETTER K WITH CARON
131901EA ; DISALLOWED # LATIN CAPITAL LETTER O WITH OGONEK
132001EB ; PVALID # LATIN SMALL LETTER O WITH OGONEK
132101EC ; DISALLOWED # LATIN CAPITAL LETTER O WITH OGONEK AND MACRO
132201ED ; PVALID # LATIN SMALL LETTER O WITH OGONEK AND MACRON
132301EE ; DISALLOWED # LATIN CAPITAL LETTER EZH WITH CARON
132401EF..01F0 ; PVALID # LATIN SMALL LETTER EZH WITH CARON..LATIN SMA
132501F1..01F4 ; DISALLOWED # LATIN CAPITAL LETTER DZ..LATIN CAPITAL LETTE
132601F5 ; PVALID # LATIN SMALL LETTER G WITH ACUTE
132701F6..01F8 ; DISALLOWED # LATIN CAPITAL LETTER HWAIR..LATIN CAPITAL LE
132801F9 ; PVALID # LATIN SMALL LETTER N WITH GRAVE
132901FA ; DISALLOWED # LATIN CAPITAL LETTER A WITH RING ABOVE AND A
133001FB ; PVALID # LATIN SMALL LETTER A WITH RING ABOVE AND ACU
133101FC ; DISALLOWED # LATIN CAPITAL LETTER AE WITH ACUTE
133201FD ; PVALID # LATIN SMALL LETTER AE WITH ACUTE
133301FE ; DISALLOWED # LATIN CAPITAL LETTER O WITH STROKE AND ACUTE
133401FF ; PVALID # LATIN SMALL LETTER O WITH STROKE AND ACUTE
13350200 ; DISALLOWED # LATIN CAPITAL LETTER A WITH DOUBLE GRAVE
13360201 ; PVALID # LATIN SMALL LETTER A WITH DOUBLE GRAVE
13370202 ; DISALLOWED # LATIN CAPITAL LETTER A WITH INVERTED BREVE
13380203 ; PVALID # LATIN SMALL LETTER A WITH INVERTED BREVE
13390204 ; DISALLOWED # LATIN CAPITAL LETTER E WITH DOUBLE GRAVE
13400205 ; PVALID # LATIN SMALL LETTER E WITH DOUBLE GRAVE
13410206 ; DISALLOWED # LATIN CAPITAL LETTER E WITH INVERTED BREVE
13420207 ; PVALID # LATIN SMALL LETTER E WITH INVERTED BREVE
1343
1344
1345
1346Faltstrom Standards Track [Page 24]
1347
1348RFC 5892 IDNA Code Points August 2010
1349
1350
13510208 ; DISALLOWED # LATIN CAPITAL LETTER I WITH DOUBLE GRAVE
13520209 ; PVALID # LATIN SMALL LETTER I WITH DOUBLE GRAVE
1353020A ; DISALLOWED # LATIN CAPITAL LETTER I WITH INVERTED BREVE
1354020B ; PVALID # LATIN SMALL LETTER I WITH INVERTED BREVE
1355020C ; DISALLOWED # LATIN CAPITAL LETTER O WITH DOUBLE GRAVE
1356020D ; PVALID # LATIN SMALL LETTER O WITH DOUBLE GRAVE
1357020E ; DISALLOWED # LATIN CAPITAL LETTER O WITH INVERTED BREVE
1358020F ; PVALID # LATIN SMALL LETTER O WITH INVERTED BREVE
13590210 ; DISALLOWED # LATIN CAPITAL LETTER R WITH DOUBLE GRAVE
13600211 ; PVALID # LATIN SMALL LETTER R WITH DOUBLE GRAVE
13610212 ; DISALLOWED # LATIN CAPITAL LETTER R WITH INVERTED BREVE
13620213 ; PVALID # LATIN SMALL LETTER R WITH INVERTED BREVE
13630214 ; DISALLOWED # LATIN CAPITAL LETTER U WITH DOUBLE GRAVE
13640215 ; PVALID # LATIN SMALL LETTER U WITH DOUBLE GRAVE
13650216 ; DISALLOWED # LATIN CAPITAL LETTER U WITH INVERTED BREVE
13660217 ; PVALID # LATIN SMALL LETTER U WITH INVERTED BREVE
13670218 ; DISALLOWED # LATIN CAPITAL LETTER S WITH COMMA BELOW
13680219 ; PVALID # LATIN SMALL LETTER S WITH COMMA BELOW
1369021A ; DISALLOWED # LATIN CAPITAL LETTER T WITH COMMA BELOW
1370021B ; PVALID # LATIN SMALL LETTER T WITH COMMA BELOW
1371021C ; DISALLOWED # LATIN CAPITAL LETTER YOGH
1372021D ; PVALID # LATIN SMALL LETTER YOGH
1373021E ; DISALLOWED # LATIN CAPITAL LETTER H WITH CARON
1374021F ; PVALID # LATIN SMALL LETTER H WITH CARON
13750220 ; DISALLOWED # LATIN CAPITAL LETTER N WITH LONG RIGHT LEG
13760221 ; PVALID # LATIN SMALL LETTER D WITH CURL
13770222 ; DISALLOWED # LATIN CAPITAL LETTER OU
13780223 ; PVALID # LATIN SMALL LETTER OU
13790224 ; DISALLOWED # LATIN CAPITAL LETTER Z WITH HOOK
13800225 ; PVALID # LATIN SMALL LETTER Z WITH HOOK
13810226 ; DISALLOWED # LATIN CAPITAL LETTER A WITH DOT ABOVE
13820227 ; PVALID # LATIN SMALL LETTER A WITH DOT ABOVE
13830228 ; DISALLOWED # LATIN CAPITAL LETTER E WITH CEDILLA
13840229 ; PVALID # LATIN SMALL LETTER E WITH CEDILLA
1385022A ; DISALLOWED # LATIN CAPITAL LETTER O WITH DIAERESIS AND MA
1386022B ; PVALID # LATIN SMALL LETTER O WITH DIAERESIS AND MACR
1387022C ; DISALLOWED # LATIN CAPITAL LETTER O WITH TILDE AND MACRON
1388022D ; PVALID # LATIN SMALL LETTER O WITH TILDE AND MACRON
1389022E ; DISALLOWED # LATIN CAPITAL LETTER O WITH DOT ABOVE
1390022F ; PVALID # LATIN SMALL LETTER O WITH DOT ABOVE
13910230 ; DISALLOWED # LATIN CAPITAL LETTER O WITH DOT ABOVE AND MA
13920231 ; PVALID # LATIN SMALL LETTER O WITH DOT ABOVE AND MACR
13930232 ; DISALLOWED # LATIN CAPITAL LETTER Y WITH MACRON
13940233..0239 ; PVALID # LATIN SMALL LETTER Y WITH MACRON..LATIN SMAL
1395023A..023B ; DISALLOWED # LATIN CAPITAL LETTER A WITH STROKE..LATIN CA
1396023C ; PVALID # LATIN SMALL LETTER C WITH STROKE
1397023D..023E ; DISALLOWED # LATIN CAPITAL LETTER L WITH BAR..LATIN CAPIT
1398023F..0240 ; PVALID # LATIN SMALL LETTER S WITH SWASH TAIL..LATIN
1399
1400
1401
1402Faltstrom Standards Track [Page 25]
1403
1404RFC 5892 IDNA Code Points August 2010
1405
1406
14070241 ; DISALLOWED # LATIN CAPITAL LETTER GLOTTAL STOP
14080242 ; PVALID # LATIN SMALL LETTER GLOTTAL STOP
14090243..0246 ; DISALLOWED # LATIN CAPITAL LETTER B WITH STROKE..LATIN CA
14100247 ; PVALID # LATIN SMALL LETTER E WITH STROKE
14110248 ; DISALLOWED # LATIN CAPITAL LETTER J WITH STROKE
14120249 ; PVALID # LATIN SMALL LETTER J WITH STROKE
1413024A ; DISALLOWED # LATIN CAPITAL LETTER SMALL Q WITH HOOK TAIL
1414024B ; PVALID # LATIN SMALL LETTER Q WITH HOOK TAIL
1415024C ; DISALLOWED # LATIN CAPITAL LETTER R WITH STROKE
1416024D ; PVALID # LATIN SMALL LETTER R WITH STROKE
1417024E ; DISALLOWED # LATIN CAPITAL LETTER Y WITH STROKE
1418024F..02AF ; PVALID # LATIN SMALL LETTER Y WITH STROKE..LATIN SMAL
141902B0..02B8 ; DISALLOWED # MODIFIER LETTER SMALL H..MODIFIER LETTER SMA
142002B9..02C1 ; PVALID # MODIFIER LETTER PRIME..MODIFIER LETTER REVER
142102C2..02C5 ; DISALLOWED # MODIFIER LETTER LEFT ARROWHEAD..MODIFIER LET
142202C6..02D1 ; PVALID # MODIFIER LETTER CIRCUMFLEX ACCENT..MODIFIER
142302D2..02EB ; DISALLOWED # MODIFIER LETTER CENTRED RIGHT HALF RING..MOD
142402EC ; PVALID # MODIFIER LETTER VOICING
142502ED ; DISALLOWED # MODIFIER LETTER UNASPIRATED
142602EE ; PVALID # MODIFIER LETTER DOUBLE APOSTROPHE
142702EF..02FF ; DISALLOWED # MODIFIER LETTER LOW DOWN ARROWHEAD..MODIFIER
14280300..033F ; PVALID # COMBINING GRAVE ACCENT..COMBINING DOUBLE OVE
14290340..0341 ; DISALLOWED # COMBINING GRAVE TONE MARK..COMBINING ACUTE T
14300342 ; PVALID # COMBINING GREEK PERISPOMENI
14310343..0345 ; DISALLOWED # COMBINING GREEK KORONIS..COMBINING GREEK YPO
14320346..034E ; PVALID # COMBINING BRIDGE ABOVE..COMBINING UPWARDS AR
1433034F ; DISALLOWED # COMBINING GRAPHEME JOINER
14340350..036F ; PVALID # COMBINING RIGHT ARROWHEAD ABOVE..COMBINING L
14350370 ; DISALLOWED # GREEK CAPITAL LETTER HETA
14360371 ; PVALID # GREEK SMALL LETTER HETA
14370372 ; DISALLOWED # GREEK CAPITAL LETTER ARCHAIC SAMPI
14380373 ; PVALID # GREEK SMALL LETTER ARCHAIC SAMPI
14390374 ; DISALLOWED # GREEK NUMERAL SIGN
14400375 ; CONTEXTO # GREEK LOWER NUMERAL SIGN
14410376 ; DISALLOWED # GREEK CAPITAL LETTER PAMPHYLIAN DIGAMMA
14420377 ; PVALID # GREEK SMALL LETTER PAMPHYLIAN DIGAMMA
14430378..0379 ; UNASSIGNED # <reserved>..<reserved>
1444037A ; DISALLOWED # GREEK YPOGEGRAMMENI
1445037B..037D ; PVALID # GREEK SMALL REVERSED LUNATE SIGMA SYMBOL..GR
1446037E ; DISALLOWED # GREEK QUESTION MARK
1447037F..0383 ; UNASSIGNED # <reserved>..<reserved>
14480384..038A ; DISALLOWED # GREEK TONOS..GREEK CAPITAL LETTER IOTA WITH
1449038B ; UNASSIGNED # <reserved>
1450038C ; DISALLOWED # GREEK CAPITAL LETTER OMICRON WITH TONOS
1451038D ; UNASSIGNED # <reserved>
1452038E..038F ; DISALLOWED # GREEK CAPITAL LETTER UPSILON WITH TONOS..GRE
14530390 ; PVALID # GREEK SMALL LETTER IOTA WITH DIALYTIKA AND T
14540391..03A1 ; DISALLOWED # GREEK CAPITAL LETTER ALPHA..GREEK CAPITAL LE
1455
1456
1457
1458Faltstrom Standards Track [Page 26]
1459
1460RFC 5892 IDNA Code Points August 2010
1461
1462
146303A2 ; UNASSIGNED # <reserved>
146403A3..03AB ; DISALLOWED # GREEK CAPITAL LETTER SIGMA..GREEK CAPITAL LE
146503AC..03CE ; PVALID # GREEK SMALL LETTER ALPHA WITH TONOS..GREEK S
146603CF..03D6 ; DISALLOWED # GREEK CAPITAL KAI SYMBOL..GREEK PI SYMBOL
146703D7 ; PVALID # GREEK KAI SYMBOL
146803D8 ; DISALLOWED # GREEK LETTER ARCHAIC KOPPA
146903D9 ; PVALID # GREEK SMALL LETTER ARCHAIC KOPPA
147003DA ; DISALLOWED # GREEK LETTER STIGMA
147103DB ; PVALID # GREEK SMALL LETTER STIGMA
147203DC ; DISALLOWED # GREEK LETTER DIGAMMA
147303DD ; PVALID # GREEK SMALL LETTER DIGAMMA
147403DE ; DISALLOWED # GREEK LETTER KOPPA
147503DF ; PVALID # GREEK SMALL LETTER KOPPA
147603E0 ; DISALLOWED # GREEK LETTER SAMPI
147703E1 ; PVALID # GREEK SMALL LETTER SAMPI
147803E2 ; DISALLOWED # COPTIC CAPITAL LETTER SHEI
147903E3 ; PVALID # COPTIC SMALL LETTER SHEI
148003E4 ; DISALLOWED # COPTIC CAPITAL LETTER FEI
148103E5 ; PVALID # COPTIC SMALL LETTER FEI
148203E6 ; DISALLOWED # COPTIC CAPITAL LETTER KHEI
148303E7 ; PVALID # COPTIC SMALL LETTER KHEI
148403E8 ; DISALLOWED # COPTIC CAPITAL LETTER HORI
148503E9 ; PVALID # COPTIC SMALL LETTER HORI
148603EA ; DISALLOWED # COPTIC CAPITAL LETTER GANGIA
148703EB ; PVALID # COPTIC SMALL LETTER GANGIA
148803EC ; DISALLOWED # COPTIC CAPITAL LETTER SHIMA
148903ED ; PVALID # COPTIC SMALL LETTER SHIMA
149003EE ; DISALLOWED # COPTIC CAPITAL LETTER DEI
149103EF ; PVALID # COPTIC SMALL LETTER DEI
149203F0..03F2 ; DISALLOWED # GREEK KAPPA SYMBOL..GREEK LUNATE SIGMA SYMBO
149303F3 ; PVALID # GREEK LETTER YOT
149403F4..03F7 ; DISALLOWED # GREEK CAPITAL THETA SYMBOL..GREEK CAPITAL LE
149503F8 ; PVALID # GREEK SMALL LETTER SHO
149603F9..03FA ; DISALLOWED # GREEK CAPITAL LUNATE SIGMA SYMBOL..GREEK CAP
149703FB..03FC ; PVALID # GREEK SMALL LETTER SAN..GREEK RHO WITH STROK
149803FD..042F ; DISALLOWED # GREEK CAPITAL REVERSED LUNATE SIGMA SYMBOL..
14990430..045F ; PVALID # CYRILLIC SMALL LETTER A..CYRILLIC SMALL LETT
15000460 ; DISALLOWED # CYRILLIC CAPITAL LETTER OMEGA
15010461 ; PVALID # CYRILLIC SMALL LETTER OMEGA
15020462 ; DISALLOWED # CYRILLIC CAPITAL LETTER YAT
15030463 ; PVALID # CYRILLIC SMALL LETTER YAT
15040464 ; DISALLOWED # CYRILLIC CAPITAL LETTER IOTIFIED E
15050465 ; PVALID # CYRILLIC SMALL LETTER IOTIFIED E
15060466 ; DISALLOWED # CYRILLIC CAPITAL LETTER LITTLE YUS
15070467 ; PVALID # CYRILLIC SMALL LETTER LITTLE YUS
15080468 ; DISALLOWED # CYRILLIC CAPITAL LETTER IOTIFIED LITTLE YUS
15090469 ; PVALID # CYRILLIC SMALL LETTER IOTIFIED LITTLE YUS
1510046A ; DISALLOWED # CYRILLIC CAPITAL LETTER BIG YUS
1511
1512
1513
1514Faltstrom Standards Track [Page 27]
1515
1516RFC 5892 IDNA Code Points August 2010
1517
1518
1519046B ; PVALID # CYRILLIC SMALL LETTER BIG YUS
1520046C ; DISALLOWED # CYRILLIC CAPITAL LETTER IOTIFIED BIG YUS
1521046D ; PVALID # CYRILLIC SMALL LETTER IOTIFIED BIG YUS
1522046E ; DISALLOWED # CYRILLIC CAPITAL LETTER KSI
1523046F ; PVALID # CYRILLIC SMALL LETTER KSI
15240470 ; DISALLOWED # CYRILLIC CAPITAL LETTER PSI
15250471 ; PVALID # CYRILLIC SMALL LETTER PSI
15260472 ; DISALLOWED # CYRILLIC CAPITAL LETTER FITA
15270473 ; PVALID # CYRILLIC SMALL LETTER FITA
15280474 ; DISALLOWED # CYRILLIC CAPITAL LETTER IZHITSA
15290475 ; PVALID # CYRILLIC SMALL LETTER IZHITSA
15300476 ; DISALLOWED # CYRILLIC CAPITAL LETTER IZHITSA WITH DOUBLE
15310477 ; PVALID # CYRILLIC SMALL LETTER IZHITSA WITH DOUBLE GR
15320478 ; DISALLOWED # CYRILLIC CAPITAL LETTER UK
15330479 ; PVALID # CYRILLIC SMALL LETTER UK
1534047A ; DISALLOWED # CYRILLIC CAPITAL LETTER ROUND OMEGA
1535047B ; PVALID # CYRILLIC SMALL LETTER ROUND OMEGA
1536047C ; DISALLOWED # CYRILLIC CAPITAL LETTER OMEGA WITH TITLO
1537047D ; PVALID # CYRILLIC SMALL LETTER OMEGA WITH TITLO
1538047E ; DISALLOWED # CYRILLIC CAPITAL LETTER OT
1539047F ; PVALID # CYRILLIC SMALL LETTER OT
15400480 ; DISALLOWED # CYRILLIC CAPITAL LETTER KOPPA
15410481 ; PVALID # CYRILLIC SMALL LETTER KOPPA
15420482 ; DISALLOWED # CYRILLIC THOUSANDS SIGN
15430483..0487 ; PVALID # COMBINING CYRILLIC TITLO..COMBINING CYRILLIC
15440488..048A ; DISALLOWED # COMBINING CYRILLIC HUNDRED THOUSANDS SIGN..C
1545048B ; PVALID # CYRILLIC SMALL LETTER SHORT I WITH TAIL
1546048C ; DISALLOWED # CYRILLIC CAPITAL LETTER SEMISOFT SIGN
1547048D ; PVALID # CYRILLIC SMALL LETTER SEMISOFT SIGN
1548048E ; DISALLOWED # CYRILLIC CAPITAL LETTER ER WITH TICK
1549048F ; PVALID # CYRILLIC SMALL LETTER ER WITH TICK
15500490 ; DISALLOWED # CYRILLIC CAPITAL LETTER GHE WITH UPTURN
15510491 ; PVALID # CYRILLIC SMALL LETTER GHE WITH UPTURN
15520492 ; DISALLOWED # CYRILLIC CAPITAL LETTER GHE WITH STROKE
15530493 ; PVALID # CYRILLIC SMALL LETTER GHE WITH STROKE
15540494 ; DISALLOWED # CYRILLIC CAPITAL LETTER GHE WITH MIDDLE HOOK
15550495 ; PVALID # CYRILLIC SMALL LETTER GHE WITH MIDDLE HOOK
15560496 ; DISALLOWED # CYRILLIC CAPITAL LETTER ZHE WITH DESCENDER
15570497 ; PVALID # CYRILLIC SMALL LETTER ZHE WITH DESCENDER
15580498 ; DISALLOWED # CYRILLIC CAPITAL LETTER ZE WITH DESCENDER
15590499 ; PVALID # CYRILLIC SMALL LETTER ZE WITH DESCENDER
1560049A ; DISALLOWED # CYRILLIC CAPITAL LETTER KA WITH DESCENDER
1561049B ; PVALID # CYRILLIC SMALL LETTER KA WITH DESCENDER
1562049C ; DISALLOWED # CYRILLIC CAPITAL LETTER KA WITH VERTICAL STR
1563049D ; PVALID # CYRILLIC SMALL LETTER KA WITH VERTICAL STROK
1564049E ; DISALLOWED # CYRILLIC CAPITAL LETTER KA WITH STROKE
1565049F ; PVALID # CYRILLIC SMALL LETTER KA WITH STROKE
156604A0 ; DISALLOWED # CYRILLIC CAPITAL LETTER BASHKIR KA
1567
1568
1569
1570Faltstrom Standards Track [Page 28]
1571
1572RFC 5892 IDNA Code Points August 2010
1573
1574
157504A1 ; PVALID # CYRILLIC SMALL LETTER BASHKIR KA
157604A2 ; DISALLOWED # CYRILLIC CAPITAL LETTER EN WITH DESCENDER
157704A3 ; PVALID # CYRILLIC SMALL LETTER EN WITH DESCENDER
157804A4 ; DISALLOWED # CYRILLIC CAPITAL LIGATURE EN GHE
157904A5 ; PVALID # CYRILLIC SMALL LIGATURE EN GHE
158004A6 ; DISALLOWED # CYRILLIC CAPITAL LETTER PE WITH MIDDLE HOOK
158104A7 ; PVALID # CYRILLIC SMALL LETTER PE WITH MIDDLE HOOK
158204A8 ; DISALLOWED # CYRILLIC CAPITAL LETTER ABKHASIAN HA
158304A9 ; PVALID # CYRILLIC SMALL LETTER ABKHASIAN HA
158404AA ; DISALLOWED # CYRILLIC CAPITAL LETTER ES WITH DESCENDER
158504AB ; PVALID # CYRILLIC SMALL LETTER ES WITH DESCENDER
158604AC ; DISALLOWED # CYRILLIC CAPITAL LETTER TE WITH DESCENDER
158704AD ; PVALID # CYRILLIC SMALL LETTER TE WITH DESCENDER
158804AE ; DISALLOWED # CYRILLIC CAPITAL LETTER STRAIGHT U
158904AF ; PVALID # CYRILLIC SMALL LETTER STRAIGHT U
159004B0 ; DISALLOWED # CYRILLIC CAPITAL LETTER STRAIGHT U WITH STRO
159104B1 ; PVALID # CYRILLIC SMALL LETTER STRAIGHT U WITH STROKE
159204B2 ; DISALLOWED # CYRILLIC CAPITAL LETTER HA WITH DESCENDER
159304B3 ; PVALID # CYRILLIC SMALL LETTER HA WITH DESCENDER
159404B4 ; DISALLOWED # CYRILLIC CAPITAL LIGATURE TE TSE
159504B5 ; PVALID # CYRILLIC SMALL LIGATURE TE TSE
159604B6 ; DISALLOWED # CYRILLIC CAPITAL LETTER CHE WITH DESCENDER
159704B7 ; PVALID # CYRILLIC SMALL LETTER CHE WITH DESCENDER
159804B8 ; DISALLOWED # CYRILLIC CAPITAL LETTER CHE WITH VERTICAL ST
159904B9 ; PVALID # CYRILLIC SMALL LETTER CHE WITH VERTICAL STRO
160004BA ; DISALLOWED # CYRILLIC CAPITAL LETTER SHHA
160104BB ; PVALID # CYRILLIC SMALL LETTER SHHA
160204BC ; DISALLOWED # CYRILLIC CAPITAL LETTER ABKHASIAN CHE
160304BD ; PVALID # CYRILLIC SMALL LETTER ABKHASIAN CHE
160404BE ; DISALLOWED # CYRILLIC CAPITAL LETTER ABKHASIAN CHE WITH D
160504BF ; PVALID # CYRILLIC SMALL LETTER ABKHASIAN CHE WITH DES
160604C0..04C1 ; DISALLOWED # CYRILLIC LETTER PALOCHKA..CYRILLIC CAPITAL L
160704C2 ; PVALID # CYRILLIC SMALL LETTER ZHE WITH BREVE
160804C3 ; DISALLOWED # CYRILLIC CAPITAL LETTER KA WITH HOOK
160904C4 ; PVALID # CYRILLIC SMALL LETTER KA WITH HOOK
161004C5 ; DISALLOWED # CYRILLIC CAPITAL LETTER EL WITH TAIL
161104C6 ; PVALID # CYRILLIC SMALL LETTER EL WITH TAIL
161204C7 ; DISALLOWED # CYRILLIC CAPITAL LETTER EN WITH HOOK
161304C8 ; PVALID # CYRILLIC SMALL LETTER EN WITH HOOK
161404C9 ; DISALLOWED # CYRILLIC CAPITAL LETTER EN WITH TAIL
161504CA ; PVALID # CYRILLIC SMALL LETTER EN WITH TAIL
161604CB ; DISALLOWED # CYRILLIC CAPITAL LETTER KHAKASSIAN CHE
161704CC ; PVALID # CYRILLIC SMALL LETTER KHAKASSIAN CHE
161804CD ; DISALLOWED # CYRILLIC CAPITAL LETTER EM WITH TAIL
161904CE..04CF ; PVALID # CYRILLIC SMALL LETTER EM WITH TAIL..CYRILLIC
162004D0 ; DISALLOWED # CYRILLIC CAPITAL LETTER A WITH BREVE
162104D1 ; PVALID # CYRILLIC SMALL LETTER A WITH BREVE
162204D2 ; DISALLOWED # CYRILLIC CAPITAL LETTER A WITH DIAERESIS
1623
1624
1625
1626Faltstrom Standards Track [Page 29]
1627
1628RFC 5892 IDNA Code Points August 2010
1629
1630
163104D3 ; PVALID # CYRILLIC SMALL LETTER A WITH DIAERESIS
163204D4 ; DISALLOWED # CYRILLIC CAPITAL LIGATURE A IE
163304D5 ; PVALID # CYRILLIC SMALL LIGATURE A IE
163404D6 ; DISALLOWED # CYRILLIC CAPITAL LETTER IE WITH BREVE
163504D7 ; PVALID # CYRILLIC SMALL LETTER IE WITH BREVE
163604D8 ; DISALLOWED # CYRILLIC CAPITAL LETTER SCHWA
163704D9 ; PVALID # CYRILLIC SMALL LETTER SCHWA
163804DA ; DISALLOWED # CYRILLIC CAPITAL LETTER SCHWA WITH DIAERESIS
163904DB ; PVALID # CYRILLIC SMALL LETTER SCHWA WITH DIAERESIS
164004DC ; DISALLOWED # CYRILLIC CAPITAL LETTER ZHE WITH DIAERESIS
164104DD ; PVALID # CYRILLIC SMALL LETTER ZHE WITH DIAERESIS
164204DE ; DISALLOWED # CYRILLIC CAPITAL LETTER ZE WITH DIAERESIS
164304DF ; PVALID # CYRILLIC SMALL LETTER ZE WITH DIAERESIS
164404E0 ; DISALLOWED # CYRILLIC CAPITAL LETTER ABKHASIAN DZE
164504E1 ; PVALID # CYRILLIC SMALL LETTER ABKHASIAN DZE
164604E2 ; DISALLOWED # CYRILLIC CAPITAL LETTER I WITH MACRON
164704E3 ; PVALID # CYRILLIC SMALL LETTER I WITH MACRON
164804E4 ; DISALLOWED # CYRILLIC CAPITAL LETTER I WITH DIAERESIS
164904E5 ; PVALID # CYRILLIC SMALL LETTER I WITH DIAERESIS
165004E6 ; DISALLOWED # CYRILLIC CAPITAL LETTER O WITH DIAERESIS
165104E7 ; PVALID # CYRILLIC SMALL LETTER O WITH DIAERESIS
165204E8 ; DISALLOWED # CYRILLIC CAPITAL LETTER BARRED O
165304E9 ; PVALID # CYRILLIC SMALL LETTER BARRED O
165404EA ; DISALLOWED # CYRILLIC CAPITAL LETTER BARRED O WITH DIAERE
165504EB ; PVALID # CYRILLIC SMALL LETTER BARRED O WITH DIAERESI
165604EC ; DISALLOWED # CYRILLIC CAPITAL LETTER E WITH DIAERESIS
165704ED ; PVALID # CYRILLIC SMALL LETTER E WITH DIAERESIS
165804EE ; DISALLOWED # CYRILLIC CAPITAL LETTER U WITH MACRON
165904EF ; PVALID # CYRILLIC SMALL LETTER U WITH MACRON
166004F0 ; DISALLOWED # CYRILLIC CAPITAL LETTER U WITH DIAERESIS
166104F1 ; PVALID # CYRILLIC SMALL LETTER U WITH DIAERESIS
166204F2 ; DISALLOWED # CYRILLIC CAPITAL LETTER U WITH DOUBLE ACUTE
166304F3 ; PVALID # CYRILLIC SMALL LETTER U WITH DOUBLE ACUTE
166404F4 ; DISALLOWED # CYRILLIC CAPITAL LETTER CHE WITH DIAERESIS
166504F5 ; PVALID # CYRILLIC SMALL LETTER CHE WITH DIAERESIS
166604F6 ; DISALLOWED # CYRILLIC CAPITAL LETTER GHE WITH DESCENDER
166704F7 ; PVALID # CYRILLIC SMALL LETTER GHE WITH DESCENDER
166804F8 ; DISALLOWED # CYRILLIC CAPITAL LETTER YERU WITH DIAERESIS
166904F9 ; PVALID # CYRILLIC SMALL LETTER YERU WITH DIAERESIS
167004FA ; DISALLOWED # CYRILLIC CAPITAL LETTER GHE WITH STROKE AND
167104FB ; PVALID # CYRILLIC SMALL LETTER GHE WITH STROKE AND HO
167204FC ; DISALLOWED # CYRILLIC CAPITAL LETTER HA WITH HOOK
167304FD ; PVALID # CYRILLIC SMALL LETTER HA WITH HOOK
167404FE ; DISALLOWED # CYRILLIC CAPITAL LETTER HA WITH STROKE
167504FF ; PVALID # CYRILLIC SMALL LETTER HA WITH STROKE
16760500 ; DISALLOWED # CYRILLIC CAPITAL LETTER KOMI DE
16770501 ; PVALID # CYRILLIC SMALL LETTER KOMI DE
16780502 ; DISALLOWED # CYRILLIC CAPITAL LETTER KOMI DJE
1679
1680
1681
1682Faltstrom Standards Track [Page 30]
1683
1684RFC 5892 IDNA Code Points August 2010
1685
1686
16870503 ; PVALID # CYRILLIC SMALL LETTER KOMI DJE
16880504 ; DISALLOWED # CYRILLIC CAPITAL LETTER KOMI ZJE
16890505 ; PVALID # CYRILLIC SMALL LETTER KOMI ZJE
16900506 ; DISALLOWED # CYRILLIC CAPITAL LETTER KOMI DZJE
16910507 ; PVALID # CYRILLIC SMALL LETTER KOMI DZJE
16920508 ; DISALLOWED # CYRILLIC CAPITAL LETTER KOMI LJE
16930509 ; PVALID # CYRILLIC SMALL LETTER KOMI LJE
1694050A ; DISALLOWED # CYRILLIC CAPITAL LETTER KOMI NJE
1695050B ; PVALID # CYRILLIC SMALL LETTER KOMI NJE
1696050C ; DISALLOWED # CYRILLIC CAPITAL LETTER KOMI SJE
1697050D ; PVALID # CYRILLIC SMALL LETTER KOMI SJE
1698050E ; DISALLOWED # CYRILLIC CAPITAL LETTER KOMI TJE
1699050F ; PVALID # CYRILLIC SMALL LETTER KOMI TJE
17000510 ; DISALLOWED # CYRILLIC CAPITAL LETTER REVERSED ZE
17010511 ; PVALID # CYRILLIC SMALL LETTER REVERSED ZE
17020512 ; DISALLOWED # CYRILLIC CAPITAL LETTER EL WITH HOOK
17030513 ; PVALID # CYRILLIC SMALL LETTER EL WITH HOOK
17040514 ; DISALLOWED # CYRILLIC CAPITAL LETTER LHA
17050515 ; PVALID # CYRILLIC SMALL LETTER LHA
17060516 ; DISALLOWED # CYRILLIC CAPITAL LETTER RHA
17070517 ; PVALID # CYRILLIC SMALL LETTER RHA
17080518 ; DISALLOWED # CYRILLIC CAPITAL LETTER YAE
17090519 ; PVALID # CYRILLIC SMALL LETTER YAE
1710051A ; DISALLOWED # CYRILLIC CAPITAL LETTER QA
1711051B ; PVALID # CYRILLIC SMALL LETTER QA
1712051C ; DISALLOWED # CYRILLIC CAPITAL LETTER WE
1713051D ; PVALID # CYRILLIC SMALL LETTER WE
1714051E ; DISALLOWED # CYRILLIC CAPITAL LETTER ALEUT KA
1715051F ; PVALID # CYRILLIC SMALL LETTER ALEUT KA
17160520 ; DISALLOWED # CYRILLIC CAPITAL LETTER EL WITH MIDDLE HOOK
17170521 ; PVALID # CYRILLIC SMALL LETTER EL WITH MIDDLE HOOK
17180522 ; DISALLOWED # CYRILLIC CAPITAL LETTER EN WITH MIDDLE HOOK
17190523 ; PVALID # CYRILLIC SMALL LETTER EN WITH MIDDLE HOOK
17200524 ; DISALLOWED # CYRILLIC CAPITAL LETTER PE WITH DESCENDER
17210525 ; PVALID # CYRILLIC SMALL LETTER PE WITH DESCENDER
17220526..0530 ; UNASSIGNED # <reserved>..<reserved>
17230531..0556 ; DISALLOWED # ARMENIAN CAPITAL LETTER AYB..ARMENIAN CAPITA
17240557..0558 ; UNASSIGNED # <reserved>..<reserved>
17250559 ; PVALID # ARMENIAN MODIFIER LETTER LEFT HALF RING
1726055A..055F ; DISALLOWED # ARMENIAN APOSTROPHE..ARMENIAN ABBREVIATION M
17270560 ; UNASSIGNED # <reserved>
17280561..0586 ; PVALID # ARMENIAN SMALL LETTER AYB..ARMENIAN SMALL LE
17290587 ; DISALLOWED # ARMENIAN SMALL LIGATURE ECH YIWN
17300588 ; UNASSIGNED # <reserved>
17310589..058A ; DISALLOWED # ARMENIAN FULL STOP..ARMENIAN HYPHEN
1732058B..0590 ; UNASSIGNED # <reserved>..<reserved>
17330591..05BD ; PVALID # HEBREW ACCENT ETNAHTA..HEBREW POINT METEG
173405BE ; DISALLOWED # HEBREW PUNCTUATION MAQAF
1735
1736
1737
1738Faltstrom Standards Track [Page 31]
1739
1740RFC 5892 IDNA Code Points August 2010
1741
1742
174305BF ; PVALID # HEBREW POINT RAFE
174405C0 ; DISALLOWED # HEBREW PUNCTUATION PASEQ
174505C1..05C2 ; PVALID # HEBREW POINT SHIN DOT..HEBREW POINT SIN DOT
174605C3 ; DISALLOWED # HEBREW PUNCTUATION SOF PASUQ
174705C4..05C5 ; PVALID # HEBREW MARK UPPER DOT..HEBREW MARK LOWER DOT
174805C6 ; DISALLOWED # HEBREW PUNCTUATION NUN HAFUKHA
174905C7 ; PVALID # HEBREW POINT QAMATS QATAN
175005C8..05CF ; UNASSIGNED # <reserved>..<reserved>
175105D0..05EA ; PVALID # HEBREW LETTER ALEF..HEBREW LETTER TAV
175205EB..05EF ; UNASSIGNED # <reserved>..<reserved>
175305F0..05F2 ; PVALID # HEBREW LIGATURE YIDDISH DOUBLE VAV..HEBREW L
175405F3..05F4 ; CONTEXTO # HEBREW PUNCTUATION GERESH..HEBREW PUNCTUATIO
175505F5..05FF ; UNASSIGNED # <reserved>..<reserved>
17560600..0603 ; DISALLOWED # ARABIC NUMBER SIGN..ARABIC SIGN SAFHA
17570604..0605 ; UNASSIGNED # <reserved>..<reserved>
17580606..060F ; DISALLOWED # ARABIC-INDIC CUBE ROOT..ARABIC SIGN MISRA
17590610..061A ; PVALID # ARABIC SIGN SALLALLAHOU ALAYHE WASSALLAM..AR
1760061B ; DISALLOWED # ARABIC SEMICOLON
1761061C..061D ; UNASSIGNED # <reserved>..<reserved>
1762061E..061F ; DISALLOWED # ARABIC TRIPLE DOT PUNCTUATION MARK..ARABIC Q
17630620 ; UNASSIGNED # <reserved>
17640621..063F ; PVALID # ARABIC LETTER HAMZA..ARABIC LETTER FARSI YEH
17650640 ; DISALLOWED # ARABIC TATWEEL
17660641..065E ; PVALID # ARABIC LETTER FEH..ARABIC FATHA WITH TWO DOT
1767065F ; UNASSIGNED # <reserved>
17680660..0669 ; CONTEXTO # ARABIC-INDIC DIGIT ZERO..ARABIC-INDIC DIGIT
1769066A..066D ; DISALLOWED # ARABIC PERCENT SIGN..ARABIC FIVE POINTED STA
1770066E..0674 ; PVALID # ARABIC LETTER DOTLESS BEH..ARABIC LETTER HIG
17710675..0678 ; DISALLOWED # ARABIC LETTER HIGH HAMZA ALEF..ARABIC LETTER
17720679..06D3 ; PVALID # ARABIC LETTER TTEH..ARABIC LETTER YEH BARREE
177306D4 ; DISALLOWED # ARABIC FULL STOP
177406D5..06DC ; PVALID # ARABIC LETTER AE..ARABIC SMALL HIGH SEEN
177506DD..06DE ; DISALLOWED # ARABIC END OF AYAH..ARABIC START OF RUB EL H
177606DF..06E8 ; PVALID # ARABIC SMALL HIGH ROUNDED ZERO..ARABIC SMALL
177706E9 ; DISALLOWED # ARABIC PLACE OF SAJDAH
177806EA..06EF ; PVALID # ARABIC EMPTY CENTRE LOW STOP..ARABIC LETTER
177906F0..06F9 ; CONTEXTO # EXTENDED ARABIC-INDIC DIGIT ZERO..EXTENDED A
178006FA..06FF ; PVALID # ARABIC LETTER SHEEN WITH DOT BELOW..ARABIC L
17810700..070D ; DISALLOWED # SYRIAC END OF PARAGRAPH..SYRIAC HARKLEAN AST
1782070E ; UNASSIGNED # <reserved>
1783070F ; DISALLOWED # SYRIAC ABBREVIATION MARK
17840710..074A ; PVALID # SYRIAC LETTER ALAPH..SYRIAC BARREKH
1785074B..074C ; UNASSIGNED # <reserved>..<reserved>
1786074D..07B1 ; PVALID # SYRIAC LETTER SOGDIAN ZHAIN..THAANA LETTER N
178707B2..07BF ; UNASSIGNED # <reserved>..<reserved>
178807C0..07F5 ; PVALID # NKO DIGIT ZERO..NKO LOW TONE APOSTROPHE
178907F6..07FA ; DISALLOWED # NKO SYMBOL OO DENNEN..NKO LAJANYALAN
179007FB..07FF ; UNASSIGNED # <reserved>..<reserved>
1791
1792
1793
1794Faltstrom Standards Track [Page 32]
1795
1796RFC 5892 IDNA Code Points August 2010
1797
1798
17990800..082D ; PVALID # SAMARITAN LETTER ALAF..SAMARITAN MARK NEQUDA
1800082E..082F ; UNASSIGNED # <reserved>..<reserved>
18010830..083E ; DISALLOWED # SAMARITAN PUNCTUATION NEQUDAA..SAMARITAN PUN
1802083F..08FF ; UNASSIGNED # <reserved>..<reserved>
18030900..0939 ; PVALID # DEVANAGARI SIGN INVERTED CANDRABINDU..DEVANA
1804093A..093B ; UNASSIGNED # <reserved>..<reserved>
1805093C..094E ; PVALID # DEVANAGARI SIGN NUKTA..DEVANAGARI VOWEL SIGN
1806094F ; UNASSIGNED # <reserved>
18070950..0955 ; PVALID # DEVANAGARI OM..DEVANAGARI VOWEL SIGN CANDRA
18080956..0957 ; UNASSIGNED # <reserved>..<reserved>
18090958..095F ; DISALLOWED # DEVANAGARI LETTER QA..DEVANAGARI LETTER YYA
18100960..0963 ; PVALID # DEVANAGARI LETTER VOCALIC RR..DEVANAGARI VOW
18110964..0965 ; DISALLOWED # DEVANAGARI DANDA..DEVANAGARI DOUBLE DANDA
18120966..096F ; PVALID # DEVANAGARI DIGIT ZERO..DEVANAGARI DIGIT NINE
18130970 ; DISALLOWED # DEVANAGARI ABBREVIATION SIGN
18140971..0972 ; PVALID # DEVANAGARI SIGN HIGH SPACING DOT..DEVANAGARI
18150973..0978 ; UNASSIGNED # <reserved>..<reserved>
18160979..097F ; PVALID # DEVANAGARI LETTER ZHA..DEVANAGARI LETTER BBA
18170980 ; UNASSIGNED # <reserved>
18180981..0983 ; PVALID # BENGALI SIGN CANDRABINDU..BENGALI SIGN VISAR
18190984 ; UNASSIGNED # <reserved>
18200985..098C ; PVALID # BENGALI LETTER A..BENGALI LETTER VOCALIC L
1821098D..098E ; UNASSIGNED # <reserved>..<reserved>
1822098F..0990 ; PVALID # BENGALI LETTER E..BENGALI LETTER AI
18230991..0992 ; UNASSIGNED # <reserved>..<reserved>
18240993..09A8 ; PVALID # BENGALI LETTER O..BENGALI LETTER NA
182509A9 ; UNASSIGNED # <reserved>
182609AA..09B0 ; PVALID # BENGALI LETTER PA..BENGALI LETTER RA
182709B1 ; UNASSIGNED # <reserved>
182809B2 ; PVALID # BENGALI LETTER LA
182909B3..09B5 ; UNASSIGNED # <reserved>..<reserved>
183009B6..09B9 ; PVALID # BENGALI LETTER SHA..BENGALI LETTER HA
183109BA..09BB ; UNASSIGNED # <reserved>..<reserved>
183209BC..09C4 ; PVALID # BENGALI SIGN NUKTA..BENGALI VOWEL SIGN VOCAL
183309C5..09C6 ; UNASSIGNED # <reserved>..<reserved>
183409C7..09C8 ; PVALID # BENGALI VOWEL SIGN E..BENGALI VOWEL SIGN AI
183509C9..09CA ; UNASSIGNED # <reserved>..<reserved>
183609CB..09CE ; PVALID # BENGALI VOWEL SIGN O..BENGALI LETTER KHANDA
183709CF..09D6 ; UNASSIGNED # <reserved>..<reserved>
183809D7 ; PVALID # BENGALI AU LENGTH MARK
183909D8..09DB ; UNASSIGNED # <reserved>..<reserved>
184009DC..09DD ; DISALLOWED # BENGALI LETTER RRA..BENGALI LETTER RHA
184109DE ; UNASSIGNED # <reserved>
184209DF ; DISALLOWED # BENGALI LETTER YYA
184309E0..09E3 ; PVALID # BENGALI LETTER VOCALIC RR..BENGALI VOWEL SIG
184409E4..09E5 ; UNASSIGNED # <reserved>..<reserved>
184509E6..09F1 ; PVALID # BENGALI DIGIT ZERO..BENGALI LETTER RA WITH L
184609F2..09FB ; DISALLOWED # BENGALI RUPEE MARK..BENGALI GANDA MARK
1847
1848
1849
1850Faltstrom Standards Track [Page 33]
1851
1852RFC 5892 IDNA Code Points August 2010
1853
1854
185509FC..0A00 ; UNASSIGNED # <reserved>..<reserved>
18560A01..0A03 ; PVALID # GURMUKHI SIGN ADAK BINDI..GURMUKHI SIGN VISA
18570A04 ; UNASSIGNED # <reserved>
18580A05..0A0A ; PVALID # GURMUKHI LETTER A..GURMUKHI LETTER UU
18590A0B..0A0E ; UNASSIGNED # <reserved>..<reserved>
18600A0F..0A10 ; PVALID # GURMUKHI LETTER EE..GURMUKHI LETTER AI
18610A11..0A12 ; UNASSIGNED # <reserved>..<reserved>
18620A13..0A28 ; PVALID # GURMUKHI LETTER OO..GURMUKHI LETTER NA
18630A29 ; UNASSIGNED # <reserved>
18640A2A..0A30 ; PVALID # GURMUKHI LETTER PA..GURMUKHI LETTER RA
18650A31 ; UNASSIGNED # <reserved>
18660A32 ; PVALID # GURMUKHI LETTER LA
18670A33 ; DISALLOWED # GURMUKHI LETTER LLA
18680A34 ; UNASSIGNED # <reserved>
18690A35 ; PVALID # GURMUKHI LETTER VA
18700A36 ; DISALLOWED # GURMUKHI LETTER SHA
18710A37 ; UNASSIGNED # <reserved>
18720A38..0A39 ; PVALID # GURMUKHI LETTER SA..GURMUKHI LETTER HA
18730A3A..0A3B ; UNASSIGNED # <reserved>..<reserved>
18740A3C ; PVALID # GURMUKHI SIGN NUKTA
18750A3D ; UNASSIGNED # <reserved>
18760A3E..0A42 ; PVALID # GURMUKHI VOWEL SIGN AA..GURMUKHI VOWEL SIGN
18770A43..0A46 ; UNASSIGNED # <reserved>..<reserved>
18780A47..0A48 ; PVALID # GURMUKHI VOWEL SIGN EE..GURMUKHI VOWEL SIGN
18790A49..0A4A ; UNASSIGNED # <reserved>..<reserved>
18800A4B..0A4D ; PVALID # GURMUKHI VOWEL SIGN OO..GURMUKHI SIGN VIRAMA
18810A4E..0A50 ; UNASSIGNED # <reserved>..<reserved>
18820A51 ; PVALID # GURMUKHI SIGN UDAAT
18830A52..0A58 ; UNASSIGNED # <reserved>..<reserved>
18840A59..0A5B ; DISALLOWED # GURMUKHI LETTER KHHA..GURMUKHI LETTER ZA
18850A5C ; PVALID # GURMUKHI LETTER RRA
18860A5D ; UNASSIGNED # <reserved>
18870A5E ; DISALLOWED # GURMUKHI LETTER FA
18880A5F..0A65 ; UNASSIGNED # <reserved>..<reserved>
18890A66..0A75 ; PVALID # GURMUKHI DIGIT ZERO..GURMUKHI SIGN YAKASH
18900A76..0A80 ; UNASSIGNED # <reserved>..<reserved>
18910A81..0A83 ; PVALID # GUJARATI SIGN CANDRABINDU..GUJARATI SIGN VIS
18920A84 ; UNASSIGNED # <reserved>
18930A85..0A8D ; PVALID # GUJARATI LETTER A..GUJARATI VOWEL CANDRA E
18940A8E ; UNASSIGNED # <reserved>
18950A8F..0A91 ; PVALID # GUJARATI LETTER E..GUJARATI VOWEL CANDRA O
18960A92 ; UNASSIGNED # <reserved>
18970A93..0AA8 ; PVALID # GUJARATI LETTER O..GUJARATI LETTER NA
18980AA9 ; UNASSIGNED # <reserved>
18990AAA..0AB0 ; PVALID # GUJARATI LETTER PA..GUJARATI LETTER RA
19000AB1 ; UNASSIGNED # <reserved>
19010AB2..0AB3 ; PVALID # GUJARATI LETTER LA..GUJARATI LETTER LLA
19020AB4 ; UNASSIGNED # <reserved>
1903
1904
1905
1906Faltstrom Standards Track [Page 34]
1907
1908RFC 5892 IDNA Code Points August 2010
1909
1910
19110AB5..0AB9 ; PVALID # GUJARATI LETTER VA..GUJARATI LETTER HA
19120ABA..0ABB ; UNASSIGNED # <reserved>..<reserved>
19130ABC..0AC5 ; PVALID # GUJARATI SIGN NUKTA..GUJARATI VOWEL SIGN CAN
19140AC6 ; UNASSIGNED # <reserved>
19150AC7..0AC9 ; PVALID # GUJARATI VOWEL SIGN E..GUJARATI VOWEL SIGN C
19160ACA ; UNASSIGNED # <reserved>
19170ACB..0ACD ; PVALID # GUJARATI VOWEL SIGN O..GUJARATI SIGN VIRAMA
19180ACE..0ACF ; UNASSIGNED # <reserved>..<reserved>
19190AD0 ; PVALID # GUJARATI OM
19200AD1..0ADF ; UNASSIGNED # <reserved>..<reserved>
19210AE0..0AE3 ; PVALID # GUJARATI LETTER VOCALIC RR..GUJARATI VOWEL S
19220AE4..0AE5 ; UNASSIGNED # <reserved>..<reserved>
19230AE6..0AEF ; PVALID # GUJARATI DIGIT ZERO..GUJARATI DIGIT NINE
19240AF0 ; UNASSIGNED # <reserved>
19250AF1 ; DISALLOWED # GUJARATI RUPEE SIGN
19260AF2..0B00 ; UNASSIGNED # <reserved>..<reserved>
19270B01..0B03 ; PVALID # ORIYA SIGN CANDRABINDU..ORIYA SIGN VISARGA
19280B04 ; UNASSIGNED # <reserved>
19290B05..0B0C ; PVALID # ORIYA LETTER A..ORIYA LETTER VOCALIC L
19300B0D..0B0E ; UNASSIGNED # <reserved>..<reserved>
19310B0F..0B10 ; PVALID # ORIYA LETTER E..ORIYA LETTER AI
19320B11..0B12 ; UNASSIGNED # <reserved>..<reserved>
19330B13..0B28 ; PVALID # ORIYA LETTER O..ORIYA LETTER NA
19340B29 ; UNASSIGNED # <reserved>
19350B2A..0B30 ; PVALID # ORIYA LETTER PA..ORIYA LETTER RA
19360B31 ; UNASSIGNED # <reserved>
19370B32..0B33 ; PVALID # ORIYA LETTER LA..ORIYA LETTER LLA
19380B34 ; UNASSIGNED # <reserved>
19390B35..0B39 ; PVALID # ORIYA LETTER VA..ORIYA LETTER HA
19400B3A..0B3B ; UNASSIGNED # <reserved>..<reserved>
19410B3C..0B44 ; PVALID # ORIYA SIGN NUKTA..ORIYA VOWEL SIGN VOCALIC R
19420B45..0B46 ; UNASSIGNED # <reserved>..<reserved>
19430B47..0B48 ; PVALID # ORIYA VOWEL SIGN E..ORIYA VOWEL SIGN AI
19440B49..0B4A ; UNASSIGNED # <reserved>..<reserved>
19450B4B..0B4D ; PVALID # ORIYA VOWEL SIGN O..ORIYA SIGN VIRAMA
19460B4E..0B55 ; UNASSIGNED # <reserved>..<reserved>
19470B56..0B57 ; PVALID # ORIYA AI LENGTH MARK..ORIYA AU LENGTH MARK
19480B58..0B5B ; UNASSIGNED # <reserved>..<reserved>
19490B5C..0B5D ; DISALLOWED # ORIYA LETTER RRA..ORIYA LETTER RHA
19500B5E ; UNASSIGNED # <reserved>
19510B5F..0B63 ; PVALID # ORIYA LETTER YYA..ORIYA VOWEL SIGN VOCALIC L
19520B64..0B65 ; UNASSIGNED # <reserved>..<reserved>
19530B66..0B6F ; PVALID # ORIYA DIGIT ZERO..ORIYA DIGIT NINE
19540B70 ; DISALLOWED # ORIYA ISSHAR
19550B71 ; PVALID # ORIYA LETTER WA
19560B72..0B81 ; UNASSIGNED # <reserved>..<reserved>
19570B82..0B83 ; PVALID # TAMIL SIGN ANUSVARA..TAMIL SIGN VISARGA
19580B84 ; UNASSIGNED # <reserved>
1959
1960
1961
1962Faltstrom Standards Track [Page 35]
1963
1964RFC 5892 IDNA Code Points August 2010
1965
1966
19670B85..0B8A ; PVALID # TAMIL LETTER A..TAMIL LETTER UU
19680B8B..0B8D ; UNASSIGNED # <reserved>..<reserved>
19690B8E..0B90 ; PVALID # TAMIL LETTER E..TAMIL LETTER AI
19700B91 ; UNASSIGNED # <reserved>
19710B92..0B95 ; PVALID # TAMIL LETTER O..TAMIL LETTER KA
19720B96..0B98 ; UNASSIGNED # <reserved>..<reserved>
19730B99..0B9A ; PVALID # TAMIL LETTER NGA..TAMIL LETTER CA
19740B9B ; UNASSIGNED # <reserved>
19750B9C ; PVALID # TAMIL LETTER JA
19760B9D ; UNASSIGNED # <reserved>
19770B9E..0B9F ; PVALID # TAMIL LETTER NYA..TAMIL LETTER TTA
19780BA0..0BA2 ; UNASSIGNED # <reserved>..<reserved>
19790BA3..0BA4 ; PVALID # TAMIL LETTER NNA..TAMIL LETTER TA
19800BA5..0BA7 ; UNASSIGNED # <reserved>..<reserved>
19810BA8..0BAA ; PVALID # TAMIL LETTER NA..TAMIL LETTER PA
19820BAB..0BAD ; UNASSIGNED # <reserved>..<reserved>
19830BAE..0BB9 ; PVALID # TAMIL LETTER MA..TAMIL LETTER HA
19840BBA..0BBD ; UNASSIGNED # <reserved>..<reserved>
19850BBE..0BC2 ; PVALID # TAMIL VOWEL SIGN AA..TAMIL VOWEL SIGN UU
19860BC3..0BC5 ; UNASSIGNED # <reserved>..<reserved>
19870BC6..0BC8 ; PVALID # TAMIL VOWEL SIGN E..TAMIL VOWEL SIGN AI
19880BC9 ; UNASSIGNED # <reserved>
19890BCA..0BCD ; PVALID # TAMIL VOWEL SIGN O..TAMIL SIGN VIRAMA
19900BCE..0BCF ; UNASSIGNED # <reserved>..<reserved>
19910BD0 ; PVALID # TAMIL OM
19920BD1..0BD6 ; UNASSIGNED # <reserved>..<reserved>
19930BD7 ; PVALID # TAMIL AU LENGTH MARK
19940BD8..0BE5 ; UNASSIGNED # <reserved>..<reserved>
19950BE6..0BEF ; PVALID # TAMIL DIGIT ZERO..TAMIL DIGIT NINE
19960BF0..0BFA ; DISALLOWED # TAMIL NUMBER TEN..TAMIL NUMBER SIGN
19970BFB..0C00 ; UNASSIGNED # <reserved>..<reserved>
19980C01..0C03 ; PVALID # TELUGU SIGN CANDRABINDU..TELUGU SIGN VISARGA
19990C04 ; UNASSIGNED # <reserved>
20000C05..0C0C ; PVALID # TELUGU LETTER A..TELUGU LETTER VOCALIC L
20010C0D ; UNASSIGNED # <reserved>
20020C0E..0C10 ; PVALID # TELUGU LETTER E..TELUGU LETTER AI
20030C11 ; UNASSIGNED # <reserved>
20040C12..0C28 ; PVALID # TELUGU LETTER O..TELUGU LETTER NA
20050C29 ; UNASSIGNED # <reserved>
20060C2A..0C33 ; PVALID # TELUGU LETTER PA..TELUGU LETTER LLA
20070C34 ; UNASSIGNED # <reserved>
20080C35..0C39 ; PVALID # TELUGU LETTER VA..TELUGU LETTER HA
20090C3A..0C3C ; UNASSIGNED # <reserved>..<reserved>
20100C3D..0C44 ; PVALID # TELUGU SIGN AVAGRAHA..TELUGU VOWEL SIGN VOCA
20110C45 ; UNASSIGNED # <reserved>
20120C46..0C48 ; PVALID # TELUGU VOWEL SIGN E..TELUGU VOWEL SIGN AI
20130C49 ; UNASSIGNED # <reserved>
20140C4A..0C4D ; PVALID # TELUGU VOWEL SIGN O..TELUGU SIGN VIRAMA
2015
2016
2017
2018Faltstrom Standards Track [Page 36]
2019
2020RFC 5892 IDNA Code Points August 2010
2021
2022
20230C4E..0C54 ; UNASSIGNED # <reserved>..<reserved>
20240C55..0C56 ; PVALID # TELUGU LENGTH MARK..TELUGU AI LENGTH MARK
20250C57 ; UNASSIGNED # <reserved>
20260C58..0C59 ; PVALID # TELUGU LETTER TSA..TELUGU LETTER DZA
20270C5A..0C5F ; UNASSIGNED # <reserved>..<reserved>
20280C60..0C63 ; PVALID # TELUGU LETTER VOCALIC RR..TELUGU VOWEL SIGN
20290C64..0C65 ; UNASSIGNED # <reserved>..<reserved>
20300C66..0C6F ; PVALID # TELUGU DIGIT ZERO..TELUGU DIGIT NINE
20310C70..0C77 ; UNASSIGNED # <reserved>..<reserved>
20320C78..0C7F ; DISALLOWED # TELUGU FRACTION DIGIT ZERO FOR ODD POWERS OF
20330C80..0C81 ; UNASSIGNED # <reserved>..<reserved>
20340C82..0C83 ; PVALID # KANNADA SIGN ANUSVARA..KANNADA SIGN VISARGA
20350C84 ; UNASSIGNED # <reserved>
20360C85..0C8C ; PVALID # KANNADA LETTER A..KANNADA LETTER VOCALIC L
20370C8D ; UNASSIGNED # <reserved>
20380C8E..0C90 ; PVALID # KANNADA LETTER E..KANNADA LETTER AI
20390C91 ; UNASSIGNED # <reserved>
20400C92..0CA8 ; PVALID # KANNADA LETTER O..KANNADA LETTER NA
20410CA9 ; UNASSIGNED # <reserved>
20420CAA..0CB3 ; PVALID # KANNADA LETTER PA..KANNADA LETTER LLA
20430CB4 ; UNASSIGNED # <reserved>
20440CB5..0CB9 ; PVALID # KANNADA LETTER VA..KANNADA LETTER HA
20450CBA..0CBB ; UNASSIGNED # <reserved>..<reserved>
20460CBC..0CC4 ; PVALID # KANNADA SIGN NUKTA..KANNADA VOWEL SIGN VOCAL
20470CC5 ; UNASSIGNED # <reserved>
20480CC6..0CC8 ; PVALID # KANNADA VOWEL SIGN E..KANNADA VOWEL SIGN AI
20490CC9 ; UNASSIGNED # <reserved>
20500CCA..0CCD ; PVALID # KANNADA VOWEL SIGN O..KANNADA SIGN VIRAMA
20510CCE..0CD4 ; UNASSIGNED # <reserved>..<reserved>
20520CD5..0CD6 ; PVALID # KANNADA LENGTH MARK..KANNADA AI LENGTH MARK
20530CD7..0CDD ; UNASSIGNED # <reserved>..<reserved>
20540CDE ; PVALID # KANNADA LETTER FA
20550CDF ; UNASSIGNED # <reserved>
20560CE0..0CE3 ; PVALID # KANNADA LETTER VOCALIC RR..KANNADA VOWEL SIG
20570CE4..0CE5 ; UNASSIGNED # <reserved>..<reserved>
20580CE6..0CEF ; PVALID # KANNADA DIGIT ZERO..KANNADA DIGIT NINE
20590CF0 ; UNASSIGNED # <reserved>
20600CF1..0CF2 ; DISALLOWED # KANNADA SIGN JIHVAMULIYA..KANNADA SIGN UPADH
20610CF3..0D01 ; UNASSIGNED # <reserved>..<reserved>
20620D02..0D03 ; PVALID # MALAYALAM SIGN ANUSVARA..MALAYALAM SIGN VISA
20630D04 ; UNASSIGNED # <reserved>
20640D05..0D0C ; PVALID # MALAYALAM LETTER A..MALAYALAM LETTER VOCALIC
20650D0D ; UNASSIGNED # <reserved>
20660D0E..0D10 ; PVALID # MALAYALAM LETTER E..MALAYALAM LETTER AI
20670D11 ; UNASSIGNED # <reserved>
20680D12..0D28 ; PVALID # MALAYALAM LETTER O..MALAYALAM LETTER NA
20690D29 ; UNASSIGNED # <reserved>
20700D2A..0D39 ; PVALID # MALAYALAM LETTER PA..MALAYALAM LETTER HA
2071
2072
2073
2074Faltstrom Standards Track [Page 37]
2075
2076RFC 5892 IDNA Code Points August 2010
2077
2078
20790D3A..0D3C ; UNASSIGNED # <reserved>..<reserved>
20800D3D..0D44 ; PVALID # MALAYALAM SIGN AVAGRAHA..MALAYALAM VOWEL SIG
20810D45 ; UNASSIGNED # <reserved>
20820D46..0D48 ; PVALID # MALAYALAM VOWEL SIGN E..MALAYALAM VOWEL SIGN
20830D49 ; UNASSIGNED # <reserved>
20840D4A..0D4D ; PVALID # MALAYALAM VOWEL SIGN O..MALAYALAM SIGN VIRAM
20850D4E..0D56 ; UNASSIGNED # <reserved>..<reserved>
20860D57 ; PVALID # MALAYALAM AU LENGTH MARK
20870D58..0D5F ; UNASSIGNED # <reserved>..<reserved>
20880D60..0D63 ; PVALID # MALAYALAM LETTER VOCALIC RR..MALAYALAM VOWEL
20890D64..0D65 ; UNASSIGNED # <reserved>..<reserved>
20900D66..0D6F ; PVALID # MALAYALAM DIGIT ZERO..MALAYALAM DIGIT NINE
20910D70..0D75 ; DISALLOWED # MALAYALAM NUMBER TEN..MALAYALAM FRACTION THR
20920D76..0D78 ; UNASSIGNED # <reserved>..<reserved>
20930D79 ; DISALLOWED # MALAYALAM DATE MARK
20940D7A..0D7F ; PVALID # MALAYALAM LETTER CHILLU NN..MALAYALAM LETTER
20950D80..0D81 ; UNASSIGNED # <reserved>..<reserved>
20960D82..0D83 ; PVALID # SINHALA SIGN ANUSVARAYA..SINHALA SIGN VISARG
20970D84 ; UNASSIGNED # <reserved>
20980D85..0D96 ; PVALID # SINHALA LETTER AYANNA..SINHALA LETTER AUYANN
20990D97..0D99 ; UNASSIGNED # <reserved>..<reserved>
21000D9A..0DB1 ; PVALID # SINHALA LETTER ALPAPRAANA KAYANNA..SINHALA L
21010DB2 ; UNASSIGNED # <reserved>
21020DB3..0DBB ; PVALID # SINHALA LETTER SANYAKA DAYANNA..SINHALA LETT
21030DBC ; UNASSIGNED # <reserved>
21040DBD ; PVALID # SINHALA LETTER DANTAJA LAYANNA
21050DBE..0DBF ; UNASSIGNED # <reserved>..<reserved>
21060DC0..0DC6 ; PVALID # SINHALA LETTER VAYANNA..SINHALA LETTER FAYAN
21070DC7..0DC9 ; UNASSIGNED # <reserved>..<reserved>
21080DCA ; PVALID # SINHALA SIGN AL-LAKUNA
21090DCB..0DCE ; UNASSIGNED # <reserved>..<reserved>
21100DCF..0DD4 ; PVALID # SINHALA VOWEL SIGN AELA-PILLA..SINHALA VOWEL
21110DD5 ; UNASSIGNED # <reserved>
21120DD6 ; PVALID # SINHALA VOWEL SIGN DIGA PAA-PILLA
21130DD7 ; UNASSIGNED # <reserved>
21140DD8..0DDF ; PVALID # SINHALA VOWEL SIGN GAETTA-PILLA..SINHALA VOW
21150DE0..0DF1 ; UNASSIGNED # <reserved>..<reserved>
21160DF2..0DF3 ; PVALID # SINHALA VOWEL SIGN DIGA GAETTA-PILLA..SINHAL
21170DF4 ; DISALLOWED # SINHALA PUNCTUATION KUNDDALIYA
21180DF5..0E00 ; UNASSIGNED # <reserved>..<reserved>
21190E01..0E32 ; PVALID # THAI CHARACTER KO KAI..THAI CHARACTER SARA A
21200E33 ; DISALLOWED # THAI CHARACTER SARA AM
21210E34..0E3A ; PVALID # THAI CHARACTER SARA I..THAI CHARACTER PHINTH
21220E3B..0E3E ; UNASSIGNED # <reserved>..<reserved>
21230E3F ; DISALLOWED # THAI CURRENCY SYMBOL BAHT
21240E40..0E4E ; PVALID # THAI CHARACTER SARA E..THAI CHARACTER YAMAKK
21250E4F ; DISALLOWED # THAI CHARACTER FONGMAN
21260E50..0E59 ; PVALID # THAI DIGIT ZERO..THAI DIGIT NINE
2127
2128
2129
2130Faltstrom Standards Track [Page 38]
2131
2132RFC 5892 IDNA Code Points August 2010
2133
2134
21350E5A..0E5B ; DISALLOWED # THAI CHARACTER ANGKHANKHU..THAI CHARACTER KH
21360E5C..0E80 ; UNASSIGNED # <reserved>..<reserved>
21370E81..0E82 ; PVALID # LAO LETTER KO..LAO LETTER KHO SUNG
21380E83 ; UNASSIGNED # <reserved>
21390E84 ; PVALID # LAO LETTER KHO TAM
21400E85..0E86 ; UNASSIGNED # <reserved>..<reserved>
21410E87..0E88 ; PVALID # LAO LETTER NGO..LAO LETTER CO
21420E89 ; UNASSIGNED # <reserved>
21430E8A ; PVALID # LAO LETTER SO TAM
21440E8B..0E8C ; UNASSIGNED # <reserved>..<reserved>
21450E8D ; PVALID # LAO LETTER NYO
21460E8E..0E93 ; UNASSIGNED # <reserved>..<reserved>
21470E94..0E97 ; PVALID # LAO LETTER DO..LAO LETTER THO TAM
21480E98 ; UNASSIGNED # <reserved>
21490E99..0E9F ; PVALID # LAO LETTER NO..LAO LETTER FO SUNG
21500EA0 ; UNASSIGNED # <reserved>
21510EA1..0EA3 ; PVALID # LAO LETTER MO..LAO LETTER LO LING
21520EA4 ; UNASSIGNED # <reserved>
21530EA5 ; PVALID # LAO LETTER LO LOOT
21540EA6 ; UNASSIGNED # <reserved>
21550EA7 ; PVALID # LAO LETTER WO
21560EA8..0EA9 ; UNASSIGNED # <reserved>..<reserved>
21570EAA..0EAB ; PVALID # LAO LETTER SO SUNG..LAO LETTER HO SUNG
21580EAC ; UNASSIGNED # <reserved>
21590EAD..0EB2 ; PVALID # LAO LETTER O..LAO VOWEL SIGN AA
21600EB3 ; DISALLOWED # LAO VOWEL SIGN AM
21610EB4..0EB9 ; PVALID # LAO VOWEL SIGN I..LAO VOWEL SIGN UU
21620EBA ; UNASSIGNED # <reserved>
21630EBB..0EBD ; PVALID # LAO VOWEL SIGN MAI KON..LAO SEMIVOWEL SIGN N
21640EBE..0EBF ; UNASSIGNED # <reserved>..<reserved>
21650EC0..0EC4 ; PVALID # LAO VOWEL SIGN E..LAO VOWEL SIGN AI
21660EC5 ; UNASSIGNED # <reserved>
21670EC6 ; PVALID # LAO KO LA
21680EC7 ; UNASSIGNED # <reserved>
21690EC8..0ECD ; PVALID # LAO TONE MAI EK..LAO NIGGAHITA
21700ECE..0ECF ; UNASSIGNED # <reserved>..<reserved>
21710ED0..0ED9 ; PVALID # LAO DIGIT ZERO..LAO DIGIT NINE
21720EDA..0EDB ; UNASSIGNED # <reserved>..<reserved>
21730EDC..0EDD ; DISALLOWED # LAO HO NO..LAO HO MO
21740EDE..0EFF ; UNASSIGNED # <reserved>..<reserved>
21750F00 ; PVALID # TIBETAN SYLLABLE OM
21760F01..0F0A ; DISALLOWED # TIBETAN MARK GTER YIG MGO TRUNCATED A..TIBET
21770F0B ; PVALID # TIBETAN MARK INTERSYLLABIC TSHEG
21780F0C..0F17 ; DISALLOWED # TIBETAN MARK DELIMITER TSHEG BSTAR..TIBETAN
21790F18..0F19 ; PVALID # TIBETAN ASTROLOGICAL SIGN -KHYUD PA..TIBETAN
21800F1A..0F1F ; DISALLOWED # TIBETAN SIGN RDEL DKAR GCIG..TIBETAN SIGN RD
21810F20..0F29 ; PVALID # TIBETAN DIGIT ZERO..TIBETAN DIGIT NINE
21820F2A..0F34 ; DISALLOWED # TIBETAN DIGIT HALF ONE..TIBETAN MARK BSDUS R
2183
2184
2185
2186Faltstrom Standards Track [Page 39]
2187
2188RFC 5892 IDNA Code Points August 2010
2189
2190
21910F35 ; PVALID # TIBETAN MARK NGAS BZUNG NYI ZLA
21920F36 ; DISALLOWED # TIBETAN MARK CARET -DZUD RTAGS BZHI MIG CAN
21930F37 ; PVALID # TIBETAN MARK NGAS BZUNG SGOR RTAGS
21940F38 ; DISALLOWED # TIBETAN MARK CHE MGO
21950F39 ; PVALID # TIBETAN MARK TSA -PHRU
21960F3A..0F3D ; DISALLOWED # TIBETAN MARK GUG RTAGS GYON..TIBETAN MARK AN
21970F3E..0F42 ; PVALID # TIBETAN SIGN YAR TSHES..TIBETAN LETTER GA
21980F43 ; DISALLOWED # TIBETAN LETTER GHA
21990F44..0F47 ; PVALID # TIBETAN LETTER NGA..TIBETAN LETTER JA
22000F48 ; UNASSIGNED # <reserved>
22010F49..0F4C ; PVALID # TIBETAN LETTER NYA..TIBETAN LETTER DDA
22020F4D ; DISALLOWED # TIBETAN LETTER DDHA
22030F4E..0F51 ; PVALID # TIBETAN LETTER NNA..TIBETAN LETTER DA
22040F52 ; DISALLOWED # TIBETAN LETTER DHA
22050F53..0F56 ; PVALID # TIBETAN LETTER NA..TIBETAN LETTER BA
22060F57 ; DISALLOWED # TIBETAN LETTER BHA
22070F58..0F5B ; PVALID # TIBETAN LETTER MA..TIBETAN LETTER DZA
22080F5C ; DISALLOWED # TIBETAN LETTER DZHA
22090F5D..0F68 ; PVALID # TIBETAN LETTER WA..TIBETAN LETTER A
22100F69 ; DISALLOWED # TIBETAN LETTER KSSA
22110F6A..0F6C ; PVALID # TIBETAN LETTER FIXED-FORM RA..TIBETAN LETTER
22120F6D..0F70 ; UNASSIGNED # <reserved>..<reserved>
22130F71..0F72 ; PVALID # TIBETAN VOWEL SIGN AA..TIBETAN VOWEL SIGN I
22140F73 ; DISALLOWED # TIBETAN VOWEL SIGN II
22150F74 ; PVALID # TIBETAN VOWEL SIGN U
22160F75..0F79 ; DISALLOWED # TIBETAN VOWEL SIGN UU..TIBETAN VOWEL SIGN VO
22170F7A..0F80 ; PVALID # TIBETAN VOWEL SIGN E..TIBETAN VOWEL SIGN REV
22180F81 ; DISALLOWED # TIBETAN VOWEL SIGN REVERSED II
22190F82..0F84 ; PVALID # TIBETAN SIGN NYI ZLA NAA DA..TIBETAN MARK HA
22200F85 ; DISALLOWED # TIBETAN MARK PALUTA
22210F86..0F8B ; PVALID # TIBETAN SIGN LCI RTAGS..TIBETAN SIGN GRU MED
22220F8C..0F8F ; UNASSIGNED # <reserved>..<reserved>
22230F90..0F92 ; PVALID # TIBETAN SUBJOINED LETTER KA..TIBETAN SUBJOIN
22240F93 ; DISALLOWED # TIBETAN SUBJOINED LETTER GHA
22250F94..0F97 ; PVALID # TIBETAN SUBJOINED LETTER NGA..TIBETAN SUBJOI
22260F98 ; UNASSIGNED # <reserved>
22270F99..0F9C ; PVALID # TIBETAN SUBJOINED LETTER NYA..TIBETAN SUBJOI
22280F9D ; DISALLOWED # TIBETAN SUBJOINED LETTER DDHA
22290F9E..0FA1 ; PVALID # TIBETAN SUBJOINED LETTER NNA..TIBETAN SUBJOI
22300FA2 ; DISALLOWED # TIBETAN SUBJOINED LETTER DHA
22310FA3..0FA6 ; PVALID # TIBETAN SUBJOINED LETTER NA..TIBETAN SUBJOIN
22320FA7 ; DISALLOWED # TIBETAN SUBJOINED LETTER BHA
22330FA8..0FAB ; PVALID # TIBETAN SUBJOINED LETTER MA..TIBETAN SUBJOIN
22340FAC ; DISALLOWED # TIBETAN SUBJOINED LETTER DZHA
22350FAD..0FB8 ; PVALID # TIBETAN SUBJOINED LETTER WA..TIBETAN SUBJOIN
22360FB9 ; DISALLOWED # TIBETAN SUBJOINED LETTER KSSA
22370FBA..0FBC ; PVALID # TIBETAN SUBJOINED LETTER FIXED-FORM WA..TIBE
22380FBD ; UNASSIGNED # <reserved>
2239
2240
2241
2242Faltstrom Standards Track [Page 40]
2243
2244RFC 5892 IDNA Code Points August 2010
2245
2246
22470FBE..0FC5 ; DISALLOWED # TIBETAN KU RU KHA..TIBETAN SYMBOL RDO RJE
22480FC6 ; PVALID # TIBETAN SYMBOL PADMA GDAN
22490FC7..0FCC ; DISALLOWED # TIBETAN SYMBOL RDO RJE RGYA GRAM..TIBETAN SY
22500FCD ; UNASSIGNED # <reserved>
22510FCE..0FD8 ; DISALLOWED # TIBETAN SIGN RDEL NAG RDEL DKAR..LEFT-FACING
22520FD9..0FFF ; UNASSIGNED # <reserved>..<reserved>
22531000..1049 ; PVALID # MYANMAR LETTER KA..MYANMAR DIGIT NINE
2254104A..104F ; DISALLOWED # MYANMAR SIGN LITTLE SECTION..MYANMAR SYMBOL
22551050..109D ; PVALID # MYANMAR LETTER SHA..MYANMAR VOWEL SIGN AITON
2256109E..10C5 ; DISALLOWED # MYANMAR SYMBOL SHAN ONE..GEORGIAN CAPITAL LE
225710C6..10CF ; UNASSIGNED # <reserved>..<reserved>
225810D0..10FA ; PVALID # GEORGIAN LETTER AN..GEORGIAN LETTER AIN
225910FB..10FC ; DISALLOWED # GEORGIAN PARAGRAPH SEPARATOR..MODIFIER LETTE
226010FD..10FF ; UNASSIGNED # <reserved>..<reserved>
22611100..11FF ; DISALLOWED # HANGUL CHOSEONG KIYEOK..HANGUL JONGSEONG SSA
22621200..1248 ; PVALID # ETHIOPIC SYLLABLE HA..ETHIOPIC SYLLABLE QWA
22631249 ; UNASSIGNED # <reserved>
2264124A..124D ; PVALID # ETHIOPIC SYLLABLE QWI..ETHIOPIC SYLLABLE QWE
2265124E..124F ; UNASSIGNED # <reserved>..<reserved>
22661250..1256 ; PVALID # ETHIOPIC SYLLABLE QHA..ETHIOPIC SYLLABLE QHO
22671257 ; UNASSIGNED # <reserved>
22681258 ; PVALID # ETHIOPIC SYLLABLE QHWA
22691259 ; UNASSIGNED # <reserved>
2270125A..125D ; PVALID # ETHIOPIC SYLLABLE QHWI..ETHIOPIC SYLLABLE QH
2271125E..125F ; UNASSIGNED # <reserved>..<reserved>
22721260..1288 ; PVALID # ETHIOPIC SYLLABLE BA..ETHIOPIC SYLLABLE XWA
22731289 ; UNASSIGNED # <reserved>
2274128A..128D ; PVALID # ETHIOPIC SYLLABLE XWI..ETHIOPIC SYLLABLE XWE
2275128E..128F ; UNASSIGNED # <reserved>..<reserved>
22761290..12B0 ; PVALID # ETHIOPIC SYLLABLE NA..ETHIOPIC SYLLABLE KWA
227712B1 ; UNASSIGNED # <reserved>
227812B2..12B5 ; PVALID # ETHIOPIC SYLLABLE KWI..ETHIOPIC SYLLABLE KWE
227912B6..12B7 ; UNASSIGNED # <reserved>..<reserved>
228012B8..12BE ; PVALID # ETHIOPIC SYLLABLE KXA..ETHIOPIC SYLLABLE KXO
228112BF ; UNASSIGNED # <reserved>
228212C0 ; PVALID # ETHIOPIC SYLLABLE KXWA
228312C1 ; UNASSIGNED # <reserved>
228412C2..12C5 ; PVALID # ETHIOPIC SYLLABLE KXWI..ETHIOPIC SYLLABLE KX
228512C6..12C7 ; UNASSIGNED # <reserved>..<reserved>
228612C8..12D6 ; PVALID # ETHIOPIC SYLLABLE WA..ETHIOPIC SYLLABLE PHAR
228712D7 ; UNASSIGNED # <reserved>
228812D8..1310 ; PVALID # ETHIOPIC SYLLABLE ZA..ETHIOPIC SYLLABLE GWA
22891311 ; UNASSIGNED # <reserved>
22901312..1315 ; PVALID # ETHIOPIC SYLLABLE GWI..ETHIOPIC SYLLABLE GWE
22911316..1317 ; UNASSIGNED # <reserved>..<reserved>
22921318..135A ; PVALID # ETHIOPIC SYLLABLE GGA..ETHIOPIC SYLLABLE FYA
2293135B..135E ; UNASSIGNED # <reserved>..<reserved>
2294135F ; PVALID # ETHIOPIC COMBINING GEMINATION MARK
2295
2296
2297
2298Faltstrom Standards Track [Page 41]
2299
2300RFC 5892 IDNA Code Points August 2010
2301
2302
23031360..137C ; DISALLOWED # ETHIOPIC SECTION MARK..ETHIOPIC NUMBER TEN T
2304137D..137F ; UNASSIGNED # <reserved>..<reserved>
23051380..138F ; PVALID # ETHIOPIC SYLLABLE SEBATBEIT MWA..ETHIOPIC SY
23061390..1399 ; DISALLOWED # ETHIOPIC TONAL MARK YIZET..ETHIOPIC TONAL MA
2307139A..139F ; UNASSIGNED # <reserved>..<reserved>
230813A0..13F4 ; PVALID # CHEROKEE LETTER A..CHEROKEE LETTER YV
230913F5..13FF ; UNASSIGNED # <reserved>..<reserved>
23101400 ; DISALLOWED # CANADIAN SYLLABICS HYPHEN
23111401..166C ; PVALID # CANADIAN SYLLABICS E..CANADIAN SYLLABICS CAR
2312166D..166E ; DISALLOWED # CANADIAN SYLLABICS CHI SIGN..CANADIAN SYLLAB
2313166F..167F ; PVALID # CANADIAN SYLLABICS QAI..CANADIAN SYLLABICS B
23141680 ; DISALLOWED # OGHAM SPACE MARK
23151681..169A ; PVALID # OGHAM LETTER BEITH..OGHAM LETTER PEITH
2316169B..169C ; DISALLOWED # OGHAM FEATHER MARK..OGHAM REVERSED FEATHER M
2317169D..169F ; UNASSIGNED # <reserved>..<reserved>
231816A0..16EA ; PVALID # RUNIC LETTER FEHU FEOH FE F..RUNIC LETTER X
231916EB..16F0 ; DISALLOWED # RUNIC SINGLE PUNCTUATION..RUNIC BELGTHOR SYM
232016F1..16FF ; UNASSIGNED # <reserved>..<reserved>
23211700..170C ; PVALID # TAGALOG LETTER A..TAGALOG LETTER YA
2322170D ; UNASSIGNED # <reserved>
2323170E..1714 ; PVALID # TAGALOG LETTER LA..TAGALOG SIGN VIRAMA
23241715..171F ; UNASSIGNED # <reserved>..<reserved>
23251720..1734 ; PVALID # HANUNOO LETTER A..HANUNOO SIGN PAMUDPOD
23261735..1736 ; DISALLOWED # PHILIPPINE SINGLE PUNCTUATION..PHILIPPINE DO
23271737..173F ; UNASSIGNED # <reserved>..<reserved>
23281740..1753 ; PVALID # BUHID LETTER A..BUHID VOWEL SIGN U
23291754..175F ; UNASSIGNED # <reserved>..<reserved>
23301760..176C ; PVALID # TAGBANWA LETTER A..TAGBANWA LETTER YA
2331176D ; UNASSIGNED # <reserved>
2332176E..1770 ; PVALID # TAGBANWA LETTER LA..TAGBANWA LETTER SA
23331771 ; UNASSIGNED # <reserved>
23341772..1773 ; PVALID # TAGBANWA VOWEL SIGN I..TAGBANWA VOWEL SIGN U
23351774..177F ; UNASSIGNED # <reserved>..<reserved>
23361780..17B3 ; PVALID # KHMER LETTER KA..KHMER INDEPENDENT VOWEL QAU
233717B4..17B5 ; DISALLOWED # KHMER VOWEL INHERENT AQ..KHMER VOWEL INHEREN
233817B6..17D3 ; PVALID # KHMER VOWEL SIGN AA..KHMER SIGN BATHAMASAT
233917D4..17D6 ; DISALLOWED # KHMER SIGN KHAN..KHMER SIGN CAMNUC PII KUUH
234017D7 ; PVALID # KHMER SIGN LEK TOO
234117D8..17DB ; DISALLOWED # KHMER SIGN BEYYAL..KHMER CURRENCY SYMBOL RIE
234217DC..17DD ; PVALID # KHMER SIGN AVAKRAHASANYA..KHMER SIGN ATTHACA
234317DE..17DF ; UNASSIGNED # <reserved>..<reserved>
234417E0..17E9 ; PVALID # KHMER DIGIT ZERO..KHMER DIGIT NINE
234517EA..17EF ; UNASSIGNED # <reserved>..<reserved>
234617F0..17F9 ; DISALLOWED # KHMER SYMBOL LEK ATTAK SON..KHMER SYMBOL LEK
234717FA..17FF ; UNASSIGNED # <reserved>..<reserved>
23481800..180E ; DISALLOWED # MONGOLIAN BIRGA..MONGOLIAN VOWEL SEPARATOR
2349180F ; UNASSIGNED # <reserved>
23501810..1819 ; PVALID # MONGOLIAN DIGIT ZERO..MONGOLIAN DIGIT NINE
2351
2352
2353
2354Faltstrom Standards Track [Page 42]
2355
2356RFC 5892 IDNA Code Points August 2010
2357
2358
2359181A..181F ; UNASSIGNED # <reserved>..<reserved>
23601820..1877 ; PVALID # MONGOLIAN LETTER A..MONGOLIAN LETTER MANCHU
23611878..187F ; UNASSIGNED # <reserved>..<reserved>
23621880..18AA ; PVALID # MONGOLIAN LETTER ALI GALI ANUSVARA ONE..MONG
236318AB..18AF ; UNASSIGNED # <reserved>..<reserved>
236418B0..18F5 ; PVALID # CANADIAN SYLLABICS OY..CANADIAN SYLLABICS CA
236518F6..18FF ; UNASSIGNED # <reserved>..<reserved>
23661900..191C ; PVALID # LIMBU VOWEL-CARRIER LETTER..LIMBU LETTER HA
2367191D..191F ; UNASSIGNED # <reserved>..<reserved>
23681920..192B ; PVALID # LIMBU VOWEL SIGN A..LIMBU SUBJOINED LETTER W
2369192C..192F ; UNASSIGNED # <reserved>..<reserved>
23701930..193B ; PVALID # LIMBU SMALL LETTER KA..LIMBU SIGN SA-I
2371193C..193F ; UNASSIGNED # <reserved>..<reserved>
23721940 ; DISALLOWED # LIMBU SIGN LOO
23731941..1943 ; UNASSIGNED # <reserved>..<reserved>
23741944..1945 ; DISALLOWED # LIMBU EXCLAMATION MARK..LIMBU QUESTION MARK
23751946..196D ; PVALID # LIMBU DIGIT ZERO..TAI LE LETTER AI
2376196E..196F ; UNASSIGNED # <reserved>..<reserved>
23771970..1974 ; PVALID # TAI LE LETTER TONE-2..TAI LE LETTER TONE-6
23781975..197F ; UNASSIGNED # <reserved>..<reserved>
23791980..19AB ; PVALID # NEW TAI LUE LETTER HIGH QA..NEW TAI LUE LETT
238019AC..19AF ; UNASSIGNED # <reserved>..<reserved>
238119B0..19C9 ; PVALID # NEW TAI LUE VOWEL SIGN VOWEL SHORTENER..NEW
238219CA..19CF ; UNASSIGNED # <reserved>..<reserved>
238319D0..19DA ; PVALID # NEW TAI LUE DIGIT ZERO..NEW TAI LUE THAM DIG
238419DB..19DD ; UNASSIGNED # <reserved>..<reserved>
238519DE..19FF ; DISALLOWED # NEW TAI LUE SIGN LAE..KHMER SYMBOL DAP-PRAM
23861A00..1A1B ; PVALID # BUGINESE LETTER KA..BUGINESE VOWEL SIGN AE
23871A1C..1A1D ; UNASSIGNED # <reserved>..<reserved>
23881A1E..1A1F ; DISALLOWED # BUGINESE PALLAWA..BUGINESE END OF SECTION
23891A20..1A5E ; PVALID # TAI THAM LETTER HIGH KA..TAI THAM CONSONANT
23901A5F ; UNASSIGNED # <reserved>
23911A60..1A7C ; PVALID # TAI THAM SIGN SAKOT..TAI THAM SIGN KHUEN-LUE
23921A7D..1A7E ; UNASSIGNED # <reserved>..<reserved>
23931A7F..1A89 ; PVALID # TAI THAM COMBINING CRYPTOGRAMMIC DOT..TAI TH
23941A8A..1A8F ; UNASSIGNED # <reserved>..<reserved>
23951A90..1A99 ; PVALID # TAI THAM THAM DIGIT ZERO..TAI THAM THAM DIGI
23961A9A..1A9F ; UNASSIGNED # <reserved>..<reserved>
23971AA0..1AA6 ; DISALLOWED # TAI THAM SIGN WIANG..TAI THAM SIGN REVERSED
23981AA7 ; PVALID # TAI THAM SIGN MAI YAMOK
23991AA8..1AAD ; DISALLOWED # TAI THAM SIGN KAAN..TAI THAM SIGN CAANG
24001AAE..1AFF ; UNASSIGNED # <reserved>..<reserved>
24011B00..1B4B ; PVALID # BALINESE SIGN ULU RICEM..BALINESE LETTER ASY
24021B4C..1B4F ; UNASSIGNED # <reserved>..<reserved>
24031B50..1B59 ; PVALID # BALINESE DIGIT ZERO..BALINESE DIGIT NINE
24041B5A..1B6A ; DISALLOWED # BALINESE PANTI..BALINESE MUSICAL SYMBOL DANG
24051B6B..1B73 ; PVALID # BALINESE MUSICAL SYMBOL COMBINING TEGEH..BAL
24061B74..1B7C ; DISALLOWED # BALINESE MUSICAL SYMBOL RIGHT-HAND OPEN DUG.
2407
2408
2409
2410Faltstrom Standards Track [Page 43]
2411
2412RFC 5892 IDNA Code Points August 2010
2413
2414
24151B7D..1B7F ; UNASSIGNED # <reserved>..<reserved>
24161B80..1BAA ; PVALID # SUNDANESE SIGN PANYECEK..SUNDANESE SIGN PAMA
24171BAB..1BAD ; UNASSIGNED # <reserved>..<reserved>
24181BAE..1BB9 ; PVALID # SUNDANESE LETTER KHA..SUNDANESE DIGIT NINE
24191BBA..1BFF ; UNASSIGNED # <reserved>..<reserved>
24201C00..1C37 ; PVALID # LEPCHA LETTER KA..LEPCHA SIGN NUKTA
24211C38..1C3A ; UNASSIGNED # <reserved>..<reserved>
24221C3B..1C3F ; DISALLOWED # LEPCHA PUNCTUATION TA-ROL..LEPCHA PUNCTUATIO
24231C40..1C49 ; PVALID # LEPCHA DIGIT ZERO..LEPCHA DIGIT NINE
24241C4A..1C4C ; UNASSIGNED # <reserved>..<reserved>
24251C4D..1C7D ; PVALID # LEPCHA LETTER TTA..OL CHIKI AHAD
24261C7E..1C7F ; DISALLOWED # OL CHIKI PUNCTUATION MUCAAD..OL CHIKI PUNCTU
24271C80..1CCF ; UNASSIGNED # <reserved>..<reserved>
24281CD0..1CD2 ; PVALID # VEDIC TONE KARSHANA..VEDIC TONE PRENKHA
24291CD3 ; DISALLOWED # VEDIC SIGN NIHSHVASA
24301CD4..1CF2 ; PVALID # VEDIC SIGN YAJURVEDIC MIDLINE SVARITA..VEDIC
24311CF3..1CFF ; UNASSIGNED # <reserved>..<reserved>
24321D00..1D2B ; PVALID # LATIN LETTER SMALL CAPITAL A..CYRILLIC LETTE
24331D2C..1D2E ; DISALLOWED # MODIFIER LETTER CAPITAL A..MODIFIER LETTER C
24341D2F ; PVALID # MODIFIER LETTER CAPITAL BARRED B
24351D30..1D3A ; DISALLOWED # MODIFIER LETTER CAPITAL D..MODIFIER LETTER C
24361D3B ; PVALID # MODIFIER LETTER CAPITAL REVERSED N
24371D3C..1D4D ; DISALLOWED # MODIFIER LETTER CAPITAL O..MODIFIER LETTER S
24381D4E ; PVALID # MODIFIER LETTER SMALL TURNED I
24391D4F..1D6A ; DISALLOWED # MODIFIER LETTER SMALL K..GREEK SUBSCRIPT SMA
24401D6B..1D77 ; PVALID # LATIN SMALL LETTER UE..LATIN SMALL LETTER TU
24411D78 ; DISALLOWED # MODIFIER LETTER CYRILLIC EN
24421D79..1D9A ; PVALID # LATIN SMALL LETTER INSULAR G..LATIN SMALL LE
24431D9B..1DBF ; DISALLOWED # MODIFIER LETTER SMALL TURNED ALPHA..MODIFIER
24441DC0..1DE6 ; PVALID # COMBINING DOTTED GRAVE ACCENT..COMBINING LAT
24451DE7..1DFC ; UNASSIGNED # <reserved>..<reserved>
24461DFD..1DFF ; PVALID # COMBINING ALMOST EQUAL TO BELOW..COMBINING R
24471E00 ; DISALLOWED # LATIN CAPITAL LETTER A WITH RING BELOW
24481E01 ; PVALID # LATIN SMALL LETTER A WITH RING BELOW
24491E02 ; DISALLOWED # LATIN CAPITAL LETTER B WITH DOT ABOVE
24501E03 ; PVALID # LATIN SMALL LETTER B WITH DOT ABOVE
24511E04 ; DISALLOWED # LATIN CAPITAL LETTER B WITH DOT BELOW
24521E05 ; PVALID # LATIN SMALL LETTER B WITH DOT BELOW
24531E06 ; DISALLOWED # LATIN CAPITAL LETTER B WITH LINE BELOW
24541E07 ; PVALID # LATIN SMALL LETTER B WITH LINE BELOW
24551E08 ; DISALLOWED # LATIN CAPITAL LETTER C WITH CEDILLA AND ACUT
24561E09 ; PVALID # LATIN SMALL LETTER C WITH CEDILLA AND ACUTE
24571E0A ; DISALLOWED # LATIN CAPITAL LETTER D WITH DOT ABOVE
24581E0B ; PVALID # LATIN SMALL LETTER D WITH DOT ABOVE
24591E0C ; DISALLOWED # LATIN CAPITAL LETTER D WITH DOT BELOW
24601E0D ; PVALID # LATIN SMALL LETTER D WITH DOT BELOW
24611E0E ; DISALLOWED # LATIN CAPITAL LETTER D WITH LINE BELOW
24621E0F ; PVALID # LATIN SMALL LETTER D WITH LINE BELOW
2463
2464
2465
2466Faltstrom Standards Track [Page 44]
2467
2468RFC 5892 IDNA Code Points August 2010
2469
2470
24711E10 ; DISALLOWED # LATIN CAPITAL LETTER D WITH CEDILLA
24721E11 ; PVALID # LATIN SMALL LETTER D WITH CEDILLA
24731E12 ; DISALLOWED # LATIN CAPITAL LETTER D WITH CIRCUMFLEX BELOW
24741E13 ; PVALID # LATIN SMALL LETTER D WITH CIRCUMFLEX BELOW
24751E14 ; DISALLOWED # LATIN CAPITAL LETTER E WITH MACRON AND GRAVE
24761E15 ; PVALID # LATIN SMALL LETTER E WITH MACRON AND GRAVE
24771E16 ; DISALLOWED # LATIN CAPITAL LETTER E WITH MACRON AND ACUTE
24781E17 ; PVALID # LATIN SMALL LETTER E WITH MACRON AND ACUTE
24791E18 ; DISALLOWED # LATIN CAPITAL LETTER E WITH CIRCUMFLEX BELOW
24801E19 ; PVALID # LATIN SMALL LETTER E WITH CIRCUMFLEX BELOW
24811E1A ; DISALLOWED # LATIN CAPITAL LETTER E WITH TILDE BELOW
24821E1B ; PVALID # LATIN SMALL LETTER E WITH TILDE BELOW
24831E1C ; DISALLOWED # LATIN CAPITAL LETTER E WITH CEDILLA AND BREV
24841E1D ; PVALID # LATIN SMALL LETTER E WITH CEDILLA AND BREVE
24851E1E ; DISALLOWED # LATIN CAPITAL LETTER F WITH DOT ABOVE
24861E1F ; PVALID # LATIN SMALL LETTER F WITH DOT ABOVE
24871E20 ; DISALLOWED # LATIN CAPITAL LETTER G WITH MACRON
24881E21 ; PVALID # LATIN SMALL LETTER G WITH MACRON
24891E22 ; DISALLOWED # LATIN CAPITAL LETTER H WITH DOT ABOVE
24901E23 ; PVALID # LATIN SMALL LETTER H WITH DOT ABOVE
24911E24 ; DISALLOWED # LATIN CAPITAL LETTER H WITH DOT BELOW
24921E25 ; PVALID # LATIN SMALL LETTER H WITH DOT BELOW
24931E26 ; DISALLOWED # LATIN CAPITAL LETTER H WITH DIAERESIS
24941E27 ; PVALID # LATIN SMALL LETTER H WITH DIAERESIS
24951E28 ; DISALLOWED # LATIN CAPITAL LETTER H WITH CEDILLA
24961E29 ; PVALID # LATIN SMALL LETTER H WITH CEDILLA
24971E2A ; DISALLOWED # LATIN CAPITAL LETTER H WITH BREVE BELOW
24981E2B ; PVALID # LATIN SMALL LETTER H WITH BREVE BELOW
24991E2C ; DISALLOWED # LATIN CAPITAL LETTER I WITH TILDE BELOW
25001E2D ; PVALID # LATIN SMALL LETTER I WITH TILDE BELOW
25011E2E ; DISALLOWED # LATIN CAPITAL LETTER I WITH DIAERESIS AND AC
25021E2F ; PVALID # LATIN SMALL LETTER I WITH DIAERESIS AND ACUT
25031E30 ; DISALLOWED # LATIN CAPITAL LETTER K WITH ACUTE
25041E31 ; PVALID # LATIN SMALL LETTER K WITH ACUTE
25051E32 ; DISALLOWED # LATIN CAPITAL LETTER K WITH DOT BELOW
25061E33 ; PVALID # LATIN SMALL LETTER K WITH DOT BELOW
25071E34 ; DISALLOWED # LATIN CAPITAL LETTER K WITH LINE BELOW
25081E35 ; PVALID # LATIN SMALL LETTER K WITH LINE BELOW
25091E36 ; DISALLOWED # LATIN CAPITAL LETTER L WITH DOT BELOW
25101E37 ; PVALID # LATIN SMALL LETTER L WITH DOT BELOW
25111E38 ; DISALLOWED # LATIN CAPITAL LETTER L WITH DOT BELOW AND MA
25121E39 ; PVALID # LATIN SMALL LETTER L WITH DOT BELOW AND MACR
25131E3A ; DISALLOWED # LATIN CAPITAL LETTER L WITH LINE BELOW
25141E3B ; PVALID # LATIN SMALL LETTER L WITH LINE BELOW
25151E3C ; DISALLOWED # LATIN CAPITAL LETTER L WITH CIRCUMFLEX BELOW
25161E3D ; PVALID # LATIN SMALL LETTER L WITH CIRCUMFLEX BELOW
25171E3E ; DISALLOWED # LATIN CAPITAL LETTER M WITH ACUTE
25181E3F ; PVALID # LATIN SMALL LETTER M WITH ACUTE
2519
2520
2521
2522Faltstrom Standards Track [Page 45]
2523
2524RFC 5892 IDNA Code Points August 2010
2525
2526
25271E40 ; DISALLOWED # LATIN CAPITAL LETTER M WITH DOT ABOVE
25281E41 ; PVALID # LATIN SMALL LETTER M WITH DOT ABOVE
25291E42 ; DISALLOWED # LATIN CAPITAL LETTER M WITH DOT BELOW
25301E43 ; PVALID # LATIN SMALL LETTER M WITH DOT BELOW
25311E44 ; DISALLOWED # LATIN CAPITAL LETTER N WITH DOT ABOVE
25321E45 ; PVALID # LATIN SMALL LETTER N WITH DOT ABOVE
25331E46 ; DISALLOWED # LATIN CAPITAL LETTER N WITH DOT BELOW
25341E47 ; PVALID # LATIN SMALL LETTER N WITH DOT BELOW
25351E48 ; DISALLOWED # LATIN CAPITAL LETTER N WITH LINE BELOW
25361E49 ; PVALID # LATIN SMALL LETTER N WITH LINE BELOW
25371E4A ; DISALLOWED # LATIN CAPITAL LETTER N WITH CIRCUMFLEX BELOW
25381E4B ; PVALID # LATIN SMALL LETTER N WITH CIRCUMFLEX BELOW
25391E4C ; DISALLOWED # LATIN CAPITAL LETTER O WITH TILDE AND ACUTE
25401E4D ; PVALID # LATIN SMALL LETTER O WITH TILDE AND ACUTE
25411E4E ; DISALLOWED # LATIN CAPITAL LETTER O WITH TILDE AND DIAERE
25421E4F ; PVALID # LATIN SMALL LETTER O WITH TILDE AND DIAERESI
25431E50 ; DISALLOWED # LATIN CAPITAL LETTER O WITH MACRON AND GRAVE
25441E51 ; PVALID # LATIN SMALL LETTER O WITH MACRON AND GRAVE
25451E52 ; DISALLOWED # LATIN CAPITAL LETTER O WITH MACRON AND ACUTE
25461E53 ; PVALID # LATIN SMALL LETTER O WITH MACRON AND ACUTE
25471E54 ; DISALLOWED # LATIN CAPITAL LETTER P WITH ACUTE
25481E55 ; PVALID # LATIN SMALL LETTER P WITH ACUTE
25491E56 ; DISALLOWED # LATIN CAPITAL LETTER P WITH DOT ABOVE
25501E57 ; PVALID # LATIN SMALL LETTER P WITH DOT ABOVE
25511E58 ; DISALLOWED # LATIN CAPITAL LETTER R WITH DOT ABOVE
25521E59 ; PVALID # LATIN SMALL LETTER R WITH DOT ABOVE
25531E5A ; DISALLOWED # LATIN CAPITAL LETTER R WITH DOT BELOW
25541E5B ; PVALID # LATIN SMALL LETTER R WITH DOT BELOW
25551E5C ; DISALLOWED # LATIN CAPITAL LETTER R WITH DOT BELOW AND MA
25561E5D ; PVALID # LATIN SMALL LETTER R WITH DOT BELOW AND MACR
25571E5E ; DISALLOWED # LATIN CAPITAL LETTER R WITH LINE BELOW
25581E5F ; PVALID # LATIN SMALL LETTER R WITH LINE BELOW
25591E60 ; DISALLOWED # LATIN CAPITAL LETTER S WITH DOT ABOVE
25601E61 ; PVALID # LATIN SMALL LETTER S WITH DOT ABOVE
25611E62 ; DISALLOWED # LATIN CAPITAL LETTER S WITH DOT BELOW
25621E63 ; PVALID # LATIN SMALL LETTER S WITH DOT BELOW
25631E64 ; DISALLOWED # LATIN CAPITAL LETTER S WITH ACUTE AND DOT AB
25641E65 ; PVALID # LATIN SMALL LETTER S WITH ACUTE AND DOT ABOV
25651E66 ; DISALLOWED # LATIN CAPITAL LETTER S WITH CARON AND DOT AB
25661E67 ; PVALID # LATIN SMALL LETTER S WITH CARON AND DOT ABOV
25671E68 ; DISALLOWED # LATIN CAPITAL LETTER S WITH DOT BELOW AND DO
25681E69 ; PVALID # LATIN SMALL LETTER S WITH DOT BELOW AND DOT
25691E6A ; DISALLOWED # LATIN CAPITAL LETTER T WITH DOT ABOVE
25701E6B ; PVALID # LATIN SMALL LETTER T WITH DOT ABOVE
25711E6C ; DISALLOWED # LATIN CAPITAL LETTER T WITH DOT BELOW
25721E6D ; PVALID # LATIN SMALL LETTER T WITH DOT BELOW
25731E6E ; DISALLOWED # LATIN CAPITAL LETTER T WITH LINE BELOW
25741E6F ; PVALID # LATIN SMALL LETTER T WITH LINE BELOW
2575
2576
2577
2578Faltstrom Standards Track [Page 46]
2579
2580RFC 5892 IDNA Code Points August 2010
2581
2582
25831E70 ; DISALLOWED # LATIN CAPITAL LETTER T WITH CIRCUMFLEX BELOW
25841E71 ; PVALID # LATIN SMALL LETTER T WITH CIRCUMFLEX BELOW
25851E72 ; DISALLOWED # LATIN CAPITAL LETTER U WITH DIAERESIS BELOW
25861E73 ; PVALID # LATIN SMALL LETTER U WITH DIAERESIS BELOW
25871E74 ; DISALLOWED # LATIN CAPITAL LETTER U WITH TILDE BELOW
25881E75 ; PVALID # LATIN SMALL LETTER U WITH TILDE BELOW
25891E76 ; DISALLOWED # LATIN CAPITAL LETTER U WITH CIRCUMFLEX BELOW
25901E77 ; PVALID # LATIN SMALL LETTER U WITH CIRCUMFLEX BELOW
25911E78 ; DISALLOWED # LATIN CAPITAL LETTER U WITH TILDE AND ACUTE
25921E79 ; PVALID # LATIN SMALL LETTER U WITH TILDE AND ACUTE
25931E7A ; DISALLOWED # LATIN CAPITAL LETTER U WITH MACRON AND DIAER
25941E7B ; PVALID # LATIN SMALL LETTER U WITH MACRON AND DIAERES
25951E7C ; DISALLOWED # LATIN CAPITAL LETTER V WITH TILDE
25961E7D ; PVALID # LATIN SMALL LETTER V WITH TILDE
25971E7E ; DISALLOWED # LATIN CAPITAL LETTER V WITH DOT BELOW
25981E7F ; PVALID # LATIN SMALL LETTER V WITH DOT BELOW
25991E80 ; DISALLOWED # LATIN CAPITAL LETTER W WITH GRAVE
26001E81 ; PVALID # LATIN SMALL LETTER W WITH GRAVE
26011E82 ; DISALLOWED # LATIN CAPITAL LETTER W WITH ACUTE
26021E83 ; PVALID # LATIN SMALL LETTER W WITH ACUTE
26031E84 ; DISALLOWED # LATIN CAPITAL LETTER W WITH DIAERESIS
26041E85 ; PVALID # LATIN SMALL LETTER W WITH DIAERESIS
26051E86 ; DISALLOWED # LATIN CAPITAL LETTER W WITH DOT ABOVE
26061E87 ; PVALID # LATIN SMALL LETTER W WITH DOT ABOVE
26071E88 ; DISALLOWED # LATIN CAPITAL LETTER W WITH DOT BELOW
26081E89 ; PVALID # LATIN SMALL LETTER W WITH DOT BELOW
26091E8A ; DISALLOWED # LATIN CAPITAL LETTER X WITH DOT ABOVE
26101E8B ; PVALID # LATIN SMALL LETTER X WITH DOT ABOVE
26111E8C ; DISALLOWED # LATIN CAPITAL LETTER X WITH DIAERESIS
26121E8D ; PVALID # LATIN SMALL LETTER X WITH DIAERESIS
26131E8E ; DISALLOWED # LATIN CAPITAL LETTER Y WITH DOT ABOVE
26141E8F ; PVALID # LATIN SMALL LETTER Y WITH DOT ABOVE
26151E90 ; DISALLOWED # LATIN CAPITAL LETTER Z WITH CIRCUMFLEX
26161E91 ; PVALID # LATIN SMALL LETTER Z WITH CIRCUMFLEX
26171E92 ; DISALLOWED # LATIN CAPITAL LETTER Z WITH DOT BELOW
26181E93 ; PVALID # LATIN SMALL LETTER Z WITH DOT BELOW
26191E94 ; DISALLOWED # LATIN CAPITAL LETTER Z WITH LINE BELOW
26201E95..1E99 ; PVALID # LATIN SMALL LETTER Z WITH LINE BELOW..LATIN
26211E9A..1E9B ; DISALLOWED # LATIN SMALL LETTER A WITH RIGHT HALF RING..L
26221E9C..1E9D ; PVALID # LATIN SMALL LETTER LONG S WITH DIAGONAL STRO
26231E9E ; DISALLOWED # LATIN CAPITAL LETTER SHARP S
26241E9F ; PVALID # LATIN SMALL LETTER DELTA
26251EA0 ; DISALLOWED # LATIN CAPITAL LETTER A WITH DOT BELOW
26261EA1 ; PVALID # LATIN SMALL LETTER A WITH DOT BELOW
26271EA2 ; DISALLOWED # LATIN CAPITAL LETTER A WITH HOOK ABOVE
26281EA3 ; PVALID # LATIN SMALL LETTER A WITH HOOK ABOVE
26291EA4 ; DISALLOWED # LATIN CAPITAL LETTER A WITH CIRCUMFLEX AND A
26301EA5 ; PVALID # LATIN SMALL LETTER A WITH CIRCUMFLEX AND ACU
2631
2632
2633
2634Faltstrom Standards Track [Page 47]
2635
2636RFC 5892 IDNA Code Points August 2010
2637
2638
26391EA6 ; DISALLOWED # LATIN CAPITAL LETTER A WITH CIRCUMFLEX AND G
26401EA7 ; PVALID # LATIN SMALL LETTER A WITH CIRCUMFLEX AND GRA
26411EA8 ; DISALLOWED # LATIN CAPITAL LETTER A WITH CIRCUMFLEX AND H
26421EA9 ; PVALID # LATIN SMALL LETTER A WITH CIRCUMFLEX AND HOO
26431EAA ; DISALLOWED # LATIN CAPITAL LETTER A WITH CIRCUMFLEX AND T
26441EAB ; PVALID # LATIN SMALL LETTER A WITH CIRCUMFLEX AND TIL
26451EAC ; DISALLOWED # LATIN CAPITAL LETTER A WITH CIRCUMFLEX AND D
26461EAD ; PVALID # LATIN SMALL LETTER A WITH CIRCUMFLEX AND DOT
26471EAE ; DISALLOWED # LATIN CAPITAL LETTER A WITH BREVE AND ACUTE
26481EAF ; PVALID # LATIN SMALL LETTER A WITH BREVE AND ACUTE
26491EB0 ; DISALLOWED # LATIN CAPITAL LETTER A WITH BREVE AND GRAVE
26501EB1 ; PVALID # LATIN SMALL LETTER A WITH BREVE AND GRAVE
26511EB2 ; DISALLOWED # LATIN CAPITAL LETTER A WITH BREVE AND HOOK A
26521EB3 ; PVALID # LATIN SMALL LETTER A WITH BREVE AND HOOK ABO
26531EB4 ; DISALLOWED # LATIN CAPITAL LETTER A WITH BREVE AND TILDE
26541EB5 ; PVALID # LATIN SMALL LETTER A WITH BREVE AND TILDE
26551EB6 ; DISALLOWED # LATIN CAPITAL LETTER A WITH BREVE AND DOT BE
26561EB7 ; PVALID # LATIN SMALL LETTER A WITH BREVE AND DOT BELO
26571EB8 ; DISALLOWED # LATIN CAPITAL LETTER E WITH DOT BELOW
26581EB9 ; PVALID # LATIN SMALL LETTER E WITH DOT BELOW
26591EBA ; DISALLOWED # LATIN CAPITAL LETTER E WITH HOOK ABOVE
26601EBB ; PVALID # LATIN SMALL LETTER E WITH HOOK ABOVE
26611EBC ; DISALLOWED # LATIN CAPITAL LETTER E WITH TILDE
26621EBD ; PVALID # LATIN SMALL LETTER E WITH TILDE
26631EBE ; DISALLOWED # LATIN CAPITAL LETTER E WITH CIRCUMFLEX AND A
26641EBF ; PVALID # LATIN SMALL LETTER E WITH CIRCUMFLEX AND ACU
26651EC0 ; DISALLOWED # LATIN CAPITAL LETTER E WITH CIRCUMFLEX AND G
26661EC1 ; PVALID # LATIN SMALL LETTER E WITH CIRCUMFLEX AND GRA
26671EC2 ; DISALLOWED # LATIN CAPITAL LETTER E WITH CIRCUMFLEX AND H
26681EC3 ; PVALID # LATIN SMALL LETTER E WITH CIRCUMFLEX AND HOO
26691EC4 ; DISALLOWED # LATIN CAPITAL LETTER E WITH CIRCUMFLEX AND T
26701EC5 ; PVALID # LATIN SMALL LETTER E WITH CIRCUMFLEX AND TIL
26711EC6 ; DISALLOWED # LATIN CAPITAL LETTER E WITH CIRCUMFLEX AND D
26721EC7 ; PVALID # LATIN SMALL LETTER E WITH CIRCUMFLEX AND DOT
26731EC8 ; DISALLOWED # LATIN CAPITAL LETTER I WITH HOOK ABOVE
26741EC9 ; PVALID # LATIN SMALL LETTER I WITH HOOK ABOVE
26751ECA ; DISALLOWED # LATIN CAPITAL LETTER I WITH DOT BELOW
26761ECB ; PVALID # LATIN SMALL LETTER I WITH DOT BELOW
26771ECC ; DISALLOWED # LATIN CAPITAL LETTER O WITH DOT BELOW
26781ECD ; PVALID # LATIN SMALL LETTER O WITH DOT BELOW
26791ECE ; DISALLOWED # LATIN CAPITAL LETTER O WITH HOOK ABOVE
26801ECF ; PVALID # LATIN SMALL LETTER O WITH HOOK ABOVE
26811ED0 ; DISALLOWED # LATIN CAPITAL LETTER O WITH CIRCUMFLEX AND A
26821ED1 ; PVALID # LATIN SMALL LETTER O WITH CIRCUMFLEX AND ACU
26831ED2 ; DISALLOWED # LATIN CAPITAL LETTER O WITH CIRCUMFLEX AND G
26841ED3 ; PVALID # LATIN SMALL LETTER O WITH CIRCUMFLEX AND GRA
26851ED4 ; DISALLOWED # LATIN CAPITAL LETTER O WITH CIRCUMFLEX AND H
26861ED5 ; PVALID # LATIN SMALL LETTER O WITH CIRCUMFLEX AND HOO
2687
2688
2689
2690Faltstrom Standards Track [Page 48]
2691
2692RFC 5892 IDNA Code Points August 2010
2693
2694
26951ED6 ; DISALLOWED # LATIN CAPITAL LETTER O WITH CIRCUMFLEX AND T
26961ED7 ; PVALID # LATIN SMALL LETTER O WITH CIRCUMFLEX AND TIL
26971ED8 ; DISALLOWED # LATIN CAPITAL LETTER O WITH CIRCUMFLEX AND D
26981ED9 ; PVALID # LATIN SMALL LETTER O WITH CIRCUMFLEX AND DOT
26991EDA ; DISALLOWED # LATIN CAPITAL LETTER O WITH HORN AND ACUTE
27001EDB ; PVALID # LATIN SMALL LETTER O WITH HORN AND ACUTE
27011EDC ; DISALLOWED # LATIN CAPITAL LETTER O WITH HORN AND GRAVE
27021EDD ; PVALID # LATIN SMALL LETTER O WITH HORN AND GRAVE
27031EDE ; DISALLOWED # LATIN CAPITAL LETTER O WITH HORN AND HOOK AB
27041EDF ; PVALID # LATIN SMALL LETTER O WITH HORN AND HOOK ABOV
27051EE0 ; DISALLOWED # LATIN CAPITAL LETTER O WITH HORN AND TILDE
27061EE1 ; PVALID # LATIN SMALL LETTER O WITH HORN AND TILDE
27071EE2 ; DISALLOWED # LATIN CAPITAL LETTER O WITH HORN AND DOT BEL
27081EE3 ; PVALID # LATIN SMALL LETTER O WITH HORN AND DOT BELOW
27091EE4 ; DISALLOWED # LATIN CAPITAL LETTER U WITH DOT BELOW
27101EE5 ; PVALID # LATIN SMALL LETTER U WITH DOT BELOW
27111EE6 ; DISALLOWED # LATIN CAPITAL LETTER U WITH HOOK ABOVE
27121EE7 ; PVALID # LATIN SMALL LETTER U WITH HOOK ABOVE
27131EE8 ; DISALLOWED # LATIN CAPITAL LETTER U WITH HORN AND ACUTE
27141EE9 ; PVALID # LATIN SMALL LETTER U WITH HORN AND ACUTE
27151EEA ; DISALLOWED # LATIN CAPITAL LETTER U WITH HORN AND GRAVE
27161EEB ; PVALID # LATIN SMALL LETTER U WITH HORN AND GRAVE
27171EEC ; DISALLOWED # LATIN CAPITAL LETTER U WITH HORN AND HOOK AB
27181EED ; PVALID # LATIN SMALL LETTER U WITH HORN AND HOOK ABOV
27191EEE ; DISALLOWED # LATIN CAPITAL LETTER U WITH HORN AND TILDE
27201EEF ; PVALID # LATIN SMALL LETTER U WITH HORN AND TILDE
27211EF0 ; DISALLOWED # LATIN CAPITAL LETTER U WITH HORN AND DOT BEL
27221EF1 ; PVALID # LATIN SMALL LETTER U WITH HORN AND DOT BELOW
27231EF2 ; DISALLOWED # LATIN CAPITAL LETTER Y WITH GRAVE
27241EF3 ; PVALID # LATIN SMALL LETTER Y WITH GRAVE
27251EF4 ; DISALLOWED # LATIN CAPITAL LETTER Y WITH DOT BELOW
27261EF5 ; PVALID # LATIN SMALL LETTER Y WITH DOT BELOW
27271EF6 ; DISALLOWED # LATIN CAPITAL LETTER Y WITH HOOK ABOVE
27281EF7 ; PVALID # LATIN SMALL LETTER Y WITH HOOK ABOVE
27291EF8 ; DISALLOWED # LATIN CAPITAL LETTER Y WITH TILDE
27301EF9 ; PVALID # LATIN SMALL LETTER Y WITH TILDE
27311EFA ; DISALLOWED # LATIN CAPITAL LETTER MIDDLE-WELSH LL
27321EFB ; PVALID # LATIN SMALL LETTER MIDDLE-WELSH LL
27331EFC ; DISALLOWED # LATIN CAPITAL LETTER MIDDLE-WELSH V
27341EFD ; PVALID # LATIN SMALL LETTER MIDDLE-WELSH V
27351EFE ; DISALLOWED # LATIN CAPITAL LETTER Y WITH LOOP
27361EFF..1F07 ; PVALID # LATIN SMALL LETTER Y WITH LOOP..GREEK SMALL
27371F08..1F0F ; DISALLOWED # GREEK CAPITAL LETTER ALPHA WITH PSILI..GREEK
27381F10..1F15 ; PVALID # GREEK SMALL LETTER EPSILON WITH PSILI..GREEK
27391F16..1F17 ; UNASSIGNED # <reserved>..<reserved>
27401F18..1F1D ; DISALLOWED # GREEK CAPITAL LETTER EPSILON WITH PSILI..GRE
27411F1E..1F1F ; UNASSIGNED # <reserved>..<reserved>
27421F20..1F27 ; PVALID # GREEK SMALL LETTER ETA WITH PSILI..GREEK SMA
2743
2744
2745
2746Faltstrom Standards Track [Page 49]
2747
2748RFC 5892 IDNA Code Points August 2010
2749
2750
27511F28..1F2F ; DISALLOWED # GREEK CAPITAL LETTER ETA WITH PSILI..GREEK C
27521F30..1F37 ; PVALID # GREEK SMALL LETTER IOTA WITH PSILI..GREEK SM
27531F38..1F3F ; DISALLOWED # GREEK CAPITAL LETTER IOTA WITH PSILI..GREEK
27541F40..1F45 ; PVALID # GREEK SMALL LETTER OMICRON WITH PSILI..GREEK
27551F46..1F47 ; UNASSIGNED # <reserved>..<reserved>
27561F48..1F4D ; DISALLOWED # GREEK CAPITAL LETTER OMICRON WITH PSILI..GRE
27571F4E..1F4F ; UNASSIGNED # <reserved>..<reserved>
27581F50..1F57 ; PVALID # GREEK SMALL LETTER UPSILON WITH PSILI..GREEK
27591F58 ; UNASSIGNED # <reserved>
27601F59 ; DISALLOWED # GREEK CAPITAL LETTER UPSILON WITH DASIA
27611F5A ; UNASSIGNED # <reserved>
27621F5B ; DISALLOWED # GREEK CAPITAL LETTER UPSILON WITH DASIA AND
27631F5C ; UNASSIGNED # <reserved>
27641F5D ; DISALLOWED # GREEK CAPITAL LETTER UPSILON WITH DASIA AND
27651F5E ; UNASSIGNED # <reserved>
27661F5F ; DISALLOWED # GREEK CAPITAL LETTER UPSILON WITH DASIA AND
27671F60..1F67 ; PVALID # GREEK SMALL LETTER OMEGA WITH PSILI..GREEK S
27681F68..1F6F ; DISALLOWED # GREEK CAPITAL LETTER OMEGA WITH PSILI..GREEK
27691F70 ; PVALID # GREEK SMALL LETTER ALPHA WITH VARIA
27701F71 ; DISALLOWED # GREEK SMALL LETTER ALPHA WITH OXIA
27711F72 ; PVALID # GREEK SMALL LETTER EPSILON WITH VARIA
27721F73 ; DISALLOWED # GREEK SMALL LETTER EPSILON WITH OXIA
27731F74 ; PVALID # GREEK SMALL LETTER ETA WITH VARIA
27741F75 ; DISALLOWED # GREEK SMALL LETTER ETA WITH OXIA
27751F76 ; PVALID # GREEK SMALL LETTER IOTA WITH VARIA
27761F77 ; DISALLOWED # GREEK SMALL LETTER IOTA WITH OXIA
27771F78 ; PVALID # GREEK SMALL LETTER OMICRON WITH VARIA
27781F79 ; DISALLOWED # GREEK SMALL LETTER OMICRON WITH OXIA
27791F7A ; PVALID # GREEK SMALL LETTER UPSILON WITH VARIA
27801F7B ; DISALLOWED # GREEK SMALL LETTER UPSILON WITH OXIA
27811F7C ; PVALID # GREEK SMALL LETTER OMEGA WITH VARIA
27821F7D ; DISALLOWED # GREEK SMALL LETTER OMEGA WITH OXIA
27831F7E..1F7F ; UNASSIGNED # <reserved>..<reserved>
27841F80..1FAF ; DISALLOWED # GREEK SMALL LETTER ALPHA WITH PSILI AND YPOG
27851FB0..1FB1 ; PVALID # GREEK SMALL LETTER ALPHA WITH VRACHY..GREEK
27861FB2..1FB4 ; DISALLOWED # GREEK SMALL LETTER ALPHA WITH VARIA AND YPOG
27871FB5 ; UNASSIGNED # <reserved>
27881FB6 ; PVALID # GREEK SMALL LETTER ALPHA WITH PERISPOMENI
27891FB7..1FC4 ; DISALLOWED # GREEK SMALL LETTER ALPHA WITH PERISPOMENI AN
27901FC5 ; UNASSIGNED # <reserved>
27911FC6 ; PVALID # GREEK SMALL LETTER ETA WITH PERISPOMENI
27921FC7..1FCF ; DISALLOWED # GREEK SMALL LETTER ETA WITH PERISPOMENI AND
27931FD0..1FD2 ; PVALID # GREEK SMALL LETTER IOTA WITH VRACHY..GREEK S
27941FD3 ; DISALLOWED # GREEK SMALL LETTER IOTA WITH DIALYTIKA AND O
27951FD4..1FD5 ; UNASSIGNED # <reserved>..<reserved>
27961FD6..1FD7 ; PVALID # GREEK SMALL LETTER IOTA WITH PERISPOMENI..GR
27971FD8..1FDB ; DISALLOWED # GREEK CAPITAL LETTER IOTA WITH VRACHY..GREEK
27981FDC ; UNASSIGNED # <reserved>
2799
2800
2801
2802Faltstrom Standards Track [Page 50]
2803
2804RFC 5892 IDNA Code Points August 2010
2805
2806
28071FDD..1FDF ; DISALLOWED # GREEK DASIA AND VARIA..GREEK DASIA AND PERIS
28081FE0..1FE2 ; PVALID # GREEK SMALL LETTER UPSILON WITH VRACHY..GREE
28091FE3 ; DISALLOWED # GREEK SMALL LETTER UPSILON WITH DIALYTIKA AN
28101FE4..1FE7 ; PVALID # GREEK SMALL LETTER RHO WITH PSILI..GREEK SMA
28111FE8..1FEF ; DISALLOWED # GREEK CAPITAL LETTER UPSILON WITH VRACHY..GR
28121FF0..1FF1 ; UNASSIGNED # <reserved>..<reserved>
28131FF2..1FF4 ; DISALLOWED # GREEK SMALL LETTER OMEGA WITH VARIA AND YPOG
28141FF5 ; UNASSIGNED # <reserved>
28151FF6 ; PVALID # GREEK SMALL LETTER OMEGA WITH PERISPOMENI
28161FF7..1FFE ; DISALLOWED # GREEK SMALL LETTER OMEGA WITH PERISPOMENI AN
28171FFF ; UNASSIGNED # <reserved>
28182000..200B ; DISALLOWED # EN QUAD..ZERO WIDTH SPACE
2819200C..200D ; CONTEXTJ # ZERO WIDTH NON-JOINER..ZERO WIDTH JOINER
2820200E..2064 ; DISALLOWED # LEFT-TO-RIGHT MARK..INVISIBLE PLUS
28212065..2069 ; UNASSIGNED # <reserved>..<reserved>
2822206A..2071 ; DISALLOWED # INHIBIT SYMMETRIC SWAPPING..SUPERSCRIPT LATI
28232072..2073 ; UNASSIGNED # <reserved>..<reserved>
28242074..208E ; DISALLOWED # SUPERSCRIPT FOUR..SUBSCRIPT RIGHT PARENTHESI
2825208F ; UNASSIGNED # <reserved>
28262090..2094 ; DISALLOWED # LATIN SUBSCRIPT SMALL LETTER A..LATIN SUBSCR
28272095..209F ; UNASSIGNED # <reserved>..<reserved>
282820A0..20B8 ; DISALLOWED # EURO-CURRENCY SIGN..TENGE SIGN
282920B9..20CF ; UNASSIGNED # <reserved>..<reserved>
283020D0..20F0 ; DISALLOWED # COMBINING LEFT HARPOON ABOVE..COMBINING ASTE
283120F1..20FF ; UNASSIGNED # <reserved>..<reserved>
28322100..214D ; DISALLOWED # ACCOUNT OF..AKTIESELSKAB
2833214E ; PVALID # TURNED SMALL F
2834214F..2183 ; DISALLOWED # SYMBOL FOR SAMARITAN SOURCE..ROMAN NUMERAL R
28352184 ; PVALID # LATIN SMALL LETTER REVERSED C
28362185..2189 ; DISALLOWED # ROMAN NUMERAL SIX LATE FORM..VULGAR FRACTION
2837218A..218F ; UNASSIGNED # <reserved>..<reserved>
28382190..23E8 ; DISALLOWED # LEFTWARDS ARROW..DECIMAL EXPONENT SYMBOL
283923E9..23FF ; UNASSIGNED # <reserved>..<reserved>
28402400..2426 ; DISALLOWED # SYMBOL FOR NULL..SYMBOL FOR SUBSTITUTE FORM
28412427..243F ; UNASSIGNED # <reserved>..<reserved>
28422440..244A ; DISALLOWED # OCR HOOK..OCR DOUBLE BACKSLASH
2843244B..245F ; UNASSIGNED # <reserved>..<reserved>
28442460..26CD ; DISALLOWED # CIRCLED DIGIT ONE..DISABLED CAR
284526CE ; UNASSIGNED # <reserved>
284626CF..26E1 ; DISALLOWED # PICK..RESTRICTED LEFT ENTRY-2
284726E2 ; UNASSIGNED # <reserved>
284826E3 ; DISALLOWED # HEAVY CIRCLE WITH STROKE AND TWO DOTS ABOVE
284926E4..26E7 ; UNASSIGNED # <reserved>..<reserved>
285026E8..26FF ; DISALLOWED # BLACK CROSS ON SHIELD..WHITE FLAG WITH HORIZ
28512700 ; UNASSIGNED # <reserved>
28522701..2704 ; DISALLOWED # UPPER BLADE SCISSORS..WHITE SCISSORS
28532705 ; UNASSIGNED # <reserved>
28542706..2709 ; DISALLOWED # TELEPHONE LOCATION SIGN..ENVELOPE
2855
2856
2857
2858Faltstrom Standards Track [Page 51]
2859
2860RFC 5892 IDNA Code Points August 2010
2861
2862
2863270A..270B ; UNASSIGNED # <reserved>..<reserved>
2864270C..2727 ; DISALLOWED # VICTORY HAND..WHITE FOUR POINTED STAR
28652728 ; UNASSIGNED # <reserved>
28662729..274B ; DISALLOWED # STRESS OUTLINED WHITE STAR..HEAVY EIGHT TEAR
2867274C ; UNASSIGNED # <reserved>
2868274D ; DISALLOWED # SHADOWED WHITE CIRCLE
2869274E ; UNASSIGNED # <reserved>
2870274F..2752 ; DISALLOWED # LOWER RIGHT DROP-SHADOWED WHITE SQUARE..UPPE
28712753..2755 ; UNASSIGNED # <reserved>..<reserved>
28722756..275E ; DISALLOWED # BLACK DIAMOND MINUS WHITE X..HEAVY DOUBLE CO
2873275F..2760 ; UNASSIGNED # <reserved>..<reserved>
28742761..2794 ; DISALLOWED # CURVED STEM PARAGRAPH SIGN ORNAMENT..HEAVY W
28752795..2797 ; UNASSIGNED # <reserved>..<reserved>
28762798..27AF ; DISALLOWED # HEAVY SOUTH EAST ARROW..NOTCHED LOWER RIGHT-
287727B0 ; UNASSIGNED # <reserved>
287827B1..27BE ; DISALLOWED # NOTCHED UPPER RIGHT-SHADOWED WHITE RIGHTWARD
287927BF ; UNASSIGNED # <reserved>
288027C0..27CA ; DISALLOWED # THREE DIMENSIONAL ANGLE..VERTICAL BAR WITH H
288127CB ; UNASSIGNED # <reserved>
288227CC ; DISALLOWED # LONG DIVISION
288327CD..27CF ; UNASSIGNED # <reserved>..<reserved>
288427D0..2B4C ; DISALLOWED # WHITE DIAMOND WITH CENTRED DOT..RIGHTWARDS A
28852B4D..2B4F ; UNASSIGNED # <reserved>..<reserved>
28862B50..2B59 ; DISALLOWED # WHITE MEDIUM STAR..HEAVY CIRCLED SALTIRE
28872B5A..2BFF ; UNASSIGNED # <reserved>..<reserved>
28882C00..2C2E ; DISALLOWED # GLAGOLITIC CAPITAL LETTER AZU..GLAGOLITIC CA
28892C2F ; UNASSIGNED # <reserved>
28902C30..2C5E ; PVALID # GLAGOLITIC SMALL LETTER AZU..GLAGOLITIC SMAL
28912C5F ; UNASSIGNED # <reserved>
28922C60 ; DISALLOWED # LATIN CAPITAL LETTER L WITH DOUBLE BAR
28932C61 ; PVALID # LATIN SMALL LETTER L WITH DOUBLE BAR
28942C62..2C64 ; DISALLOWED # LATIN CAPITAL LETTER L WITH MIDDLE TILDE..LA
28952C65..2C66 ; PVALID # LATIN SMALL LETTER A WITH STROKE..LATIN SMAL
28962C67 ; DISALLOWED # LATIN CAPITAL LETTER H WITH DESCENDER
28972C68 ; PVALID # LATIN SMALL LETTER H WITH DESCENDER
28982C69 ; DISALLOWED # LATIN CAPITAL LETTER K WITH DESCENDER
28992C6A ; PVALID # LATIN SMALL LETTER K WITH DESCENDER
29002C6B ; DISALLOWED # LATIN CAPITAL LETTER Z WITH DESCENDER
29012C6C ; PVALID # LATIN SMALL LETTER Z WITH DESCENDER
29022C6D..2C70 ; DISALLOWED # LATIN CAPITAL LETTER ALPHA..LATIN CAPITAL LE
29032C71 ; PVALID # LATIN SMALL LETTER V WITH RIGHT HOOK
29042C72 ; DISALLOWED # LATIN CAPITAL LETTER W WITH HOOK
29052C73..2C74 ; PVALID # LATIN SMALL LETTER W WITH HOOK..LATIN SMALL
29062C75 ; DISALLOWED # LATIN CAPITAL LETTER HALF H
29072C76..2C7B ; PVALID # LATIN SMALL LETTER HALF H..LATIN LETTER SMAL
29082C7C..2C80 ; DISALLOWED # LATIN SUBSCRIPT SMALL LETTER J..COPTIC CAPIT
29092C81 ; PVALID # COPTIC SMALL LETTER ALFA
29102C82 ; DISALLOWED # COPTIC CAPITAL LETTER VIDA
2911
2912
2913
2914Faltstrom Standards Track [Page 52]
2915
2916RFC 5892 IDNA Code Points August 2010
2917
2918
29192C83 ; PVALID # COPTIC SMALL LETTER VIDA
29202C84 ; DISALLOWED # COPTIC CAPITAL LETTER GAMMA
29212C85 ; PVALID # COPTIC SMALL LETTER GAMMA
29222C86 ; DISALLOWED # COPTIC CAPITAL LETTER DALDA
29232C87 ; PVALID # COPTIC SMALL LETTER DALDA
29242C88 ; DISALLOWED # COPTIC CAPITAL LETTER EIE
29252C89 ; PVALID # COPTIC SMALL LETTER EIE
29262C8A ; DISALLOWED # COPTIC CAPITAL LETTER SOU
29272C8B ; PVALID # COPTIC SMALL LETTER SOU
29282C8C ; DISALLOWED # COPTIC CAPITAL LETTER ZATA
29292C8D ; PVALID # COPTIC SMALL LETTER ZATA
29302C8E ; DISALLOWED # COPTIC CAPITAL LETTER HATE
29312C8F ; PVALID # COPTIC SMALL LETTER HATE
29322C90 ; DISALLOWED # COPTIC CAPITAL LETTER THETHE
29332C91 ; PVALID # COPTIC SMALL LETTER THETHE
29342C92 ; DISALLOWED # COPTIC CAPITAL LETTER IAUDA
29352C93 ; PVALID # COPTIC SMALL LETTER IAUDA
29362C94 ; DISALLOWED # COPTIC CAPITAL LETTER KAPA
29372C95 ; PVALID # COPTIC SMALL LETTER KAPA
29382C96 ; DISALLOWED # COPTIC CAPITAL LETTER LAULA
29392C97 ; PVALID # COPTIC SMALL LETTER LAULA
29402C98 ; DISALLOWED # COPTIC CAPITAL LETTER MI
29412C99 ; PVALID # COPTIC SMALL LETTER MI
29422C9A ; DISALLOWED # COPTIC CAPITAL LETTER NI
29432C9B ; PVALID # COPTIC SMALL LETTER NI
29442C9C ; DISALLOWED # COPTIC CAPITAL LETTER KSI
29452C9D ; PVALID # COPTIC SMALL LETTER KSI
29462C9E ; DISALLOWED # COPTIC CAPITAL LETTER O
29472C9F ; PVALID # COPTIC SMALL LETTER O
29482CA0 ; DISALLOWED # COPTIC CAPITAL LETTER PI
29492CA1 ; PVALID # COPTIC SMALL LETTER PI
29502CA2 ; DISALLOWED # COPTIC CAPITAL LETTER RO
29512CA3 ; PVALID # COPTIC SMALL LETTER RO
29522CA4 ; DISALLOWED # COPTIC CAPITAL LETTER SIMA
29532CA5 ; PVALID # COPTIC SMALL LETTER SIMA
29542CA6 ; DISALLOWED # COPTIC CAPITAL LETTER TAU
29552CA7 ; PVALID # COPTIC SMALL LETTER TAU
29562CA8 ; DISALLOWED # COPTIC CAPITAL LETTER UA
29572CA9 ; PVALID # COPTIC SMALL LETTER UA
29582CAA ; DISALLOWED # COPTIC CAPITAL LETTER FI
29592CAB ; PVALID # COPTIC SMALL LETTER FI
29602CAC ; DISALLOWED # COPTIC CAPITAL LETTER KHI
29612CAD ; PVALID # COPTIC SMALL LETTER KHI
29622CAE ; DISALLOWED # COPTIC CAPITAL LETTER PSI
29632CAF ; PVALID # COPTIC SMALL LETTER PSI
29642CB0 ; DISALLOWED # COPTIC CAPITAL LETTER OOU
29652CB1 ; PVALID # COPTIC SMALL LETTER OOU
29662CB2 ; DISALLOWED # COPTIC CAPITAL LETTER DIALECT-P ALEF
2967
2968
2969
2970Faltstrom Standards Track [Page 53]
2971
2972RFC 5892 IDNA Code Points August 2010
2973
2974
29752CB3 ; PVALID # COPTIC SMALL LETTER DIALECT-P ALEF
29762CB4 ; DISALLOWED # COPTIC CAPITAL LETTER OLD COPTIC AIN
29772CB5 ; PVALID # COPTIC SMALL LETTER OLD COPTIC AIN
29782CB6 ; DISALLOWED # COPTIC CAPITAL LETTER CRYPTOGRAMMIC EIE
29792CB7 ; PVALID # COPTIC SMALL LETTER CRYPTOGRAMMIC EIE
29802CB8 ; DISALLOWED # COPTIC CAPITAL LETTER DIALECT-P KAPA
29812CB9 ; PVALID # COPTIC SMALL LETTER DIALECT-P KAPA
29822CBA ; DISALLOWED # COPTIC CAPITAL LETTER DIALECT-P NI
29832CBB ; PVALID # COPTIC SMALL LETTER DIALECT-P NI
29842CBC ; DISALLOWED # COPTIC CAPITAL LETTER CRYPTOGRAMMIC NI
29852CBD ; PVALID # COPTIC SMALL LETTER CRYPTOGRAMMIC NI
29862CBE ; DISALLOWED # COPTIC CAPITAL LETTER OLD COPTIC OOU
29872CBF ; PVALID # COPTIC SMALL LETTER OLD COPTIC OOU
29882CC0 ; DISALLOWED # COPTIC CAPITAL LETTER SAMPI
29892CC1 ; PVALID # COPTIC SMALL LETTER SAMPI
29902CC2 ; DISALLOWED # COPTIC CAPITAL LETTER CROSSED SHEI
29912CC3 ; PVALID # COPTIC SMALL LETTER CROSSED SHEI
29922CC4 ; DISALLOWED # COPTIC CAPITAL LETTER OLD COPTIC SHEI
29932CC5 ; PVALID # COPTIC SMALL LETTER OLD COPTIC SHEI
29942CC6 ; DISALLOWED # COPTIC CAPITAL LETTER OLD COPTIC ESH
29952CC7 ; PVALID # COPTIC SMALL LETTER OLD COPTIC ESH
29962CC8 ; DISALLOWED # COPTIC CAPITAL LETTER AKHMIMIC KHEI
29972CC9 ; PVALID # COPTIC SMALL LETTER AKHMIMIC KHEI
29982CCA ; DISALLOWED # COPTIC CAPITAL LETTER DIALECT-P HORI
29992CCB ; PVALID # COPTIC SMALL LETTER DIALECT-P HORI
30002CCC ; DISALLOWED # COPTIC CAPITAL LETTER OLD COPTIC HORI
30012CCD ; PVALID # COPTIC SMALL LETTER OLD COPTIC HORI
30022CCE ; DISALLOWED # COPTIC CAPITAL LETTER OLD COPTIC HA
30032CCF ; PVALID # COPTIC SMALL LETTER OLD COPTIC HA
30042CD0 ; DISALLOWED # COPTIC CAPITAL LETTER L-SHAPED HA
30052CD1 ; PVALID # COPTIC SMALL LETTER L-SHAPED HA
30062CD2 ; DISALLOWED # COPTIC CAPITAL LETTER OLD COPTIC HEI
30072CD3 ; PVALID # COPTIC SMALL LETTER OLD COPTIC HEI
30082CD4 ; DISALLOWED # COPTIC CAPITAL LETTER OLD COPTIC HAT
30092CD5 ; PVALID # COPTIC SMALL LETTER OLD COPTIC HAT
30102CD6 ; DISALLOWED # COPTIC CAPITAL LETTER OLD COPTIC GANGIA
30112CD7 ; PVALID # COPTIC SMALL LETTER OLD COPTIC GANGIA
30122CD8 ; DISALLOWED # COPTIC CAPITAL LETTER OLD COPTIC DJA
30132CD9 ; PVALID # COPTIC SMALL LETTER OLD COPTIC DJA
30142CDA ; DISALLOWED # COPTIC CAPITAL LETTER OLD COPTIC SHIMA
30152CDB ; PVALID # COPTIC SMALL LETTER OLD COPTIC SHIMA
30162CDC ; DISALLOWED # COPTIC CAPITAL LETTER OLD NUBIAN SHIMA
30172CDD ; PVALID # COPTIC SMALL LETTER OLD NUBIAN SHIMA
30182CDE ; DISALLOWED # COPTIC CAPITAL LETTER OLD NUBIAN NGI
30192CDF ; PVALID # COPTIC SMALL LETTER OLD NUBIAN NGI
30202CE0 ; DISALLOWED # COPTIC CAPITAL LETTER OLD NUBIAN NYI
30212CE1 ; PVALID # COPTIC SMALL LETTER OLD NUBIAN NYI
30222CE2 ; DISALLOWED # COPTIC CAPITAL LETTER OLD NUBIAN WAU
3023
3024
3025
3026Faltstrom Standards Track [Page 54]
3027
3028RFC 5892 IDNA Code Points August 2010
3029
3030
30312CE3..2CE4 ; PVALID # COPTIC SMALL LETTER OLD NUBIAN WAU..COPTIC S
30322CE5..2CEB ; DISALLOWED # COPTIC SYMBOL MI RO..COPTIC CAPITAL LETTER C
30332CEC ; PVALID # COPTIC SMALL LETTER CRYPTOGRAMMIC SHEI
30342CED ; DISALLOWED # COPTIC CAPITAL LETTER CRYPTOGRAMMIC GANGIA
30352CEE..2CF1 ; PVALID # COPTIC SMALL LETTER CRYPTOGRAMMIC GANGIA..CO
30362CF2..2CF8 ; UNASSIGNED # <reserved>..<reserved>
30372CF9..2CFF ; DISALLOWED # COPTIC OLD NUBIAN FULL STOP..COPTIC MORPHOLO
30382D00..2D25 ; PVALID # GEORGIAN SMALL LETTER AN..GEORGIAN SMALL LET
30392D26..2D2F ; UNASSIGNED # <reserved>..<reserved>
30402D30..2D65 ; PVALID # TIFINAGH LETTER YA..TIFINAGH LETTER YAZZ
30412D66..2D6E ; UNASSIGNED # <reserved>..<reserved>
30422D6F ; DISALLOWED # TIFINAGH MODIFIER LETTER LABIALIZATION MARK
30432D70..2D7F ; UNASSIGNED # <reserved>..<reserved>
30442D80..2D96 ; PVALID # ETHIOPIC SYLLABLE LOA..ETHIOPIC SYLLABLE GGW
30452D97..2D9F ; UNASSIGNED # <reserved>..<reserved>
30462DA0..2DA6 ; PVALID # ETHIOPIC SYLLABLE SSA..ETHIOPIC SYLLABLE SSO
30472DA7 ; UNASSIGNED # <reserved>
30482DA8..2DAE ; PVALID # ETHIOPIC SYLLABLE CCA..ETHIOPIC SYLLABLE CCO
30492DAF ; UNASSIGNED # <reserved>
30502DB0..2DB6 ; PVALID # ETHIOPIC SYLLABLE ZZA..ETHIOPIC SYLLABLE ZZO
30512DB7 ; UNASSIGNED # <reserved>
30522DB8..2DBE ; PVALID # ETHIOPIC SYLLABLE CCHA..ETHIOPIC SYLLABLE CC
30532DBF ; UNASSIGNED # <reserved>
30542DC0..2DC6 ; PVALID # ETHIOPIC SYLLABLE QYA..ETHIOPIC SYLLABLE QYO
30552DC7 ; UNASSIGNED # <reserved>
30562DC8..2DCE ; PVALID # ETHIOPIC SYLLABLE KYA..ETHIOPIC SYLLABLE KYO
30572DCF ; UNASSIGNED # <reserved>
30582DD0..2DD6 ; PVALID # ETHIOPIC SYLLABLE XYA..ETHIOPIC SYLLABLE XYO
30592DD7 ; UNASSIGNED # <reserved>
30602DD8..2DDE ; PVALID # ETHIOPIC SYLLABLE GYA..ETHIOPIC SYLLABLE GYO
30612DDF ; UNASSIGNED # <reserved>
30622DE0..2DFF ; PVALID # COMBINING CYRILLIC LETTER BE..COMBINING CYRI
30632E00..2E2E ; DISALLOWED # RIGHT ANGLE SUBSTITUTION MARKER..REVERSED QU
30642E2F ; PVALID # VERTICAL TILDE
30652E30..2E31 ; DISALLOWED # RING POINT..WORD SEPARATOR MIDDLE DOT
30662E32..2E7F ; UNASSIGNED # <reserved>..<reserved>
30672E80..2E99 ; DISALLOWED # CJK RADICAL REPEAT..CJK RADICAL RAP
30682E9A ; UNASSIGNED # <reserved>
30692E9B..2EF3 ; DISALLOWED # CJK RADICAL CHOKE..CJK RADICAL C-SIMPLIFIED
30702EF4..2EFF ; UNASSIGNED # <reserved>..<reserved>
30712F00..2FD5 ; DISALLOWED # KANGXI RADICAL ONE..KANGXI RADICAL FLUTE
30722FD6..2FEF ; UNASSIGNED # <reserved>..<reserved>
30732FF0..2FFB ; DISALLOWED # IDEOGRAPHIC DESCRIPTION CHARACTER LEFT TO RI
30742FFC..2FFF ; UNASSIGNED # <reserved>..<reserved>
30753000..3004 ; DISALLOWED # IDEOGRAPHIC SPACE..JAPANESE INDUSTRIAL STAND
30763005..3007 ; PVALID # IDEOGRAPHIC ITERATION MARK..IDEOGRAPHIC NUMB
30773008..3029 ; DISALLOWED # LEFT ANGLE BRACKET..HANGZHOU NUMERAL NINE
3078302A..302D ; PVALID # IDEOGRAPHIC LEVEL TONE MARK..IDEOGRAPHIC ENT
3079
3080
3081
3082Faltstrom Standards Track [Page 55]
3083
3084RFC 5892 IDNA Code Points August 2010
3085
3086
3087302E..303B ; DISALLOWED # HANGUL SINGLE DOT TONE MARK..VERTICAL IDEOGR
3088303C ; PVALID # MASU MARK
3089303D..303F ; DISALLOWED # PART ALTERNATION MARK..IDEOGRAPHIC HALF FILL
30903040 ; UNASSIGNED # <reserved>
30913041..3096 ; PVALID # HIRAGANA LETTER SMALL A..HIRAGANA LETTER SMA
30923097..3098 ; UNASSIGNED # <reserved>..<reserved>
30933099..309A ; PVALID # COMBINING KATAKANA-HIRAGANA VOICED SOUND MAR
3094309B..309C ; DISALLOWED # KATAKANA-HIRAGANA VOICED SOUND MARK..KATAKAN
3095309D..309E ; PVALID # HIRAGANA ITERATION MARK..HIRAGANA VOICED ITE
3096309F..30A0 ; DISALLOWED # HIRAGANA DIGRAPH YORI..KATAKANA-HIRAGANA DOU
309730A1..30FA ; PVALID # KATAKANA LETTER SMALL A..KATAKANA LETTER VO
309830FB ; CONTEXTO # KATAKANA MIDDLE DOT
309930FC..30FE ; PVALID # KATAKANA-HIRAGANA PROLONGED SOUND MARK..KATA
310030FF ; DISALLOWED # KATAKANA DIGRAPH KOTO
31013100..3104 ; UNASSIGNED # <reserved>..<reserved>
31023105..312D ; PVALID # BOPOMOFO LETTER B..BOPOMOFO LETTER IH
3103312E..3130 ; UNASSIGNED # <reserved>..<reserved>
31043131..318E ; DISALLOWED # HANGUL LETTER KIYEOK..HANGUL LETTER ARAEAE
3105318F ; UNASSIGNED # <reserved>
31063190..319F ; DISALLOWED # IDEOGRAPHIC ANNOTATION LINKING MARK..IDEOGRA
310731A0..31B7 ; PVALID # BOPOMOFO LETTER BU..BOPOMOFO FINAL LETTER H
310831B8..31BF ; UNASSIGNED # <reserved>..<reserved>
310931C0..31E3 ; DISALLOWED # CJK STROKE T..CJK STROKE Q
311031E4..31EF ; UNASSIGNED # <reserved>..<reserved>
311131F0..31FF ; PVALID # KATAKANA LETTER SMALL KU..KATAKANA LETTER SM
31123200..321E ; DISALLOWED # PARENTHESIZED HANGUL KIYEOK..PARENTHESIZED K
3113321F ; UNASSIGNED # <reserved>
31143220..32FE ; DISALLOWED # PARENTHESIZED IDEOGRAPH ONE..CIRCLED KATAKAN
311532FF ; UNASSIGNED # <reserved>
31163300..33FF ; DISALLOWED # SQUARE APAATO..SQUARE GAL
31173400..4DB5 ; PVALID # <CJK Ideograph Extension A>..<CJK Ideograph
31184DB6..4DBF ; UNASSIGNED # <reserved>..<reserved>
31194DC0..4DFF ; DISALLOWED # HEXAGRAM FOR THE CREATIVE HEAVEN..HEXAGRAM F
31204E00..9FCB ; PVALID # <CJK Ideograph>..<CJK Ideograph>
31219FCC..9FFF ; UNASSIGNED # <reserved>..<reserved>
3122A000..A48C ; PVALID # YI SYLLABLE IT..YI SYLLABLE YYR
3123A48D..A48F ; UNASSIGNED # <reserved>..<reserved>
3124A490..A4C6 ; DISALLOWED # YI RADICAL QOT..YI RADICAL KE
3125A4C7..A4CF ; UNASSIGNED # <reserved>..<reserved>
3126A4D0..A4FD ; PVALID # LISU LETTER BA..LISU LETTER TONE MYA JEU
3127A4FE..A4FF ; DISALLOWED # LISU PUNCTUATION COMMA..LISU PUNCTUATION FUL
3128A500..A60C ; PVALID # VAI SYLLABLE EE..VAI SYLLABLE LENGTHENER
3129A60D..A60F ; DISALLOWED # VAI COMMA..VAI QUESTION MARK
3130A610..A62B ; PVALID # VAI SYLLABLE NDOLE FA..VAI SYLLABLE NDOLE DO
3131A62C..A63F ; UNASSIGNED # <reserved>..<reserved>
3132A640 ; DISALLOWED # CYRILLIC CAPITAL LETTER ZEMLYA
3133A641 ; PVALID # CYRILLIC SMALL LETTER ZEMLYA
3134A642 ; DISALLOWED # CYRILLIC CAPITAL LETTER DZELO
3135
3136
3137
3138Faltstrom Standards Track [Page 56]
3139
3140RFC 5892 IDNA Code Points August 2010
3141
3142
3143A643 ; PVALID # CYRILLIC SMALL LETTER DZELO
3144A644 ; DISALLOWED # CYRILLIC CAPITAL LETTER REVERSED DZE
3145A645 ; PVALID # CYRILLIC SMALL LETTER REVERSED DZE
3146A646 ; DISALLOWED # CYRILLIC CAPITAL LETTER IOTA
3147A647 ; PVALID # CYRILLIC SMALL LETTER IOTA
3148A648 ; DISALLOWED # CYRILLIC CAPITAL LETTER DJERV
3149A649 ; PVALID # CYRILLIC SMALL LETTER DJERV
3150A64A ; DISALLOWED # CYRILLIC CAPITAL LETTER MONOGRAPH UK
3151A64B ; PVALID # CYRILLIC SMALL LETTER MONOGRAPH UK
3152A64C ; DISALLOWED # CYRILLIC CAPITAL LETTER BROAD OMEGA
3153A64D ; PVALID # CYRILLIC SMALL LETTER BROAD OMEGA
3154A64E ; DISALLOWED # CYRILLIC CAPITAL LETTER NEUTRAL YER
3155A64F ; PVALID # CYRILLIC SMALL LETTER NEUTRAL YER
3156A650 ; DISALLOWED # CYRILLIC CAPITAL LETTER YERU WITH BACK YER
3157A651 ; PVALID # CYRILLIC SMALL LETTER YERU WITH BACK YER
3158A652 ; DISALLOWED # CYRILLIC CAPITAL LETTER IOTIFIED YAT
3159A653 ; PVALID # CYRILLIC SMALL LETTER IOTIFIED YAT
3160A654 ; DISALLOWED # CYRILLIC CAPITAL LETTER REVERSED YU
3161A655 ; PVALID # CYRILLIC SMALL LETTER REVERSED YU
3162A656 ; DISALLOWED # CYRILLIC CAPITAL LETTER IOTIFIED A
3163A657 ; PVALID # CYRILLIC SMALL LETTER IOTIFIED A
3164A658 ; DISALLOWED # CYRILLIC CAPITAL LETTER CLOSED LITTLE YUS
3165A659 ; PVALID # CYRILLIC SMALL LETTER CLOSED LITTLE YUS
3166A65A ; DISALLOWED # CYRILLIC CAPITAL LETTER BLENDED YUS
3167A65B ; PVALID # CYRILLIC SMALL LETTER BLENDED YUS
3168A65C ; DISALLOWED # CYRILLIC CAPITAL LETTER IOTIFIED CLOSED LITT
3169A65D ; PVALID # CYRILLIC SMALL LETTER IOTIFIED CLOSED LITTLE
3170A65E ; DISALLOWED # CYRILLIC CAPITAL LETTER YN
3171A65F ; PVALID # CYRILLIC SMALL LETTER YN
3172A660..A661 ; UNASSIGNED # <reserved>..<reserved>
3173A662 ; DISALLOWED # CYRILLIC CAPITAL LETTER SOFT DE
3174A663 ; PVALID # CYRILLIC SMALL LETTER SOFT DE
3175A664 ; DISALLOWED # CYRILLIC CAPITAL LETTER SOFT EL
3176A665 ; PVALID # CYRILLIC SMALL LETTER SOFT EL
3177A666 ; DISALLOWED # CYRILLIC CAPITAL LETTER SOFT EM
3178A667 ; PVALID # CYRILLIC SMALL LETTER SOFT EM
3179A668 ; DISALLOWED # CYRILLIC CAPITAL LETTER MONOCULAR O
3180A669 ; PVALID # CYRILLIC SMALL LETTER MONOCULAR O
3181A66A ; DISALLOWED # CYRILLIC CAPITAL LETTER BINOCULAR O
3182A66B ; PVALID # CYRILLIC SMALL LETTER BINOCULAR O
3183A66C ; DISALLOWED # CYRILLIC CAPITAL LETTER DOUBLE MONOCULAR O
3184A66D..A66F ; PVALID # CYRILLIC SMALL LETTER DOUBLE MONOCULAR O..CO
3185A670..A673 ; DISALLOWED # COMBINING CYRILLIC TEN MILLIONS SIGN..SLAVON
3186A674..A67B ; UNASSIGNED # <reserved>..<reserved>
3187A67C..A67D ; PVALID # COMBINING CYRILLIC KAVYKA..COMBINING CYRILLI
3188A67E ; DISALLOWED # CYRILLIC KAVYKA
3189A67F ; PVALID # CYRILLIC PAYEROK
3190A680 ; DISALLOWED # CYRILLIC CAPITAL LETTER DWE
3191
3192
3193
3194Faltstrom Standards Track [Page 57]
3195
3196RFC 5892 IDNA Code Points August 2010
3197
3198
3199A681 ; PVALID # CYRILLIC SMALL LETTER DWE
3200A682 ; DISALLOWED # CYRILLIC CAPITAL LETTER DZWE
3201A683 ; PVALID # CYRILLIC SMALL LETTER DZWE
3202A684 ; DISALLOWED # CYRILLIC CAPITAL LETTER ZHWE
3203A685 ; PVALID # CYRILLIC SMALL LETTER ZHWE
3204A686 ; DISALLOWED # CYRILLIC CAPITAL LETTER CCHE
3205A687 ; PVALID # CYRILLIC SMALL LETTER CCHE
3206A688 ; DISALLOWED # CYRILLIC CAPITAL LETTER DZZE
3207A689 ; PVALID # CYRILLIC SMALL LETTER DZZE
3208A68A ; DISALLOWED # CYRILLIC CAPITAL LETTER TE WITH MIDDLE HOOK
3209A68B ; PVALID # CYRILLIC SMALL LETTER TE WITH MIDDLE HOOK
3210A68C ; DISALLOWED # CYRILLIC CAPITAL LETTER TWE
3211A68D ; PVALID # CYRILLIC SMALL LETTER TWE
3212A68E ; DISALLOWED # CYRILLIC CAPITAL LETTER TSWE
3213A68F ; PVALID # CYRILLIC SMALL LETTER TSWE
3214A690 ; DISALLOWED # CYRILLIC CAPITAL LETTER TSSE
3215A691 ; PVALID # CYRILLIC SMALL LETTER TSSE
3216A692 ; DISALLOWED # CYRILLIC CAPITAL LETTER TCHE
3217A693 ; PVALID # CYRILLIC SMALL LETTER TCHE
3218A694 ; DISALLOWED # CYRILLIC CAPITAL LETTER HWE
3219A695 ; PVALID # CYRILLIC SMALL LETTER HWE
3220A696 ; DISALLOWED # CYRILLIC CAPITAL LETTER SHWE
3221A697 ; PVALID # CYRILLIC SMALL LETTER SHWE
3222A698..A69F ; UNASSIGNED # <reserved>..<reserved>
3223A6A0..A6E5 ; PVALID # BAMUM LETTER A..BAMUM LETTER KI
3224A6E6..A6EF ; DISALLOWED # BAMUM LETTER MO..BAMUM LETTER KOGHOM
3225A6F0..A6F1 ; PVALID # BAMUM COMBINING MARK KOQNDON..BAMUM COMBININ
3226A6F2..A6F7 ; DISALLOWED # BAMUM NJAEMLI..BAMUM QUESTION MARK
3227A6F8..A6FF ; UNASSIGNED # <reserved>..<reserved>
3228A700..A716 ; DISALLOWED # MODIFIER LETTER CHINESE TONE YIN PING..MODIF
3229A717..A71F ; PVALID # MODIFIER LETTER DOT VERTICAL BAR..MODIFIER L
3230A720..A722 ; DISALLOWED # MODIFIER LETTER STRESS AND HIGH TONE..LATIN
3231A723 ; PVALID # LATIN SMALL LETTER EGYPTOLOGICAL ALEF
3232A724 ; DISALLOWED # LATIN CAPITAL LETTER EGYPTOLOGICAL AIN
3233A725 ; PVALID # LATIN SMALL LETTER EGYPTOLOGICAL AIN
3234A726 ; DISALLOWED # LATIN CAPITAL LETTER HENG
3235A727 ; PVALID # LATIN SMALL LETTER HENG
3236A728 ; DISALLOWED # LATIN CAPITAL LETTER TZ
3237A729 ; PVALID # LATIN SMALL LETTER TZ
3238A72A ; DISALLOWED # LATIN CAPITAL LETTER TRESILLO
3239A72B ; PVALID # LATIN SMALL LETTER TRESILLO
3240A72C ; DISALLOWED # LATIN CAPITAL LETTER CUATRILLO
3241A72D ; PVALID # LATIN SMALL LETTER CUATRILLO
3242A72E ; DISALLOWED # LATIN CAPITAL LETTER CUATRILLO WITH COMMA
3243A72F..A731 ; PVALID # LATIN SMALL LETTER CUATRILLO WITH COMMA..LAT
3244A732 ; DISALLOWED # LATIN CAPITAL LETTER AA
3245A733 ; PVALID # LATIN SMALL LETTER AA
3246A734 ; DISALLOWED # LATIN CAPITAL LETTER AO
3247
3248
3249
3250Faltstrom Standards Track [Page 58]
3251
3252RFC 5892 IDNA Code Points August 2010
3253
3254
3255A735 ; PVALID # LATIN SMALL LETTER AO
3256A736 ; DISALLOWED # LATIN CAPITAL LETTER AU
3257A737 ; PVALID # LATIN SMALL LETTER AU
3258A738 ; DISALLOWED # LATIN CAPITAL LETTER AV
3259A739 ; PVALID # LATIN SMALL LETTER AV
3260A73A ; DISALLOWED # LATIN CAPITAL LETTER AV WITH HORIZONTAL BAR
3261A73B ; PVALID # LATIN SMALL LETTER AV WITH HORIZONTAL BAR
3262A73C ; DISALLOWED # LATIN CAPITAL LETTER AY
3263A73D ; PVALID # LATIN SMALL LETTER AY
3264A73E ; DISALLOWED # LATIN CAPITAL LETTER REVERSED C WITH DOT
3265A73F ; PVALID # LATIN SMALL LETTER REVERSED C WITH DOT
3266A740 ; DISALLOWED # LATIN CAPITAL LETTER K WITH STROKE
3267A741 ; PVALID # LATIN SMALL LETTER K WITH STROKE
3268A742 ; DISALLOWED # LATIN CAPITAL LETTER K WITH DIAGONAL STROKE
3269A743 ; PVALID # LATIN SMALL LETTER K WITH DIAGONAL STROKE
3270A744 ; DISALLOWED # LATIN CAPITAL LETTER K WITH STROKE AND DIAGO
3271A745 ; PVALID # LATIN SMALL LETTER K WITH STROKE AND DIAGONA
3272A746 ; DISALLOWED # LATIN CAPITAL LETTER BROKEN L
3273A747 ; PVALID # LATIN SMALL LETTER BROKEN L
3274A748 ; DISALLOWED # LATIN CAPITAL LETTER L WITH HIGH STROKE
3275A749 ; PVALID # LATIN SMALL LETTER L WITH HIGH STROKE
3276A74A ; DISALLOWED # LATIN CAPITAL LETTER O WITH LONG STROKE OVER
3277A74B ; PVALID # LATIN SMALL LETTER O WITH LONG STROKE OVERLA
3278A74C ; DISALLOWED # LATIN CAPITAL LETTER O WITH LOOP
3279A74D ; PVALID # LATIN SMALL LETTER O WITH LOOP
3280A74E ; DISALLOWED # LATIN CAPITAL LETTER OO
3281A74F ; PVALID # LATIN SMALL LETTER OO
3282A750 ; DISALLOWED # LATIN CAPITAL LETTER P WITH STROKE THROUGH D
3283A751 ; PVALID # LATIN SMALL LETTER P WITH STROKE THROUGH DES
3284A752 ; DISALLOWED # LATIN CAPITAL LETTER P WITH FLOURISH
3285A753 ; PVALID # LATIN SMALL LETTER P WITH FLOURISH
3286A754 ; DISALLOWED # LATIN CAPITAL LETTER P WITH SQUIRREL TAIL
3287A755 ; PVALID # LATIN SMALL LETTER P WITH SQUIRREL TAIL
3288A756 ; DISALLOWED # LATIN CAPITAL LETTER Q WITH STROKE THROUGH D
3289A757 ; PVALID # LATIN SMALL LETTER Q WITH STROKE THROUGH DES
3290A758 ; DISALLOWED # LATIN CAPITAL LETTER Q WITH DIAGONAL STROKE
3291A759 ; PVALID # LATIN SMALL LETTER Q WITH DIAGONAL STROKE
3292A75A ; DISALLOWED # LATIN CAPITAL LETTER R ROTUNDA
3293A75B ; PVALID # LATIN SMALL LETTER R ROTUNDA
3294A75C ; DISALLOWED # LATIN CAPITAL LETTER RUM ROTUNDA
3295A75D ; PVALID # LATIN SMALL LETTER RUM ROTUNDA
3296A75E ; DISALLOWED # LATIN CAPITAL LETTER V WITH DIAGONAL STROKE
3297A75F ; PVALID # LATIN SMALL LETTER V WITH DIAGONAL STROKE
3298A760 ; DISALLOWED # LATIN CAPITAL LETTER VY
3299A761 ; PVALID # LATIN SMALL LETTER VY
3300A762 ; DISALLOWED # LATIN CAPITAL LETTER VISIGOTHIC Z
3301A763 ; PVALID # LATIN SMALL LETTER VISIGOTHIC Z
3302A764 ; DISALLOWED # LATIN CAPITAL LETTER THORN WITH STROKE
3303
3304
3305
3306Faltstrom Standards Track [Page 59]
3307
3308RFC 5892 IDNA Code Points August 2010
3309
3310
3311A765 ; PVALID # LATIN SMALL LETTER THORN WITH STROKE
3312A766 ; DISALLOWED # LATIN CAPITAL LETTER THORN WITH STROKE THROU
3313A767 ; PVALID # LATIN SMALL LETTER THORN WITH STROKE THROUGH
3314A768 ; DISALLOWED # LATIN CAPITAL LETTER VEND
3315A769 ; PVALID # LATIN SMALL LETTER VEND
3316A76A ; DISALLOWED # LATIN CAPITAL LETTER ET
3317A76B ; PVALID # LATIN SMALL LETTER ET
3318A76C ; DISALLOWED # LATIN CAPITAL LETTER IS
3319A76D ; PVALID # LATIN SMALL LETTER IS
3320A76E ; DISALLOWED # LATIN CAPITAL LETTER CON
3321A76F ; PVALID # LATIN SMALL LETTER CON
3322A770 ; DISALLOWED # MODIFIER LETTER US
3323A771..A778 ; PVALID # LATIN SMALL LETTER DUM..LATIN SMALL LETTER U
3324A779 ; DISALLOWED # LATIN CAPITAL LETTER INSULAR D
3325A77A ; PVALID # LATIN SMALL LETTER INSULAR D
3326A77B ; DISALLOWED # LATIN CAPITAL LETTER INSULAR F
3327A77C ; PVALID # LATIN SMALL LETTER INSULAR F
3328A77D..A77E ; DISALLOWED # LATIN CAPITAL LETTER INSULAR G..LATIN CAPITA
3329A77F ; PVALID # LATIN SMALL LETTER TURNED INSULAR G
3330A780 ; DISALLOWED # LATIN CAPITAL LETTER TURNED L
3331A781 ; PVALID # LATIN SMALL LETTER TURNED L
3332A782 ; DISALLOWED # LATIN CAPITAL LETTER INSULAR R
3333A783 ; PVALID # LATIN SMALL LETTER INSULAR R
3334A784 ; DISALLOWED # LATIN CAPITAL LETTER INSULAR S
3335A785 ; PVALID # LATIN SMALL LETTER INSULAR S
3336A786 ; DISALLOWED # LATIN CAPITAL LETTER INSULAR T
3337A787..A788 ; PVALID # LATIN SMALL LETTER INSULAR T..MODIFIER LETTE
3338A789..A78B ; DISALLOWED # MODIFIER LETTER COLON..LATIN CAPITAL LETTER
3339A78C ; PVALID # LATIN SMALL LETTER SALTILLO
3340A78D..A7FA ; UNASSIGNED # <reserved>..<reserved>
3341A7FB..A827 ; PVALID # LATIN EPIGRAPHIC LETTER REVERSED F..SYLOTI N
3342A828..A82B ; DISALLOWED # SYLOTI NAGRI POETRY MARK-1..SYLOTI NAGRI POE
3343A82C..A82F ; UNASSIGNED # <reserved>..<reserved>
3344A830..A839 ; DISALLOWED # NORTH INDIC FRACTION ONE QUARTER..NORTH INDI
3345A83A..A83F ; UNASSIGNED # <reserved>..<reserved>
3346A840..A873 ; PVALID # PHAGS-PA LETTER KA..PHAGS-PA LETTER CANDRABI
3347A874..A877 ; DISALLOWED # PHAGS-PA SINGLE HEAD MARK..PHAGS-PA MARK DOU
3348A878..A87F ; UNASSIGNED # <reserved>..<reserved>
3349A880..A8C4 ; PVALID # SAURASHTRA SIGN ANUSVARA..SAURASHTRA SIGN VI
3350A8C5..A8CD ; UNASSIGNED # <reserved>..<reserved>
3351A8CE..A8CF ; DISALLOWED # SAURASHTRA DANDA..SAURASHTRA DOUBLE DANDA
3352A8D0..A8D9 ; PVALID # SAURASHTRA DIGIT ZERO..SAURASHTRA DIGIT NINE
3353A8DA..A8DF ; UNASSIGNED # <reserved>..<reserved>
3354A8E0..A8F7 ; PVALID # COMBINING DEVANAGARI DIGIT ZERO..DEVANAGARI
3355A8F8..A8FA ; DISALLOWED # DEVANAGARI SIGN PUSHPIKA..DEVANAGARI CARET
3356A8FB ; PVALID # DEVANAGARI HEADSTROKE
3357A8FC..A8FF ; UNASSIGNED # <reserved>..<reserved>
3358A900..A92D ; PVALID # KAYAH LI DIGIT ZERO..KAYAH LI TONE CALYA PLO
3359
3360
3361
3362Faltstrom Standards Track [Page 60]
3363
3364RFC 5892 IDNA Code Points August 2010
3365
3366
3367A92E..A92F ; DISALLOWED # KAYAH LI SIGN CWI..KAYAH LI SIGN SHYA
3368A930..A953 ; PVALID # REJANG LETTER KA..REJANG VIRAMA
3369A954..A95E ; UNASSIGNED # <reserved>..<reserved>
3370A95F..A97C ; DISALLOWED # REJANG SECTION MARK..HANGUL CHOSEONG SSANGYE
3371A97D..A97F ; UNASSIGNED # <reserved>..<reserved>
3372A980..A9C0 ; PVALID # JAVANESE SIGN PANYANGGA..JAVANESE PANGKON
3373A9C1..A9CD ; DISALLOWED # JAVANESE LEFT RERENGGAN..JAVANESE TURNED PAD
3374A9CE ; UNASSIGNED # <reserved>
3375A9CF..A9D9 ; PVALID # JAVANESE PANGRANGKEP..JAVANESE DIGIT NINE
3376A9DA..A9DD ; UNASSIGNED # <reserved>..<reserved>
3377A9DE..A9DF ; DISALLOWED # JAVANESE PADA TIRTA TUMETES..JAVANESE PADA I
3378A9E0..A9FF ; UNASSIGNED # <reserved>..<reserved>
3379AA00..AA36 ; PVALID # CHAM LETTER A..CHAM CONSONANT SIGN WA
3380AA37..AA3F ; UNASSIGNED # <reserved>..<reserved>
3381AA40..AA4D ; PVALID # CHAM LETTER FINAL K..CHAM CONSONANT SIGN FIN
3382AA4E..AA4F ; UNASSIGNED # <reserved>..<reserved>
3383AA50..AA59 ; PVALID # CHAM DIGIT ZERO..CHAM DIGIT NINE
3384AA5A..AA5B ; UNASSIGNED # <reserved>..<reserved>
3385AA5C..AA5F ; DISALLOWED # CHAM PUNCTUATION SPIRAL..CHAM PUNCTUATION TR
3386AA60..AA76 ; PVALID # MYANMAR LETTER KHAMTI GA..MYANMAR LOGOGRAM K
3387AA77..AA79 ; DISALLOWED # MYANMAR SYMBOL AITON EXCLAMATION..MYANMAR SY
3388AA7A..AA7B ; PVALID # MYANMAR LETTER AITON RA..MYANMAR SIGN PAO KA
3389AA7C..AA7F ; UNASSIGNED # <reserved>..<reserved>
3390AA80..AAC2 ; PVALID # TAI VIET LETTER LOW KO..TAI VIET TONE MAI SO
3391AAC3..AADA ; UNASSIGNED # <reserved>..<reserved>
3392AADB..AADD ; PVALID # TAI VIET SYMBOL KON..TAI VIET SYMBOL SAM
3393AADE..AADF ; DISALLOWED # TAI VIET SYMBOL HO HOI..TAI VIET SYMBOL KOI
3394AAE0..ABBF ; UNASSIGNED # <reserved>..<reserved>
3395ABC0..ABEA ; PVALID # MEETEI MAYEK LETTER KOK..MEETEI MAYEK VOWEL
3396ABEB ; DISALLOWED # MEETEI MAYEK CHEIKHEI
3397ABEC..ABED ; PVALID # MEETEI MAYEK LUM IYEK..MEETEI MAYEK APUN IYE
3398ABEE..ABEF ; UNASSIGNED # <reserved>..<reserved>
3399ABF0..ABF9 ; PVALID # MEETEI MAYEK DIGIT ZERO..MEETEI MAYEK DIGIT
3400ABFA..ABFF ; UNASSIGNED # <reserved>..<reserved>
3401AC00..D7A3 ; PVALID # <Hangul Syllable>..<Hangul Syllable>
3402D7A4..D7AF ; UNASSIGNED # <reserved>..<reserved>
3403D7B0..D7C6 ; DISALLOWED # HANGUL JUNGSEONG O-YEO..HANGUL JUNGSEONG ARA
3404D7C7..D7CA ; UNASSIGNED # <reserved>..<reserved>
3405D7CB..D7FB ; DISALLOWED # HANGUL JONGSEONG NIEUN-RIEUL..HANGUL JONGSEO
3406D7FC..D7FF ; UNASSIGNED # <reserved>..<reserved>
3407D800..FA0D ; DISALLOWED # <Non Private Use High Surrogate>..CJK COMPAT
3408FA0E..FA0F ; PVALID # CJK COMPATIBILITY IDEOGRAPH-FA0E..CJK COMPAT
3409FA10 ; DISALLOWED # CJK COMPATIBILITY IDEOGRAPH-FA10
3410FA11 ; PVALID # CJK COMPATIBILITY IDEOGRAPH-FA11
3411FA12 ; DISALLOWED # CJK COMPATIBILITY IDEOGRAPH-FA12
3412FA13..FA14 ; PVALID # CJK COMPATIBILITY IDEOGRAPH-FA13..CJK COMPAT
3413FA15..FA1E ; DISALLOWED # CJK COMPATIBILITY IDEOGRAPH-FA15..CJK COMPAT
3414FA1F ; PVALID # CJK COMPATIBILITY IDEOGRAPH-FA1F
3415
3416
3417
3418Faltstrom Standards Track [Page 61]
3419
3420RFC 5892 IDNA Code Points August 2010
3421
3422
3423FA20 ; DISALLOWED # CJK COMPATIBILITY IDEOGRAPH-FA20
3424FA21 ; PVALID # CJK COMPATIBILITY IDEOGRAPH-FA21
3425FA22 ; DISALLOWED # CJK COMPATIBILITY IDEOGRAPH-FA22
3426FA23..FA24 ; PVALID # CJK COMPATIBILITY IDEOGRAPH-FA23..CJK COMPAT
3427FA25..FA26 ; DISALLOWED # CJK COMPATIBILITY IDEOGRAPH-FA25..CJK COMPAT
3428FA27..FA29 ; PVALID # CJK COMPATIBILITY IDEOGRAPH-FA27..CJK COMPAT
3429FA2A..FA2D ; DISALLOWED # CJK COMPATIBILITY IDEOGRAPH-FA2A..CJK COMPAT
3430FA2E..FA2F ; UNASSIGNED # <reserved>..<reserved>
3431FA30..FA6D ; DISALLOWED # CJK COMPATIBILITY IDEOGRAPH-FA30..CJK COMPAT
3432FA6E..FA6F ; UNASSIGNED # <reserved>..<reserved>
3433FA70..FAD9 ; DISALLOWED # CJK COMPATIBILITY IDEOGRAPH-FA70..CJK COMPAT
3434FADA..FAFF ; UNASSIGNED # <reserved>..<reserved>
3435FB00..FB06 ; DISALLOWED # LATIN SMALL LIGATURE FF..LATIN SMALL LIGATUR
3436FB07..FB12 ; UNASSIGNED # <reserved>..<reserved>
3437FB13..FB17 ; DISALLOWED # ARMENIAN SMALL LIGATURE MEN NOW..ARMENIAN SM
3438FB18..FB1C ; UNASSIGNED # <reserved>..<reserved>
3439FB1D ; DISALLOWED # HEBREW LETTER YOD WITH HIRIQ
3440FB1E ; PVALID # HEBREW POINT JUDEO-SPANISH VARIKA
3441FB1F..FB36 ; DISALLOWED # HEBREW LIGATURE YIDDISH YOD YOD PATAH..HEBRE
3442FB37 ; UNASSIGNED # <reserved>
3443FB38..FB3C ; DISALLOWED # HEBREW LETTER TET WITH DAGESH..HEBREW LETTER
3444FB3D ; UNASSIGNED # <reserved>
3445FB3E ; DISALLOWED # HEBREW LETTER MEM WITH DAGESH
3446FB3F ; UNASSIGNED # <reserved>
3447FB40..FB41 ; DISALLOWED # HEBREW LETTER NUN WITH DAGESH..HEBREW LETTER
3448FB42 ; UNASSIGNED # <reserved>
3449FB43..FB44 ; DISALLOWED # HEBREW LETTER FINAL PE WITH DAGESH..HEBREW L
3450FB45 ; UNASSIGNED # <reserved>
3451FB46..FBB1 ; DISALLOWED # HEBREW LETTER TSADI WITH DAGESH..ARABIC LETT
3452FBB2..FBD2 ; UNASSIGNED # <reserved>..<reserved>
3453FBD3..FD3F ; DISALLOWED # ARABIC LETTER NG ISOLATED FORM..ORNATE RIGHT
3454FD40..FD4F ; UNASSIGNED # <reserved>..<reserved>
3455FD50..FD8F ; DISALLOWED # ARABIC LIGATURE TEH WITH JEEM WITH MEEM INIT
3456FD90..FD91 ; UNASSIGNED # <reserved>..<reserved>
3457FD92..FDC7 ; DISALLOWED # ARABIC LIGATURE MEEM WITH JEEM WITH KHAH INI
3458FDC8..FDCF ; UNASSIGNED # <reserved>..<reserved>
3459FDD0..FDFD ; DISALLOWED # <noncharacter>..ARABIC LIGATURE BISMILLAH AR
3460FDFE..FDFF ; UNASSIGNED # <reserved>..<reserved>
3461FE00..FE19 ; DISALLOWED # VARIATION SELECTOR-1..PRESENTATION FORM FOR
3462FE1A..FE1F ; UNASSIGNED # <reserved>..<reserved>
3463FE20..FE26 ; PVALID # COMBINING LIGATURE LEFT HALF..COMBINING CONJ
3464FE27..FE2F ; UNASSIGNED # <reserved>..<reserved>
3465FE30..FE52 ; DISALLOWED # PRESENTATION FORM FOR VERTICAL TWO DOT LEADE
3466FE53 ; UNASSIGNED # <reserved>
3467FE54..FE66 ; DISALLOWED # SMALL SEMICOLON..SMALL EQUALS SIGN
3468FE67 ; UNASSIGNED # <reserved>
3469FE68..FE6B ; DISALLOWED # SMALL REVERSE SOLIDUS..SMALL COMMERCIAL AT
3470FE6C..FE6F ; UNASSIGNED # <reserved>..<reserved>
3471
3472
3473
3474Faltstrom Standards Track [Page 62]
3475
3476RFC 5892 IDNA Code Points August 2010
3477
3478
3479FE70..FE72 ; DISALLOWED # ARABIC FATHATAN ISOLATED FORM..ARABIC DAMMAT
3480FE73 ; PVALID # ARABIC TAIL FRAGMENT
3481FE74 ; DISALLOWED # ARABIC KASRATAN ISOLATED FORM
3482FE75 ; UNASSIGNED # <reserved>
3483FE76..FEFC ; DISALLOWED # ARABIC FATHA ISOLATED FORM..ARABIC LIGATURE
3484FEFD..FEFE ; UNASSIGNED # <reserved>..<reserved>
3485FEFF ; DISALLOWED # ZERO WIDTH NO-BREAK SPACE
3486FF00 ; UNASSIGNED # <reserved>
3487FF01..FFBE ; DISALLOWED # FULLWIDTH EXCLAMATION MARK..HALFWIDTH HANGUL
3488FFBF..FFC1 ; UNASSIGNED # <reserved>..<reserved>
3489FFC2..FFC7 ; DISALLOWED # HALFWIDTH HANGUL LETTER A..HALFWIDTH HANGUL
3490FFC8..FFC9 ; UNASSIGNED # <reserved>..<reserved>
3491FFCA..FFCF ; DISALLOWED # HALFWIDTH HANGUL LETTER YEO..HALFWIDTH HANGU
3492FFD0..FFD1 ; UNASSIGNED # <reserved>..<reserved>
3493FFD2..FFD7 ; DISALLOWED # HALFWIDTH HANGUL LETTER YO..HALFWIDTH HANGUL
3494FFD8..FFD9 ; UNASSIGNED # <reserved>..<reserved>
3495FFDA..FFDC ; DISALLOWED # HALFWIDTH HANGUL LETTER EU..HALFWIDTH HANGUL
3496FFDD..FFDF ; UNASSIGNED # <reserved>..<reserved>
3497FFE0..FFE6 ; DISALLOWED # FULLWIDTH CENT SIGN..FULLWIDTH WON SIGN
3498FFE7 ; UNASSIGNED # <reserved>
3499FFE8..FFEE ; DISALLOWED # HALFWIDTH FORMS LIGHT VERTICAL..HALFWIDTH WH
3500FFEF..FFF8 ; UNASSIGNED # <reserved>..<reserved>
3501FFF9..FFFF ; DISALLOWED # INTERLINEAR ANNOTATION ANCHOR..<noncharacter
350210000..1000B; PVALID # LINEAR B SYLLABLE B008 A..LINEAR B SYLLABLE
35031000C ; UNASSIGNED # <reserved>
35041000D..10026; PVALID # LINEAR B SYLLABLE B036 JO..LINEAR B SYLLABLE
350510027 ; UNASSIGNED # <reserved>
350610028..1003A; PVALID # LINEAR B SYLLABLE B060 RA..LINEAR B SYLLABLE
35071003B ; UNASSIGNED # <reserved>
35081003C..1003D; PVALID # LINEAR B SYLLABLE B017 ZA..LINEAR B SYLLABLE
35091003E ; UNASSIGNED # <reserved>
35101003F..1004D; PVALID # LINEAR B SYLLABLE B020 ZO..LINEAR B SYLLABLE
35111004E..1004F; UNASSIGNED # <reserved>..<reserved>
351210050..1005D; PVALID # LINEAR B SYMBOL B018..LINEAR B SYMBOL B089
35131005E..1007F; UNASSIGNED # <reserved>..<reserved>
351410080..100FA; PVALID # LINEAR B IDEOGRAM B100 MAN..LINEAR B IDEOGRA
3515100FB..100FF; UNASSIGNED # <reserved>..<reserved>
351610100..10102; DISALLOWED # AEGEAN WORD SEPARATOR LINE..AEGEAN CHECK MAR
351710103..10106; UNASSIGNED # <reserved>..<reserved>
351810107..10133; DISALLOWED # AEGEAN NUMBER ONE..AEGEAN NUMBER NINETY THOU
351910134..10136; UNASSIGNED # <reserved>..<reserved>
352010137..1018A; DISALLOWED # AEGEAN WEIGHT BASE UNIT..GREEK ZERO SIGN
35211018B..1018F; UNASSIGNED # <reserved>..<reserved>
352210190..1019B; DISALLOWED # ROMAN SEXTANS SIGN..ROMAN CENTURIAL SIGN
35231019C..101CF; UNASSIGNED # <reserved>..<reserved>
3524101D0..101FC; DISALLOWED # PHAISTOS DISC SIGN PEDESTRIAN..PHAISTOS DISC
3525101FD ; PVALID # PHAISTOS DISC SIGN COMBINING OBLIQUE STROKE
3526101FE..1027F; UNASSIGNED # <reserved>..<reserved>
3527
3528
3529
3530Faltstrom Standards Track [Page 63]
3531
3532RFC 5892 IDNA Code Points August 2010
3533
3534
353510280..1029C; PVALID # LYCIAN LETTER A..LYCIAN LETTER X
35361029D..1029F; UNASSIGNED # <reserved>..<reserved>
3537102A0..102D0; PVALID # CARIAN LETTER A..CARIAN LETTER UUU3
3538102D1..102FF; UNASSIGNED # <reserved>..<reserved>
353910300..1031E; PVALID # OLD ITALIC LETTER A..OLD ITALIC LETTER UU
35401031F ; UNASSIGNED # <reserved>
354110320..10323; DISALLOWED # OLD ITALIC NUMERAL ONE..OLD ITALIC NUMERAL F
354210324..1032F; UNASSIGNED # <reserved>..<reserved>
354310330..10340; PVALID # GOTHIC LETTER AHSA..GOTHIC LETTER PAIRTHRA
354410341 ; DISALLOWED # GOTHIC LETTER NINETY
354510342..10349; PVALID # GOTHIC LETTER RAIDA..GOTHIC LETTER OTHAL
35461034A ; DISALLOWED # GOTHIC LETTER NINE HUNDRED
35471034B..1037F; UNASSIGNED # <reserved>..<reserved>
354810380..1039D; PVALID # UGARITIC LETTER ALPA..UGARITIC LETTER SSU
35491039E ; UNASSIGNED # <reserved>
35501039F ; DISALLOWED # UGARITIC WORD DIVIDER
3551103A0..103C3; PVALID # OLD PERSIAN SIGN A..OLD PERSIAN SIGN HA
3552103C4..103C7; UNASSIGNED # <reserved>..<reserved>
3553103C8..103CF; PVALID # OLD PERSIAN SIGN AURAMAZDAA..OLD PERSIAN SIG
3554103D0..103D5; DISALLOWED # OLD PERSIAN WORD DIVIDER..OLD PERSIAN NUMBER
3555103D6..103FF; UNASSIGNED # <reserved>..<reserved>
355610400..10427; DISALLOWED # DESERET CAPITAL LETTER LONG I..DESERET CAPIT
355710428..1049D; PVALID # DESERET SMALL LETTER LONG I..OSMANYA LETTER
35581049E..1049F; UNASSIGNED # <reserved>..<reserved>
3559104A0..104A9; PVALID # OSMANYA DIGIT ZERO..OSMANYA DIGIT NINE
3560104AA..107FF; UNASSIGNED # <reserved>..<reserved>
356110800..10805; PVALID # CYPRIOT SYLLABLE A..CYPRIOT SYLLABLE JA
356210806..10807; UNASSIGNED # <reserved>..<reserved>
356310808 ; PVALID # CYPRIOT SYLLABLE JO
356410809 ; UNASSIGNED # <reserved>
35651080A..10835; PVALID # CYPRIOT SYLLABLE KA..CYPRIOT SYLLABLE WO
356610836 ; UNASSIGNED # <reserved>
356710837..10838; PVALID # CYPRIOT SYLLABLE XA..CYPRIOT SYLLABLE XE
356810839..1083B; UNASSIGNED # <reserved>..<reserved>
35691083C ; PVALID # CYPRIOT SYLLABLE ZA
35701083D..1083E; UNASSIGNED # <reserved>..<reserved>
35711083F..10855; PVALID # CYPRIOT SYLLABLE ZO..IMPERIAL ARAMAIC LETTER
357210856 ; UNASSIGNED # <reserved>
357310857..1085F; DISALLOWED # IMPERIAL ARAMAIC SECTION SIGN..IMPERIAL ARAM
357410860..108FF; UNASSIGNED # <reserved>..<reserved>
357510900..10915; PVALID # PHOENICIAN LETTER ALF..PHOENICIAN LETTER TAU
357610916..1091B; DISALLOWED # PHOENICIAN NUMBER ONE..PHOENICIAN NUMBER THR
35771091C..1091E; UNASSIGNED # <reserved>..<reserved>
35781091F ; DISALLOWED # PHOENICIAN WORD SEPARATOR
357910920..10939; PVALID # LYDIAN LETTER A..LYDIAN LETTER C
35801093A..1093E; UNASSIGNED # <reserved>..<reserved>
35811093F ; DISALLOWED # LYDIAN TRIANGULAR MARK
358210940..109FF; UNASSIGNED # <reserved>..<reserved>
3583
3584
3585
3586Faltstrom Standards Track [Page 64]
3587
3588RFC 5892 IDNA Code Points August 2010
3589
3590
359110A00..10A03; PVALID # KHAROSHTHI LETTER A..KHAROSHTHI VOWEL SIGN V
359210A04 ; UNASSIGNED # <reserved>
359310A05..10A06; PVALID # KHAROSHTHI VOWEL SIGN E..KHAROSHTHI VOWEL SI
359410A07..10A0B; UNASSIGNED # <reserved>..<reserved>
359510A0C..10A13; PVALID # KHAROSHTHI VOWEL LENGTH MARK..KHAROSHTHI LET
359610A14 ; UNASSIGNED # <reserved>
359710A15..10A17; PVALID # KHAROSHTHI LETTER CA..KHAROSHTHI LETTER JA
359810A18 ; UNASSIGNED # <reserved>
359910A19..10A33; PVALID # KHAROSHTHI LETTER NYA..KHAROSHTHI LETTER TTT
360010A34..10A37; UNASSIGNED # <reserved>..<reserved>
360110A38..10A3A; PVALID # KHAROSHTHI SIGN BAR ABOVE..KHAROSHTHI SIGN D
360210A3B..10A3E; UNASSIGNED # <reserved>..<reserved>
360310A3F ; PVALID # KHAROSHTHI VIRAMA
360410A40..10A47; DISALLOWED # KHAROSHTHI DIGIT ONE..KHAROSHTHI NUMBER ONE
360510A48..10A4F; UNASSIGNED # <reserved>..<reserved>
360610A50..10A58; DISALLOWED # KHAROSHTHI PUNCTUATION DOT..KHAROSHTHI PUNCT
360710A59..10A5F; UNASSIGNED # <reserved>..<reserved>
360810A60..10A7C; PVALID # OLD SOUTH ARABIAN LETTER HE..OLD SOUTH ARABI
360910A7D..10A7F; DISALLOWED # OLD SOUTH ARABIAN NUMBER ONE..OLD SOUTH ARAB
361010A80..10AFF; UNASSIGNED # <reserved>..<reserved>
361110B00..10B35; PVALID # AVESTAN LETTER A..AVESTAN LETTER HE
361210B36..10B38; UNASSIGNED # <reserved>..<reserved>
361310B39..10B3F; DISALLOWED # AVESTAN ABBREVIATION MARK..LARGE ONE RING OV
361410B40..10B55; PVALID # INSCRIPTIONAL PARTHIAN LETTER ALEPH..INSCRIP
361510B56..10B57; UNASSIGNED # <reserved>..<reserved>
361610B58..10B5F; DISALLOWED # INSCRIPTIONAL PARTHIAN NUMBER ONE..INSCRIPTI
361710B60..10B72; PVALID # INSCRIPTIONAL PAHLAVI LETTER ALEPH..INSCRIPT
361810B73..10B77; UNASSIGNED # <reserved>..<reserved>
361910B78..10B7F; DISALLOWED # INSCRIPTIONAL PAHLAVI NUMBER ONE..INSCRIPTIO
362010B80..10BFF; UNASSIGNED # <reserved>..<reserved>
362110C00..10C48; PVALID # OLD TURKIC LETTER ORKHON A..OLD TURKIC LETTE
362210C49..10E5F; UNASSIGNED # <reserved>..<reserved>
362310E60..10E7E; DISALLOWED # RUMI DIGIT ONE..RUMI FRACTION TWO THIRDS
362410E7F..1107F; UNASSIGNED # <reserved>..<reserved>
362511080..110BA; PVALID # KAITHI SIGN CANDRABINDU..KAITHI SIGN NUKTA
3626110BB..110C1; DISALLOWED # KAITHI ABBREVIATION SIGN..KAITHI DOUBLE DAND
3627110C2..11FFF; UNASSIGNED # <reserved>..<reserved>
362812000..1236E; PVALID # CUNEIFORM SIGN A..CUNEIFORM SIGN ZUM
36291236F..123FF; UNASSIGNED # <reserved>..<reserved>
363012400..12462; DISALLOWED # CUNEIFORM NUMERIC SIGN TWO ASH..CUNEIFORM NU
363112463..1246F; UNASSIGNED # <reserved>..<reserved>
363212470..12473; DISALLOWED # CUNEIFORM PUNCTUATION SIGN OLD ASSYRIAN WORD
363312474..12FFF; UNASSIGNED # <reserved>..<reserved>
363413000..1342E; PVALID # EGYPTIAN HIEROGLYPH A001..EGYPTIAN HIEROGLYP
36351342F..1CFFF; UNASSIGNED # <reserved>..<reserved>
36361D000..1D0F5; DISALLOWED # BYZANTINE MUSICAL SYMBOL PSILI..BYZANTINE MU
36371D0F6..1D0FF; UNASSIGNED # <reserved>..<reserved>
36381D100..1D126; DISALLOWED # MUSICAL SYMBOL SINGLE BARLINE..MUSICAL SYMBO
3639
3640
3641
3642Faltstrom Standards Track [Page 65]
3643
3644RFC 5892 IDNA Code Points August 2010
3645
3646
36471D127..1D128; UNASSIGNED # <reserved>..<reserved>
36481D129..1D1DD; DISALLOWED # MUSICAL SYMBOL MULTIPLE MEASURE REST..MUSICA
36491D1DE..1D1FF; UNASSIGNED # <reserved>..<reserved>
36501D200..1D245; DISALLOWED # GREEK VOCAL NOTATION SYMBOL-1..GREEK MUSICAL
36511D246..1D2FF; UNASSIGNED # <reserved>..<reserved>
36521D300..1D356; DISALLOWED # MONOGRAM FOR EARTH..TETRAGRAM FOR FOSTERING
36531D357..1D35F; UNASSIGNED # <reserved>..<reserved>
36541D360..1D371; DISALLOWED # COUNTING ROD UNIT DIGIT ONE..COUNTING ROD TE
36551D372..1D3FF; UNASSIGNED # <reserved>..<reserved>
36561D400..1D454; DISALLOWED # MATHEMATICAL BOLD CAPITAL A..MATHEMATICAL IT
36571D455 ; UNASSIGNED # <reserved>
36581D456..1D49C; DISALLOWED # MATHEMATICAL ITALIC SMALL I..MATHEMATICAL SC
36591D49D ; UNASSIGNED # <reserved>
36601D49E..1D49F; DISALLOWED # MATHEMATICAL SCRIPT CAPITAL C..MATHEMATICAL
36611D4A0..1D4A1; UNASSIGNED # <reserved>..<reserved>
36621D4A2 ; DISALLOWED # MATHEMATICAL SCRIPT CAPITAL G
36631D4A3..1D4A4; UNASSIGNED # <reserved>..<reserved>
36641D4A5..1D4A6; DISALLOWED # MATHEMATICAL SCRIPT CAPITAL J..MATHEMATICAL
36651D4A7..1D4A8; UNASSIGNED # <reserved>..<reserved>
36661D4A9..1D4AC; DISALLOWED # MATHEMATICAL SCRIPT CAPITAL N..MATHEMATICAL
36671D4AD ; UNASSIGNED # <reserved>
36681D4AE..1D4B9; DISALLOWED # MATHEMATICAL SCRIPT CAPITAL S..MATHEMATICAL
36691D4BA ; UNASSIGNED # <reserved>
36701D4BB ; DISALLOWED # MATHEMATICAL SCRIPT SMALL F
36711D4BC ; UNASSIGNED # <reserved>
36721D4BD..1D4C3; DISALLOWED # MATHEMATICAL SCRIPT SMALL H..MATHEMATICAL SC
36731D4C4 ; UNASSIGNED # <reserved>
36741D4C5..1D505; DISALLOWED # MATHEMATICAL SCRIPT SMALL P..MATHEMATICAL FR
36751D506 ; UNASSIGNED # <reserved>
36761D507..1D50A; DISALLOWED # MATHEMATICAL FRAKTUR CAPITAL D..MATHEMATICAL
36771D50B..1D50C; UNASSIGNED # <reserved>..<reserved>
36781D50D..1D514; DISALLOWED # MATHEMATICAL FRAKTUR CAPITAL J..MATHEMATICAL
36791D515 ; UNASSIGNED # <reserved>
36801D516..1D51C; DISALLOWED # MATHEMATICAL FRAKTUR CAPITAL S..MATHEMATICAL
36811D51D ; UNASSIGNED # <reserved>
36821D51E..1D539; DISALLOWED # MATHEMATICAL FRAKTUR SMALL A..MATHEMATICAL D
36831D53A ; UNASSIGNED # <reserved>
36841D53B..1D53E; DISALLOWED # MATHEMATICAL DOUBLE-STRUCK CAPITAL D..MATHEM
36851D53F ; UNASSIGNED # <reserved>
36861D540..1D544; DISALLOWED # MATHEMATICAL DOUBLE-STRUCK CAPITAL I..MATHEM
36871D545 ; UNASSIGNED # <reserved>
36881D546 ; DISALLOWED # MATHEMATICAL DOUBLE-STRUCK CAPITAL O
36891D547..1D549; UNASSIGNED # <reserved>..<reserved>
36901D54A..1D550; DISALLOWED # MATHEMATICAL DOUBLE-STRUCK CAPITAL S..MATHEM
36911D551 ; UNASSIGNED # <reserved>
36921D552..1D6A5; DISALLOWED # MATHEMATICAL DOUBLE-STRUCK SMALL A..MATHEMAT
36931D6A6..1D6A7; UNASSIGNED # <reserved>..<reserved>
36941D6A8..1D7CB; DISALLOWED # MATHEMATICAL BOLD CAPITAL ALPHA..MATHEMATICA
3695
3696
3697
3698Faltstrom Standards Track [Page 66]
3699
3700RFC 5892 IDNA Code Points August 2010
3701
3702
37031D7CC..1D7CD; UNASSIGNED # <reserved>..<reserved>
37041D7CE..1D7FF; DISALLOWED # MATHEMATICAL BOLD DIGIT ZERO..MATHEMATICAL M
37051D800..1EFFF; UNASSIGNED # <reserved>..<reserved>
37061F000..1F02B; DISALLOWED # MAHJONG TILE EAST WIND..MAHJONG TILE BACK
37071F02C..1F02F; UNASSIGNED # <reserved>..<reserved>
37081F030..1F093; DISALLOWED # DOMINO TILE HORIZONTAL BACK..DOMINO TILE VER
37091F094..1F0FF; UNASSIGNED # <reserved>..<reserved>
37101F100..1F10A; DISALLOWED # DIGIT ZERO FULL STOP..DIGIT NINE COMMA
37111F10B..1F10F; UNASSIGNED # <reserved>..<reserved>
37121F110..1F12E; DISALLOWED # PARENTHESIZED LATIN CAPITAL LETTER A..CIRCLE
37131F12F..1F130; UNASSIGNED # <reserved>..<reserved>
37141F131 ; DISALLOWED # SQUARED LATIN CAPITAL LETTER B
37151F132..1F13C; UNASSIGNED # <reserved>..<reserved>
37161F13D ; DISALLOWED # SQUARED LATIN CAPITAL LETTER N
37171F13E ; UNASSIGNED # <reserved>
37181F13F ; DISALLOWED # SQUARED LATIN CAPITAL LETTER P
37191F140..1F141; UNASSIGNED # <reserved>..<reserved>
37201F142 ; DISALLOWED # SQUARED LATIN CAPITAL LETTER S
37211F143..1F145; UNASSIGNED # <reserved>..<reserved>
37221F146 ; DISALLOWED # SQUARED LATIN CAPITAL LETTER W
37231F147..1F149; UNASSIGNED # <reserved>..<reserved>
37241F14A..1F14E; DISALLOWED # SQUARED HV..SQUARED PPV
37251F14F..1F156; UNASSIGNED # <reserved>..<reserved>
37261F157 ; DISALLOWED # NEGATIVE CIRCLED LATIN CAPITAL LETTER H
37271F158..1F15E; UNASSIGNED # <reserved>..<reserved>
37281F15F ; DISALLOWED # NEGATIVE CIRCLED LATIN CAPITAL LETTER P
37291F160..1F178; UNASSIGNED # <reserved>..<reserved>
37301F179 ; DISALLOWED # NEGATIVE SQUARED LATIN CAPITAL LETTER J
37311F17A ; UNASSIGNED # <reserved>
37321F17B..1F17C; DISALLOWED # NEGATIVE SQUARED LATIN CAPITAL LETTER L..NEG
37331F17D..1F17E; UNASSIGNED # <reserved>..<reserved>
37341F17F ; DISALLOWED # NEGATIVE SQUARED LATIN CAPITAL LETTER P
37351F180..1F189; UNASSIGNED # <reserved>..<reserved>
37361F18A..1F18D; DISALLOWED # CROSSED NEGATIVE SQUARED LATIN CAPITAL LETTE
37371F18E..1F18F; UNASSIGNED # <reserved>..<reserved>
37381F190 ; DISALLOWED # SQUARE DJ
37391F191..1F1FF; UNASSIGNED # <reserved>..<reserved>
37401F200 ; DISALLOWED # SQUARE HIRAGANA HOKA
37411F201..1F20F; UNASSIGNED # <reserved>..<reserved>
37421F210..1F231; DISALLOWED # SQUARED CJK UNIFIED IDEOGRAPH-624B..SQUARED
37431F232..1F23F; UNASSIGNED # <reserved>..<reserved>
37441F240..1F248; DISALLOWED # TORTOISE SHELL BRACKETED CJK UNIFIED IDEOGRA
37451F249..1FFFD; UNASSIGNED # <reserved>..<reserved>
37461FFFE..1FFFF; DISALLOWED # <noncharacter>..<noncharacter>
374720000..2A6D6; PVALID # <CJK Ideograph Extension B>..<CJK Ideograph
37482A6D7..2A6FF; UNASSIGNED # <reserved>..<reserved>
37492A700..2B734; PVALID # <CJK Ideograph Extension C>..<CJK Ideograph
37502B735..2F7FF; UNASSIGNED # <reserved>..<reserved>
3751
3752
3753
3754Faltstrom Standards Track [Page 67]
3755
3756RFC 5892 IDNA Code Points August 2010
3757
3758
37592F800..2FA1D; DISALLOWED # CJK COMPATIBILITY IDEOGRAPH-2F800..CJK COMPA
37602FA1E..2FFFD; UNASSIGNED # <reserved>..<reserved>
37612FFFE..2FFFF; DISALLOWED # <noncharacter>..<noncharacter>
376230000..3FFFD; UNASSIGNED # <reserved>..<reserved>
37633FFFE..3FFFF; DISALLOWED # <noncharacter>..<noncharacter>
376440000..4FFFD; UNASSIGNED # <reserved>..<reserved>
37654FFFE..4FFFF; DISALLOWED # <noncharacter>..<noncharacter>
376650000..5FFFD; UNASSIGNED # <reserved>..<reserved>
37675FFFE..5FFFF; DISALLOWED # <noncharacter>..<noncharacter>
376860000..6FFFD; UNASSIGNED # <reserved>..<reserved>
37696FFFE..6FFFF; DISALLOWED # <noncharacter>..<noncharacter>
377070000..7FFFD; UNASSIGNED # <reserved>..<reserved>
37717FFFE..7FFFF; DISALLOWED # <noncharacter>..<noncharacter>
377280000..8FFFD; UNASSIGNED # <reserved>..<reserved>
37738FFFE..8FFFF; DISALLOWED # <noncharacter>..<noncharacter>
377490000..9FFFD; UNASSIGNED # <reserved>..<reserved>
37759FFFE..9FFFF; DISALLOWED # <noncharacter>..<noncharacter>
3776A0000..AFFFD; UNASSIGNED # <reserved>..<reserved>
3777AFFFE..AFFFF; DISALLOWED # <noncharacter>..<noncharacter>
3778B0000..BFFFD; UNASSIGNED # <reserved>..<reserved>
3779BFFFE..BFFFF; DISALLOWED # <noncharacter>..<noncharacter>
3780C0000..CFFFD; UNASSIGNED # <reserved>..<reserved>
3781CFFFE..CFFFF; DISALLOWED # <noncharacter>..<noncharacter>
3782D0000..DFFFD; UNASSIGNED # <reserved>..<reserved>
3783DFFFE..DFFFF; DISALLOWED # <noncharacter>..<noncharacter>
3784E0000 ; UNASSIGNED # <reserved>
3785E0001 ; DISALLOWED # LANGUAGE TAG
3786E0002..E001F; UNASSIGNED # <reserved>..<reserved>
3787E0020..E007F; DISALLOWED # TAG SPACE..CANCEL TAG
3788E0080..E00FF; UNASSIGNED # <reserved>..<reserved>
3789E0100..E01EF; DISALLOWED # VARIATION SELECTOR-17..VARIATION SELECTOR-25
3790E01F0..EFFFD; UNASSIGNED # <reserved>..<reserved>
3791EFFFE..10FFFF; DISALLOWED # <noncharacter>..<noncharacter>
3792
3793
3794
3795
3796
3797
3798
3799
3800
3801
3802
3803
3804
3805
3806
3807
3808
3809
3810Faltstrom Standards Track [Page 68]
3811
3812RFC 5892 IDNA Code Points August 2010
3813
3814
38158. References
3816
38178.1. Normative References
3818
3819 [RFC2119] Bradner, S., "Key words for use in RFCs to Indicate
3820 Requirement Levels", BCP 14, RFC 2119, March 1997.
3821
3822 [TR15] Davis, M. and M. Duerst, "Unicode Standard Annex #15,
3823 Unicode Normalization Forms, an integral part of the
3824 Unicode Standard",
3825 <http://unicode.org/unicode/reports/tr15/>.
3826
3827 [Unicode] The Unicode Consortium, "The Unicode Standard, Version
3828 5.0", 2007. Boston, MA, USA: Addison-Wesley. ISBN
3829 0-321-48091-0. This printed reference has now been
3830 updated online to reflect additional code points. For
3831 code points, the reference at the time this document was
3832 published is to Unicode 5.2.
3833
3834 [Unicode52] The Unicode Consortium. The Unicode Standard, Version
3835 5.2.0, defined by: "The Unicode Standard, Version
3836 5.2.0", (Mountain View, CA: The Unicode Consortium,
3837 2009. ISBN 978-1-936213-00-9).
3838 <http://www.unicode.org/versions/Unicode5.2.0/>.
3839
38408.2. Informative References
3841
3842 [BlockNames] "Blocks-5.2.0.txt", Unicode Character Database,
3843 May 2009,
3844 <http://unicode.org/Public/5.2.0/ucd/Blocks.txt>.
3845
3846 [DerivedCoreProperties]
3847 "DerivedCoreProperties-5.2.0.txt", Unicode Character
3848 Database, August 2009, <http://unicode.org/Public/5.2.0/
3849 ucd/DerivedCoreProperties.txt>.
3850
3851 [RFC3454] Hoffman, P. and M. Blanchet, "Preparation of
3852 Internationalized Strings ("stringprep")", RFC 3454,
3853 December 2002.
3854
3855 [RFC3491] Hoffman, P. and M. Blanchet, "Nameprep: A Stringprep
3856 Profile for Internationalized Domain Names (IDN)",
3857 RFC 3491, March 2003.
3858
3859 [RFC4690] Klensin, J., Faltstrom, P., Karp, C., and IAB, "Review
3860 and Recommendations for Internationalized Domain Names
3861 (IDNs)", RFC 4690, September 2006.
3862
3863
3864
3865
3866Faltstrom Standards Track [Page 69]
3867
3868RFC 5892 IDNA Code Points August 2010
3869
3870
3871 [RFC5226] Narten, T. and H. Alvestrand, "Guidelines for Writing an
3872 IANA Considerations Section in RFCs", BCP 26, RFC 5226,
3873 May 2008.
3874
3875 [RFC5890] Klensin, J., "Internationalized Domain Names for
3876 Applications (IDNA): Definitions and Document
3877 Framework", RFC 5890, August 2010.
3878
3879 [RFC5891] Klensin, J., "Internationalized Domain Names in
3880 Applications (IDNA): Protocol", RFC 5891, August 2010.
3881
3882 [RFC5893] Alvestrand, H., Ed. and C. Karp, "Right-to-Left Scripts
3883 for Internationalized Domain Names for Applications
3884 (IDNA)", RFC 5893, August 2010.
3885
3886 [RFC5894] Klensin, J., "Internationalized Domain Names for
3887 Applications (IDNA): Background, Explanation, and
3888 Rationale", RFC 5894, August 2010.
3889
3890Author's Address
3891
3892 Patrik Faltstrom (editor)
3893 Cisco
3894
3895 EMail: paf@cisco.com
3896
3897
3898
3899
3900
3901
3902
3903
3904
3905
3906
3907
3908
3909
3910
3911
3912
3913
3914
3915
3916
3917
3918
3919
3920
3921
3922Faltstrom Standards Track [Page 70]
3923
3924