This repository has been archived by the owner on Jun 16, 2021. It is now read-only.
-
Notifications
You must be signed in to change notification settings - Fork 0
/
index.html
595 lines (545 loc) · 51.2 KB
/
index.html
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
<!DOCTYPE HTML>
<html lang="en"><head>
<meta http-equiv="Content-type" content="text/html;charset=UTF-8">
<meta name="viewport" content="width=device-width, initial-scale=1, shrink-to-fit=no">
<title>XML Entity Definitions for Characters (3rd Edition)</title>
<style type="text/css">
.base {background-color: #EEEEEE;}
.kwlabel {font-weight:normal}
</style>
<style>
a.loc {
padding-bottom: .1em; /* align underline with the rest of the text */
color: hsla(203, 90%, 30%,.8);
}
</style>
<link rel="stylesheet" type="text/css" href="https://www.w3.org/StyleSheets/TR/2016/W3C-ED">
<script src="//www.w3.org/scripts/TR/2016/fixup.js"> </script>
</head>
<body>
<div class="head"><p><a href="https://www.w3.org/"><img src="https://www.w3.org/StyleSheets/TR/2016/logos/W3C" alt="W3C" height="48" width="72"></a></p>
<h1><a id="title"></a>XML Entity Definitions for Characters (3rd Edition)</h1>
<h2><a id="w3c-doctype"></a>W3C Recommendation 07 August 2017</h2><dl>
<dt>This version:</dt><dd><a href="https://www.w3.org/TR/2017/REC-xml-entity-names-20170807/">https://www.w3.org/TR/2017/REC-xml-entity-names-20170807/</a></dd>
<dt>Latest version:</dt><dd><a href="https://www.w3.org/TR/2017/WD-xml-entity-names-20170807/">https://www.w3.org/TR/2017/REC-xml-entity-names-20170807/</a></dd>
<dt>Previous version:</dt><dd><a href="https://www.w3.org/TR/2014/REC-xml-entity-names-20140410/">https://www.w3.org/TR/2014/REC-xml-entity-names-20140410/</a></dd><dt>
Editors' version:</dt><dd><a href="https://www.w3.org/2003/entities/2007doc/">https://www.w3.org/2003/entities/2007doc/</a></dd><dt>Editors:</dt><dd>David Carlisle, NAG</dd><dd>Patrick Ion, Mathematical Reviews, American Mathematical Society</dd></dl><p>Please refer to the <a href="errata.html"><strong>errata</strong></a>
for this document, which may include some normative corrections.</p><p>See also <a href="https://www.w3.org/2005/11/Translations/Query?titleMatch=XML+Entity+definitions+for+Characters"><strong>translations</strong></a>.</p><p class="copyright"><a href="https://www.w3.org/Consortium/Legal/ipr-notice#Copyright">Copyright</a> © 1998-2017 <a href="https://www.w3.org/"><abbr title="World Wide Web Consortium">W3C</abbr></a><sup>®</sup> (<a href="https://www.csail.mit.edu/"><abbr title="Massachusetts Institute of Technology">MIT</abbr></a>, <a href="https://www.ercim.eu/"><abbr title="European Research Consortium for Informatics and Mathematics">ERCIM</abbr></a>, <a href="https://www.keio.ac.jp/">Keio</a>, <a href="http://ev.buaa.edu.cn/">Beihang</a>). W3C <a href="https://www.w3.org/Consortium/Legal/ipr-notice#Legal_Disclaimer">liability</a>, <a href="https://www.w3.org/Consortium/Legal/ipr-notice#W3C_Trademarks">trademark</a> and <a href="https://www.w3.org/Consortium/Legal/copyright-documents">document use</a> rules apply.</p><hr title="Separator for header"></div><div>
<h2><a id="abstract"></a>Abstract</h2>
<p>
This document defines several sets of names, so that to each name is assigned
a Unicode character or sequence of characters.
Each of these sets is expressed as a file of XML entity declarations.
</p>
</div><div>
<h2><a id="status"></a>Status of this Document</h2>
<p><strong>This document is an editors' copy that has no official standing.</strong></p>
<p><em> This section describes the status of this document at the time
of its publication. Other documents may supersede this document. A
list of current W3C publications and the latest revision of this
technical report can be found in the <a href="https://www.w3.org/TR/">W3C technical reports index</a> at
https://www.w3.org/TR/.</em>
</p>
<p>This document has been reviewed by W3C Members, by software
developers, and by other W3C groups and interested parties, and is
endorsed by the Director as a W3C Recommendation. It is a stable
document and may be used as reference material or cited from another
document. W3C's role in making the Recommendation is to draw attention
to the specification and to promote its widespread deployment. This
enhances the functionality and interoperability of the Web.</p>
<p>This third edition is based on Unicode 13 and
incorporates changes to Unicode since Unicode 5.2 and 6.3,
on which the first and second editions of this document were based. Note these updates only affect
the non normative descriptions of the Unicode blocks. There are no changes to the Normative entity definitions.
It also has been updated and restructured slightly to note that <a href="#HTML5">[HTML5]</a> now uses these definitions.
and to more clearly highlight that the HTML-MathML entity set should be used in preference to the older ISO sets
that are also defined in this document.</p>
<p>
This document was produced by the
<a href="https://www.w3.org/Math/">W3C Math Working Group</a>
as a Recommendation and as part of the W3C
<a href="https://www.w3.org/Math/Activity">Math Activity</a>.
The goals of the W3C Math Working Group are discussed in the
<a href="https://www.w3.org/Math/Documents/Charter2006.html"> W3C Math WG Charter</a>.
The authors of this document are W3C Math Working Group members.
</p>
<p>
Comments should be sent to the
<a href="mailto:[email protected]">Public W3C Math mailing list</a>
(<a href="https://lists.w3.org/Archives/Public/www-math/">list archives</a>;
see also <a href="https://www.w3.org/Mail/Request">instructions</a>).
When sending an e-mail comment on the XML Entity Definitions for Characters, please
put the text “XML-Entities” in the subject line, preferably like this:
“[XML-Entities] …summary of comment ”.
Alternatively, report an issue at this specification's <a href="https://github.com/w3c/xml-entities">GitHub repository</a>.
</p>
<p>This document is governed by the <a href="https://www.w3.org/2017/Process-20170301/" id="w3c_process_revision">1 March 2017 W3C Process Document</a>. </p>
<p>This document was produced by a group operating under the
<a href="https://www.w3.org/Consortium/Patent-Policy-20040205/">5 February 2004 W3C Patent Policy</a>.
W3C maintains a
<a href="https://www.w3.org/2004/01/pp-impl/35549/status" rel="disclosure">public list of any patent disclosures</a> made in connection with
the deliverables of the group; that page also includes instructions
for disclosing a patent. An individual who has actual knowledge of
a patent which the individual believes contains
<a href="https://www.w3.org/Consortium/Patent-Policy-20040205/#def-essential">Essential Claim(s)</a>
must disclose the information in accordance with
<a href="https://www.w3.org/Consortium/Patent-Policy-20040205/#sec-Disclosure">section 6 of the W3C Patent Policy</a>.
</p>
<p> Appendix <a href="#changes"><b>B Changes</b></a> details the changes since earlier versions of this document.</p>
</div>
<nav id="toc">
<h2 id="tochead">Table of Contents</h2>
<ol class="toc">
<li><a href="#chars_intro"><span class="secno">1 </span>Introduction</a></li>
<li><a href="#sets"><span class="secno">2 </span>Sets of names</a>
<ol class="toc">
<li><a href="#htmlmathml"><span class="secno">2.1 </span>The HTML MathML entity set</a></li>
<li><a href="#legacysets"><span class="secno">2.2 </span>Legacy Entity sets</a></li></ol></li>
<li><a href="#blocks"><span class="secno">3 </span>Unicode Character Ranges for Scientific Documents</a></li>
<li><a href="#alphabets"><span class="secno">4 </span>Mathematical Alphanumeric Characters</a></li>
<li><a href="#combining"><span class="secno">5 </span>Entities for Negated and Variant Characters</a>
<ol class="toc">
<li><a href="#chars_math-negated-tables"><span class="secno">5.1 </span>Negated Mathematical Characters</a></li>
<li><a href="#chars_math-variant-tables"><span class="secno">5.2 </span>Variant
Mathematical Characters</a></li></ol></li>
<li><a href="#oddities"><span class="secno">A </span>Special Considerations</a>
<ol class="toc">
<li><a href="#epsilon"><span class="secno">A.1 </span>Epsilon</a></li>
<li><a href="#phi"><span class="secno">A.2 </span>Phi</a></li>
<li><a href="#chars_math-multiple-tables"><span class="secno">A.3 </span>Multiple Character Entities</a></li>
<li><a href="#chars_math-combining-tables"><span class="secno">A.4 </span>Entities Defined to be a Combining Character</a></li></ol></li>
<li><a href="#changes"><span class="secno">B </span>Changes</a>
<ol class="toc">
<li><a href="#changes20140410"><span class="secno">B.1 </span>Changes since 2014-04-10 (Second Edition Recommendation)</a></li>
<li><a href="#changes20100401"><span class="secno">B.2 </span>Changes between 2010-04-01 and 2014-04-10 (First and Second Edition Recommendations)</a></li>
<li><a href="#changes20100211"><span class="secno">B.3 </span>Changes between 2010-04-01 and 2010-02-11</a></li>
<li><a href="#changes20091117"><span class="secno">B.4 </span>Changes between 2010-02-11 and 2009-11-17</a></li>
<li><a href="#changes20080721"><span class="secno">B.5 </span>Changes between 2009-11-17 and 2008-07-21</a></li>
<li><a href="#changes20071214"><span class="secno">B.6 </span>Changes between 2008-07-21 and 2007-12-14</a></li></ol></li>
<li><a href="#diffs"><span class="secno">C </span>Differences between these entities and earlier W3C DTDs</a>
<ol class="toc">
<li><a href="#diff-xhtml1"><span class="secno">C.1 </span>Differences from XHTML 1.0</a></li>
<li><a href="#diff-mathml2"><span class="secno">C.2 </span>Differences from MathML 2.0 (second edition)</a></li></ol></li>
<li><a href="#source"><span class="secno">D </span>Source Files</a></li>
<li><a href="#references"><span class="secno">E </span>References</a></li></ol>
</nav>
<div class="body">
<div class="div1">
<h2><a id="chars_intro"></a>1 Introduction</h2>
<p>
Notation and symbols have proved very important for human communication,
especially in scientific documents. Mathematics has
grown in part because its notation continually changes toward being succinct
and suggestive. There have been many new signs
developed for use in mathematical notation, and
mathematicians have not held back from making use of many symbols
originally introduced elsewhere. The result is that
science in general, and particularly mathematics, makes use of
a very large collection of symbols. It is
difficult to write science fluently if these characters are not
available for use. It is difficult to read science if
corresponding glyphs are not available for presentation on specific
display devices. In the majority of cases it is preferable to store
characters directly as Unicode character data or as XML numeric
character references.
</p>
<p>However, in some environments it is more
convenient to use the ASCII input mechanism provided by XML entity
references. Many entity names are in common use, and this
specification aims to provide standard mappings to Unicode for each of
these names. It introduces no names that have not already been used in
earlier specifications. Note that these names are short mnemonic names
designed for input methods such as XML entity references, not the longer formal names
that form part of the Unicode standard.
</p>
<p>
Specifically, the entity names in the sets
starting with the letters "iso" were first standardized in SGML (<a href="#SGML">[SGML]</a>) and updated in <a href="#ISO9573-13-1991">[ISO9573-13-1991]</a>.
The W3C Math Working Group has been
invited to take over the maintenance and development of these sets by the
original standards committee (ISO/IEC JTC1 SC34). The sets with names
starting "mml" were first standardized in
MathML <a href="#MathML2">[MathML2]</a> and those starting
with "xhtml" were first standardized in HTML <a href="#HTML4">[HTML4]</a>.
</p>
<p>This document is the result of years of employing entity names on the Web. There were
always a few named entities used for special characters in HTML, and many more names
used for MathML. This means that this document can be
viewed as an extension and final revision of Chapter 6 of the MathML 2.0
<a href="#MathML2">[MathML2]</a> recommendation. Now it presents a completed listing harmonizing
the known uses of character entity names in XML and HTML, together with defined mappings to Unicode.
</p>
<p>Since there are so many character entity names, and the files specifying them
are resources that may be subject to frequent lookup, a template catalog file has also been
provided. Users are strongly encouraged to design their implementations so that
relevant entity name tables are cached locally, since it is not expected that
the listings provided with this specification will need changing for some long time.
</p>
</div><div id="setsdiv">
<h2><a id="sets"></a>2 Sets of names</h2><div class="div2">
<h3><a id="htmlmathml"></a>2.1 The HTML MathML entity set</h3>
<p>Historically the entity sets have been split into relatively small groups of related characters
however for any new document type that is being defined it is strongly recommended that the combined
<b>htmlmathml</b> set is used. This defines an identical set of names to the names built in to
the HTML parser (derived from the same source materials as this document see <a href="#source"><b>D Source Files</b></a>).</p>
<p>To incorporate the <b>htmlmathml</b> set into an XML DTD, a typical construct is:</p>
<div class="exampleInner"><pre class="example">
<!ENTITY % htmlmathml-f PUBLIC
"-//W3C//ENTITIES HTML MathML Set//EN//XML"
"https://www.w3.org/2003/entities/2007/htmlmathml-f.ent"
>
%htmlmathml-f;
</pre></div>
<p>
The public identifier should always be used verbatim,
The system identifier should be changed to suit local requirements.
</p>
<p>The entity set is available in two forms: </p>
<ul><li><a href="https://www.w3.org/2003/entities/2007/htmlmathml-f.ent">htmlmathml-f</a> the expanded set of HTML and MathML entity definitions</li><li><a href="https://www.w3.org/2003/entities/2007/htmlmathml.ent">htmlmathml</a> the HTML and MathML entities defined via reference to the legacy entity set definitions as listed in the following section.</li></ul>
<p>The information is also available in JSON format. The JSON arrays encode the entity names and mappings to Unicode and also a list of those entity references for which the HTML (but not XML) parser allows the trailing semicolon to be omitted. So <code>&amp</code> may be used as well as <code>&amp;</code> when using HTML.</p>
<ul><li><a href="2007/htmlmathml.json">htmlmathml.json</a></li></ul>
<p>An XSLT2 stylesheet is available which performs the reverse mapping, replacing Unicode characters by entity references.</p>
<ul><li><a href="2007/htmlmathml.xsl">htmlmathml.xsl</a></li></ul></div><div class="div2">
<h3><a id="legacysets"></a>2.2 Legacy Entity sets</h3>
<p>This specification defines mappings to Unicode of many sets of names
that have been defined by earlier specifications.</p>
<p>We present two tables listing all the sets combined, first in
Unicode order and then in alphabetic order:</p>
<ul><li>All in <a href="2007doc/bycodes.html">Unicode order</a></li><li>All in <a href="2007doc/byalpha.html">alphabetic order</a>.</li></ul>
<p>Then there come tables
documenting each of the entity sets. Each set has a link to the DTD
entity declaration for the corresponding entity set, and also a link
to an XSLT2 stylesheet that will implement a reverse mapping from
characters to entity names (this is, of course, only possible for entity names
that map to a single Unicode code point).
</p>
</div><ul><li><a href="2007doc/isobox.html">isobox</a> Box and Line Drawing</li><li><a href="2007doc/isocyr1.html">isocyr1</a> Russian Cyrillic</li><li><a href="2007doc/isocyr2.html">isocyr2</a> Non-Russian Cyrillic</li><li><a href="2007doc/isodia.html">isodia</a> Diacritical Marks</li><li><a href="2007doc/isolat1.html">isolat1</a> Added Latin 1</li><li><a href="2007doc/isolat2.html">isolat2</a> Added Latin 2</li><li><a href="2007doc/isonum.html">isonum</a> Numeric and Special Graphic</li><li><a href="2007doc/isopub.html">isopub</a> Publishing</li><li><a href="2007doc/isoamsa.html">isoamsa</a> Added Math Symbols: Arrow Relations</li><li><a href="2007doc/isoamsb.html">isoamsb</a> Added Math Symbols: Binary Operators</li><li><a href="2007doc/isoamsc.html">isoamsc</a> Added Math Symbols: Delimiters</li><li><a href="2007doc/isoamsn.html">isoamsn</a> Added Math Symbols: Negated Relations</li><li><a href="2007doc/isoamso.html">isoamso</a> Added Math Symbols: Ordinary</li><li><a href="2007doc/isoamsr.html">isoamsr</a> Added Math Symbols: Relations</li><li><a href="2007doc/isogrk1.html">isogrk1</a><span style="background-color: #FFCCCC"> Greek Letters (not in MathML3 / HTML5)</span></li><li><a href="2007doc/isogrk2.html">isogrk2</a><span style="background-color: #FFCCCC"> Monotoniko Greek (not in MathML3 / HTML5)</span></li><li><a href="2007doc/isogrk3.html">isogrk3</a> Greek Symbols</li><li><a href="2007doc/isogrk4.html">isogrk4</a><span style="background-color: #FFCCCC"> Alternative Greek Symbols (not in MathML3 / HTML5)</span></li><li><a href="2007doc/isomfrk.html">isomfrk</a> Math Alphabets: Fraktur</li><li><a href="2007doc/isomopf.html">isomopf</a> Math Alphabets: Open Face</li><li><a href="2007doc/isomscr.html">isomscr</a> Math Alphabets: Script</li><li><a href="2007doc/isotech.html">isotech</a> General Technical</li><li><a href="2007doc/mmlextra.html">mmlextra</a> Additional MathML Symbols</li><li><a href="2007doc/mmlalias.html">mmlalias</a> MathML Aliases</li><li><a href="2007doc/xhtml1-lat1.html">xhtml1-lat1</a> Latin for HTML</li><li><a href="2007doc/xhtml1-special.html">xhtml1-special</a> Special for HTML</li><li><a href="2007doc/xhtml1-symbol.html">xhtml1-symbol</a> Symbol for HTML</li><li><a href="2007doc/html5-uppercase.html">html5-uppercase</a> uppercase aliases for HTML</li><li><a href="2007doc/predefined.html">predefined</a> Predefined XML</li></ul><p>In addition to the stylesheets and entity files corresponding
to each individual entity set, a <a href="https://www.w3.org/2003/entities/2007/entitynamesmap.xsl">combined
stylesheet</a> is provided, as well as a combined entity set,
in two formats, as for the HTML MathML set described above.</p><ul><li><a href="https://www.w3.org/2003/entities/2007/w3centities.ent">w3centities</a> W3C entities collection; referencing all entity sets listed above</li><li><a href="https://www.w3.org/2003/entities/2007/w3centities-f.ent">w3centities-f</a> the same set of entity definitions, expanded into a single file, with duplicates removed</li></ul></div><div id="blocksdiv">
<h2><a id="blocks"></a>3 Unicode Character Ranges for Scientific Documents</h2>
<p>
Certain characters are of particular relevance to scientific document production. The following
tables display Unicode ranges containing the characters that are most used in mathematics.
</p>
<p>Note that each of the tables linked from this section contains 256 images and may take a
while to load if the images have not been cached locally.</p>
<ul><li><span>000</span> <a href="2007doc/000.html">C0 Controls and Basic Latin, C1 Controls and Latin-1 Supplement</a></li><li><span>001</span> <a href="2007doc/001.html">Latin Extended-A, Latin Extended-B</a></li><li><span>002</span> <a href="2007doc/002.html">IPA Extensions, Spacing Modifier Letters</a></li><li><span>003</span> <a href="2007doc/003.html">Combining Diacritical Marks, Greek and Coptic</a></li><li><span>004</span> <a href="2007doc/004.html">Cyrillic</a></li><li><span>006</span> <a href="2007doc/006.html">Arabic</a></li><li><span>020</span> <a href="2007doc/020.html">General Punctuation, Superscripts and Subscripts, Currency Symbols, Combining Diacritical Marks for Symbols</a></li><li><span>021</span> <a href="2007doc/021.html">Letterlike Symbols, Number Forms, Arrows</a></li><li><span>022</span> <a href="2007doc/022.html">Mathematical Operators</a></li><li><span>023</span> <a href="2007doc/023.html">Miscellaneous Technical</a></li><li><span>024</span> <a href="2007doc/024.html">Control Pictures, Optical Character Recognition, Enclosed Alphanumerics</a></li><li><span>025</span> <a href="2007doc/025.html">Box Drawing, Block Elements, Geometric Shapes</a></li><li><span>026</span> <a href="2007doc/026.html">Miscellaneous Symbols</a></li><li><span>027</span> <a href="2007doc/027.html">Dingbats, Miscellaneous Mathematical Symbols-A, Supplemental Arrows-A</a></li><li><span>029</span> <a href="2007doc/029.html">Supplemental Arrows-B, Miscellaneous Mathematical Symbols-B</a></li><li><span>02A</span> <a href="2007doc/02A.html">Supplemental Mathematical Operators</a></li><li><span>02B</span> <a href="2007doc/02B.html">Miscellaneous Symbols and Arrows</a></li><li><span>0FB</span> <a href="2007doc/0FB.html">Alphabetic Presentation Forms, Arabic Presentation Forms-A</a></li><li><span>0FE</span> <a href="2007doc/0FE.html">Variation Selectors, Vertical Forms, Combining Half Marks, CJK Compatibility Forms, Small Form Variants, Arabic Presentation Forms-B</a></li><li><span>1D4</span> <a href="2007doc/1D4.html">Mathematical Alphanumeric Symbols</a></li><li><span>1D5</span> <a href="2007doc/1D5.html">Mathematical Alphanumeric Symbols (continued)</a></li><li><span>1D6</span> <a href="2007doc/1D6.html">Mathematical Alphanumeric Symbols (continued)</a></li><li><span>1D7</span> <a href="2007doc/1D7.html">Mathematical Alphanumeric Symbols (continued)</a></li><li><span>1EE</span> <a href="2007doc/1EE.html">Arabic Mathematical Alphabetic Symbols</a></li><li><span>1F7</span> <a href="2007doc/1F7.html">Alchemical Symbols, Geometric Shapes Extended</a></li><li><span>1F8</span> <a href="2007doc/1F8.html">Supplemental Arrows-C</a></li></ul></div><div id="alphabetsdiv">
<h2><a id="alphabets"></a>4 Mathematical Alphanumeric Characters</h2>
<p>
Many of the entities defined by this specification relate to the
mathematical alphanumeric characters contained in the letter-like
symbols block of Unicode Plane 0, or in the Mathematical Alphanumeric Symbols
block in Unicode Plane 1. The following tables list all these symbols,
highlighting those that are not in Plane 1, and giving entity names
where appropriate.
</p>
<ul><li><a href="2007doc/bold.html">Bold (Serif)</a></li><li><a href="2007doc/italic.html">Italic or Slanted</a></li><li><a href="2007doc/bold-italic.html">Bold Italic or Slanted</a></li><li><a href="2007doc/script.html">Script (or Calligraphic)</a></li><li><a href="2007doc/bold-script.html">Bold Script</a></li><li><a href="2007doc/fraktur.html">Fraktur</a></li><li><a href="2007doc/bold-fraktur.html">Bold Fraktur</a></li><li><a href="2007doc/sans-serif.html">Sans Serif</a></li><li><a href="2007doc/bold-sans-serif.html">Bold Sans Serif</a></li><li><a href="2007doc/sans-serif-italic.html">Slanted Sans Serif</a></li><li><a href="2007doc/sans-serif-bold-italic.html">Slanted Bold Sans Serif</a></li><li><a href="2007doc/monospace.html">Monospace</a></li><li><a href="2007doc/double-struck.html">Double Struck (Open Face, Blackboard Bold)</a></li><li><a href="2007doc/initial.html">Arabic Initial Form</a></li><li><a href="2007doc/tailed.html">Arabic Tailed Form</a></li><li><a href="2007doc/looped.html">Arabic Looped Form</a></li><li><a href="2007doc/stretched.html">Arabic Stretched Form</a></li></ul></div><div class="div1">
<h2><a id="combining"></a>5 Entities for Negated and Variant Characters</h2>
<p>
Each of the entity definitions in a majority of the specification expands
to a single Unicode character. The definitions that expand to a sequence of
two or more characters are outlined in this section.
</p>
<div class="div2">
<h3><a id="chars_math-negated-tables"></a>5.1 Negated Mathematical Characters</h3>
<p>
In addition to the Unicode Characters so far listed, one may use the
combining characters U+0338 (/),
U+20D2 (|) and
U+20E5 (\) to produce
negated or canceled forms of
characters. A combining character
should be placed immediately after its "base" character, with no
intervening markup or space, just as is the case for combining accents.
</p>
<p>
In principle, the negation characters may be applied to any Unicode
character, although fonts designed for mathematics typically have some
negated glyphs ready composed. A MathML renderer should be able to use
these pre-composed glyphs in these cases. A compound character code
either represents a UCS character that is already available, as in the
case of U+003D U+0338 which amounts to
U+2260, or it does not, as is the
case for U+2202 U+0338. The common cases of
negations, of the latter type,
that have been identified are listed in the tables.</p>
<ul><li><a href="2007doc/U00338.html">combining long solidus overlay</a></li><li><a href="2007doc/U020D2.html">combining long vertical line overlay</a></li><li><a href="2007doc/U020E5.html">combining reverse solidus overlay</a></li></ul>
<p>
Note that it is the policy of the W3C and of Unicode that if a single
character is already defined for what can be achieved with a combining
character, that character must be used instead of the decomposed form.
It is also intended that no new single characters representing what
can be done by with existing compositions will be introduced.
For further information on these matters see
the Unicode Standard Annex 15, Unicode Normalization Forms
<a href="#Unicode15">[Unicode15]</a>, especially
the discussion of Normalization Form C.
</p>
</div><div class="div2">
<h3><a id="chars_math-variant-tables"></a>5.2 Variant
Mathematical Characters</h3>
<p>
Unicode attempts to avoid having several character codes for simple
font variants. For a code point to be assigned there should be
more than a nuance in glyphs to be recorded. To record
variants worth noting there is a special character in
Unicode 3.2, U+FE00 (VARIATION SELECTOR-1), which
acts as a postfix modifier. However the legally allowed
combinations with this variation selector are restricted to a
list recorded as part of Unicode. The VARIATION SELECTOR-1
character may only be applied to the characters listed here.
The resulting combination is not regarded by Unicode as a separate
character, but a variation on the base character. Unicode aware systems
may render the combination as the base if the available fonts do not
support the variant glyph shape.
</p>
<ul><li><a href="2007doc/U0FE00.html">variation selector-1</a></li></ul></div></div>
</div>
<div class="back"><div class="div1">
<h2><a id="oddities"></a>A Special Considerations</h2><div class="div2">
<h3><a id="epsilon"></a>A.1 Epsilon</h3>
<p>Historically there has been much confusion and lack of
agreement over variant forms for lower case epsilon.</p>
<p>This specification uses the definitions below. Note that the
name <b>epsilon</b> is used for the character used in textual
Greek (U+03B5) and <b>varepsilon</b> used for the epsilon
symbol character more commonly used in mathematics
(U+03F5). Note that this usage is compatible with the naming of
similar pairs of characters (for example <b>theta</b>,
<b>vartheta</b>) but <em>incompatible</em> with the naming
convention used in TeX, MathML2 and some earlier mappings of the
ISO entity sets to Unicode.</p>
<table class="complex data"><thead><tr><th>Entity</th><th>Set</th><th>Description</th><th colspan="3">Unicode Character</th></tr></thead><tbody><tr><td>eacgr</td><td>isogrk2</td><td>=small epsilon, accent, Greek</td><td rowspan="1">U+03AD</td><td rowspan="1"><img height="32" width="32" src="2007doc/glyphs/003/U003AD.png" alt="U003AD"></td><td rowspan="1">GREEK SMALL LETTER EPSILON WITH TONOS</td></tr><tr><td>egr</td><td>isogrk1</td><td>=small epsilon, Greek</td><td rowspan="3">U+03B5</td><td rowspan="3"><img height="32" width="32" src="2007doc/glyphs/003/U003B5.png" alt="U003B5"></td><td rowspan="3">GREEK SMALL LETTER EPSILON</td></tr><tr><td>epsi</td><td>isogrk3</td><td>/epsilon</td></tr><tr><td>epsilon</td><td>xhtml1-symbol</td><td></td></tr><tr><td>epsiv</td><td>isogrk3</td><td>/straightepsilon, small epsilon, Greek</td><td rowspan="3">U+03F5</td><td rowspan="3"><img height="32" width="32" src="2007doc/glyphs/003/U003F5.png" alt="U003F5"></td><td rowspan="3">GREEK LUNATE EPSILON SYMBOL</td></tr><tr><td>straightepsilon</td><td>mmlalias</td><td>alias ISOGRK3 epsiv</td></tr><tr><td>varepsilon</td><td>mmlalias</td><td>alias ISOGRK3 epsiv</td></tr><tr><td>bepsi</td><td>isoamsr</td><td>/backepsilon R: such that</td><td rowspan="2">U+03F6</td><td rowspan="2"><img height="32" width="32" src="2007doc/glyphs/003/U003F6.png" alt="U003F6"></td><td rowspan="2">GREEK REVERSED LUNATE EPSILON SYMBOL</td></tr><tr><td>backepsilon</td><td>mmlalias</td><td>alias ISOAMSR bepsi</td></tr><tr><td>b.epsi</td><td>isogrk4</td><td>small epsilon, Greek</td><td rowspan="1">U+1D6C6</td><td rowspan="1"><img height="32" width="32" src="2007doc/glyphs/1D6/U1D6C6.png" alt="U1D6C6"></td><td rowspan="1">MATHEMATICAL BOLD SMALL EPSILON</td></tr><tr><td>b.epsiv</td><td>isogrk4</td><td>variant epsilon</td><td rowspan="1">U+1D6DC</td><td rowspan="1"><img height="32" width="32" src="2007doc/glyphs/1D6/U1D6DC.png" alt="U1D6DC"></td><td rowspan="1">MATHEMATICAL BOLD EPSILON SYMBOL</td></tr></tbody></table></div><div class="div2">
<h3><a id="phi"></a>A.2 Phi</h3>
<p>The situation for phi is very similar to that of epsilon,
although with the further complication that early versions of
Unicode had the sample glyphs for U+03C6 and U+03D5 swapped
from the current usage, and some older fonts still in use follow
that older convention. The definitions used in this
specification are as listed below.</p>
<table class="complex data"><thead><tr><th>Entity</th><th>Set</th><th>Description</th><th colspan="3">Unicode Character</th></tr></thead><tbody><tr><td>phi</td><td>isogrk3</td><td>/phi - small phi, Greek</td><td rowspan="3">U+03C6</td><td rowspan="3"><img height="32" width="32" src="2007doc/glyphs/003/U003C6.png" alt="U003C6"></td><td rowspan="3">GREEK SMALL LETTER PHI</td></tr><tr><td>phi</td><td>xhtml1-symbol</td><td>greek small letter phi</td></tr><tr><td>phgr</td><td>isogrk1</td><td>=small phi, Greek</td></tr><tr><td>straightphi</td><td>mmlalias</td><td>alias ISOGRK3 phiv</td><td rowspan="3">U+03D5</td><td rowspan="3"><img height="32" width="32" src="2007doc/glyphs/003/U003D5.png" alt="U003D5"></td><td rowspan="3">GREEK PHI SYMBOL</td></tr><tr><td>phiv</td><td>isogrk3</td><td>/varphi - straight phi</td></tr><tr><td>varphi</td><td>mmlalias</td><td>alias ISOGRK3 phiv</td></tr><tr><td>b.phi</td><td>isogrk4</td><td>small phi, Greek</td><td rowspan="1">U+1D6D7</td><td rowspan="1"><img height="32" width="32" src="2007doc/glyphs/1D6/U1D6D7.png" alt="U1D6D7"></td><td rowspan="1">MATHEMATICAL BOLD SMALL PHI</td></tr><tr><td>b.phiv</td><td>isogrk4</td><td>variant phi</td><td rowspan="1">U+1D6DF</td><td rowspan="1"><img height="32" width="32" src="2007doc/glyphs/1D6/U1D6DF.png" alt="U1D6DF"></td><td rowspan="1">MATHEMATICAL BOLD PHI SYMBOL</td></tr></tbody></table></div><div class="div2">
<h3><a id="chars_math-multiple-tables"></a>A.3 Multiple Character Entities</h3>
<p>
In addition to the combining and variant character combinations
listed in the previous sections,
the following table lists the remaining entity replacement texts that
consist of more than one character.
</p>
<table class="complex data"><thead><tr><th>Entity</th><th>Set</th><th>Description</th><th colspan="3">Unicode Character</th></tr></thead><tbody><tr class="combine"><td>fjlig</td><td>isopub</td><td>small fj ligature</td><td rowspan="1">U+0066 U+006A</td><td rowspan="1"><img height="32" width="32" src="2007doc/glyphs/000/U00066-0006A.png" alt="U00066-0006A"></td><td rowspan="1">fj ligature</td></tr><tr class="combine"><td>ThickSpace</td><td>mmlextra</td><td>space of width 5/18 em</td><td rowspan="1">U+205F U+200A</td><td rowspan="1"><img height="32" width="32" src="2007doc/glyphs/020/U0205F-0200A.png" alt="U0205F-0200A"></td><td rowspan="1">space of width 5/18 em</td></tr><tr class="combine"><td>race</td><td>isoamsb</td><td>reverse most positive, line below</td><td rowspan="1">U+223D U+0331</td><td rowspan="1"><img height="32" width="32" src="2007doc/glyphs/022/U0223D-00331.png" alt="U0223D-00331"></td><td rowspan="1">REVERSED TILDE with underline</td></tr><tr class="combine"><td>acE</td><td>isoamsb</td><td>most positive, two lines below</td><td rowspan="1">U+223E U+0333</td><td rowspan="1"><img height="32" width="32" src="2007doc/glyphs/022/U0223E-00333.png" alt="U0223E-00333"></td><td rowspan="1">INVERTED LAZY S with double underline</td></tr><tr class="combine"><td>DownBreve</td><td>mmlextra</td><td>breve, inverted (non-spacing)</td><td rowspan="1">U+0020 U+0311</td><td rowspan="1"><img height="32" width="32" src="2007doc/glyphs/003/U00311.png" alt="U00311"></td><td rowspan="1">COMBINING INVERTED BREVE</td></tr><tr class="combine"><td>tdot</td><td>isotech</td><td>three dots above</td><td rowspan="2">U+0020 U+20DB</td><td rowspan="2"><img height="32" width="32" src="2007doc/glyphs/020/U020DB.png" alt="U020DB"></td><td rowspan="2">COMBINING THREE DOTS ABOVE</td></tr><tr class="combine"><td>TripleDot</td><td>mmlalias</td><td>alias ISOTECH tdot</td></tr><tr class="combine"><td>DotDot</td><td>isotech</td><td>four dots above</td><td rowspan="1">U+0020 U+20DC</td><td rowspan="1"><img height="32" width="32" src="2007doc/glyphs/020/U020DC.png" alt="U020DC"></td><td rowspan="1">COMBINING FOUR DOTS ABOVE</td></tr></tbody></table>
<p>Unicode does not have an fj character, although the other common f ligatures
such as fi (U+FB01) are contained in the Alphabetic Presentation Forms block.
The <b>fjlig</b> entity is mapped to the pair of characters "fj";
modern typesetting engines should automatically use the fj ligature for this
combination if the font supplies such a ligature.</p>
<p>Unicode has a range of space characters (including all multiples of
1/18 em up to 6/18, except for 5/18 em) thus the <b>ThickSpace</b> entity is
mapped to a pair of space characters. An alternative would have been to use
U+2005 (1/4 em), but 1/4 em is not equal to 5/18 em, so the above definition was
chosen, despite the fact that the difference is unlikely to be visibly
noticeable at most typeset font sizes.</p>
<p>The entities <b>race</b> and <b>acE</b> denote underlined
characters for which Unicode does not have codepoints, thus combining
underline characters have been used, in a way analogous to the use of
combining strokes for negated operators.</p>
</div><div class="div2">
<h3><a id="chars_math-combining-tables"></a>A.4 Entities Defined to be a Combining Character</h3>
<p>
The following table lists the entity replacement texts that
consist of a combining character.
</p>
<p id="combining-start"></p>
<p>For reasons explained further in <a href="#Charmod-norm">[Charmod-norm]</a>, it is
not advisable to to start the replacement text of an entity with a
combining character, as then potentially different results may be
produced depending on the order in which entity expansion and Unicode
normalisation are performed. As far as possible this specification
uses non-combining characters, however, in the cases <b>DownBreve</b>,
<b>tdot</b>, <b>TripleDot</b> and <b>DotDot</b>
Unicode only has combining forms of the accents.</p>
<p>Earlier versions of this specification defined these entities
with the replacement text starting with a space, to avoid the possibility that
the expansion of the entity combined with preceding text. However for various reasons
the entities as incorporated in HTML do not have a space here, and so the
definitions now consist just of the combining character so that HTML and XHTML
are consistent with any specifications using these definitions.</p>
</div></div><div class="div1">
<h2><a id="changes"></a>B Changes</h2><div class="div2">
<h3><a id="changes20140410"></a>B.1 Changes since 2014-04-10 (Second Edition Recommendation)</h3>
<p>Reference added to the 1 March 2017 W3C Process Document.</p>
<p>Some changes to the front matter including link to GitHub as
required by the latest W3C publication process.</p>
<p>Adjustments to CSS styling to match new W3C document style.</p>
<p>Source files updated to Unicode 11.0, affecting the character tables,
but with no changes to generated entity files or stylesheets.</p>
<p>The source repository has been moved to github so the log is now public.</p>
<p>As detailed in <a href="#chars_math-combining-tables"><b>A.4 Entities Defined to be a Combining Character</b></a> <b>DownBreve</b>,
<b>tdot</b>, <b>TripleDot</b> and <b>DotDot</b> are no longer prefixed by a space.</p>
</div><div class="div2">
<h3><a id="changes20100401"></a>B.2 Changes between 2010-04-01 and 2014-04-10 (First and Second Edition Recommendations)</h3>
<p>Source files updated to Unicode 6.3, affecting the character tables,
but with no changes to generated entity files or stylesheets.</p>
<p>Source files updated Unicode 6.1 data on Arabic math alphabets (U+1EE??). Additional tables shown in Sections 3 and 4.</p>
<p>Section <a href="#sets"><b>2 Sets of names</b></a> reorganized to highlight the <code>htmlmathml</code> set which is used in MathML and HTML. Also link to XSL and JSON formats for the HTML MathML set.</p>
<p>References updated: <a href="#MathML3">[MathML3]</a>, <a href="#HTML5">[HTML5]</a> and <a href="#Unicode">[Unicode]</a>.</p>
</div><div class="div2">
<h3><a id="changes20100211"></a>B.3 Changes between 2010-04-01 and 2010-02-11</h3>
<p>Several example images improved, bringing them more in line with the Unicode reference images.</p>
</div><div class="div2">
<h3><a id="changes20091117"></a>B.4 Changes between 2010-02-11 and 2009-11-17</h3>
<p>Various editorial improvements, including using Unicode U+1234
notation more consistently rather than displaying the internal
IDs of the form U01234.</p>
<p>The combined entities file distributed with the 2009-11-17
draft introduced an error that if two entity names differed only
by case, only one was included. This has been corrected.</p>
<p>The combined entity set htmlmathml corresponding to the
entities usable in HTML and MathML is now explicitly provided. The
predefined set, corresponding to the entities predefined in XML
is now documented (it was previously used internally).</p>
<p>The entities <b>xvee</b> and <b>xwedge</b> had the correct
Unicode assignments (U+22C1 and U+22C0) but the entity descriptions
have been swapped, <b>xvee</b> is logical or and <b>xwedge</b> is logical and.
This error in <a href="#ISO9573-13-1991">[ISO9573-13-1991]</a> was reported in 1999,
in a Proposed Technical Corrigendum,
but not previously fixed. The entity files are unaffected by this change.</p>
<p>The entity <b>NotGreaterFullEqual</b> which had been erroneously assigned to
a negated less than operator (U+2266 U+0338) has been corrected to be the negated greater than operator (U+2267 U+0338).</p>
<p>A sample <a href="#catalog">catalog</a> is now provided to redirect references to the entity files to copies on the local machine rather than the W3C server.</p>
</div><div class="div2">
<h3><a id="changes20080721"></a>B.5 Changes between 2009-11-17 and 2008-07-21</h3>
<p>The html5-uppercase set is now documented.</p>
<p>The entities <b>ohm</b> and <b>angst</b> have changed to U+03A9 and U+00C5 to match NFC. See
<a href="https://www.w3.org/Bugs/Public/show_bug.cgi?id=5897">w3c bugzilla entry</a>.</p>
<p>The entity <b>race</b>, which had been erroneously assigned U+29DA,
is now assigned the combination U+223D U+0331. (U+223D isn't
quite the shape shown in the original ISO document which is a
rotated S rather than a rotated tilde, but this appears to be
the closest character in Unicode 5.2.)</p>
<p>The entities <b>bsolhsub</b> and <b>suphsol</b> which were previously
mapped to two-character combinations U+005C U+2282 and U+2283 U+002F
are now mapped to the Unicode 5 characters that were added
specifically to support these entities, U+27C8 and U+27C9.</p>
<p>The source files have all been updated to match Unicode 5.2.</p>
<p>The entity <b>ThickSpace</b> now maps to the pair
U+205F U+200A rather than the triple U+2009 U+200A U+200A
(4/18 + 1/18)em rather than (3/18 + 1/18 + 1/18)em.</p>
<p>The entity <b>UnderBar</b> maps to the spacing character
_ rather than the combining character U+0332.</p>
<p>The entity <b>OverBar</b> maps to the spacing character
U+203E (like the XHTML entity <b>oline</b>) rather than the macron character U+00AF.</p>
<p>The entities <b>epsiv</b> and <b>varepsilon</b> are now mapped to the epsilon symbol
U+03F5 rather than being aliases for the entity <b>epsilon</b>, U+03B5.</p>
<p>The entities <b>phiv</b> and <b>varphi</b> are now mapped to the phi symbol U+03D5
rather than being aliases for the entity <b>phi</b>, U+03C6.</p>
</div><div class="div2">
<h3><a id="changes20071214"></a>B.6 Changes between 2008-07-21 and 2007-12-14</h3>
<p>The following entity definitions have changed at this draft:</p>
<p><b>phi</b>, <b>lang</b>, <b>rang</b>,
<b>OverParenthesis</b>, <b>UnderParenthesis</b>,
<b>OverBrace</b>, <b>UnderBrace</b>,
<b>lbbrk</b>, <b>rbbrk</b>.</p>
</div></div><div class="div1">
<h2><a id="diffs"></a>C Differences between these entities and earlier W3C DTDs</h2><div class="div2">
<h3><a id="diff-xhtml1"></a>C.1 Differences from XHTML 1.0</h3>
<p>Differences between the XHTML entity definitions described here and the entity set
described in the <a href="https://www.w3.org/TR/xhtml1/dtds.html">XHTML 1.0 DTD</a>.</p>
<dl><dt class="kwlabel"><b>lang</b> and <b>rang</b></dt><dd>U+27E8 and
U+27E9; XHTML 1.0 used U+2329 and U+232A (which have canonical
decomposition to U+3008 and U+3009).</dd></dl><div class="note"><p class="prefix"><b>Note:</b></p>
<p>The current drafts of <a href="#HTML5">[HTML5]</a> use entity definitions derived from
this specification.</p>
</div></div><div class="div2">
<h3><a id="diff-mathml2"></a>C.2 Differences from MathML 2.0 (second edition)</h3>
<p>The differences between MathML 2 and the current entity
definitions are listed below.</p>
<dl><dt class="kwlabel"><b>fjlig</b></dt><dd>ISOPUB (and MathML 1) defined an fj ligature;
Unicode does not have a specific character and the entity was dropped from MathML2.
It is re-instated here for maximum compatibility with <a href="#SGML">[SGML]</a>.</dd><dt class="kwlabel"><b>phi</b></dt><dd>U+03C6 GREEK SMALL LETTER PHI
(the definition used in HTML4);
MathML2 used U+03D5 GREEK PHI SYMBOL. </dd><dt class="kwlabel"><b>epsiv</b>, <b>varepsilon</b>, <b>phiv</b>,
<b>varphi</b></dt><dd>these have been changed to map to the symbol character
(to match other uses of the var prefix such as <b>vartheta</b>).</dd><dt class="kwlabel"><b>jmath</b></dt><dd>U+0237; MathML 2 used U+006A (j) as
there was no dotless j before Unicode 4.1.</dd><dt class="kwlabel"><b>trpezium</b>, <b>elinters</b></dt><dd>U+23E2 and U+23E7;
MathML 2 used U+FFFD (REPLACEMENT CHARACTER) as these characters were added at Unicode 5.0
specifically to support these entities. </dd><dt class="kwlabel"><b>ohm</b>, <b>angst</b></dt><dd>As noted above, the
definitions of these entities have been changed so that the
definitions use characters that are in NFC normal
form.</dd><dt class="kwlabel"><b>bsolhsub</b> and <b>suphsol</b></dt><dd> U+27C8 and
U+27C9;
MathML2 used U+005C U+02282 and U+2283 U+002F.</dd><dt class="kwlabel"><b>NotGreaterFullEqual</b></dt><dd> U+2267 U+0338 ;
MathML2 used the erroneous definition U+2266 U+0338.</dd></dl>
<p>The following bracket symbols have been added to the Mathematical
symbols block in Unicode versions between 3.1 and 5.1. MathML2 used
similar characters intended for CJK punctuation.</p>
<dl><dt class="kwlabel"><b>lang</b>, <b>langle</b>, <b>LeftAngleBracket</b> and
<b>rang</b>, <b>rangle</b>, <b>RightAngleBracket</b></dt><dd>U+27E8 and
U+27E9; MathML2 used U+2329 and U+232A (which have canonical
decomposition to U+3008 and U+3009).</dd><dt class="kwlabel"><b>Lang</b> and <b>Rang</b></dt><dd>U+27EA and U+27EB;
MathML2 used U+300A and U+300B.</dd><dt class="kwlabel"><b>lbbrk</b> and <b>rbbrk</b></dt><dd>U+2772 and
U+2773; MathML2 used U+3014 and U+3015.</dd><dt class="kwlabel"><b>loang</b> and <b>roang</b></dt><dd>U+27EC and
U+27ED; MathML2 used U+3018 and U+3019.</dd><dt class="kwlabel"><b>lobrk</b> and <b>robrk</b></dt><dd>U+27E6 and
U+27E7; MathML2 used U+301A and U+301B.</dd><dt class="kwlabel"><b>OverBrace</b> and <b>UnderBrace</b></dt><dd>U+23DE and U+23DF; MathML2
used U+FE37 and U+FE38.</dd><dt class="kwlabel"><b>OverParenthesis</b> and
<b>UnderParenthesis</b></dt><dd>U+23DC and U+23DD;
MathML2 used U+FE35 and U+FE36.</dd><dt class="kwlabel"><b>LeftDoubleBracket</b> and
<b>RightDoubleBracket</b></dt><dd>U+27E6 and U+27E7;
MathML2 used U+301A and U+301B.</dd></dl><div class="note"><p class="prefix"><b>Note:</b></p>
<p><a href="#MathML3">[MathML3]</a> uses the entity sets defined by this specification.</p>
</div></div></div><div class="div1">
<h2><a id="source"></a>D Source Files</h2>
<p>All data files used to construct the entity declarations, XSLT character maps, and
HTML tables referenced from this document are available from
<a href="https://github.com/w3c/xml-entities/">https://github.com/w3c/xml-entities/</a>.</p>
<ul><li><a href="https://github.com/w3c/xml-entities/blob/gh-pages/unicode.xml">unicode.xml</a> master file detailing all Unicode characters with names in various entity sets and applications, TeX equivalents and other data. This file has been maintained for many years, originally by Sebastian Rahtz as part of the jadetex distribution and since around 1999 as part of the MathML specification sources by David Carlisle. The current version encodes data for all characters in Unicode 13.
<em>Note: unicode.xml is over 5MB in size and may not really be suitable for direct viewing in a browser. You may prefer to save the file rather than follow the above link to unicode.xml in a browser.</em></li><li><a href="https://github.com/w3c/xml-entities/blob/gh-pages/charlist.rnc">charlist.rnc</a> relax NG schema for unicode.xml.</li><li><a href="https://github.com/w3c/xml-entities/blob/gh-pages/unicode.xsl">unicode.xsl</a> XSLT stylesheet that renders unicode.xml as an HTML table.</li><li><a href="https://github.com/w3c/xml-entities/blob/gh-pages/character-set.xml">character-set.xml</a> the source file for this document.</li><li><a href="https://github.com/w3c/xml-entities/blob/gh-pages/xmlspec.xsl">xmlspec.xsl</a> a copy of the standard xmlspec stylesheet.</li><li><a href="https://github.com/w3c/xml-entities/blob/gh-pages/run">run</a> small script file that builds this collection.</li><li><a href="https://github.com/w3c/xml-entities/blob/gh-pages/xhtml1.xml">xhtml1.xml</a> record of XHTML 1.0 entity definitions.</li><li><a href="https://github.com/w3c/xml-entities/blob/gh-pages/mml2.xml">mml2.xml</a> record of MathML 2.0 (second edition) entity definitions.</li><li><a href="https://github.com/w3c/xml-entities/blob/gh-pages/unicodedata.xsl">unicodedata.xsl</a> stylesheet that generates a new copy of unicode.xml, incorporating data from the Unicode data file, used to update unicode.xml as new versions of Unicode are released.</li><li><a href="https://github.com/w3c/xml-entities/blob/gh-pages/entities.xsl">entities.xsl</a> stylesheet to generate the DTD declarations for the entities.</li><li><a href="https://github.com/w3c/xml-entities/blob/gh-pages/charmap.xsl">charmap.xsl</a> stylesheet to generate the XSLT character maps.</li><li><a href="https://github.com/w3c/xml-entities/blob/gh-pages/characters.xsl">characters.xsl</a> stylesheet to generate this document, including the referenced HTML tables.</li><li><a href="https://github.com/w3c/xml-entities/blob/gh-pages/schemas.xml">schemas.xml</a> file associating XML documents with appropriate Relax NG schema.</li><li id="catalog"><a href="https://github.com/w3c/xml-entities/blob/gh-pages/catalog">catalog</a> Sample OASIS XML catalog that redirects references to the entity or stylesheet files at https://www.w3.org/2003/entities/2007/ to the local file system at
/etc/xml/w3c-entities. It should be edited to refer to the location of a local copy of the files. Many XML parsers may be configured to read this catalog format, but the specific options depend on the parser being used.</li></ul></div><div class="div1">
<h2><a id="references"></a>E References</h2><dl><dt class="label" id="SGML">SGML</dt><dd>ISO/IEC 8879:1986, Information processing — Text and office
systems — Standard Generalized Markup Language (SGML)</dd><dt class="label" id="ISO9573-13-1991">ISO9573-13-1991</dt><dd>ISO/IEC TR :1991, Information
technology — SGML support facilities
Techniques for using
SGML — Part 13: Public entity sets for
mathematics and science</dd><dt class="label" id="Unicode">Unicode</dt><dd>The Unicode Consortium. The Unicode Standard, Version 5.2.0, defined by:
The Unicode Standard, Version 5.2 (Mountain View, CA: The Unicode Consortium, 2009. ISBN 978-1-936213-00-9). Unicode 6.3 update
(<a href="https://www.unicode.org/versions/Unicode6.3.0/">https://www.unicode.org/versions/Unicode6.3.0/</a>)
</dd><dt class="label" id="Unicode15">Unicode15</dt><dd>Unicode Standard Annex 15, Version 6.3.0;
<em><a href="https://www.unicode.org/reports/tr15/">Unicode Normalization Forms</a></em>,
The Unicode Consortium, 2013-09-20.
(<a href="https://www.unicode.org/reports/tr15/">https://www.unicode.org/reports/tr15/</a>)
</dd><dt class="label" id="Unicode25">Unicode25</dt><dd>Barbara Beeton, Asmus Freytag, Murray Sargent III,
<em><a href="https://www.unicode.org/reports/tr25/">Unicode Support for Mathematics</a></em>,
Unicode Technical Report #25 2012-04-02.
(<a href="https://www.unicode.org/reports/tr25/">https://www.unicode.org/reports/tr25/</a>)
</dd><dt class="label" id="MathML2">MathML2</dt><dd>David Carlisle, Patrick Ion, Robert Miner, Nico Poppelier,
<em><a href="https://www.w3.org/TR/MathML2/">Mathematical Markup Language (MathML)
Version 2.0 (Second Edition)</a></em>
W3C Recommendation 21 October 2003
(<a href="https://www.w3.org/TR/2003/REC-MathML2-20031021/">https://www.w3.org/TR/2003/REC-MathML2-20031021/</a>)
</dd><dt class="label" id="MathML3">MathML3</dt><dd>David Carlisle, Patrick Ion, Robert Miner,
<em><a href="https://www.w3.org/TR/MathML3/">Mathematical Markup Language (MathML) Version 3.0 2nd Edition</a></em>
W3C Recommendation 10 April 2014
(<a href="https://www.w3.org/TR/2014/REC-MathML3-20140410/">https://www.w3.org/TR/2014/REC-MathML3-20140410/</a>)
</dd><dt class="label" id="HTML4">HTML4</dt><dd>Dave Raggett, Arnaud Le Hors, Ian Jacobs,
<em><a href="https://www.w3.org/TR/html4/"></a>HTML 4.01 Specification</em>
W3C Recommendation 24 December 1999
(<a href="https://www.w3.org/TR/1999/REC-html401-19991224/">https://www.w3.org/TR/1999/REC-html401-19991224</a>)</dd><dt class="label" id="HTML5">HTML5</dt><dd>Robin Berjon,
Steve Faulkner,
Travis Leithead,
Erika Doyle Navara,
Edward O'Connor,
Silvia Pfeiffer,
Ian Hickson
<em><a href="https://www.w3.org/TR/html5/">HTML 5,
A vocabulary and associated APIs for HTML and XHTML</a></em>
W3C Candidate Recommendation 6 August 2013
(<a href="https://www.w3.org/TR/html5/">https://www.w3.org/TR/html5/</a>)</dd><dt class="label" id="Charmod-norm">Charmod-norm</dt><dd> François Yergeau,
Martin J. Dürst,
Richard Ishida,
Addison Phillips,
Misha Wolf,
Tex Texin,
<em><a href="https://www.w3.org/TR/charmod-norm/">Character Model
for the World Wide Web 1.0: Normalization</a></em>
W3C Working Draft 1 May 2012
(<a href="https://www.w3.org/TR/charmod-norm/">https://www.w3.org/TR/charmod-norm/</a>)
</dd></dl></div></div></body></html>