aboutsummaryrefslogtreecommitdiffstats
path: root/man7/unicode.7
diff options
context:
space:
mode:
authorMichael Kerrisk <mtk.manpages@gmail.com>2017-08-18 00:59:04 +0200
committerMichael Kerrisk <mtk.manpages@gmail.com>2017-08-18 00:59:04 +0200
commita721e8b25faf1a33d32961f5b22fdfa0f4a82515 (patch)
treebbf2ffdfa587d418ca2da922a66d06ec7f78a575 /man7/unicode.7
parent38db2ef4d0af619b8108fecd2e4b7b7062d59da6 (diff)
downloadman-pages-a721e8b25faf1a33d32961f5b22fdfa0f4a82515.tar.gz
aio.7, arp.7, attributes.7, boot.7, cgroups.7, cpuset.7, credentials.7, fanotify.7, fifo.7, glob.7, hier.7, hostname.7, icmp.7, inode.7, inotify.7, keyrings.7, libc.7, mailaddr.7, mount_namespaces.7, mq_overview.7, nptl.7, numa.7, path_resolution.7, persistent-keyring.7, pid_namespaces.7, pipe.7, pkeys.7, process-keyring.7, pthreads.7, pty.7, random.7, sched.7, sem_overview.7, session-keyring.7, shm_overview.7, signal-safety.7, signal.7, spufs.7, standards.7, symlink.7, termio.7, thread-keyring.7, time.7, unicode.7, user-keyring.7, user-session-keyring.7, user_namespaces.7, utf-8.7, xattr.7: ffix
Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>
Diffstat (limited to 'man7/unicode.7')
-rw-r--r--man7/unicode.710
1 files changed, 5 insertions, 5 deletions
diff --git a/man7/unicode.7 b/man7/unicode.7
index 371cd227fc..934f8aeb56 100644
--- a/man7/unicode.7
+++ b/man7/unicode.7
@@ -37,7 +37,7 @@ It also guarantees "round-trip compatibility";
in other words,
conversion tables can be built such that no information is lost
when a string is converted from any other encoding to UCS and back.
-
+.PP
UCS contains the characters required to represent practically all
known languages.
This includes not only the Latin, Greek, Cyrillic,
@@ -59,7 +59,7 @@ graphical, typographical, mathematical, and scientific symbols,
including those provided by TeX, Postscript, APL, MS-DOS, MS-Windows,
Macintosh, OCR fonts, as well as many word processing and publishing
systems, and more are being added.
-
+.PP
The UCS standard (ISO 10646) describes a
31-bit character set architecture
consisting of 128 24-bit
@@ -166,7 +166,7 @@ code values (in all locales), a convention that is signaled by the GNU
C library to applications by defining the constant
.B __STDC_ISO_10646__
as specified in the ISO C99 standard.
-
+.PP
UCS/Unicode can be used just like ASCII in input/output streams,
terminal communication, plaintext files, filenames, and environment
variables in the ASCII compatible UTF-8 multibyte encoding.
@@ -216,7 +216,7 @@ Information technology \(em Universal Multiple-Octet Coded Character
Set (UCS) \(em Part 1: Architecture and Basic Multilingual Plane.
International Standard ISO/IEC 10646-1, International Organization
for Standardization, Geneva, 2000.
-
+.IP
This is the official specification of UCS .
Available from
.UR http://www.iso.ch/
@@ -228,7 +228,7 @@ Reading, MA, 2000, ISBN 0-201-61633-5.
.IP *
S. Harbison, G. Steele. C: A Reference Manual. Fourth edition,
Prentice Hall, Englewood Cliffs, 1995, ISBN 0-13-326224-3.
-
+.IP
A good reference book about the C programming language.
The fourth
edition covers the 1994 Amendment 1 to the ISO C90 standard, which