English version
Accueil     À propos     Téléchargement     Ressources     Contactez-nous    

Ce site est rarement mis à jour. Pour les informations les plus récentes, rendez-vous sur le nouveau site OCaml à l'adresse ocaml.org.

Browse thread
Unicode (was RE: JIT-compilation for OCaml?)
[ Home ] [ Index: by date | by threads ]
[ Search: ]

[ Message by date: previous | next ] [ Message in thread: previous | next ] [ Thread: previous | next ]
Date: 2001-01-12 (09:11)
From: Pierpaolo BERNARDI <bernardp@c...>
Subject: Re: Unicode (was RE: JIT-compilation for OCaml?)

On Thu, 11 Jan 2001, Dave Berry wrote:

> I thought Unicode was a recognised subset of ISO-10646, corresponding to the
> range 0-2^16.  

No. ISO-10646 and Unicode contains exactly the same code points.
Unicode has room for about 2^20 code points. The ISO committee has
agreed to limit ISO-10646 to the same range.

The current version of Unicode 3.0.1 (and consequently of ISO-10646) has
less than 2^16 code points assigned.  The next version, due
out in a couple of months will contain about 100.000 characters 

> My knowledge of C/C++ is probably out of date, but I thought they just used
> the wide character type, without requiring a particular internal
> representation.  

This is correct.

> In what way do ISO C/C++ support ISO-10646?

I have not been following this discussion, so I missed the message you are
replying to. One can say that C supports ISO-10646 in the sense that a C
environment *can* use ISO-10646 as its internal representation for wide
chars, if the C implementor so chooses.  Many compilers do just this, in