Version française
Home     About     Download     Resources     Contact us    
Browse thread
[Caml-list] Supporting unicode in ocaml...
[ Home ] [ Index: by date | by threads ]
[ Search: ]

[ Message by date: previous | next ] [ Message in thread: previous | next ] [ Thread: previous | next ]
Date: -- (:)
From: Richard Jones <rich@a...>
Subject: Re: [Caml-list] Supporting unicode in ocaml...
On Sat, Aug 19, 2006 at 11:37:47AM +1200, Jonathan Roewen wrote:
> Does the ocaml team ever plan on supporting unicode to some degree?
> 
> What about being able to parse utf-8 encoded files, but keeping the
> ascii only grammar? Then with the only change that if it's a utf-8
> file, that the utf-8 encoding of string constants are maintained. With
> this scheme, you could theoretically bail on non-ascii characters
> everywhere else. And a 3rd-party library like camomile could be used
> for higher-level processing of the utf8-encoded string constants (from
> the camomile docs, utf8 strings use the ocaml string type too).

Have a look at Camomile:

http://camomile.sourceforge.net/

Generally speaking, though, I just always use string == UTF-8 string
and avoid using some of the unsafe functions from the standard
library, such as String.lowercase.

Rich.

-- 
Richard Jones, CTO Merjis Ltd.
Merjis - web marketing and technology - http://merjis.com
Team Notepad - intranets and extranets for business - http://team-notepad.com