[
Home
]
[ Index:
by date
|
by threads
]
[ Message by date: previous | next ] [ Message in thread: previous | next ] [ Thread: previous | next ]
[ Message by date: previous | next ] [ Message in thread: previous | next ] [ Thread: previous | next ]
| Date: | -- (:) |
| From: | Richard Jones <rich@a...> |
| Subject: | Re: [Caml-list] Supporting unicode in ocaml... |
On Sat, Aug 19, 2006 at 11:37:47AM +1200, Jonathan Roewen wrote: > Does the ocaml team ever plan on supporting unicode to some degree? > > What about being able to parse utf-8 encoded files, but keeping the > ascii only grammar? Then with the only change that if it's a utf-8 > file, that the utf-8 encoding of string constants are maintained. With > this scheme, you could theoretically bail on non-ascii characters > everywhere else. And a 3rd-party library like camomile could be used > for higher-level processing of the utf8-encoded string constants (from > the camomile docs, utf8 strings use the ocaml string type too). Have a look at Camomile: http://camomile.sourceforge.net/ Generally speaking, though, I just always use string == UTF-8 string and avoid using some of the unsafe functions from the standard library, such as String.lowercase. Rich. -- Richard Jones, CTO Merjis Ltd. Merjis - web marketing and technology - http://merjis.com Team Notepad - intranets and extranets for business - http://team-notepad.com