Version française
Home     About     Download     Resources     Contact us    

This site is updated infrequently. For up-to-date information, please visit the new OCaml website at

Browse thread
[Caml-list] ulex: lexer generator for Unicode
[ Home ] [ Index: by date | by threads ]
[ Search: ]

[ Message by date: previous | next ] [ Message in thread: previous | next ] [ Thread: previous | next ]
Date: -- (:)
From: Alain.Frisch@e...
Subject: [Caml-list] ulex: lexer generator for Unicode
Hello list,

I started working on a lexer generator for Unicode. The architecture is
similar to ocamllex, except that ulex lexers use a new kind of lexbuf that
holds Unicode code points. It is possible to inject in these lexbufs any
Unicode stream (for the moment, adapters are provided only for Latin1/utf8
streams/strings/channels, but you can also pass Unicode code points as

Lexer specifications are embedded in OCaml code, and parsed with a Camlp4
syntax extension.

As several people showed interest in Unicode support for Caml, I thought I
could make a preliminary release to collect feedback on the design of
ulex. Here is the tarball:

-- Alain

To unsubscribe, mail Archives:
Bug reports: FAQ:
Beginner's list: