[
Home
]
[ Index:
by date
|
by threads
]
[ Message by date: previous | next ] [ Message in thread: previous | next ] [ Thread: previous | next ]
[ Message by date: previous | next ] [ Message in thread: previous | next ] [ Thread: previous | next ]
| Date: | -- (:) |
| From: | Jan Kybic <kybic@f...> |
| Subject: | Re: [Caml-list] ANN: cfind 0.0.0 |
> Description: cfind is a UNIX tool that provides functionality similar > to that of Google Desktop from the command line. It is written > entirely in OCaml. > > Homepage: http://cfind.sourceforge.net/ > > I'll appreciate any input from the OCaml community. It looks definitely very useful. Proposed extensions and changes: - configurable choise of a lexer. For example there could be a table (read from a configuration file) with regular expressions matching path and file names, association them to parsers. - If I understand your code correctly, in TeX files only command names are indexed, is it correct? Then I might prefer a different lexer, which ignores comments and command names and indexes the words in the text. - It should be also possible to apply other configurable filters to the files before indexing. An example would be to decompress all "*.gz" or "*.bz2" files before indexing - More complicate logical expressions defining match, in the spirit of: "functional" AND "lazy" AND NOT "Haskell" - It would be nice to be able to break files into smaller units and to find the units which match, not the whole file. A typical example would be email in mbox format, or perhaps functions in a program. Good luck, Jan -- ------------------------------------------------------------------------- Jan Kybic <kybic@fel.cvut.cz> tel. +420 2 2435 5721 http://cmp.felk.cvut.cz/~kybic ICQ 200569450