Version française
Home     About     Download     Resources     Contact us    

The Caml Hump: Science :: Linguistics

Recently updated contribs

Older contribs

Native OCaml libraries
Zen [25-Sep-2006, version 2.2, Stable] Has a documentation
This toolkit offers a few fundamental structures useful for computational linguistics. It is implemented in Pidgin ML, which is a core subset of the Objective Caml programming language under the so-called revised syntax.
Author: Gérard Huet.
Native OCaml libraries
Sentence Segmenter [03-Jul-2006, Alpha]
This library can be used to segment Newspaper style (english) text articles into sentences. It uses the Ocaml lexer with a simple set of lexical rules to classify those periods ('.') in the text that end sentences.
Author: Ramu Ramamurthy.
Bindings with C libraries
Ocaml Link Grammar [29-May-2006, version 0.1, Beta]
OCaml interface to Link Grammar. This package contains the OCaml interface to Link Grammar. It enables Ocaml applications to use Link Grammar to parse English sentences.
Author: Ramu Ramamurthy.
Native OCaml libraries
OCaml WordNet [19-Apr-2006, version 0.1, Beta] Has a tutorial
This package contains the OCaml Interface to WordNet 2.1. It enables Ocaml programs to use the Wordnet dictionary for (english) word forms and meanings. This library directly parses the WordNet dictionary files, and does not depend on any other libraries. This library requires WordNet 2.1 to be installed (WordNet is available as a free download).
Author: Ramu Ramamurthy.
Scientific software
Mana [23-May-2005, version 0.0.2, Alpha]
Mana is a Japanese input engine, which converts phonetic representation to proper written forms. Mana models Japanese language as a HMM (Hidden Markovian Model), and its output is quite natural even for a long sentence. In addition to Japanese, mana can be used for Chinese once we have a necessary data.
Courses
A course on Computational Linguistics, using Caml.
Author: John Hale.