English version
Accueil     À propos     Téléchargement     Ressources     Contactez-nous    

Ce site est rarement mis à jour. Pour les informations les plus récentes, rendez-vous sur le nouveau site OCaml à l'adresse ocaml.org.

Browse thread
[Caml-list] Ocaml-Weblib?
[ Home ] [ Index: by date | by threads ]
[ Search: ]

[ Message by date: previous | next ] [ Message in thread: previous | next ] [ Thread: previous | next ]
Date: 2002-09-03 (12:10)
From: Oliver Bandel <oliver@f...>
Subject: Re: [Caml-list] Ocaml-Weblib?
On Tue, 3 Sep 2002, Gerd Stolpmann wrote:

> Am 2002.09.03 01:24 schrieb(en) Oliver Bandel:
> > On Tue, 3 Sep 2002, Gerd Stolpmann wrote:

> HTML parsing can be done with Nethtml. Simple example:
> Nethtml.parse 
>   (new Netchannels.input_string "<HTML><HEAD>...</HEAD><BODY>...</BODY></HTML>")
> Returns something like
> [ Element("html",[], [ Element("head",[], [ ... ]);
>                        Element("body",[], [ ... ]) ) ]
> just try it in the toploop.

OK, done. Is good... very good.
Yields expected results similar to the Perl-Libs for
such work; but it's soooo much better here, because it's
cleaner ( typesafety and clearer structure than the Perl-stuff :) ).

So, this is, what I have looked for. :)

Are there "opposite" tools, to out back the datastructure,
which will get here from html-parsing back into a html-file?
I think about reading a html-file, throwing out unnecessary
contents, adding interesting stuff, re-layouting the page
and create new pages, better for viewing as output.

(Often webpages are messy (called "good designed";-)) and
 I only need two or three parts of it. Creating a pagelaout,
 which matches my needs is intended here.)

Can this be done by the cgi-module?
Or is it part of the nethtml-lib?

> As far as I know there is no ready-to-use NNTP client. There are important
> components for an NNTP client, though. For example, there are parsers for
> messages in email format, and there is the working implementation for the
> POP protocol that has some similarities.

OK, if I can rely on the ocamlnet's sub-libraries, then I may write
such a client based on it.

> There is another object involved (input_stream) that has no effect if you
> read only from a string, but that allows you to read the header from 
> non-seekable files (e.g. pipelines or sockets). But this is definitely
> a feature for experts.

Well, I have done socket-programming .... in C. :)

> See the sources in netpop.ml for an example how to write a "telnet-style"
> client. Note that netpop.ml does not use sockets, it expects that the
> user of this module passes channels that are already connected sockets.

Hmhhh... how can be sockets read like channels?
Sockets must be handled unbuffered... Unix.read and such
functions... hmhhh is this done by Stream-module or similar
things? (Or could it be done in this way?)

> I hope this short introduction gives you the right impression of
> the library.

Yes. It's powerful (and wonderful:)).

With each more of impression of Ocaml's features (e.g.
module system) and such libraries, I see, that my interest
in perl and other languages fades away....


To unsubscribe, mail caml-list-request@inria.fr Archives: http://caml.inria.fr
Bug reports: http://caml.inria.fr/bin/caml-bugs FAQ: http://caml.inria.fr/FAQ/
Beginner's list: http://groups.yahoo.com/group/ocaml_beginners