Browse thread
[Caml-list] Parse crazy HTML, output XML
[
Home
]
[ Index:
by date
|
by threads
]
[ Message by date: previous | next ] [ Message in thread: previous | next ] [ Thread: previous | next ]
[ Message by date: previous | next ] [ Message in thread: previous | next ] [ Thread: previous | next ]
Date: | 2004-06-25 (07:17) |
From: | Paul Snively <psnively@m...> |
Subject: | Re: [Caml-list] Parse crazy HTML, output XML |
-----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 On Jun 21, 2004, at 9:19 AM, Shawn Wagner wrote: > On Mon, Jun 21, 2004 at 05:03:28PM +0100, Richard Jones wrote: >> >> The problem is the parsing phase. Both PXP and XmlLight will only >> parse valid XML (as far as I can see). Is there any simple pure OCaml >> library for parsing HTML and producing a DOM? >> > > There's a html parser in the ocamlnet library. > I've recently found the OCamlNet HTML parser also. Does anyone know if Alain Frisch's XPath implementation, which is modularized and functorized, has been/can be used on the resulting tree from Nethtml? > -- > Shawn Wagner > shawnw@speakeasy.org > Many thanks and best regards, Paul Snively -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.2.4 (Darwin) iEYEARECAAYFAkDb0WIACgkQbot1wzHBQBUd7QCcDkrzAX1diwMisH31VUDR2aeV S3MAoLatoYjH1lmpKSaOxhAm4VmYKfCc =Skxm -----END PGP SIGNATURE----- ------------------- To unsubscribe, mail caml-list-request@inria.fr Archives: http://caml.inria.fr Bug reports: http://caml.inria.fr/bin/caml-bugs FAQ: http://caml.inria.fr/FAQ/ Beginner's list: http://groups.yahoo.com/group/ocaml_beginners