From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.1.3 (2006-06-01) on yquem.inria.fr X-Spam-Level: X-Spam-Status: No, score=0.0 required=5.0 tests=AWL autolearn=disabled version=3.1.3 Received: from discorde.inria.fr (discorde.inria.fr [192.93.2.38]) by yquem.inria.fr (Postfix) with ESMTP id 1B616BC0A for ; Tue, 27 Feb 2007 12:28:05 +0100 (CET) Received: from mail15.bluewin.ch (mail15.bluewin.ch [195.186.18.63]) by discorde.inria.fr (8.13.6/8.13.6) with ESMTP id l1RBS41C025901 for ; Tue, 27 Feb 2007 12:28:04 +0100 Received: from [10.0.1.2] (85.2.126.110) by mail15.bluewin.ch (Bluewin 7.3.121) id 45C31B2B007409DD; Tue, 27 Feb 2007 11:28:04 +0000 In-Reply-To: <20070227082805.GA4631@takhisis.invalid> References: <20070227082805.GA4631@takhisis.invalid> Mime-Version: 1.0 (Apple Message framework v752.2) Content-Type: text/plain; charset=ISO-8859-1; delsp=yes; format=flowed Message-Id: <21AB6F18-E8C2-4D00-8067-F11D8DCD00FD@epfl.ch> Cc: caml-list@yquem.inria.fr Content-Transfer-Encoding: quoted-printable From: =?ISO-8859-1?Q?Daniel_B=FCnzli?= Subject: Re: [Caml-list] ANNOUNCE: Xmlm Date: Tue, 27 Feb 2007 12:29:49 +0100 To: Stefano Zacchiroli X-Mailer: Apple Mail (2.752.2) X-Miltered: at discorde with ID 45E415C4.000 by Joe's j-chkmail (http://j-chkmail . ensmp . fr)! X-Spam: no; 0.00; bunzli:01 buenzli:01 zacchiroli:01 ocaml:01 macos:01 in-memory:01 28,:98 caml-list:01 epfl:02 tree:02 tree:02 parse:02 parse:02 benchmark:02 sys:03 Le 27 f=E9vr. 07 =E0 09:28, Stefano Zacchiroli a =E9crit : > What about the performance? I don't know, I didn't invest a lot of time in profiling, maybe it =20 can be improved. But if you want an unscientific benchmark here you have : I compare the two programs xmllint (which comes on my system) and =20 xmltrip (compiled without any special option) respectively =20 distributed with libxml and xmlm. Note xmllint is a C program so we =20 are not comparing to libxml's ocaml interface. Besides I have no idea =20= how xmllint is written and what it does internally, maybe it does =20 more than xmlm does, so we may well be comparing the uncomparable. =20 The files are , uncompressed this =20= is 144 mb of xml files. On macos 10.4.8, G4 1Ghz, 512mo ram. Parse only, without building the tree. > > time ./xmltrip.opt -p -ename ~/tmp/xmls/*.xml > > real 0m53.567s > user 0m51.562s > sys 0m1.043s > > > time xmllint --noent --nocdata --noout --nonet --stream ~/tmp/=20 > xmls/*.xml > > real 0m25.264s > user 0m24.314s > sys 0m0.725s Parse only, building an in-memory tree. > > time ./xmltrip.opt -t -p -ename ~/tmp/xmls/*.xml > > real 2m2.099s > user 1m44.821s > sys 0m8.215s > > > time xmllint --noblanks --noent --nocdata --noout --nonet ~/tmp/=20 > xmls/*.xml > real 1m4.590s > user 0m47.561s > sys 0m3.193s Best, Daniel