Bug#92959: volunteering to package Larbin (webcrawler)

Basile STARYNKEVITCH Sun, 15 Sep 2002 15:33:38 -0500

* Package name    : larbin
  Version         : 2.6.2
  Upstream Author : Sebastien Ailleret =?us-ascii?q?<[EMAIL PROTECTED]>?=
* URL             : http://larbin.sourceforge.net/use-eng.html
* License         : GPL
  Description     : Larbin is an efficient web crawler =?us-ascii?q?(but?=
        not an =?us-ascii?q?indexer)?=
Bcc: "Basile STARYNKEVITCH" <[EMAIL PROTECTED]>
X-Mailer: reportbug 1.99.58
Date: Sun, 15 Sep 2002 22:21:56 +0200


Package: wnpp
Version: unavailable; reported 2002-09-15
Followup-For: Bug #92959

(Include the long description here.)

Larbin is a web crawler (also called (web) robot, spider,
scooter...). It is intended to fetch a large number of web pages to
fill the database of a search engine. With a network fast enough,
Larbin should be able to fetch more than 100 millions pages on a
standard PC.

Larbin is (just) a web crawler (coded in C & C++), NOT an indexer. You
have to write some code yourself in order to save pages or index them
in a database.

Larbin was initially developped for the XYLEME project in the VERSO
team at INRIA. The goal of Larbin was to go and fetch xml pages on the
web to fill the database of an xml-oriented search engine. Thanks to
its origins, Larbin is very generalistic (and easy to customize).

I (Basile) am not the author of Larbin, but I did use and patch it.

my email is  basile<at>starynkevitch<dot>net; 
my web page is http://starynkevitch.net/Basile/index_en.html
-- System Information:
Debian Release: testing/unstable
Architecture: i386
Kernel: Linux hector.lesours 2.4.18 #11-basile_portable Mon Jun 3 07:09:41 CEST 
2002 i686
Locale: LANG=C, LC_CTYPE=C

Bug#92959: volunteering to package Larbin (webcrawler)

Reply via email to