https://issues.apache.org/bugzilla/show_bug.cgi?id=55976

--- Comment #10 from Rainer Jung <rainer.j...@kippdata.de> ---
Created attachment 31190
  --> https://issues.apache.org/bugzilla/attachment.cgi?id=31190&action=edit
multi-threaded WebCrawler

Slightly enhanced version of the provided WebCrawler class:

- Params URL ITERATIONS THREADS
  (crawl recursively starting at URL, retrieve found list of pages ITERATIONS
times distributed on THREADS threads)
- Log timestamp if an error occurs
- don't convert bytes to strings when in replay mode

Example: 

/path/to/bin/java webcrawler.WebCrawler http://myserver:8080/docs/ 1000 10

Code tunables:

    public static final int BUFSZ = 1000; // Buffer size for reading responses
    public static final int CONNECT_TIMEOUT = 5000; // Connect timeout
    public static final int READ_TIMEOUT = 30000; // Connect timeout

-- 
You are receiving this mail because:
You are the assignee for the bug.

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@tomcat.apache.org
For additional commands, e-mail: dev-h...@tomcat.apache.org

Reply via email to