Re: merely a suggestion: schema.xml validator or better schema validation logging

Jed Reynolds Fri, 02 Mar 2007 18:27:01 -0800

Yonik Seeley wrote:

If the actual schema was null, then that was probably some problem
parsing the schema.
If that's the case, hopefully you saw an exception in the logs onstartup?



Using apache-solr-1.1.0-incubating.

Actually not at first, but now I do. But I've gone back and re-createdthe (or a similar) error, and what the problem was happened to be theway I was watching my logs. When I first started, I was just doing atail -F on catalina.out, but the exception was throwing to the logfilelocalhost.2007-03-01.log. Ah, tomcat my best old buddy old pal. I'velearned to just do a "tail -F *". I've obviously grown desinsitized byother java projects throwing exceptions to logs, and by so much loggingduplication between catalina.out and the tomcat contextual logs.

I almost didn't notice the exception fly by because there's soooo muchlog output, and I can see why I might not have noticed. Yay forscrollback! (Hrm, I might not have wanted to watch logging for 4instances of solr all at once. Might explain why so much logging.)

Another helpful modification would be returning 500 errors codes in theheader. This would help a script detect error codes without needing togrep or dom process the result element. The output of my php script toload documents was showing me the snippet below. Possibly making theerror code configurable might help (I can see cases where forcing a 200response is useful) .




Array
(
   [errno] => 0
   [errstr] =>
   [response] => HTTP/1.1 200 OK
Server: Apache-Coyote/1.1
Content-Type: text/xml;charset=UTF-8
Content-Length: 1329
Date: Sat, 03 Mar 2007 02:04:12 GMT
Connection: close

<result status="1">java.lang.NullPointerException
       at org.apache.solr.core.SolrCore.update(SolrCore.java:763)

atorg.apache.solr.servlet.SolrUpdateServlet.doPost(SolrUpdateServlet.java:53)

--snip--
</result>
)

Anyway, I agree that some config errors could be handled in a more
user-friendly manner, and it would be nice if config failures could
make it to the front-page admin screen or something.



That would groovy!

I was able to see instances where a field was not defined. Now that I'mlooking at all the log files, I'm seeing the error I should have seenearlier.


Thanks guys!

Jed

PS Last night I was able to index about 180,000 documents in about 2.5hours. The resulting index is a bit over 800M. Compared to myself-crawling with Nutch, this is 1/4 the time to index and 1/30th thedisk space used by indicies. I am really impressed. I threw fourconcurrent scripts making 50,000 distinct (select distinct tag fromtaglist;) requests at this solr instance and my solr server was serving50 requests per second per script and the solr server load average wasabout 3.2. That's 200 requests per second against a 4 core box. Thetomcat instance was taking 606M ram, resident.

Re: merely a suggestion: schema.xml validator or better schema validation logging

Reply via email to