RE: Solr wiki link broken

2010-01-30 Thread Chris Hostetter
: According to this... : http://wiki.apache.org/solr/HelpOnLanguages : : ...we should be deleting all those unused pages, or have INFRA change or : wiki config so that something other then FrontPage is out default (which Hmm... i attempted to crawl the TitleIndex to build a list of all these

DataImportHandler problem - reading XML from a file

2010-01-30 Thread Lance Norskog
This DataImportHandler script does not find any documents in this HTML file. The DIH definitely opens the file, but the either the xpathprocessor gets no data or it does not recognize the xpaths described. Any hints? (I'm using Solr 1.5-dev, sometime recent.) Thanks! Lance xhtml-data-config.xml

Re: Getting the Lucene Document ID for TermVectorComponent

2010-01-30 Thread Chris Hostetter
: using the TermVectorComponent for a special set of documents, I will need : the Lucene Document IDs.[1] ... : http://wiki.apache.org/solr/TermVectorComponent?highlight=(tv.docIds) internal doc ids are generally not exposed to the end user and shouldn't be trusted 99% of the time ... th

SnapShooter exception during optimize()

2010-01-30 Thread Peter S
Hi, I've been doing some testing with asynchronous index optimization (i.e. performing an optimize in a separate thread while indexing carries on - using its own CommonsHttpSolrServer instance of course :-) -- I've come across this intermittent problem with the snapshooter: 30-Jan-2

Re: sort items by whether the user has viewed it or not

2010-01-30 Thread Wangsheng Mei
You can try to use boost query(bq) instead of sort. They should have similar effect. e.g. http://solr?q=baseball&bq=isbn:ABC+isbn:DEF+isbn:GHI&qt=dismax 2010/1/30 > hi, > > i want to query for documents that have certain values but i want it first > sorted by documents that this person has view

Re: Is optimizing always necessary?

2010-01-30 Thread Marcus Herou
Sure, that seems fair if I would have thousands of segments. However I have started a total rebuild of the index which started 2 days ago. I only have about 100-150 segments right now per core. Is that much at all ? Cheers //Marcus 2010/1/29 Wangsheng Mei > In addition to destory the "holes"

Re: Solr usage with Auctions/Classifieds?

2010-01-30 Thread Jan Høydahl / Cominvent
A follow-up on the auction use case. How do you handle the need for frequent updates of only one field, such as the last bid field (needed for sort on price, facets or range)? For high traffic sites, the document update rate becomes very high if you re-send the whole document every time the bid

Getting the Lucene Document ID for TermVectorComponent

2010-01-30 Thread MitchK
Hello community, using the TermVectorComponent for a special set of documents, I will need the Lucene Document IDs.[1] How do I get them from a response? I have searched the Solr- and the Lucene-wiki but can't find anything to solve this problem. By the way: Are there any known bugs with using