Re: Tree Faceting in Solr 1.4

2010-07-24 Thread SR
Hi Geert-Jan, What did you mean by this: > Also, just a suggestion, consider using id's instead of names for filtering; Thanks, -S

Re: a bug of solr distributed search

2010-07-24 Thread MitchK
Okay, but than LiLi did something wrong, right? I mean, if the document exists only at one shard, it should get the same score whenever one requests it, no? Of course, this only applies if nothing gets changed between the requests. The only remaining problem here would be, that you need distribut

SolrCloud in production?

2010-07-24 Thread Andrew Clegg
Is anyone using ZooKeeper-based Solr Cloud in production yet? Any war stories? Any problematic missing features? Thanks, Andrew. -- View this message in context: http://lucene.472066.n3.nabble.com/SolrCloud-in-production-tp991995p991995.html Sent from the Solr - User mailing list archive at N

Re: Tree Faceting in Solr 1.4

2010-07-24 Thread Geert-Jan Brits
Perhaps completely unnessecery when you have a controlled domain, but I meant to use ids for places instead of names, because names will quickly become ambiguous, e.g.: there are numerous different places over the world called washington, etc. 2010/7/24 SR > Hi Geert-Jan, > > What did you mean b

RE: Novice seeking help to change filters to search without diacritics

2010-07-24 Thread Steven A Rowe
Hi HSingh, Usually people set up two fields, one with diacritics and one without. Then searches are against both fields. If you think a match against the field with diacritics is more valuable, you can give that field a boost. Steve > -Original Message- > From: HSingh [mailto:hsin...

Re: Performance issues when querying on large documents

2010-07-24 Thread dc tech
Are you storing the full 1,000 pages in the index? If so, that is probably not helping either. On 7/23/10, ahammad wrote: > > Hello, > > I have an index with lots of different types of documents. One of those > types basically contains extracts of PDF docs. Some of those PDFs can have > 1000+ pag

RE: Tree Faceting in Solr 1.4

2010-07-24 Thread Jonathan Rochkind
> Perhaps completely unnessecery when you have a controlled domain, but I > meant to use ids for places instead of names, because names will quickly > become ambiguous, e.g.: there are numerous different places over the world > called washington, etc. This is related to something I've been thinkin

Re: Tree Faceting in Solr 1.4

2010-07-24 Thread Stefan Moises
Hi Jonathan, I too am using IDs instead of names, one reason being that URLs are easier to read and they are more safe, because special chars in names could break the URLs etc. I am keeping the id-to-name lookups in SOLR though, I just use some "lookup fields" where I put id and name into one

RE: Tree Faceting in Solr 1.4

2010-07-24 Thread Jonathan Rochkind
> I am keeping the id-to-name lookups in SOLR though, I just use some > "lookup fields" where I put id and name into one field, separated by > some fixed delimiter, e.g. > "134982__Some name I am going to lookup later" > The separator here would be two underscores ("__"). > So I can query for that

Re: Tree Faceting in Solr 1.4

2010-07-24 Thread Geert-Jan Brits
I believe we use an in-process weakhashmap to store the id-name relationship. It's not that we're talking billions of values here. For anything more mem-intensive we use no-sql (tokyo tyrant through memcached protocol at the moment) 2010/7/24 Jonathan Rochkind > > Perhaps completely unnessecery

Re: SolrCloud in production?

2010-07-24 Thread Dennis Gearon
Boy, if it does what it says it does, it's really a powerful tool. How is such a thing hosted, I wonder? Dennis Gearon Signature Warning EARTH has a Right To Life, otherwise we all die. Read 'Hot, Flat, and Crowded' Laugh at http://www.yert.com/film.php --- On Sat, 7/24/1

Which is a good XPath generator?

2010-07-24 Thread Savannah Beckett
Hi,   I am looking for a XPath generator that can generate xpath by picking a specific tag inside a html.  Do you know a good xpath generator?  If possible, free xpath generator would be great. Thanks.

Re: Performance issues when querying on large documents

2010-07-24 Thread Erick Erickson
What are you returning? I'd be quite surprised if it was the search, so first I'd look elsewhere. In particular, are you returning all 1,000 pages? What happens if you specify returning a small field (the fl= parameter). Also, look at the debug output of the query, it breaks down the various phase

RE: Novice seeking help to change filters to search without diacritics

2010-07-24 Thread HSingh
: Usually people set up two fields, one with diacritics and one without. : Then searches are against both fields. If you think a match against the field : with diacritics is more valuable, you can give that field a boost. Hi Steve, where can one setup these two fields? Thank you for your k

Re: Which is a good XPath generator?

2010-07-24 Thread Li Li
it's not a related topic in solr. maybe you should read some papers about wrapper generation or automatical web data extraction. If you want to generate xpath, you could possibly read liubing's papers such as "Structured Data Extraction from the Web based on Partial Tree Alignment". Besides dom tre

Tika, Solr running under Tomcat 6 on Debian

2010-07-24 Thread Tim AtLee
Hello I desperately hope someone can help me here... I'm a bit out of my league here. I am trying to implement content extraction using Tika and Solr as part of a search package for a product I am using. I have been successful in getting Solr to work so far as indexing text, and returning searc