Hi Geert-Jan,
What did you mean by this:
> Also, just a suggestion, consider using id's instead of names for filtering;
Thanks,
-S
Okay, but than LiLi did something wrong, right?
I mean, if the document exists only at one shard, it should get the same
score whenever one requests it, no?
Of course, this only applies if nothing gets changed between the requests.
The only remaining problem here would be, that you need distribut
Is anyone using ZooKeeper-based Solr Cloud in production yet? Any war
stories? Any problematic missing features?
Thanks,
Andrew.
--
View this message in context:
http://lucene.472066.n3.nabble.com/SolrCloud-in-production-tp991995p991995.html
Sent from the Solr - User mailing list archive at N
Perhaps completely unnessecery when you have a controlled domain, but I
meant to use ids for places instead of names, because names will quickly
become ambiguous, e.g.: there are numerous different places over the world
called washington, etc.
2010/7/24 SR
> Hi Geert-Jan,
>
> What did you mean b
Hi HSingh,
Usually people set up two fields, one with diacritics and one without. Then
searches are against both fields. If you think a match against the field with
diacritics is more valuable, you can give that field a boost.
Steve
> -Original Message-
> From: HSingh [mailto:hsin...
Are you storing the full 1,000 pages in the index? If so, that is
probably not helping either.
On 7/23/10, ahammad wrote:
>
> Hello,
>
> I have an index with lots of different types of documents. One of those
> types basically contains extracts of PDF docs. Some of those PDFs can have
> 1000+ pag
> Perhaps completely unnessecery when you have a controlled domain, but I
> meant to use ids for places instead of names, because names will quickly
> become ambiguous, e.g.: there are numerous different places over the world
> called washington, etc.
This is related to something I've been thinkin
Hi Jonathan,
I too am using IDs instead of names, one reason being that URLs are
easier to read and they are more safe, because special chars in names
could break the URLs etc.
I am keeping the id-to-name lookups in SOLR though, I just use some
"lookup fields" where I put id and name into one
> I am keeping the id-to-name lookups in SOLR though, I just use some
> "lookup fields" where I put id and name into one field, separated by
> some fixed delimiter, e.g.
> "134982__Some name I am going to lookup later"
> The separator here would be two underscores ("__").
> So I can query for that
I believe we use an in-process weakhashmap to store the id-name
relationship. It's not that we're talking billions of values here.
For anything more mem-intensive we use no-sql (tokyo tyrant through
memcached protocol at the moment)
2010/7/24 Jonathan Rochkind
> > Perhaps completely unnessecery
Boy, if it does what it says it does, it's really a powerful tool.
How is such a thing hosted, I wonder?
Dennis Gearon
Signature Warning
EARTH has a Right To Life,
otherwise we all die.
Read 'Hot, Flat, and Crowded'
Laugh at http://www.yert.com/film.php
--- On Sat, 7/24/1
Hi,
I am looking for a XPath generator that can generate xpath by picking a
specific tag inside a html. Do you know a good xpath generator? If possible,
free xpath generator would be great.
Thanks.
What are you returning? I'd be quite surprised if it was the search, so
first I'd look elsewhere. In particular, are you returning all 1,000 pages?
What happens if you specify returning a small field (the fl= parameter).
Also, look at the debug output of the query, it breaks down the various
phase
: Usually people set up two fields, one with diacritics and one without.
: Then searches are against both fields. If you think a match against the
field
: with diacritics is more valuable, you can give that field a boost.
Hi Steve, where can one setup these two fields? Thank you for your k
it's not a related topic in solr. maybe you should read some papers
about wrapper generation or automatical web data extraction. If you
want to generate xpath, you could possibly read liubing's papers such
as "Structured Data Extraction from the Web based on Partial Tree
Alignment". Besides dom tre
Hello
I desperately hope someone can help me here... I'm a bit out of my league
here.
I am trying to implement content extraction using Tika and Solr as part of a
search package for a product I am using. I have been successful in getting
Solr to work so far as indexing text, and returning searc
16 matches
Mail list logo