I have a function that works well in 3.x, but when I tried to
re-implement in 4.x it runs very very slow (~20ms vs 45s on an index w
~100K items).
Big picture, I am trying to calculate a bounding box for items that
match the query. To calculate this, I have two fields bboxNS, and
bboxEW that get
On 8/23/2010 12:07 AM, Shawn Heisey wrote:
I have a field named "keywords" in my index. The schema browser page
is not able to deal with this, so I have trouble getting statistical
information on this field. When I click on the field, Firefox hangs
for a minute and then gives the "unrespons
Hi Savannah,
Check out the patches I just threw up for SOLR-2073, SOLR-2074, SOLR-2075,
SOLR-2076 and SOLR-2077.
There's code in there to deal with Geonames.org data. There's more patches
coming so hopefully it will be clearer as I add them. Thanks to W. Quach for
leading the charge on these p
Hi,
I am using nutch to do the crawling and solr to do the searching. The index
has City and State. I want to able to get all nearby cities by entering city
name. e.g. when I type New York, I want to get the following as facet:
New York, NY (1905)
Brooklyn, NY (89)
Jersey City, NJ (55)
N
I have a field named "keywords" in my index. The schema browser page
is not able to deal with this, so I have trouble getting statistical
information on this field. When I click on the field, Firefox hangs for
a minute and then gives the "unresponsive script" warning. I assume
(without actu
On 8/20/2010 8:56 PM, Lance Norskog wrote:
The first question is about your use cases. How many words are in the
eventual 3GB spelling index? Do you really need that many?
Spell-checking is a more controllable UI if you make it from a
dictionary.
It's built from an index-only field that combin
On Sat, Aug 21, 2010 at 5:56 PM, Andrew Clegg wrote:
>
> Hi,
>
> First off, sorry about previous accidental post, had a sausage-fingered
> moment.
>
> Anyway...
>
> If I merge two indices with CoreAdmin, as detailed here...
>
> http://wiki.apache.org/solr/MergingSolrIndexes
>
> What happens to dup
Thanks. It worked out fine.
On Fri, Aug 20, 2010 at 7:08 PM, Jan Høydahl / Cominvent <
jan@cominvent.com> wrote:
> Check out the luke request handler:
>
> http://localhost:8983/solr/admin/luke?fl=my_ad_field&numTerms=100 - you'll
> find topTerms for the fields specified
>
> --
> Jan Høydahl,
I think I didn't state my problem very well, allow me rephrase my case here:
1. We have over ten million news articles to build into Solr index.
2. We copy several fields, such as title, author, body, caption of attahed
photos into a new field for default search.
3. We then wanna use shingle fi
okay, thx. but it want work =(
i checkout solr1.4.1 as dynamic web project into eclipse. startet jetty with
XDebug. In eclpise i add WebLogic exactly how the tutorial shows but eclipse
cannot connect =(
any idea what im doing wrong ?
--
View this message in context:
http://lucene.472066.n3.na
After a build of what you have checked out, there's an instance you
can run in the examples directory within the solr directory.
1.4.1 can be checked out from
http://svn.apache.org/repos/asf/lucene/solr/tags/release-1.4.1/
On Sun, Aug 22, 2010 at 6:01 PM, stockii wrote:
>
> im stupid. XD
>
> how
Hmmm, is it then really acceptable for the document display to look like
title crew cast
cvalue
mcast value
?
Presuming that each document has a title, I know of no built-in way to say
"only sort on the title if there was a title match
im stupid. XD
how get i the revsion of the solr 1.4.1 or do i nee build an war from my
snv-checkout ?
--
View this message in context:
http://lucene.472066.n3.nabble.com/How-to-Debug-Sol-Code-in-Eclipse-tp1262050p1281253.html
Sent from the Solr - User mailing list archive at Nabble.com.
I didn't look at them closely now, but look at:
https://issues.apache.org/jira/browse/SOLR-1093
https://issues.apache.org/jira/browse/SOLR-2026
Incidentally, I found them with:
http://search-lucene.com/?q=multiple+queries&fc_project=Solr&fc_type=jira
Otis
Sematext :: http://sematext.com/ ::
wich revision is the revision of 1.4.1 ?
--
View this message in context:
http://lucene.472066.n3.nabble.com/How-to-Debug-Sol-Code-in-Eclipse-tp1262050p1280956.html
Sent from the Solr - User mailing list archive at Nabble.com.
Could you fill us in a little more on the behavior you're after? Because I'm
having
trouble understanding what "sort across title and multi-valued fields"
means...
If every document has a title, and title is unique, then there's no need to
sort by
anything else. Sub-sorts only make sense if you ha
Make sure you're clear on the concepts though. "Unique ID" is kind of fuzzy.
Unique Key is a field YOU define in your schema. This will cause SOLR to
overwrite any document already in the index with a particular unique key
when
another document is added. Examples would be, say, "inventory_number" o
Hi Eric,
I think this query explains what I'm trying to do to an extent minus the
sorting:
> http://localhost:8983/solr/core/select/?q=(titleac:dr or castac:dr or
> crewac:dr)&version=2.2&start=0&rows=100&indent=on&fl=title,cast,crew
If I get a match in the title field or the cast field
I suspect (though I can't say for sure since you didn't include your
schema definition, both type and actual field def) that your
problem stems from WordDelimiterFilterFactory options. The
default in the schema usually has catenateall=0. In which case
you have the tokens "ads" and "12" but not "ads
Hello Mitch,
Agree. Basically you described the same context/needs. Your suggestion about
adding possibility to create more than one resultset for a single
solr-request is exactly what I meant in last paragraph of my initial message
(I called it "specifying extra group(s) of rows" and "we could ex
Well, you can't really do a *blah* without arranging your index
appropriately
(n-grams come to mind). Queries of this form are really, really expensive.
You can do leading and trailing wildcards (i.e. *blah or blah*), but even
leading wildcards require some changes to your schema...
The short ans
Hi,
thanks for your suggestion (so simple, why didn't i think of that).
When using AND i didn't get the desired result, probably because
"work" matches work* and work.
I changed the query to something like q=title:work OR title:work?* which works.
I'll have to try that with real documents/examp
oha, thx i will try it =)
--
View this message in context:
http://lucene.472066.n3.nabble.com/How-to-Debug-Sol-Code-in-Eclipse-tp1262050p1278704.html
Sent from the Solr - User mailing list archive at Nabble.com.
On Sun, Aug 22, 2010 at 8:22 AM, stockii wrote:
>
> thx for you reply.
>
> i dont want to test my own classes in unittest. i try to understand how solr
> works , because i write a little text about solr and lucene. so i want go
> through the code, step by step and find out on which places is solr
Hi,
keepword-filter is no solution for this problem, since this would lead to
the problematic that one has to manage a word-dictionary. As explained, this
would lead to too much effort.
You can easily add outputUnigrams=true and check out the analysis.jsp for
this field. So you can see how much
> Isn't set outputUnigrams="true" will
> make index size about twice than when it's set to false?
Sure index will be bigger. I didn't know that this is problem for you. But if
you have a list of special single words that you want to keep, keepwordfilter
can eliminate other tokens. So index size
Alex,
it sounds like it would make sense.
Use cases could be i.e. clustering or similar techniques.
However, in my opinion the point of view for such a modification is not the
right.
I.e. one wants to have got several resultsets. I could imagine that one does
a primary-query (the query for the d
Hi,
Try a wildcard term with lower score:
q=title:work AND title:work*&debugQuery=true
You will now see from the debug printout that you get an extra boost for
workload.
--
Jan Høydahl, search solution architect
Cominvent AS - www.cominvent.com
Training in Europe - www.solrtraining.com
On 22.
1. did you mean solr war ? or all libs in /trunk/solr/lib/
2. what is the revision of the actual 1.4.1 ?
3. EmbededSolrServer ? what should do this little programm ?
--
View this message in context:
http://lucene.472066.n3.nabble.com/How-to-Debug-Sol-Code-in-Eclipse-tp1262050p1275295.html
Sent
Isn't set outputUnigrams="true" will make index size about twice than when
it's set to false?
Scott
- Original Message -
From: "Ahmet Arslan"
To:
Sent: Saturday, August 21, 2010 1:15 AM
Subject: Re: Doing Shingle but also keep special single word
I am building index with Shingle
Hello,
i need some help adjusting the scoring of documents.
Assume i have two documents with only one field "title", fieldtype is
text (nothing changed), index and stored are true. Doc1's title is
"work", doc2's title is "work and workload". A search for "work" will
now return the following order
1. download solr lib and import them in your project.
2. download solr source-code of the same version and attach in to the
libraries. (I haven't got eclipse open but it is something like project ->
settings -> jre/libraries?)
3. write a small program yourself which calls EmbededSolrServer and
step
thx for you reply.
i dont want to test my own classes in unittest. i try to understand how solr
works , because i write a little text about solr and lucene. so i want go
through the code, step by step and find out on which places is solr using
lucene.
when i can debug the code its easyer ;-)
--
Hello Solr users and devs!
Is there a way to limit number of rows to which highlighting applies? I
don't see any "hl.rows" or similar parameter description, so it looks like I
need to enhance HighlightComponent to enable that. If it is not possible
currently, do you think it's worth adding such po
34 matches
Mail list logo