hi,
lukeresponse api's in solr provide info about the maximum no of documents
etc..
but does any one know how to get the same programatically.???
I have used the piece of code below:
LukeResponse lukeResponse = new LukeResponse();
String indexD
Thanks it worked now. I think building clean ant is what made the difference.
I'll work on this a bit more and give you feedbacks.
--
View this message in context:
http://n3.nabble.com/Wildcard-search-in-phrase-query-using-spanquery-tp729275p741988.html
Sent from the Solr - User mailing list arc
: > For instance a request like: "tomcat servlet" should return document
: which >>> have "tomcat is a servlet container" rather than a
: document that have"tomcat offers the last specification implementaion of
: the servlet >>> technology", at least this last should not come
: before
:
http://localhost:8983/solr/admin/cores?action=CREATE&name=temp&instanceDir=
:
: where is the solr_home/home/core
:
: The core is created fine but no directory shows for the new core name "temp"
a dta directory is definitley there somewhere, but if you use relative
paths in your con
: However, maybe I missunderstood your point:
: "- Pick MAX_LEN Based On Number Of Query Clauses From Super"
: since I thought, that the number of query clauses depends on the number of
: whitespaces in my query. If I am wrong, and it depends on the result of my
: analyzer-chain, there is no prob
Having poked around little it doesn't look like there's an query param
to turn this on but it'd be really useful if highlighted fragments could
have a character offset return somehow - maybe something like
Lorem ipsum dolor sit amet, consectetur adipisicing
: I need to change a parameter from within a query string.
:
: :* AND requestid:100 AND timestamp:[2010-04-13T20:30:00.000Z TO
: 2010-04-13T21:00:00.000Z] AND
:
source:"LogCollector-risidev3was2.201002020100._opt_ISI_logs.FNM.stdout_ISIREG_10.02.01_02.00.00.txt.tar.gz-stdout_ISIREG_10.02.01_02.0
: I'm starting using SolR 1.4 queried by SolRJ 1.4
: (all official release that I've downloaded from the main link on the web
...
: The servers start OK
: and the class CommonsHttpSolrServer works fine.
: But when running the class EmbeddedSolrServer to do the same basic test,
: it fails w
: Once I have the jar file compiled and I include it in my home/lib directory
: how do I go about using it? Will this override the existing behavior or will
it depends on what Plugin you've implemented in your jar. Based on this
thread, it's not clear to me if you wrote a custom UpdateProcessor
: We don't mind having an occasional long delay between commiting data and
: being able to find that data, as long as the average is somewhere south of a
: second or so, and Lucene's NRS looks like it will provides that level of
: 'realtimeness'.
an average 500ms "lag until visible" is totally fe
: Subject: DIH won't run Script Transformer-Complains need java6 or above
:
: Hi,
:
: ScriptTransformer is working on my local box fine, but its failing on a
: integrated development environment . Any clues how to debug what's going
: on.
i'm going to ask the obvious question here: are you su
: Subject: Solr Index Lock Issue
: Date: Fri, 16 Apr 2010 22:43:57 +0530
: In-Reply-To:
: References: <1271363131217-722255.p...@n3.nabble.com>
:
http://people.apache.org/~hossman/#threadhijack
Thread Hijacking on Mailing Lists
When starting a new discussion on a mailing list, please do no
: Subject: XSD for Solrv1.4
: In-Reply-To: <1271362384448-78.p...@n3.nabble.com>
http://people.apache.org/~hossman/#threadhijack
Thread Hijacking on Mailing Lists
When starting a new discussion on a mailing list, please do not reply to
an existing message, instead start a fresh email. Even
Hey there! Wanted to let you all know about our next meetup, April
28th. We've got a killer new venue thanks to Amazon.
Check out the details at the link:
http://www.meetup.com/Seattle-Hadoop-HBase-NoSQL-Meetup/calendar/13072272/
Our Speakers this month:
1. Nick Dimiduk, Drawn to Scale: Intro to
: // a quick hack to get rid of get-file.jsp -- note this still spits out
invalid HTML
: out.write(
org.apache.solr.handler.admin.ShowFileRequestHandler.getFileContents(
"admin-extra.html" ) );
: %>
:
: Instead of resolving with the core.getName() path ...
the problem isn't that it should r
:
: Solr works fine with jetty. But with the same solr i get the error "HTTP
: Status 404 - missing core name in path". I see the first site, but when i
what do your log files say?
this is a fairly generic error message that can pop up in Solr 1.4 if
there was a problem initalizing Solr, you ha
: Subject: strange behaviour with literal.xxx and solr cell
: References: <193487.51444...@web52902.mail.re2.yahoo.com>
: <427eac7e-3e7b-44e7-9b76-cfac1f0d1...@gmail.com>
http://people.apache.org/~hossman/#threadhijack
Thread Hijacking on Mailing Lists
When starting a new discussion on a mai
It should work (on the trunk anyway) ... SolrCore uses PluginInfos for the
SOlrEventListeners.
If it's not working that way please open a bug.
(BTW: what gives you the impression that "It doesn't seem that listeners
are part of PluginInfo scheme though" ?)
: Before digging through src ...
Hi,
I have some issues with stemming on spell corrections and auto completes.
Given below is a sample record from my docs.
K-82
22570
Extension IPM Coordination Program
US Department of Agriculture
2009-06-11T00:00:00Z
180
Extension Integrated Management Coordination
Program
Agronomy
Entomolo
I like this discussion pretty much.
It is a really complex topic.
I want to add another example.
In english, you are saying "it is a red dress".
In german it would mean "es ist ein rotes Kleid" (words can be translated in
the same order).
However the basic form of "rotes" is "rot".
If your user
On Wed, Apr 21, 2010 at 3:29 PM, Mark Miller wrote:
>
> Stemming/lematization will pretty much always improve recall at the cost of
> precision - that's nothing new. If you stem instead, are you going to want
> documents that had run and water when you searched for running water? I just
> don't s
On 4/21/10 3:22 PM, Robert Muir wrote:
On Wed, Apr 21, 2010 at 2:26 PM, Mark Miller wrote:
Its an orthogonal issue - running will have that problem no matter what. It
doesn't affect whether a user that types running may be just as interested
in a doc that matches all of their other terms bu
Is it possible to issue some kind of query to a Solr core that will
return the last time the index was optimized? Every day, one of my
shards should get optimized, so I would like my monitoring system to
tell me when the newest optimize date is more than 24 hours ago. I
could not find a way t
On Wed, Apr 21, 2010 at 2:26 PM, Mark Miller wrote:
>
> Its an orthogonal issue - running will have that problem no matter what. It
> doesn't affect whether a user that types running may be just as interested
> in a doc that matches all of their other terms but has ran instead of
> running. Its al
IMHO, a 'stemmer' (being a specific 'thing') is exactly that. An
algorithm for stemming. A database or lexicon is not referred to as a
'stemmer'. One can perform "stemming" using a lexicon if that's their
need.
For me, its more than just stemming because some words have morphology
totally separat
On 4/21/10 2:20 PM, Robert Muir wrote:
On Wed, Apr 21, 2010 at 2:09 PM, Mark Miller wrote:
Right - I agree they both have their strengths and weakness' - but you
usually don't get things like running->ran with stemming. Like most things,
its a tradeoff. There is always a hybrid approach as
On Wed, Apr 21, 2010 at 2:09 PM, Mark Miller wrote:
>
> Right - I agree they both have their strengths and weakness' - but you
> usually don't get things like running->ran with stemming. Like most things,
> its a tradeoff. There is always a hybrid approach as well.
>
>
I think running/ran has mor
Below is the official announcement for our exciting upcoming webinar.
This one is near and dear to my heart, so I'll be eagerly listening
too, and participating with my experiences as it fits with the flow of
the webinar.
I'm a card-carrying library geek, and I've had the pleasure of worki
On 4/21/10 2:02 PM, Robert Muir wrote:
On Wed, Apr 21, 2010 at 1:49 PM, Mark Miller wrote:
I believe that's covered by morphology?
The problem is typically a morphological analyzer emits multiple solutions,
which include POS.
So morphology can tell you that "building" has two s
On Wed, Apr 21, 2010 at 1:49 PM, Mark Miller wrote:
>
> I believe that's covered by morphology?
>
>
The problem is typically a morphological analyzer emits multiple solutions,
which include POS.
So morphology can tell you that "building" has two solutions: the gerund
form which you might stem t
What is the unique id set in schema?
From: Jason Rutherglen [via Lucene]
[mailto:ml-node+740744-1209892083-124...@n3.nabble.com]
Sent: Wednesday, April 21, 2010 10:56 AM
To: caman
Subject: Re: Problem with DataImportHandler and embedded entities
The other issue now is full-import i
The other issue now is full-import is only importing 1 document, and
that's all. Despite no limits etc... Odd...
On Wed, Apr 21, 2010 at 10:48 AM, Jason Rutherglen
wrote:
> I think it's working, it was the lack of the seemingly innocuous
> sub-entity pk="application_id". After adding that I'm s
On 4/21/10 1:43 PM, Walter Underwood wrote:
On Apr 21, 2010, at 10:30 AM, Mark Miller wrote:
But they don't usually call 'non algorithmic' stemming 'stemming'. Stemming
usually means using a simple heuristic process. When you use vocabulary and
morphology, its usually called lemmatization
On 4/21/10 1:43 PM, Robert Muir wrote:
On Wed, Apr 21, 2010 at 1:30 PM, Mark Miller wrote:
But they don't usually call 'non algorithmic' stemming 'stemming'.
Stemming usually means using a simple heuristic process. When you use
vocabulary and morphology, its usually called lemmatization
I think it's working, it was the lack of the seemingly innocuous
sub-entity pk="application_id". After adding that I'm seeing some
data returned.
On Wed, Apr 21, 2010 at 10:44 AM, Jason Rutherglen
wrote:
> Something's off, for each row, it's performing the following 5
> sub-queries. Weird. Bel
Something's off, for each row, it's performing the following 5
sub-queries. Weird. Below is the updated data-config.xml (compared
to the original email I changed the field from comment to added).
--- row #1-
876
2009-11-02T06:36:28Z
--
On Wed, Apr 21, 2010 at 1:30 PM, Mark Miller wrote:
>
> But they don't usually call 'non algorithmic' stemming 'stemming'.
> Stemming usually means using a simple heuristic process. When you use
> vocabulary and morphology, its usually called lemmatization rather than
> stemming.
>
>
Lemmatizati
On Apr 21, 2010, at 10:30 AM, Mark Miller wrote:
> But they don't usually call 'non algorithmic' stemming 'stemming'. Stemming
> usually means using a simple heuristic process. When you use vocabulary and
> morphology, its usually called lemmatization rather than stemming.
>
"stemmer" is jargo
Why do these approaches have to be mutually exclusive?
Do a dictionary lookup, if no satisfactory match found use an
algorithmic stemmer. Would probably save a few CPU cycles by
algorithmic stemming iff necessary.
On Wed, Apr 21, 2010 at 1:31 PM, Robert Muir wrote:
> sy to look at the "faults" o
Hard to tell.
Did you try putting the child entity part of main query with subquery. Don't
think that is the issue though but worth a try
Select id, updated,( SELECT comment FROM ratings WHERE app = appParent.id)
as comment FROM applications appParent limit 10
From: Jason Rutherglen [
Caman,
I'm storing it. This is what I see when DataImportHandler verbose is turned on.
While the field names don't match, I am seeing that sub-queries are
being performed, data is being returned. It's just not making it into
the document.
-
-
SELECT id, updated FROM applications limit 10
0:
On Wed, Apr 21, 2010 at 1:18 PM, Chris Hostetter
wrote:
>
> Strictly speaking: you haven't "ditched" stemmers altogether -- you've
> ditched *algorithmic* stemmers and moved to a *dictionary* based stemmer
> -- but it's still a stemmer.
>
> (i just don't want people reading this thread to be confus
I pretty sure that function queries always work off of indexed values,
rather than stored values. So, for example, if you want to use a field in a
function query, it needs to be indexed. I want to add this fact to the wiki,
where it's not currently stated explicitly, but I wanted to first confirm
t
On 4/21/10 1:18 PM, Chris Hostetter wrote:
: Regarding stemmers, I ditched them altogether a long time ago in favor
: of a dictionary of morphologies of all known words (for any given
: language). A simple lookup of any word morphology thus produces the set,
: including the correct stem.
Strictl
: Regarding stemmers, I ditched them altogether a long time ago in favor
: of a dictionary of morphologies of all known words (for any given
: language). A simple lookup of any word morphology thus produces the set,
: including the correct stem.
Strictly speaking: you haven't "ditched" stemmers a
Are you storing the comment field or indexing it?
will not appear in the document.
From: Jason Rutherglen [via Lucene]
[mailto:ml-node+740624-966329660-124...@n3.nabble.com]
Sent: Wednesday, April 21, 2010 10:15 AM
To: caman
Subject: Problem with DataImportHandler and embedded entities
I'm using the following data-config.xml with DataImportHandler. I've
never used embedded entities before however I'm not seeing the comment
show up in the document... I'm not sure what's up.
I've been looking around some of the posts, wiki and such and haven't fully
found the answer. So, if someone might take a moment to explain some of the
nuances given the following.
So say you have a large number of datasets with some common fields,
Additionally, the each dataset may or may no
we try to index some pdf and other documents with solr cell/tika.
so far our crawler downloads the documents and post them to solr cell.
this works, the documents get indexed, some fields are filled etc.
our crawler is written in perl. we prepare the following params to post
to solr:
my $
Greetings!
I'm using StreamingUpdateSolrServer to index my daily Solr shards.
However, at midnight when I need to start indexing the next day's shard, is
there a way to reset the StreamingUpdateSolrServer URL to point to my new
shard, or is there a way to flush the current StreamingUpdateSolrServ
This is my last shot ! your solr webapp doesn't know where to locate solr
home, solr home must have at least this two folders:
conf/
solrconfig.xml
schema.xml
data/
(after indexing) index/
spellchecker/
try to ping
This is the package-description:
http://commons.apache.org/codec/api-release/org/apache/commons/codec/language/package-summary.html
However, there is no example given. Maybe you will find something by
searching in one of your favorite search engines.
Another solution for your problem might be N
Is also correct.
Abdelhamid ABID schrieb:
Well, check your solr config file if the folder "Data" is correctly defined,
you may set it like this: ${solr.data.dir:}
On 4/21/10, Jan Kammer wrote:
Exactly that i got in the file but it doesn't work :-(
Abdelhamid ABID schrieb:
The pa
Well, check your solr config file if the folder "Data" is correctly defined,
you may set it like this: ${solr.data.dir:}
On 4/21/10, Jan Kammer wrote:
>
> Exactly that i got in the file but it doesn't work :-(
>
> Abdelhamid ABID schrieb:
>
> The path paramater defines your context path, "/s
Hello MitchK,
I tried Refined Soundex (sorry I forgot to mention it) but It wasn't much
better. And I didn't find any refence about the caverphone filter even here
:
http://wiki.apache.org/solr/caverphone?action=fullsearch&context=180&value=caverphone&fullsearch=Texte
Thanks
--
View this messag
Exactly that i got in the file but it doesn't work :-(
Abdelhamid ABID schrieb:
The path paramater defines your context path, "/solr" for example, what you
are missing, I think, is the environment variable 'solr/home' which points
to solr home (again) and not the solr web-app. solr/home may
> you can see in the response if you append
> &debugQuery=on to your search url.
Thanks, I guess that would work, but it also returns a lot of information that
I am not interested in. In particular under
One line with this would really be enough, but I guess that is not possible? :
indextim
The path paramater defines your context path, "/solr" for example, what you
are missing, I think, is the environment variable 'solr/home' which points
to solr home (again) and not the solr web-app. solr/home may be set in your
xml-file like this
PATH_TO_SOLR_HOME must contains "conf" and "d
Hi Sophie,
the only thing which I would suggest is to create more fields but with other
filters.
What about soundex and caverphone? Maybe those ones are working a little bit
different from DoubleMetaphone and fit your needs?
Kind regards
- Mitch
--
View this message in context:
http://n3.nabbl
Hello,
Did you try not to write the full path in instanceDir ? like :
core name="cis.products" instanceDir="core_cis_products">
--
View this message in context:
http://n3.nabble.com/Core-admin-pages-broken-after-deployment-on-different-container-WAS-6-1-tp739897p739905.html
Sent from the So
hi,
thanks for your answer.
I defined the path in an xml-file in Tomcat/conf/Catalina/localhost/ I
read that this must be the right place. Could there be anywhere an other
mistake?
greetz, Jan
Abdelhamid ABID schrieb:
You are missing the "solr/home" context paramater , which points to the
f
Hello !
Firstly, english is not my native language.
I use solr to find artists and tracks. I built a new field type for my did
you mean that checks words. That field type has some standard filter for
query and analysis ans the phoneticFilterFactory (please see file, there is
the type definition
You are missing the "solr/home" context paramater , which points to the
folder (solr) holding "data" and "conf"
On 4/21/10, Jan Kammer wrote:
>
> Hi there,
>
> i got a problem with the combination solr + tomcat under mac osx server
> (snow leopard).
> Versions: tomcat 6.0 and newest solr.
>
> So
> I tried it by placing ComplexPhrase-1.0.jar in
> apache-solr-1.4.0\lib ;
> apache-solr-1.4.0\example\lib ; and
> apache-solr-1.4.0\example\solr\lib with
> the same error
You need to copy it to only apache-solr-1.4.0\lib
Maybe it is better to get a fresh copy of apache-solr-1.4.0.zip and contin
Hi there,
i got a problem with the combination solr + tomcat under mac osx server
(snow leopard).
Versions: tomcat 6.0 and newest solr.
Solr works fine with jetty. But with the same solr i get the error "HTTP
Status 404 - missing core name in path". I see the first site, but when
i click on
Ahmet Arslan wrote:
>
>
> Before calling 'ant dist' where did you copy the ComplexPhrase-1.0.jar?
>
> apache-solr-1.4.0\lib or apache-solr-1.4.0\example\lib?
>
>
>
I tried it by placing ComplexPhrase-1.0.jar in apache-solr-1.4.0\lib ;
apache-solr-1.4.0\example\lib ; and apache-solr-1.4.0\e
Thanks Ahmet, I used that approach and it is working well.
-Original Message-
From: Ahmet Arslan [mailto:iori...@yahoo.com]
Sent: Wednesday, April 21, 2010 2:36 PM
To: solr-user@lucene.apache.org
Subject: Re: sorting on text field fails
> I have a field configured as text type (default t
>
> I used command line to build ant this time.
>
Before calling 'ant dist' where did you copy the ComplexPhrase-1.0.jar?
apache-solr-1.4.0\lib or apache-solr-1.4.0\example\lib?
> Yes I ran solr using "java -jar start.jar". I did the above
> mentioned tasks
> but the results were the same.
I used command line to build ant this time.
Ahmet Arslan wrote:
>
>
> Are you running solr using "java -jar start.jar"?
> If yes you need to re-name
> apache-solr-1.4.0\dist\apache-solr-1.4.1-dev.war to solr.war and put it
> under apache-solr-1.4.0\example\webapps
>
> Also you may need to de
> I use the query {NOW -1DAY to NOW} on
> a date field, an it works just fine.
>
> I am however also interested in the actual value that the
> server substituted for NOW, how can I have that returned in
> the query response?
you can see in the response if you append
&debugQuery=on to your sear
Can somebody suggest something similar or this is not possible to autofill
'id' using configuration only?
--
View this message in context:
http://n3.nabble.com/Autofill-id-field-with-the-URL-of-files-posted-to-Solr-tp727985p739606.html
Sent from the Solr - User mailing list archive at Nabble.com
Can somebody please guide me here?
On Tue, Apr 20, 2010 at 10:53 AM, Praveen Agrawal wrote:
> I'm using Solr 1.4 distribution, with Solr cell. Can i update only new
> version of Tika in Solr 1.4 distn? If yes, any guide etc?
> Thanks.
>
>
>
> On Mon, Apr 19, 2010 at 4:36 PM, Koji Sekiguchi wrot
I use the query {NOW -1DAY to NOW} on a date field, an it works just fine.
I am however also interested in the actual value that the server substituted
for NOW, how can I have that returned in the query response?
--
jo
> I used eclipse-jee-galileo-SR2-win32 to build the ant and
> selected dist-war
> for execution in build. I got the following message.
I use command prompt to invoke ant so I am not sure about this.
> The solr performed as usual and when I tried adding
> &defType=complexphrase
> to search url th
I used eclipse-jee-galileo-SR2-win32 to build the ant and selected dist-war
for execution in build. I got the following message.
Buildfile: D:\apache-solr-1.4.0\build.xml
init-forrest-entities:
compile-solrj:
compile:
[javac] Compiling 1 source file to D:\apache-solr-1.4.0\build\solr
[jav
While appying Alphabetic range bucketing i have to map it with
mb_letterBuckets.txt can anyone please guide me wts the content of this file
(mb_letterBuckets.txt)
--
View this message in context:
http://n3.nabble.com/Alphabetic-range-bucketing-tp739467p739467.html
Sent from the Solr - User mail
> I have a field configured as text type (default text type -
> with stemming enabled on both index and query time):
>
> stored="true" />
>
> When I try to sort on this field, it is throwing the
> exception:
>
> HTTP Status 500 - there are more terms than documents in
> field "MyTitle", but it'
Hi,
I have a field configured as text type (default text type - with stemming
enabled on both index and query time):
When I try to sort on this field, it is throwing the exception:
HTTP Status 500 - there are more terms than documents in field "MyTitle", but
it's impossible to sort on tokeni
Hi,
You can use wildcards but I suppose it would only work with one word (though
maybe if you use tokenization you could use something like "field:sun* AND
field:hot*")
You could also use N-grams to achieve partial searchs. For example, if you
use 3-grams for "hotel" you'll index "hot", "ote" and
> I'm quite new to solr 1.4. I have a requirement to be able
> to search partial
> words ("sun hot" => "Sunway Hotel") and to search full
> word("sunway hotel"
> => "Sunway Hotel"). Currently, I could be able to search
> only full word.
> Anyone has any suggestions?
Looks like a PrefixQuery. sun*
Hi everyone,
I'm quite new to solr 1.4. I have a requirement to be able to search partial
words ("sun hot" => "Sunway Hotel") and to search full word("sunway hotel"
=> "Sunway Hotel"). Currently, I could be able to search only full word.
Anyone has any suggestions?
--
Chhorn Chamnap
http://cham
81 matches
Mail list logo