Stemmer bug?

2007-07-10 Thread Andrew Stromnov

Working config (with proper russian stemming):


  
  



Non-working config (no russian stemming):


  



  
  



  


-- 
View this message in context: 
http://www.nabble.com/Problem-with-Russian-stemmer-in-Solr-1.2-tf4049948.html#a11516099
Sent from the Solr - User mailing list archive at Nabble.com.



query parameter question

2007-07-10 Thread David Ginzburg
Hi,

I am trying to search a solr index and I don't understand whether the query
has to conform to a lucene query structure

maybe even generated by the lucene api  such as QueryParser. If that is the
case then what happens with the tokens in my query? Are they further
tokenised by the tokenizer classes used for the different fields defined in
the solr scehma?

Can someone ,please, direct me to explicit examples of searching where the
query is more complex than a single token in each field?



RE: Solr Server Configuration

2007-07-10 Thread nithyavembu

Hi Kijiji Xu,

   Thanks a lot for your clear guidance. I tried as you said. But i was
unclear with the environment varible values in solr.xml  So atlast i didnt
touch the configuration parts.
These are the steps i followed for the solr server working..

1. Used tomcat 5 with windows.
2. Deployed the war apache-solr-1.2.0 in to the webapp directory and rename
it as solr.war
3. Then i created one folder named solr in tomcat - > bin.
4. Copy the conf folder in example folder and placed that conf in the solr
folder in tomcat bin.
5. Then start the server and and tried the url http://localhost:8080/solr/
in browser.

Is there anything wrong i did? If so please tell me..

with Regards,
V.Nithya.


Kijiji Xu, Ping wrote:
> 
> i) In lucene we add document  by using java. But here they given in xml
> format and posting them using  POST.jar. How can i add document in java?
> In java ,there have a class named org.apache.solr.util.SimplePostTool ,you
> can look at the source code of it,and you'll know how it worked for
> posting xml data to solr
> 
> ii) How can i send the http request n get response in a standalone java
> program?
> In my opinion, you can use the java HttpURLConnection class to request and
> get response from the solr server
> 
> iii) Whether i have to add any xml or i have to modify the xml
> (server.xml,web.xml) files for solr server configuration?
> NO, but there have two type of configuration for solr .
> First of all, the solr configuration file locate in the solr/conf,there
> are solrconfig.xml and schema.xml ,you would change them for your
> application .
> Second, you have to deploy solr application itself to Tomcat, and you
> would put a configuration file like this in the tomcat ,the path is 
> /home/tomcat/conf/Catalina/localhost/:
>  crossContext="true" >
> value="/home/tomcat/solr" override="true" />
> 
>  
> That's all,if you have any question,feel free to ask me:)
> -Original Message-
> From: nithyavembu [mailto:[EMAIL PROTECTED] 
> Sent: 2007年7月6日 13:27
> To: solr-user@lucene.apache.org
> Subject: Solr Server Configuration
> 
> 
> Hi All,
> 
>   I am new to Solr. But i am familiar with lucene.
>  I am stuggling in solr server configuation.I am using Tomcat 5. I have
> worked with the "example"   standalone given with solr while i download.
>  But my doubt is 
>   i) In lucene we add document  by using java. But here they given in xml
> format and posting them using  POST.jar. How can i add document in java?
>  ii) How can i send the http request n get response in a standalone java
> program?
>  iii) Whether i have to add any xml or i have to modify the xml
> (server.xml,web.xml) files for solr server configuration?
> 
> 
> Best,
> Nithya.V.
> -- 
> View this message in context:
> http://www.nabble.com/Solr-Server-Configuration-tf4033748.html#a11458997
> Sent from the Solr - User mailing list archive at Nabble.com.
> 
> 

-- 
View this message in context: 
http://www.nabble.com/Solr-Server-Configuration-tf4033748.html#a11517331
Sent from the Solr - User mailing list archive at Nabble.com.



RE: Solr Server Configuration

2007-07-10 Thread Kijiji Xu, Ping
I'm sorry I don't quite understand why you create a folder named solr in 
tomcat/bin in your step 3
The first of all ,you should understand this solr.xml file which I put in 
tomcat/conf/Catalina/localhost/,its contents follows like this:

   


When tomcat was start,tomcat will read this file,and then,tomcat know that he 
should unzip the /home/tomcat/webapps/solr.war,deploy it ,you'll see a folder 
named solr in the tomcat webapps folder when tomcat be started at last.
And  means solr's home path is /home/solr, in the folder 
'/home/solr',it contains folder 'conf' with solr's configuration 
file:schema.xml and solrconfig.xml ,so at last file structure will be like this:

/home/tomcat/conf/Catalina/localhost/solr.xml
/home/tomcat/webapps/solr.war
/home/solr/conf/schema.xml
/home/solr/conf/solrconfig.xml

I attacked a home.rar file in this mail,it is solr's files in the tomcat 
directory structure,FYI

My msn: [EMAIL PROTECTED] if there is problem,pls contact with me.

-Original Message-
From: nithyavembu [mailto:[EMAIL PROTECTED] 
Sent: 2007年7月10日 17:14
To: solr-user@lucene.apache.org
Subject: RE: Solr Server Configuration


Hi Kijiji Xu,

   Thanks a lot for your clear guidance. I tried as you said. But i was
unclear with the environment varible values in solr.xml  So atlast i didnt
touch the configuration parts.
These are the steps i followed for the solr server working..

1. Used tomcat 5 with windows.
2. Deployed the war apache-solr-1.2.0 in to the webapp directory and rename
it as solr.war
3. Then i created one folder named solr in tomcat - > bin.
4. Copy the conf folder in example folder and placed that conf in the solr
folder in tomcat bin.
5. Then start the server and and tried the url http://localhost:8080/solr/
in browser.

Is there anything wrong i did? If so please tell me..

with Regards,
V.Nithya.


Kijiji Xu, Ping wrote:
> 
> i) In lucene we add document  by using java. But here they given in xml
> format and posting them using  POST.jar. How can i add document in java?
> In java ,there have a class named org.apache.solr.util.SimplePostTool ,you
> can look at the source code of it,and you'll know how it worked for
> posting xml data to solr
> 
> ii) How can i send the http request n get response in a standalone java
> program?
> In my opinion, you can use the java HttpURLConnection class to request and
> get response from the solr server
> 
> iii) Whether i have to add any xml or i have to modify the xml
> (server.xml,web.xml) files for solr server configuration?
> NO, but there have two type of configuration for solr .
> First of all, the solr configuration file locate in the solr/conf,there
> are solrconfig.xml and schema.xml ,you would change them for your
> application .
> Second, you have to deploy solr application itself to Tomcat, and you
> would put a configuration file like this in the tomcat ,the path is 
> /home/tomcat/conf/Catalina/localhost/:
>  crossContext="true" >
> value="/home/tomcat/solr" override="true" />
> 
>  
> That's all,if you have any question,feel free to ask me:)
> -Original Message-
> From: nithyavembu [mailto:[EMAIL PROTECTED] 
> Sent: 2007年7月6日 13:27
> To: solr-user@lucene.apache.org
> Subject: Solr Server Configuration
> 
> 
> Hi All,
> 
>   I am new to Solr. But i am familiar with lucene.
>  I am stuggling in solr server configuation.I am using Tomcat 5. I have
> worked with the "example"   standalone given with solr while i download.
>  But my doubt is 
>   i) In lucene we add document  by using java. But here they given in xml
> format and posting them using  POST.jar. How can i add document in java?
>  ii) How can i send the http request n get response in a standalone java
> program?
>  iii) Whether i have to add any xml or i have to modify the xml
> (server.xml,web.xml) files for solr server configuration?
> 
> 
> Best,
> Nithya.V.
> -- 
> View this message in context:
> http://www.nabble.com/Solr-Server-Configuration-tf4033748.html#a11458997
> Sent from the Solr - User mailing list archive at Nabble.com.
> 
> 

-- 
View this message in context: 
http://www.nabble.com/Solr-Server-Configuration-tf4033748.html#a11517331
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Problems running SOLR 1.2 - documents not being indexed properly

2007-07-10 Thread Daniel Alheiros
Hi Hoss

Yes, no error and that strange behaviour on the numbers shown by the admin
console. I'll try an see how to make my SOLR logging better, because so far
it's not that good.

Regards,
Daniel


On 9/7/07 19:16, "Chris Hostetter" <[EMAIL PROTECTED]> wrote:

> 
> : After I removed manually it worked correctly and I've restarted a few times
> : since the "lost lock" was there... Isn't that lock removal on start-up
> : optional?
> 
> it is, it's controlled by the...
> false
> ...option in the mainIndex or indexDefaults section.
> 
> : >> The main problem to me is that instead of having some failure
> : >> logging or
> : >> console information about it I just had those misleading information.
> 
> are you sure you didn't get an error?
> 
> I just tested this out by touching the write.lock file in solr/data/index,
> then started the server up, and on any attempt to add a document i got
> this in the Solr logs...
> 
> SEVERE: org.apache.lucene.store.LockObtainFailedException: Lock obtain
> timed out: 
> SimpleFSLock@/home/chrish/svn/solr/example/solr/data/index/write.lock
> 
> ...and i got a status 500 HTTP response code to my client.
> 
> 
> 
> -Hoss
> 


http://www.bbc.co.uk/
This e-mail (and any attachments) is confidential and may contain personal 
views which are not the views of the BBC unless specifically stated.
If you have received it in error, please delete it from your system.
Do not use, copy or disclose the information in any way nor act in reliance on 
it and notify the sender immediately.
Please note that the BBC monitors e-mails sent or received.
Further communication will signify your consent to this.



Re: Problem with Russian stemmer in Solr 1.2

2007-07-10 Thread Daniel Alheiros
Hi Andrew

Yes, I saw that. As I'm not knowledgeable in Russian I had to infer it was
adequate. But as you have much more to add to it, it could be interesting if
you could contribute that.

The problem is Russian analyzer and it's filters are all final class, don't
allowing an elegant extension. But you can create an analyzer that reuse
what is interesting for you (in this case, the stemmer) and customize the
other filters. I would propose you to do that creating the Solr factories so
you can point to your files containing your stopwords. Any chance you could
contribute with this stopwords list?

One of my reasons to not use directly the RussianAnalyzer was that I need to
use an WhitespaceTokenizer removing HTML code... So I created my factories.

Regards,
Daniel 


On 9/7/07 19:36, "Andrew Stromnov" <[EMAIL PROTECTED]> wrote:

> 
> Hi, Daniel
> 
> Stemmer in RussianAnalyser works as expected. But this analyser doesn't
> allow any Solr customization. All stopwords are hardcoded, no support for
> custom tokenizer, no synonym support.
> 
> RussianAnalyser is similar to this scheme:
>   standard tokenizer
>   standard filter factory
>   word delimeter filter factory
>   lowercase filter factory
>   stop filter factory (with hardcoded stopwords)
>   russian stem filter
>  
> 
> Regards,
> Andrew
> 
> 
> Daniel Alheiros wrote:
>> 
>> Hi Andrew
>> 
>> In fact I did it creating all the Factories for Solr, but I think you can
>> use it directly, changing your index like this:
>> 
>> > positionIncrementGap="100">
>> > class=²org.apache.lucene.analysis.ru.RussianAnalyzer²>
>> 
>> > class=²org.apache.lucene.analysis.ru.RussianAnalyzer²>
>> 
>> 
>> 
>> I¹ve not tested that, but I saw something like this.
>> 
>> Please tell me if it works as expected and if it solves your problem (I¹m
>> indexing Russian content and as you seem to be knowledgeable of Russian
>> language your comments are very useful).
>> 
>> Regards,
>> Daniel
>> 


http://www.bbc.co.uk/
This e-mail (and any attachments) is confidential and may contain personal 
views which are not the views of the BBC unless specifically stated.
If you have received it in error, please delete it from your system.
Do not use, copy or disclose the information in any way nor act in reliance on 
it and notify the sender immediately.
Please note that the BBC monitors e-mails sent or received.
Further communication will signify your consent to this.



RE: Solr Server Configuration

2007-07-10 Thread nithyavembu

Hi,

 I tried as you said and got the result without any error. So we can make
the solr home anywhere. But we have to give the path correctly in solr.xml.
Am i correct?

 Now i am one step further.. :)

Best,
V.Nithya.



Kijiji Xu, Ping wrote:
> 
> I'm sorry I don't quite understand why you create a folder named solr in
> tomcat/bin in your step 3
> The first of all ,you should understand this solr.xml file which I put in
> tomcat/conf/Catalina/localhost/,its contents follows like this:
>  crossContext="true" >
> value="/home/solr" override="true" />
> 
> 
> When tomcat was start,tomcat will read this file,and then,tomcat know that
> he should unzip the /home/tomcat/webapps/solr.war,deploy it ,you'll see a
> folder named solr in the tomcat webapps folder when tomcat be started at
> last.
> And  value="/home/solr" override="true" /> means solr's home path is
> /home/solr, in the folder '/home/solr',it contains folder 'conf' with
> solr's configuration file:schema.xml and solrconfig.xml ,so at last file
> structure will be like this:
> 
> /home/tomcat/conf/Catalina/localhost/solr.xml
> /home/tomcat/webapps/solr.war
> /home/solr/conf/schema.xml
> /home/solr/conf/solrconfig.xml
> 
> I attacked a home.rar file in this mail,it is solr's files in the tomcat
> directory structure,FYI
> 
> My msn: [EMAIL PROTECTED] if there is problem,pls contact with me.
> 
> -Original Message-
> From: nithyavembu [mailto:[EMAIL PROTECTED] 
> Sent: 2007年7月10日 17:14
> To: solr-user@lucene.apache.org
> Subject: RE: Solr Server Configuration
> 
> 
> Hi Kijiji Xu,
> 
>Thanks a lot for your clear guidance. I tried as you said. But i was
> unclear with the environment varible values in solr.xml  So atlast i didnt
> touch the configuration parts.
> These are the steps i followed for the solr server working..
> 
> 1. Used tomcat 5 with windows.
> 2. Deployed the war apache-solr-1.2.0 in to the webapp directory and
> rename
> it as solr.war
> 3. Then i created one folder named solr in tomcat - > bin.
> 4. Copy the conf folder in example folder and placed that conf in the solr
> folder in tomcat bin.
> 5. Then start the server and and tried the url http://localhost:8080/solr/
> in browser.
> 
> Is there anything wrong i did? If so please tell me..
> 
> with Regards,
> V.Nithya.
> 
> 
> Kijiji Xu, Ping wrote:
>> 
>> i) In lucene we add document  by using java. But here they given in xml
>> format and posting them using  POST.jar. How can i add document in java?
>> In java ,there have a class named org.apache.solr.util.SimplePostTool
>> ,you
>> can look at the source code of it,and you'll know how it worked for
>> posting xml data to solr
>> 
>> ii) How can i send the http request n get response in a standalone java
>> program?
>> In my opinion, you can use the java HttpURLConnection class to request
>> and
>> get response from the solr server
>> 
>> iii) Whether i have to add any xml or i have to modify the xml
>> (server.xml,web.xml) files for solr server configuration?
>> NO, but there have two type of configuration for solr .
>> First of all, the solr configuration file locate in the solr/conf,there
>> are solrconfig.xml and schema.xml ,you would change them for your
>> application .
>> Second, you have to deploy solr application itself to Tomcat, and you
>> would put a configuration file like this in the tomcat ,the path is 
>> /home/tomcat/conf/Catalina/localhost/:
>> > crossContext="true" >
>>> value="/home/tomcat/solr" override="true" />
>> 
>>  
>> That's all,if you have any question,feel free to ask me:)
>> -Original Message-
>> From: nithyavembu [mailto:[EMAIL PROTECTED] 
>> Sent: 2007年7月6日 13:27
>> To: solr-user@lucene.apache.org
>> Subject: Solr Server Configuration
>> 
>> 
>> Hi All,
>> 
>>   I am new to Solr. But i am familiar with lucene.
>>  I am stuggling in solr server configuation.I am using Tomcat 5. I have
>> worked with the "example"   standalone given with solr while i download.
>>  But my doubt is 
>>   i) In lucene we add document  by using java. But here they given in xml
>> format and posting them using  POST.jar. How can i add document in java?
>>  ii) How can i send the http request n get response in a standalone java
>> program?
>>  iii) Whether i have to add any xml or i have to modify the xml
>> (server.xml,web.xml) files for solr server configuration?
>> 
>> 
>> Best,
>> Nithya.V.
>> -- 
>> View this message in context:
>> http://www.nabble.com/Solr-Server-Configuration-tf4033748.html#a11458997
>> Sent from the Solr - User mailing list archive at Nabble.com.
>> 
>> 
> 
> -- 
> View this message in context:
> http://www.nabble.com/Solr-Server-Configuration-tf4033748.html#a11517331
> Sent from the Solr - User mailing list archive at Nabble.com.
> 
> 

-- 
View this message in context: 
http://www.nabble.com/Solr-Server-Configuration-tf4033748.html#a11518289
Sent from the Solr - User mailing list archive at Nabble.com.



Re: Add 2 stemmers to a textfield?

2007-07-10 Thread Daniel Alheiros
Hi Thierry.

I'm not sure this is the best approach. What I've adopted an so far is
working really well is to have one field per language (like text_french and
text_dutch) and in your schema you declare both plus one that just receives
the copy of them.

Your index/query analysis have to be compatible or else it's not possible to
match results. Take a look at the Lucene documentation ("Lucene in Action"
is a good book and talks about that).

schema:










And in the Solr config you can create dismax request handlers to handle each
language defining boost in a language relative way.


Regards,
Daniel


On 10/7/07 07:38, "Thierry Collogne" <[EMAIL PROTECTED]> wrote:

> Hello,
> 
> Our index contains 2 languages : dutch and french. I was wondering if it is
> possible to add 2 solr.SnowballPorterFilterFactory filters to one text field
> like this :
> 
> 
>   
> 
> 
>  generateNumberParts="1" catenateWords="1" catenateNumbers="1"
> catenateAll="0"/>
> 
> 
> 
> 
> 
> 
>   
>   
> 
>  ignoreCase="true" expand="true"/>
> 
>  generateNumberParts="1" catenateWords="0" catenateNumbers="0"
> catenateAll="0"/>
> 
> 
> 
> 
> 
> 
>   
> 
> 
> 
> Also can someone explain to me, why sometimes a filter is used at index time
> and sometimes at query time. It is not entirely clear to me what the
> difference is.
> 
> Thank you,
> 
> Thierry


http://www.bbc.co.uk/
This e-mail (and any attachments) is confidential and may contain personal 
views which are not the views of the BBC unless specifically stated.
If you have received it in error, please delete it from your system.
Do not use, copy or disclose the information in any way nor act in reliance on 
it and notify the sender immediately.
Please note that the BBC monitors e-mails sent or received.
Further communication will signify your consent to this.



Re: Solr Server Configuration

2007-07-10 Thread James liu

u can find configuration datadir in solrconfig.xml(solr 1.2)

2007/7/10, nithyavembu <[EMAIL PROTECTED]>:



Hi,

I tried as you said and got the result without any error. So we can make
the solr home anywhere. But we have to give the path correctly in solr.xml
.
Am i correct?

Now i am one step further.. :)

Best,
V.Nithya.



Kijiji Xu, Ping wrote:
>
> I'm sorry I don't quite understand why you create a folder named solr in
> tomcat/bin in your step 3
> The first of all ,you should understand this solr.xml file which I put
in
> tomcat/conf/Catalina/localhost/,its contents follows like this:
>  crossContext="true" >
> value="/home/solr" override="true" />
> 
>
> When tomcat was start,tomcat will read this file,and then,tomcat know
that
> he should unzip the /home/tomcat/webapps/solr.war,deploy it ,you'll see
a
> folder named solr in the tomcat webapps folder when tomcat be started at
> last.
> And  value="/home/solr" override="true" /> means solr's home path is
> /home/solr, in the folder '/home/solr',it contains folder 'conf' with
> solr's configuration file:schema.xml and solrconfig.xml ,so at last file
> structure will be like this:
>
> /home/tomcat/conf/Catalina/localhost/solr.xml
> /home/tomcat/webapps/solr.war
> /home/solr/conf/schema.xml
> /home/solr/conf/solrconfig.xml
>
> I attacked a home.rar file in this mail,it is solr's files in the tomcat
> directory structure,FYI
>
> My msn: [EMAIL PROTECTED] if there is problem,pls contact with me.
>
> -Original Message-
> From: nithyavembu [mailto:[EMAIL PROTECTED]
> Sent: 2007年7月10日 17:14
> To: solr-user@lucene.apache.org
> Subject: RE: Solr Server Configuration
>
>
> Hi Kijiji Xu,
>
>Thanks a lot for your clear guidance. I tried as you said. But i was
> unclear with the environment varible values in solr.xml  So atlast i
didnt
> touch the configuration parts.
> These are the steps i followed for the solr server working..
>
> 1. Used tomcat 5 with windows.
> 2. Deployed the war apache-solr-1.2.0 in to the webapp directory and
> rename
> it as solr.war
> 3. Then i created one folder named solr in tomcat - > bin.
> 4. Copy the conf folder in example folder and placed that conf in the
solr
> folder in tomcat bin.
> 5. Then start the server and and tried the url
http://localhost:8080/solr/
> in browser.
>
> Is there anything wrong i did? If so please tell me..
>
> with Regards,
> V.Nithya.
>
>
> Kijiji Xu, Ping wrote:
>>
>> i) In lucene we add document  by using java. But here they given in xml
>> format and posting them using  POST.jar. How can i add document in
java?
>> In java ,there have a class named org.apache.solr.util.SimplePostTool
>> ,you
>> can look at the source code of it,and you'll know how it worked for
>> posting xml data to solr
>>
>> ii) How can i send the http request n get response in a standalone java
>> program?
>> In my opinion, you can use the java HttpURLConnection class to request
>> and
>> get response from the solr server
>>
>> iii) Whether i have to add any xml or i have to modify the xml
>> (server.xml,web.xml) files for solr server configuration?
>> NO, but there have two type of configuration for solr .
>> First of all, the solr configuration file locate in the solr/conf,there
>> are solrconfig.xml and schema.xml ,you would change them for your
>> application .
>> Second, you have to deploy solr application itself to Tomcat, and you
>> would put a configuration file like this in the tomcat ,the path is
>> /home/tomcat/conf/Catalina/localhost/:
>> > crossContext="true" >
>>> value="/home/tomcat/solr" override="true" />
>> 
>>
>> That's all,if you have any question,feel free to ask me:)
>> -Original Message-
>> From: nithyavembu [mailto:[EMAIL PROTECTED]
>> Sent: 2007年7月6日 13:27
>> To: solr-user@lucene.apache.org
>> Subject: Solr Server Configuration
>>
>>
>> Hi All,
>>
>>   I am new to Solr. But i am familiar with lucene.
>>  I am stuggling in solr server configuation.I am using Tomcat 5. I have
>> worked with the "example"   standalone given with solr while i
download.
>>  But my doubt is
>>   i) In lucene we add document  by using java. But here they given in
xml
>> format and posting them using  POST.jar. How can i add document in
java?
>>  ii) How can i send the http request n get response in a standalone
java
>> program?
>>  iii) Whether i have to add any xml or i have to modify the xml
>> (server.xml,web.xml) files for solr server configuration?
>>
>>
>> Best,
>> Nithya.V.
>> --
>> View this message in context:
>>
http://www.nabble.com/Solr-Server-Configuration-tf4033748.html#a11458997
>> Sent from the Solr - User mailing list archive at Nabble.com.
>>
>>
>
> --
> View this message in context:
> http://www.nabble.com/Solr-Server-Configuration-tf4033748.html#a11517331
> Sent from the Solr - User mailing list archive at Nabble.com.
>
>

--
View this message in context:
http://www.nabble.com/Solr-Server-Configuration-tf4033748.html#a11518289
Sent from the Solr - User mailing list archive at Nabbl

wildcard searches standard request handler

2007-07-10 Thread Karen Loughran


Hi all,

I understand from browsing through the mailing list that I won't be able to 
perform wildcard searches using disMax request handler.  But why doesn't any 
of the following wildcard searches work when using the standard request 
handler:

Firstly the following query accurately returns 27 documents:
PublishedStart:Whenever

Attempted Wildcard searches:-->
Whene*
URL:
http://localhost:8080/solr/select/?q=Whene*&version=2.2&start=0&rows=10&indent=on

PublishedStart:When*
URL:
http://localhost:8080/solr/select/?q=PublishedStart%3AWhen*&version=2.2&start=0&rows=10&indent=on

PublishedStart:"When*
URL:
"http://localhost:8080/solr/select/?q=PublishedStart%3A%22When*%22&version=2.2&start=0&rows=10&indent=on
--

PublishedStart is declared as :


"text" is as defined in the default supplied schema.xml.
I am using tomcat 5.5.23 with solr 1.2


Many thanks

Karen


SOLR using sort but not sorting

2007-07-10 Thread Alessandro Ferrucci

Yo guys

so I have a very simplistic solr installation.  The relevant schema portion
is:


   
   
   
   
   
   

everything else is set to default.

now I do

last-name:a*
I get all docs wiht last-name starting with a.

now I add sort=last-name asc

this DOES change the ordering of the docs but it's not exactly
alphabetically.

I looked at each relevancy score and all records have 1.0, so it's not using
the relevancy to boost at all.

Any ideas?

thx guys

--
ale =]


Re: wildcard searches standard request handler

2007-07-10 Thread Yonik Seeley

On 7/10/07, Karen Loughran <[EMAIL PROTECTED]> wrote:

I understand from browsing through the mailing list that I won't be able to
perform wildcard searches using disMax request handler.  But why doesn't any
of the following wildcard searches work when using the standard request
handler:

Firstly the following query accurately returns 27 documents:
PublishedStart:Whenever

Attempted Wildcard searches:-->
Whene*


http://www.nabble.com/partial-word-searches-tf4046948.html#a11495509
Try whene*

And a potential start for a fix:
https://issues.apache.org/jira/browse/SOLR-219

-Yonik


Re: wildcard searches standard request handler

2007-07-10 Thread Karen Loughran

Hi Yonik, whene* does indeed work thanks.  Though the Context diff patch fails 
against my 1.2 download:
 
patch -p0 < lowercase_prefix.patch
patching file src/java/org/apache/solr/schema/FieldType.java
Hunk #1 FAILED at 24.
Hunk #2 FAILED at 387.
2 out of 2 hunks FAILED -- saving rejects to file 
src/java/org/apache/solr/schema/FieldType.java.rej
patching file src/java/org/apache/solr/schema/FieldType.java
Hunk #1 FAILED at 24.
Hunk #2 FAILED at 387.
2 out of 2 hunks FAILED -- saving rejects to file 
src/java/org/apache/solr/schema/FieldType.java.rej
patching file src/java/org/apache/solr/schema/TextField.java
Hunk #1 FAILED at 18.
Hunk #2 FAILED at 52.
2 out of 2 hunks FAILED -- saving rejects to file 
src/java/org/apache/solr/schema/TextField.java.rej
patching file src/java/org/apache/solr/search/SolrQueryParser.java
Hunk #1 FAILED at 23.
Hunk #2 FAILED at 89.
2 out of 2 hunks FAILED -- saving rejects to file 
src/java/org/apache/solr/search/SolrQueryParser.java.rej

Should this be applied to a different version ?

Thanks
Karen


On Tuesday 10 July 2007 16:21:10 Yonik Seeley wrote:
> On 7/10/07, Karen Loughran <[EMAIL PROTECTED]> wrote:
> > I understand from browsing through the mailing list that I won't be able
> > to perform wildcard searches using disMax request handler.  But why
> > doesn't any of the following wildcard searches work when using the
> > standard request handler:
> >
> > Firstly the following query accurately returns 27 documents:
> > PublishedStart:Whenever
> >
> > Attempted Wildcard
> > searches:--> Whene*
>
> http://www.nabble.com/partial-word-searches-tf4046948.html#a11495509
> Try whene*
>
> And a potential start for a fix:
> https://issues.apache.org/jira/browse/SOLR-219
>
> -Yonik




Facet names coming back without decimal points

2007-07-10 Thread Matthew Runo
Hello!I was wondering if any of you had any ideas on why my "size" facets are coming back without any decimal points in their names. For example, my 7.5s come back as 75. Very strange. You can see, right below the size facet is a brand facet with a name of "Vans". As far as I can tell, both the brand_exact and the size are stored in the index the same exact way, except that the sizes are multivalued. I've also attached my schema file. I'm using SOLR 1.2. You can see the "front end display" of this install at https://prelive.canada.zappos.com/n/solr.cgi  (check it out, let me know if you find anything weird... heh) ++ | Matthew Runo | Zappos Development | [EMAIL PROTECTED] | 702-943-7833                                                 ++ 

02true15100price:[1 TO 50]price:[50 TO 100]price:[150 TO 200]price:[200 TO 250]price:[250 TO 300]price:[300 TO *]0bluesizebrand_exactdismaxbrand_exact:"Vans"country_code:USprice:[50 TO 100]brand_exact:"Vans"1.1481537Vans171Navy/Della Blue LeatherCAUS
  • A contemporary classic with rock star attitude.
  • The TNT redefines a vintage design with quality materials and supporting construction.
  • Endorsed by Thrasher Skater of the Year all-terrain destroyer Tony Trujillo.
TNT59.95114373FootwearClosed FootwearGeneral Closed Footwear/n/p/p/114373/c/59967.html/images/114/114373/3832-236500-t.jpg1.1481537Vans171Navy/Della Blue LeatherCAUS
  • A contemporary classic with rock star attitude.
  • The TNT redefines a vintage design with quality materials and supporting construction.
  • Endorsed by Thrasher Skater of the Year all-terrain destroyer Tony Trujillo.
TNT59.95114373FootwearClosed FootwearGeneral Closed Footwear/n/p/p/114373/c/54979.html/images/114/114373/1733-192169-t.jpg1.1481482Vans171Black/White/Royal BlueCAUS
  • Hit the streets in style while rippin' in these skate sneaks.
  • Leather upper with per detail for breathability.
  • Cupsole design.
  • Mesh lined.
  • Durable rubber outsole.
Fastplant54.957242176FootwearClosed FootwearGeneral Closed Footwear/n/p/p/7242176/c/3.html/images/724/7242176/6220-326959-t.jpg1.1481482Vans171Black/White/Royal BlueCAUS
  • Hit the streets in style while rippin' in these skate sneaks.
  • Leather upper with per detail for breathability.
  • Cupsole design.
  • Mesh lined.
  • Durable rubber outsole.
Fastplant54.957242176FootwearClosed FootwearGeneral Closed Footwear/n/p/p/7242176/c/742.html/images/724/7242176/4974-262346-t.jpg1.1481482Vans171Black/White/Royal BlueCAUS
  • Hit the streets in style while rippin' in these skate sneaks.
  • Leather upper with per detail for breathability.
  • Cupsole design.
  • Mesh lined.
  • Durable rubber outsole.
Fastplant54.957242176FootwearClosed FootwearGeneral Closed Footwear/n/p/p/7242176/c/1247.html/images/724/7242176/985-262347-t.jpg1.1481482Vans171Black/White/Royal BlueCAUS
  • Hit the streets in style while rippin' in these skate sneaks.
  • Leather upper with per detail for breathability.
  • Cupsole design.
  • Mesh lined.
  • Durable rubber outsole.
Fastplant54.957242176FootwearClosed FootwearGeneral Closed Footwear/n/p/p/7242176/c/2125.html/images/724/7242176/4974-326961-t.jpg1.1481482Vans171Black/White/Royal BlueCAUS
  • Hit the streets in style while rippin' in these skate sneaks.
  • Leather upper with per detail for breathability.
  • Cupsole design.
  • Mesh lined.
  • Durable rubber outsole.
Fastplant54.957242176FootwearClosed FootwearGeneral Closed Footwear/n/p/p/7242176/c/48880.html/images/724/7242176/3034-262348-t.jpg1.1481482Vans171Black/White/Royal BlueCAUS
  • Hit the streets in style while rippin' in these skate sneaks.
  • Leather upper with per detail for breathability.
  • Cupsole design.
  • Mesh lined.
  • Durable rubber outsole.
Fastplant54.957242176FootwearClosed FootwearGeneral Closed Footwear/n/p/p/7242176/c/133.html/images/724/7242176/1733-262345-t.jpg1.1481482Vans171Black/White/Royal BlueCAUS
  • Suede or suede/leather upper.
  • Quarter underlay detail.
  • Embroidered quarter logo detail.
  • Ghillie lacing system with underlay detail.
  • Cupsole construction.
  • Grippy rubber outsole.
Hayes54.957277297FootwearAthleticSkate/n/p/p/7277297/c/1247.html/images/727/7277297/4998-326970-t.jpg1.1481482Vans171Black/White/Royal BlueCAUS
  • Suede or suede/leather upper.
  • Quarter underlay detail.
  • Embroidered quarter logo detail.
  • Ghillie lacing system with underlay detail.
  • Cupsole construction.
  • Grippy rubber outsole.
Hayes54.957277297FootwearAthleticSkate/n/p/p/7277297/c/126.html/images/727/7277297/4998-326971-t.jpg1.1481482Vans171Black/White/Royal BlueCAUS
  • Suede or suede/leather upper.
  • Quarter underlay detail.
  • Embroidered quarter logo detail.
  • Ghillie lacing system with underlay detail.
  • Cupsole construction.
  • Grippy rubber outsole.
Hayes54.957277297FootwearAthlet

Re: Facet names coming back without decimal points

2007-07-10 Thread Daniel Alheiros
Hi Matthew.

It¹s probably caused by the way you are processing this field. As you have
defined it as a ³text² that has a Whitespace tokenizer and a set of filters
related to it. You could create a new field type or just use a numeric type
(like sfloat) for that.
Anyway you can always see how your content is being processed by using Solr
analysis command available on the web console (/solr/admin/analysis.jsp).

Regards,
Daniel

On 10/7/07 17:23, "Matthew Runo" <[EMAIL PROTECTED]> wrote:

> Hello!
> 
> I was wondering if any of you had any ideas on why my "size" facets are coming
> back without any decimal points in their names. For example, my 7.5s come back
> as 75. 
> 
> Very strange. 
> 
> You can see, right below the size facet is a brand facet with a name of
> "Vans". As far as I can tell, both the brand_exact and the size are stored in
> the index the same exact way, except that the sizes are multivalued. 
> 
> I've also attached my schema file. I'm using SOLR 1.2. You can see the "front
> end display" of this install at https://prelive.canada.zappos.com/n/solr.cgi
>  (check it out, let me know if you find anything weird... heh)
> 
>  
> ++
>  | Matthew Runo
>  | Zappos Development
>  | [EMAIL PROTECTED]
>  | 702-943-7833                                                 
> ++
> 
>  
> 
> 
> 



http://www.bbc.co.uk/
This e-mail (and any attachments) is confidential and may contain personal 
views which are not the views of the BBC unless specifically stated.
If you have received it in error, please delete it from your system.
Do not use, copy or disclose the information in any way nor act in reliance on 
it and notify the sender immediately.
Please note that the BBC monitors e-mails sent or received.
Further communication will signify your consent to this.



Re: query parameter question

2007-07-10 Thread Chris Hostetter

: I am trying to search a solr index and I don't understand whether the query
: has to conform to a lucene query structure

Fundementally, there is no single query structure -- anyone can write a
request handler that parses any syntax they want.

Practically speaking, Solr comes with two request handlers for doing
searches out of the box: the StandardREquestHandler supports the lucene
query syntax, with some tweaks which are documented in the wiki...
   http://wiki.apache.org/solr/StandardRequestHandler
   http://wiki.apache.org/solr/SolrQuerySyntax
...and the DisMaxRequestHandler which has a much more limited syntax, only
the plus, minus, nad quote characters have special meaning.  this is also
described in the wiki (although there aren't any good examples)
   http://wiki.apache.org/solr/DisMaxRequestHandler

: maybe even generated by the lucene api  such as QueryParser. If that is the

i'm not sure i follow you there ... the lucene QueryParser doesn't
generate query strings ... it parses them, but i think what you mean is
"can i give solr any expression i can give the lucene QueryParser ... the
answer is yes, with the small caveat that depending on the FieldType you
use you might need to format the tokenshemselves slightly differnetly (ie:
if you are using Solr's DateField then your dates have to be in the
appropriate format, not the format you might be use to from Lucene's
QueryParser/DateTools support.

: Can someone ,please, direct me to explicit examples of searching where the
: query is more complex than a single token in each field?

The tutorial has about a dozen differnet example search urls that
excercise several features of the standard request handler query
parsing...

video
name:video
+video +price:[* TO 400]
id:SP2514N
power-shot
"1 gigabyte"

Specificly, to your question of "more complex than a single token in each
field" that would be any of the examples from the lucene query parser
syntax doc...

title:"The Right Way" AND text:go
title:"Do it right" AND right
title:(+return +"pink panther")




-Hoss



Re: wildcard searches standard request handler

2007-07-10 Thread Yonik Seeley

On 7/10/07, Karen Loughran <[EMAIL PROTECTED]> wrote:

Hi Yonik, whene* does indeed work thanks.  Though the Context diff patch fails
against my 1.2 download:


For now, I'd advise just lowercasing wildcard queries in the client if
you know that is how your field is indexed.

-Yonik


Re: SOLR using sort but not sorting

2007-07-10 Thread Chris Hostetter

: 

: now I add sort=last-name asc
:
: this DOES change the ordering of the docs but it's not exactly
: alphabetically.

Lucene sorting can't work on a field with more then one indexed term per
document, this was breifly covered in the "sort" param docs, but i have
beefed up the info

http://wiki.apache.org/solr/CommonQueryParameters#sort

...if you have any ideas for improving the docs, please feel free to edit
the wiki.


-Hoss



Re: Stemmer bug?

2007-07-10 Thread Chris Hostetter

: Subject: Stemmer bug?

can you elaborate on what exactly you view as a bug?

if the issue is just that one of the examples stemms something in a way
thta you think makes sense, but the other one does not that really isn't a
bug so much as it is a comment on the effectiveness of the Snowball
Stemmer for Russian vs the RussianStemmer class used by the
RussianAnalzer.  if you like the stemming that comes out of hte
RussianAnalyzer you can use the RussianStemFilter yourslf by creating a
simple FilterFactory arround it (there are lots of examples in teh Solr
code base)

Also keep in mind that the Snowball Stemmer is not designed to produce
"real" words when it stems ... it's an algorithmic stemmer designed to
produce artificial stems for common cases ... so if you think it's a bug
because it produces terms that aren't real words -- it's not, that's just
the way it works -- what matters is that it produces the same artificaial
stem for related words.



-Hoss



Re: Stemmer bug?

2007-07-10 Thread Andrew Stromnov

Hi

RussianAnalyzer produces russian stemmed forms, but
SnowballPorterFilterFactory with language="Russian" leaves _all_ russian
content unchanged.


hossman wrote:
> 
> 
> : Subject: Stemmer bug?
> 
> can you elaborate on what exactly you view as a bug?
> 
> if the issue is just that one of the examples stemms something in a way
> thta you think makes sense, but the other one does not that really isn't a
> bug so much as it is a comment on the effectiveness of the Snowball
> Stemmer for Russian vs the RussianStemmer class used by the
> RussianAnalzer.  if you like the stemming that comes out of hte
> RussianAnalyzer you can use the RussianStemFilter yourslf by creating a
> simple FilterFactory arround it (there are lots of examples in teh Solr
> code base)
> 
> Also keep in mind that the Snowball Stemmer is not designed to produce
> "real" words when it stems ... it's an algorithmic stemmer designed to
> produce artificial stems for common cases ... so if you think it's a bug
> because it produces terms that aren't real words -- it's not, that's just
> the way it works -- what matters is that it produces the same artificaial
> stem for related words.
> 
> -Hoss
> 

-- 
View this message in context: 
http://www.nabble.com/Problem-with-Russian-stemmer-in-Solr-1.2-tf4049948.html#a11530601
Sent from the Solr - User mailing list archive at Nabble.com.



RE: Solr Server Configuration

2007-07-10 Thread Kijiji Xu, Ping
Yes, that's right, just make the path in the solr.xml is right, everything will 
be ok!

-Original Message-
From: nithyavembu [mailto:[EMAIL PROTECTED] 
Sent: 2007年7月10日 18:28
To: solr-user@lucene.apache.org
Subject: RE: Solr Server Configuration


Hi,

 I tried as you said and got the result without any error. So we can make
the solr home anywhere. But we have to give the path correctly in solr.xml.
Am i correct?

 Now i am one step further.. :)

Best,
V.Nithya.



Kijiji Xu, Ping wrote:
> 
> I'm sorry I don't quite understand why you create a folder named solr in
> tomcat/bin in your step 3
> The first of all ,you should understand this solr.xml file which I put in
> tomcat/conf/Catalina/localhost/,its contents follows like this:
>  crossContext="true" >
> value="/home/solr" override="true" />
> 
> 
> When tomcat was start,tomcat will read this file,and then,tomcat know that
> he should unzip the /home/tomcat/webapps/solr.war,deploy it ,you'll see a
> folder named solr in the tomcat webapps folder when tomcat be started at
> last.
> And  value="/home/solr" override="true" /> means solr's home path is
> /home/solr, in the folder '/home/solr',it contains folder 'conf' with
> solr's configuration file:schema.xml and solrconfig.xml ,so at last file
> structure will be like this:
> 
> /home/tomcat/conf/Catalina/localhost/solr.xml
> /home/tomcat/webapps/solr.war
> /home/solr/conf/schema.xml
> /home/solr/conf/solrconfig.xml
> 
> I attacked a home.rar file in this mail,it is solr's files in the tomcat
> directory structure,FYI
> 
> My msn: [EMAIL PROTECTED] if there is problem,pls contact with me.
> 
> -Original Message-
> From: nithyavembu [mailto:[EMAIL PROTECTED] 
> Sent: 2007年7月10日 17:14
> To: solr-user@lucene.apache.org
> Subject: RE: Solr Server Configuration
> 
> 
> Hi Kijiji Xu,
> 
>Thanks a lot for your clear guidance. I tried as you said. But i was
> unclear with the environment varible values in solr.xml  So atlast i didnt
> touch the configuration parts.
> These are the steps i followed for the solr server working..
> 
> 1. Used tomcat 5 with windows.
> 2. Deployed the war apache-solr-1.2.0 in to the webapp directory and
> rename
> it as solr.war
> 3. Then i created one folder named solr in tomcat - > bin.
> 4. Copy the conf folder in example folder and placed that conf in the solr
> folder in tomcat bin.
> 5. Then start the server and and tried the url http://localhost:8080/solr/
> in browser.
> 
> Is there anything wrong i did? If so please tell me..
> 
> with Regards,
> V.Nithya.
> 
> 
> Kijiji Xu, Ping wrote:
>> 
>> i) In lucene we add document  by using java. But here they given in xml
>> format and posting them using  POST.jar. How can i add document in java?
>> In java ,there have a class named org.apache.solr.util.SimplePostTool
>> ,you
>> can look at the source code of it,and you'll know how it worked for
>> posting xml data to solr
>> 
>> ii) How can i send the http request n get response in a standalone java
>> program?
>> In my opinion, you can use the java HttpURLConnection class to request
>> and
>> get response from the solr server
>> 
>> iii) Whether i have to add any xml or i have to modify the xml
>> (server.xml,web.xml) files for solr server configuration?
>> NO, but there have two type of configuration for solr .
>> First of all, the solr configuration file locate in the solr/conf,there
>> are solrconfig.xml and schema.xml ,you would change them for your
>> application .
>> Second, you have to deploy solr application itself to Tomcat, and you
>> would put a configuration file like this in the tomcat ,the path is 
>> /home/tomcat/conf/Catalina/localhost/:
>> > crossContext="true" >
>>> value="/home/tomcat/solr" override="true" />
>> 
>>  
>> That's all,if you have any question,feel free to ask me:)
>> -Original Message-
>> From: nithyavembu [mailto:[EMAIL PROTECTED] 
>> Sent: 2007年7月6日 13:27
>> To: solr-user@lucene.apache.org
>> Subject: Solr Server Configuration
>> 
>> 
>> Hi All,
>> 
>>   I am new to Solr. But i am familiar with lucene.
>>  I am stuggling in solr server configuation.I am using Tomcat 5. I have
>> worked with the "example"   standalone given with solr while i download.
>>  But my doubt is 
>>   i) In lucene we add document  by using java. But here they given in xml
>> format and posting them using  POST.jar. How can i add document in java?
>>  ii) How can i send the http request n get response in a standalone java
>> program?
>>  iii) Whether i have to add any xml or i have to modify the xml
>> (server.xml,web.xml) files for solr server configuration?
>> 
>> 
>> Best,
>> Nithya.V.
>> -- 
>> View this message in context:
>> http://www.nabble.com/Solr-Server-Configuration-tf4033748.html#a11458997
>> Sent from the Solr - User mailing list archive at Nabble.com.
>> 
>> 
> 
> -- 
> View this message in context:
> http://www.nabble.com/Solr-Server-Configuration-tf4033748.html#a11517331
> Sent from the Solr - User mailing list archive 

RE: wildcard searches standard request handler

2007-07-10 Thread Xuesong Luo
That's also what I did in my code, I search for * or ?, if exists,
lowercase the query string.

-Original Message-
From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of Yonik
Seeley
Sent: Tuesday, July 10, 2007 12:13 PM
To: solr-user@lucene.apache.org
Subject: Re: wildcard searches standard request handler

On 7/10/07, Karen Loughran <[EMAIL PROTECTED]> wrote:
> Hi Yonik, whene* does indeed work thanks.  Though the Context diff
patch fails
> against my 1.2 download:

For now, I'd advise just lowercasing wildcard queries in the client if
you know that is how your field is indexed.

-Yonik


A few questions regarding multi-word synonyms and parameters encoding

2007-07-10 Thread climbingrose

Hi all,

I've been using Solr for the last few projects and the experience has been
great. I'll post the link to the website once it finishes. Just have a few
questions regarding synonyms and parameters encoding:

1) Is multi-word synonyms possible now in Solr? For example, can I have
things like synonyms like:
"I.T. & T", "IT & T", "Information Technologies", "Computer science"
I read the message on mailing list sometime ago (think back in mid 2006)
saying that there is no clean way to implement this. Is it possible now? In
my case, I have two field category and location in which category is of
default string type and location is of default text type:
+Category field is used only for faceting by category therefore, no anylasis
needs to be done. Can I use the synonyms config above to do facet query on
category field and the Solr will combine items having one of these category
into one facet category? For example:

I.T. & T (10)
IT & T (20)
Information Technologies (30)
Computer science (40)

Can I have something like:

I.T. & T (100)

Or do I have to manually filter query on for each category:"I.T. & T" and
count the results?

+Location field is used for searching by city, state and post code. Since I
collect the data from different sources, there might be mix & match
information. For example, on one record I might have "Inner Sydney, NSW"
while the other record I might have "Inner Sydney, New South Wales". In
Australia, NSW & New South Wales are interchangeable used so when the users
search for "NSW", I want "New South Wales" record to be returned and vice
versa. How could I achieve this? The "location" field is of the default text
type.

2) I'm having trouble with using facet values in my url. For example, I have
"title" facet field in my query and it returns something like:

Software engineer
C++ Programmer
C Programmer & PHP developer

Now I want create a link for each of these value so that the user can filter
the results by that title by clicking on the link. For example, if I click
on "Software Engineer", the results are now narrowed down to just include
records with "Software Engineer" in their title. Since "title" field can
contain special chars like '+', '&' ..., I really can't find a clean way to
do this. At the moment, I replace all the space by '+' and it seems to work
for words like "Software engineer" (converted to "Software+Engineer").
However, "C++ Programmer" is converted to "C+++Programmer", and it doesn't
seem to work (return no results). Any ideas?

Looking back, this is such a long email. If you reach this point, thanks a
lot for your time!!!

--
Regards,

Cuong Hoang


How to run the Embedded Solr Sample

2007-07-10 Thread nithyavembu

Hi All,

  I am using Netbeans IDE for compiling and running java.
  I want to run the EmbeddedSolr sample given in apache.
  I have set the solr home also. 
  But how can i run that program? I am getting the following errors while
running. 

 Jul 11, 2007 11:14:48 AM org.apache.solr.core.Config setInstanceDir
INFO: Solr home set to 'D:/jakarta Tomcat/solr/'
Jul 11, 2007 11:14:48 AM org.apache.solr.core.SolrConfig initConfig
INFO: Loaded SolrConfig: solrconfig.xml
Exception in thread "main" java.lang.NoSuchMethodError:
org.apache.lucene.index.IndexWriter.setDefaultWriteLockTimeout(J)V
at org.apache.solr.core.SolrCore.(SolrCore.java:88)
at EmbeddedSolr.main(EmbeddedSolr.java:40)
Java Result: 1


 How can i run this program? 
 In apache site they said its like sample "example" program. If so where i
have to place this file in tomcat?

 If any help, appreciated.

Thanks,
V.Nithya.

-- 
View this message in context: 
http://www.nabble.com/How-to-run-the-Embedded-Solr-Sample-tf4060072.html#a11534704
Sent from the Solr - User mailing list archive at Nabble.com.