Re: Facets and sort

2007-09-24 Thread Guillaume Smet
Hi Chris,

On 9/24/07, Chris Hostetter <[EMAIL PROTECTED]> wrote:
> 1) 
> 2) 

In fact both were typos in my email: the code is right. It was a bit
late here. After a few hours of sleep, I think the problem is that I
sort on a tokenized value. Without a facet filter, I have enough
results to have only results containing one word beginning by A on the
first page but as soon as I apply a filter I don't have enough results
to do so and then I begin to see results sorted on other tokens.
I'll try to create a new strField used only for sorting to see if it
solves my problem and let you know. I didn't think that was it as it
seemed to work without facet filtering but I'm more and more convinced
it's the problem.

Thanks for your answer.

--
Guillaume


How to get all the search results - python

2007-09-24 Thread Roopesh P Raj

Hi,

I am using solr setup in Tomcat 5.5 with python 2.4 using python client solr.py. 

When I search, all the results are not returned. 


The method call for searching is as follows : rows specifies the number of rows.
data = c.search(q='query', fl='id score unique_id Message-ID To From 
Subject',rows=50, wt='python')

I want to specify that I want all the rows. How can I do that ?

Regards
Roopesh




--
DigitalGlue, India





Re: How to get all the search results - python

2007-09-24 Thread Thorsten Scherler
On Mon, 2007-09-24 at 14:34 +0530, Roopesh P Raj wrote:
> Hi,
> 
> I am using solr setup in Tomcat 5.5 with python 2.4 using python client 
> solr.py. 
> 
> When I search, all the results are not returned. 
> 
> The method call for searching is as follows : rows specifies the number of 
> rows.
> data = c.search(q='query', fl='id score unique_id Message-ID To From 
> Subject',rows=50, wt='python')
> 
> I want to specify that I want all the rows. How can I do that ?

Hi Roopesh,

I am not sure whether I understand your problem. 

Is it the limitation of rows/pagination? 
If so why not using a real high number (like rows=100)?

salu2
-- 
Thorsten Scherler thorsten.at.apache.org
Open Source Java  consulting, training and solutions



Re: How to get all the search results - python

2007-09-24 Thread Roopesh P Raj

Hi Roopesh,


I am not sure whether I understand your problem. 


Is it the limitation of rows/pagination? 
If so why not using a real high number (like rows=100)?



salu2


Hi,

Assigning a high number will solve my problem. (I thought that there will 
something like rows='all' to do it).

Can I do pagination using the python client? How can I specify the starting position, offset etc for 
pagination through the python client? 


Regards
Roopesh


--
DigitalGlue, India





Re: How to get all the search results - python

2007-09-24 Thread Thorsten Scherler
On Mon, 2007-09-24 at 16:29 +0530, Roopesh P Raj wrote:
> > Hi Roopesh,
> 
> > I am not sure whether I understand your problem. 
> 
> > Is it the limitation of rows/pagination? 
> > If so why not using a real high number (like rows=100)?
> 
> > salu2
> 
> Hi,
> 
> Assigning a high number will solve my problem. (I thought that there will 
> something like rows='all' to do it).
> 
> Can I do pagination using the python client? 

I am not a python expert but I think so.

> How can I specify the starting position, offset etc for 
> pagination through the python client? 

http://wiki.apache.org/solr/CommonQueryParameters

It should work as described in the above document (with the start
parameter.

e.g. 
data = c.search(q='query', fl='id score unique_id Message-ID To From
Subject',rows=50, wt='python',start=50)

HTH
-- 
Thorsten Scherler thorsten.at.apache.org
Open Source Java  consulting, training and solutions



Re: How to get all the search results - python

2007-09-24 Thread Jérôme Etévé
By design, it's not very efficient to ask for a large number of
results with solr/lucene. I think you will face performance and memory
problems if you do that.


On 9/24/07, Thorsten Scherler <[EMAIL PROTECTED]> wrote:
> On Mon, 2007-09-24 at 16:29 +0530, Roopesh P Raj wrote:
> > > Hi Roopesh,
> >
> > > I am not sure whether I understand your problem.
> >
> > > Is it the limitation of rows/pagination?
> > > If so why not using a real high number (like rows=100)?
> >
> > > salu2
> >
> > Hi,
> >
> > Assigning a high number will solve my problem. (I thought that there will 
> > something like rows='all' to do it).
> >
> > Can I do pagination using the python client?
>
> I am not a python expert but I think so.
>
> > How can I specify the starting position, offset etc for
> > pagination through the python client?
>
> http://wiki.apache.org/solr/CommonQueryParameters
>
> It should work as described in the above document (with the start
> parameter.
>
> e.g.
> data = c.search(q='query', fl='id score unique_id Message-ID To From
> Subject',rows=50, wt='python',start=50)
>
> HTH
> --
> Thorsten Scherler thorsten.at.apache.org
> Open Source Java  consulting, training and solutions
>
>


-- 
Jerome Eteve.
[EMAIL PROTECTED]
http://jerome.eteve.free.fr/


Re: real time search

2007-09-24 Thread Grant Ingersoll

Hi James,

Can you provide more information about what you are trying to do?  By  
real time search, do you mean you want indexed documents to be  
available immediately?  Or is a minute or two acceptable?  Do all  
users need to see them immediately, or just the current user?


We can better help you if you give us more details on what you are  
trying to accomplish.


Thanks,
Grant

On Sep 23, 2007, at 10:38 AM, James liu wrote:


i wanna do it.

Maybe someone did it, if so, give me some tips.

thks

--
regards
jl


--
Grant Ingersoll
http://lucene.grantingersoll.com

Lucene Helpful Hints:
http://wiki.apache.org/lucene-java/BasicsOfPerformance
http://wiki.apache.org/lucene-java/LuceneFAQ




Solrsharp culture problems

2007-09-24 Thread JP Genty - LibertySurf

I use solrsharp on a french XP and I have problems with the float
conversion to text.

I modified ExempleIndexDocument constructor to force the "en-US"
culture.

CultureInfo MyCulture = new CultureInfo("en-US");
.
.
this.Add(new IndexFieldValue("weight", weight.ToString(MyCulture)));
this.Add(new IndexFieldValue("price", price.ToString(MyCulture)));

   And I modified IndexFieldAttribute SetValue method

   CultureInfo MyCulture = new CultureInfo("en-US");

   this.PropertyInfo.SetValue(searchRecord,
Convert.ChangeType(xnodevalue.InnerText,
this.PropertyInfo.PropertyType, MyCulture), null);
   

valueArray.SetValue(Convert.ChangeType(xnlvalues[i].InnerText, basetype,
MyCulture), i);


  Now the example runs smoothly on a windows XP french.


 Is it the right solution ??

   Thanks
 Jean-Paul






Re: real time search

2007-09-24 Thread Matthew Runo

I assume you mean something like this:

http://addictedtonew.com/archives/145/wordpress-live-search-plugin/

Take a look at how the search box works - is that what you mean?


++
 | Matthew Runo
 | Zappos Development
 | [EMAIL PROTECTED]
 | 702-943-7833
++


On Sep 24, 2007, at 8:13 AM, Grant Ingersoll wrote:


Hi James,

Can you provide more information about what you are trying to do?   
By real time search, do you mean you want indexed documents to be  
available immediately?  Or is a minute or two acceptable?  Do all  
users need to see them immediately, or just the current user?


We can better help you if you give us more details on what you are  
trying to accomplish.


Thanks,
Grant

On Sep 23, 2007, at 10:38 AM, James liu wrote:


i wanna do it.

Maybe someone did it, if so, give me some tips.

thks

--
regards
jl


--
Grant Ingersoll
http://lucene.grantingersoll.com

Lucene Helpful Hints:
http://wiki.apache.org/lucene-java/BasicsOfPerformance
http://wiki.apache.org/lucene-java/LuceneFAQ






Re: Facets and sort

2007-09-24 Thread Chris Hostetter
: late here. After a few hours of sleep, I think the problem is that I
: sort on a tokenized value. Without a facet filter, I have enough

sorting on a tokenized field is definitely a no-no ... the only situation 
in which that will work is if your tokenizer produces only one token per 
document (ie: KeywrodTokenizer)



-Hoss



Re: Solrsharp culture problems

2007-09-24 Thread Jeff Rodenburg
Yes, that would be the right solution.  I'm not sure if, in order to use
french culture settings on xp, you would require corresponding changes in
culture settings for the solr instance.

Hope this helps.

-- j

On 9/24/07, JP Genty - LibertySurf <[EMAIL PROTECTED]> wrote:
>
>
> I use solrsharp on a french XP and I have problems with the float
> conversion to text.
>
> I modified ExempleIndexDocument constructor to force the "en-US"
> culture.
>
> CultureInfo MyCulture = new CultureInfo("en-US");
> .
> .
> this.Add(new IndexFieldValue("weight", weight.ToString(MyCulture)));
> this.Add(new IndexFieldValue("price", price.ToString(MyCulture)));
>
>And I modified IndexFieldAttribute SetValue method
>
>CultureInfo MyCulture = new CultureInfo("en-US");
>
>this.PropertyInfo.SetValue(searchRecord,
> Convert.ChangeType(xnodevalue.InnerText,
> this.PropertyInfo.PropertyType, MyCulture), null);
>
>
> valueArray.SetValue(Convert.ChangeType(xnlvalues[i].InnerText, basetype,
> MyCulture), i);
>
>
>   Now the example runs smoothly on a windows XP french.
>
>
>  Is it the right solution ??
>
>Thanks
> Jean-Paul
>
>
>
>
>


Re: Facets and sort

2007-09-24 Thread Guillaume Smet
On 9/24/07, Chris Hostetter <[EMAIL PROTECTED]> wrote:
> sorting on a tokenized field is definitely a no-no ... the only situation
> in which that will work is if your tokenizer produces only one token per
> document (ie: KeywrodTokenizer)

Yes, it works now. I have now based my sort field on the alphaOnlySort
type present in the example schema.xml.

Thanks for your help.

--
Guillaume


Index files not being deleted

2007-09-24 Thread AgentHubcap

I've got an issue with my index files not being deleted.  I have a cron job
set up to kick off the index process every 5 minutes (test box, with little
data).  The process is as follows:

- Delete all index files via a delete command
- Loop through all the records in the database and index them
- Commits are triggered every 250 records
- An optimize is done at the end (twice because I read there was an issue
with the optimize command)

What is happening is that the number of open files keeps growing every time
the cron job runs, and the disk space is eventually used up by deleted files
(something isn't closing the handle on the file, and thus the file is never
really deleted)

My question is what exactly do I need to do to close these files?  This is
for an index only -- no searching is taking place.  I'm also using the
embedded version if that makes a difference...
-- 
View this message in context: 
http://www.nabble.com/Index-files-not-being-deleted-tf4512068.html#a12869275
Sent from the Solr - User mailing list archive at Nabble.com.



Re: Index files not being deleted

2007-09-24 Thread Ryan McKinley


- Delete all index files via a delete command


make sure to optimize after deleting the docs -- optimize has lucene get 
rid of deleted files rather then appending them to the end of the index.


what version of solr are you running?  if you are running 1.3-dev 
deleting *:* is fast -- if you aren't using 1.3, i don't suggest moving 
there just for that though


ryan


Re: Index files not being deleted

2007-09-24 Thread AgentHubcap

I'm running 1.2.

Acutally, i am doing an optimize after I delete the indexes.  (twice, as I
read there was an issue with the optimize).  Do I need to close something
manually?

Here's my optimize code:

private void optimize() throws IOException
{
UpdateHandler updateHandler = 
SolrCore.getSolrCore().getUpdateHandler();
CommitUpdateCommand commitcmd = new CommitUpdateCommand(false);
commitcmd.optimize = true;
updateHandler.commit(commitcmd);
updateHandler.close();
}




ryantxu wrote:
> 
>> 
>> - Delete all index files via a delete command
> 
> make sure to optimize after deleting the docs -- optimize has lucene get 
> rid of deleted files rather then appending them to the end of the index.
> 
> what version of solr are you running?  if you are running 1.3-dev 
> deleting *:* is fast -- if you aren't using 1.3, i don't suggest moving 
> there just for that though
> 
> ryan
> 
> 

-- 
View this message in context: 
http://www.nabble.com/Index-files-not-being-deleted-tf4512068.html#a12870871
Sent from the Solr - User mailing list archive at Nabble.com.



Re: How to get all the search results - python

2007-09-24 Thread Roopesh P Raj

Thanks a lot for your replies. I will follow the paginated search.

Thanks and Regards
Roopesh

--
DigitalGlue, India





Re: real time search

2007-09-24 Thread James liu
it seems use somthing like ajax...

if so, it not what i wanna


2007/9/25, Matthew Runo <[EMAIL PROTECTED]>:
>
> I assume you mean something like this:
>
> http://addictedtonew.com/archives/145/wordpress-live-search-plugin/
>
> Take a look at how the search box works - is that what you mean?
>
>
> ++
> | Matthew Runo
> | Zappos Development
> | [EMAIL PROTECTED]
> | 702-943-7833
> ++
>
>
> On Sep 24, 2007, at 8:13 AM, Grant Ingersoll wrote:
>
> > Hi James,
> >
> > Can you provide more information about what you are trying to do?
> > By real time search, do you mean you want indexed documents to be
> > available immediately?  Or is a minute or two acceptable?  Do all
> > users need to see them immediately, or just the current user?
> >
> > We can better help you if you give us more details on what you are
> > trying to accomplish.
> >
> > Thanks,
> > Grant
> >
> > On Sep 23, 2007, at 10:38 AM, James liu wrote:
> >
> >> i wanna do it.
> >>
> >> Maybe someone did it, if so, give me some tips.
> >>
> >> thks
> >>
> >> --
> >> regards
> >> jl
> >
> > --
> > Grant Ingersoll
> > http://lucene.grantingersoll.com
> >
> > Lucene Helpful Hints:
> > http://wiki.apache.org/lucene-java/BasicsOfPerformance
> > http://wiki.apache.org/lucene-java/LuceneFAQ
> >
> >
>
>


-- 
regards
jl


Re: real time search

2007-09-24 Thread James liu
if new data come in and drive index it, load new search it.

if more docs, optimize time will cost much, so can't do search like real
time.

so i think new solr instance only for newest information. the docs will
be ~10K.
if it arrive 10k, it should be closed and rebuild new instance.
(if we have more memory and cpu,,,just open more instance ,not one by one.)

final, we merge result and show it to user.


that all i think, not test it.


2007/9/24, Grant Ingersoll <[EMAIL PROTECTED]>:
>
> Hi James,
>
> Can you provide more information about what you are trying to do?  By
> real time search, do you mean you want indexed documents to be
> available immediately?  Or is a minute or two acceptable?  Do all
> users need to see them immediately, or just the current user?
>
> We can better help you if you give us more details on what you are
> trying to accomplish.
>
> Thanks,
> Grant
>
> On Sep 23, 2007, at 10:38 AM, James liu wrote:
>
> > i wanna do it.
> >
> > Maybe someone did it, if so, give me some tips.
> >
> > thks
> >
> > --
> > regards
> > jl
>
> --
> Grant Ingersoll
> http://lucene.grantingersoll.com
>
> Lucene Helpful Hints:
> http://wiki.apache.org/lucene-java/BasicsOfPerformance
> http://wiki.apache.org/lucene-java/LuceneFAQ
>
>
>


-- 
regards
jl


can solr do it?

2007-09-24 Thread James liu
if use multi solr with one index, it will cache individually.

so i think can it share their cache.(they have same config)

-- 
regards
jl