date:20090115

Re: Indexing the same data in many records

2009-01-15 Thread philmccarthy


Hi,

Adding same document many times is actually the scenario I wanted to
test--indexing hits from Apache webserver logs with the source of the
referring page.

My expectation would be that the majority of hits on a given day would
originate from a small number of referrers, so each of these referring pages
would be indexed multiple times. I really wanted to check that this would
scale better than indexing the same number of different documents--your
explanation regarding term distribution explains why this is the case.

Many thanks,
Phil


Otis Gospodnetic wrote:
> 
> Phil,
> 
> Note that adding the same document multiple times and looking at the index
> size is not a very good approach.  You are adding a fixed number of
> distinct terms over and over.  In real-life scenario you will have a much
> greater term distribution, and that will affect index size.
> 
> Otis
> --
> Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch
> 
> 
> 
> - Original Message 
>> From: philmccarthy 
>> To: solr-user@lucene.apache.org
>> Sent: Wednesday, January 14, 2009 7:36:38 PM
>> Subject: Re: Indexing the same data in many records
>> 
>> 
>> Thanks Otis. I tweaked the Solr example app a little and then uploaded a
>> ~55KB document to it a couple of thousand times (changing the ID each
>> time).
>> The solr/data directory was 72MB on disc after adding the document 2000
>> times, so it seems that the index is growing by approximately 36KB for
>> each
>> document. That seems reasonable.
>> 
>> I guess I need to do some research into expected data volumes now, and
>> limits on Lucene index size.
>> 
>> Cheers,
>> Phil
>> 
>> 
>> Otis Gospodnetic wrote:
>> > 
>> > Phil,
>> > 
>> > From what you described so far, I don't see any red flags.  I would pay
>> > attention to reading those timestamps (covered on the Wiki and ML
>> > archives), that's all.
>> > 
>> > 
>> > Otis
>> > --
>> > Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch
>> > 
>> > 
>> > 
>> > - Original Message 
>> >> From: philmccarthy 
>> >> To: solr-user@lucene.apache.org
>> >> Sent: Tuesday, January 13, 2009 8:49:33 PM
>> >> Subject: Indexing the same data in many records
>> >> 
>> >> 
>> >> Hi,
>> >> 
>> >> I'd like to use Solr to index some webserver logs, in order to allow
>> easy
>> >> ad-hoc querying and analysis. Each Solr Document will represent a
>> single
>> >> request to the webserver, with fields for time, request URL, referring
>> >> URL
>> >> etc.
>> >> 
>> >> I'm also planning to fetch the page source of each referring URL, and
>> add
>> >> that as an indexed field in the Solr document. The aim is to allow
>> >> queries
>> >> like "find hits to /xyz.html where the referring page contains the
>> word
>> >> 'foobar'".
>> >> 
>> >> Since hundreds or even thousands of hits may all come from the same
>> >> referring page, would this approach be horribly inefficient? (Note the
>> >> page
>> >> source won't be stored in each Document, just indexed). Am I going to
>> >> dramatically increase the index size if I do this?
>> >> 
>> >> If so, is there a more elegant way to do what I want?
>> >> 
>> >> Many thanks,
>> >> Phil
>> >> 
>> >> 
>> >> 
>> >> -- 
>> >> View this message in context: 
>> >> 
>> http://www.nabble.com/Indexing-the-same-data-in-many-records-tp21448465p21448465.html
>> >> Sent from the Solr - User mailing list archive at Nabble.com.
>> > 
>> > 
>> > 
>> 
>> -- 
>> View this message in context: 
>> http://www.nabble.com/Indexing-the-same-data-in-many-records-tp21448465p21468706.html
>> Sent from the Solr - User mailing list archive at Nabble.com.
> 
> 
> 

-- 
View this message in context: 
http://www.nabble.com/Indexing-the-same-data-in-many-records-tp21448465p21475019.html
Sent from the Solr - User mailing list archive at Nabble.com.

wildcard with capital letters

2009-01-15 Thread pcu


Hello,

I am working on simple prototype using solr but I did not figure out how to
configure solr to give me the right results.

for example if I use this field:










 





and put 'Koller, Julo' into this field

then get this results

PASSED: search("koller")
PASSED: search("julo")
PASSED: search("KOLLER")
PASSED: search("JULO")
PASSED: search("Koller")
PASSED: search("Julo")
PASSED: search("kolle*")
PASSED: search("jul*")
FAILED: search("KOLLE*")
java.lang.AssertionError: trying found KOLLE* in Koller, Julo expected:<0>
but was:<1>
FAILED: search("JUL*")
java.lang.AssertionError: trying found JUL* in Koller, Julo expected:<0> but
was:<1>
FAILED: search("Kolle*")
java.lang.AssertionError: trying found Kolle* in Koller, Julo expected:<0>
but was:<1>
FAILED: search("Jul*")
java.lang.AssertionError: trying found Jul* in Koller, Julo expected:<0> but
was:<1>

Why wildcard with at least one capital letters does not work.

thanks in advance

Peter
-- 
View this message in context: 
http://www.nabble.com/wildcard-with-capital-letters-tp21475396p21475396.html
Sent from the Solr - User mailing list archive at Nabble.com.

Re: Import data from RSS Feed Question

2009-01-15 Thread Shalin Shekhar Mangar

On Thu, Jan 15, 2009 at 5:55 AM, Burt-Prior  wrote:

>
> Everything works and is setup correctly, but when I change the 'url'
> attribute in the entity declaration to a url on my intranet that requires
> basic authentication (username and password),  I get a HTTP 401 error when
> solr attempts to read the rss feed and update the index.
>
> Question: is there a way to specify a username and password for solr to use
> for an HttpDataSource?


No, not right now.

Any suggestions on how to solve this issue?


HttpDataSource will need to be enhanced. Right now it is a very simple
implementation using UrlConnection. We can probably switch to
commons-httpclient and use it's authentication capabilities.


> I've been using Lucene for awhile, but am new to solr.  Solr is fantastic!
>
> Thanks for your help,
> .Burt
> --
> View this message in context:
> http://www.nabble.com/Import-data-from-RSS-Feed-Question-tp21468562p21468562.html
> Sent from the Solr - User mailing list archive at Nabble.com.
>
>


-- 
Regards,
Shalin Shekhar Mangar.

Re: Single facet on multiple attributes

2009-01-15 Thread Shalin Shekhar Mangar

On Wed, Jan 14, 2009 at 8:14 PM, prerna07  wrote:

>
> Hi,
>
> How can we create single facet on multiple attributes?

Do you mean to combine facets from multiple fields into one output? If yes,
you can create a copyField of all these fields and facet on that.

>
>
> Thanks,
> --
> View this message in context:
> http://www.nabble.com/Single-facet-on-multiple-attributes-tp21457259p21457259.html
> Sent from the Solr - User mailing list archive at Nabble.com.
>
>

-- 
Regards,
Shalin Shekhar Mangar.

Re: wildcard with capital letters

2009-01-15 Thread Shalin Shekhar Mangar

On Thu, Jan 15, 2009 at 4:28 PM, pcu  wrote:

> Why wildcard with at least one capital letters does not work.

Prefix queries are not analysed. So the query you are making is of a
different case than the tokens in the index. Before sending the query to
Solr, you can lowercase it yourself.

>
>
> thanks in advance
>
> Peter
> --
> View this message in context:
> http://www.nabble.com/wildcard-with-capital-letters-tp21475396p21475396.html
> Sent from the Solr - User mailing list archive at Nabble.com.
>
>

-- 
Regards,
Shalin Shekhar Mangar.

Re: Unwanted clustering of search results after sorting by score

2009-01-15 Thread Axel Tetzlaff

Hi,

I'm working on the problem Max described as well. We did try to omit the
norms which lead to the phenomenon that products that have a very extensive
description were more likely to have a higher score since they contained the
word more often. Due to many expands of the SynonymFilter at index-time this
grew especially ugly. But as you already pointed out we should have a deeper
look at how the score is assembled..

Nevertheless the second problem of getting a good mix of shops can be
discussed seperatly. Say we have 5 products per result page and the 10 best
matches for a search have all the same score. 8 of the products are of one
shop (A), and the two others by two other shops (B,C).

What we often get is (letter indicating a product of this shop)
1.A
2.A
3.A
4.A
5.A
second result page
6.A
7.B
8.A
9.C
10. A

but what we want to get is s.th. like this:

1.A
2.C
3.B
4.A
5.A
second result page
6.A
7.A
8.A
9.A
10. A

As you can imagine there is no uniform distribution of products over shops.
So sorting by a random field does not work out since there are shops with
10s of thousands of products and shops with less than 100 products.

So theoretically I would sort by score and then by a magic factor which gets
greater the less products of this shop (eventually with that same score) are
already in the search result. Alternativly to a second sorting criteria the
score could be diminished with as well I guess...

What really bothers me, is that this requirement seems to need an extra
iteration over the search result which keeps track of the distribution of
products and shops in the search result.

We're really thankful for any hint on howto tackle this problem,
Axel
--
View this message in context:
http://www.nabble.com/Unwanted-clustering-of-search-results-after-sorting-by-score-tp20977761p21477387.html
Sent from the Solr - User mailing list archive at Nabble.com.

Customizing Solr to handle Leading Wildcard queries

2009-01-15 Thread Jana, Kumar Raja

Hi,

 

Not being able to perform Leading Wildcard queries is a major handicap.
I want to be able to perform searches like *.pdf to fetch all pdf
documents from Solr.

 

I have found quite a few threads on this topic and one of the solutions
was that this feature can be enabled by adding:

parser.setAllowLeadingWildcards(true); at Line 92 in QueryParsing.java

Unfortunately, this did not work or may be I was using a different
parser and I don't know how to configure the parsers to make this work.

 

Can someone please tell me the steps to customize Solr to enable this
feature?

 

Thanks,

Kumar

Re: Customizing Solr to handle Leading Wildcard queries

2009-01-15 Thread Erik Hatcher



On Jan 15, 2009, at 8:23 AM, Jana, Kumar Raja wrote:
Not being able to perform Leading Wildcard queries is a major  
handicap.

I want to be able to perform searches like *.pdf to fetch all pdf
documents from Solr.


For this particular case, I recommend indexing the document type as a  
separate field.  Something like type:pdf (or use a MIME type string).   
Then you can do a very direct and fast query to search or facet by  
document types.


Erik

RE: Customizing Solr to handle Leading Wildcard queries

2009-01-15 Thread Jana, Kumar Raja

Hi Erik,

Thanks for the quick reply.
I want to enable leading wildcard query searches in general. The case
mentioned in the earlier mail is just one of the many instances I use
this feature.

-Kumar

-Original Message-
From: Erik Hatcher [mailto:e...@ehatchersolutions.com] 
Sent: Thursday, January 15, 2009 7:59 PM
To: solr-user@lucene.apache.org
Subject: Re: Customizing Solr to handle Leading Wildcard queries

On Jan 15, 2009, at 8:23 AM, Jana, Kumar Raja wrote:
> Not being able to perform Leading Wildcard queries is a major  
> handicap.
> I want to be able to perform searches like *.pdf to fetch all pdf
> documents from Solr.

For this particular case, I recommend indexing the document type as a  
separate field.  Something like type:pdf (or use a MIME type string).   
Then you can do a very direct and fast query to search or facet by  
document types.

Erik

Is it just me or multicore default is broken? Can't ping

2009-01-15 Thread Julian Davchev

Hi,
I am trying to setup multicore solr. So I just download default one with
jetty...goto example/
and run
java -Dsolr.solr.home=multicore -jar start.jar


All looks smooth without errors on startup.
Also can can open admin at

http://localhost:8983/solr/core1/admin/


But then trying to ping
http://localhost:8983/solr/core1/admin/ping

I get  error 500 INTERNAL SERVER ERROR


And tons of exceptions in background starting with nullpointer

Anyone have a clue? Is solr stable to be used or multicore is something
reacently added and not to be trusted yet?

Re: Customizing Solr to handle Leading Wildcard queries

2009-01-15 Thread Glen Newton

If we are talking short single term fields (like a file field that has
a single term like "foo.pdf") then do what the DBMS b-tree indexes did
a long time ago: for every field you want a leading wildcard, insert
it in reverse order. So field file:"foo.pdf"  is also stored, indexed
as reverseField:"fdp.oof". Now when someone does a search on
reverseField, like reverseField:*oo.pdf, you reverse the query to be:
fdp.oo*

I believe some of the DBMSs kept a separate reverse b-tree to handle
leading wildcard queries.

And obviously this technique is harder to put in place for arbitrary
sections of text that have to parsed. But a special parser could be
written to handle this as well.

-glen
http://zzzoot.blogspot.com/

2009/1/15 Jana, Kumar Raja :
> Hi Erik,
>
> Thanks for the quick reply.
> I want to enable leading wildcard query searches in general. The case
> mentioned in the earlier mail is just one of the many instances I use
> this feature.
>
> -Kumar
>
>
>
>
> -Original Message-
> From: Erik Hatcher [mailto:e...@ehatchersolutions.com]
> Sent: Thursday, January 15, 2009 7:59 PM
> To: solr-user@lucene.apache.org
> Subject: Re: Customizing Solr to handle Leading Wildcard queries
>
>
> On Jan 15, 2009, at 8:23 AM, Jana, Kumar Raja wrote:
>> Not being able to perform Leading Wildcard queries is a major
>> handicap.
>> I want to be able to perform searches like *.pdf to fetch all pdf
>> documents from Solr.
>
> For this particular case, I recommend indexing the document type as a
> separate field.  Something like type:pdf (or use a MIME type string).
> Then you can do a very direct and fast query to search or facet by
> document types.
>
>Erik
>
>

-- 

-

Re: Is it just me or multicore default is broken? Can't ping

2009-01-15 Thread Otis Gospodnetic

Not sure, I'd have to try it.  But you didn't mention which version of Solr you 
are using.  Nightly build?


Otis
--
Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch



- Original Message 
> From: Julian Davchev 
> To: solr-user@lucene.apache.org
> Sent: Thursday, January 15, 2009 9:53:37 AM
> Subject: Is it just me or multicore default is broken? Can't ping
> 
> Hi,
> I am trying to setup multicore solr. So I just download default one with
> jetty...goto example/
> and run
> java -Dsolr.solr.home=multicore -jar start.jar
> 
> 
> All looks smooth without errors on startup.
> Also can can open admin at
> 
> http://localhost:8983/solr/core1/admin/
> 
> 
> But then trying to ping
> http://localhost:8983/solr/core1/admin/ping
> 
> I get  error 500 INTERNAL SERVER ERROR
> 
> 
> And tons of exceptions in background starting with nullpointer
> 
> Anyone have a clue? Is solr stable to be used or multicore is something
> reacently added and not to be trusted yet?

Re: Customizing Solr to handle Leading Wildcard queries

2009-01-15 Thread Otis Gospodnetic

Hi ramuK,

I believe you can turn that "on" via the Lucene QueryParser, but of course such 
searches will be slo(oo)w.  You can also index reversed tokens (e.g. *kumar --> 
rakum*) or you could index n-grams with begin/end delim characters (e.g. kumar 
-> ^ k u m a r $, *kumar -> "k u m a r $")


Otis
--
Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch



- Original Message 
> From: "Jana, Kumar Raja" 
> To: solr-user@lucene.apache.org
> Sent: Thursday, January 15, 2009 9:49:24 AM
> Subject: RE: Customizing Solr to handle Leading Wildcard queries
> 
> Hi Erik,
> 
> Thanks for the quick reply.
> I want to enable leading wildcard query searches in general. The case
> mentioned in the earlier mail is just one of the many instances I use
> this feature.
> 
> -Kumar
> 
> 
> 
> 
> -Original Message-
> From: Erik Hatcher [mailto:e...@ehatchersolutions.com] 
> Sent: Thursday, January 15, 2009 7:59 PM
> To: solr-user@lucene.apache.org
> Subject: Re: Customizing Solr to handle Leading Wildcard queries
> 
> 
> On Jan 15, 2009, at 8:23 AM, Jana, Kumar Raja wrote:
> > Not being able to perform Leading Wildcard queries is a major  
> > handicap.
> > I want to be able to perform searches like *.pdf to fetch all pdf
> > documents from Solr.
> 
> For this particular case, I recommend indexing the document type as a  
> separate field.  Something like type:pdf (or use a MIME type string).  
> Then you can do a very direct and fast query to search or facet by  
> document types.
> 
> Erik

Re: Searchable and Non Searchable Fields

2009-01-15 Thread Otis Gospodnetic

Con,

Sure.  You just have to specify the field name when searching:

FirstName:George (and not just: George)


Otis
--
Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch



- Original Message 
> From: con 
> To: solr-user@lucene.apache.org
> Sent: Thursday, January 15, 2009 12:20:55 AM
> Subject: Re: Searchable and Non Searchable Fields
> 
> 
> Thanks for the reply Otis
> Even if we dont get both George and Georgeon, Can we have only the firstname
> as searchable.
> That is, If I search George, I should get firstname, lastname, and country
> of the first row, and no values from the third row should be returned
> 
> Regards
> Con
> 
> 
> 
> Otis Gospodnetic wrote:
> > 
> > Hi,
> > 
> > Your schema setup looks fine.
> > George is no the same as Georgeon, so 2) won't match a search for
> > FirstName:George
> > 
> > Otis
> > --
> > Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch
> > 
> > 
> > 
> > - Original Message 
> >> From: con 
> >> To: solr-user@lucene.apache.org
> >> Sent: Wednesday, January 14, 2009 1:23:06 AM
> >> Subject: Searchable and Non Searchable Fields
> >> 
> >> 
> >> Hi All
> >> 
> >> I am using dataimporthandler to index values from oracle db.
> >> 
> >> My sample rows are like:
> >> 
> >> 1) FirstName-> George,LastName-> Bush,  Country-> US
> >> 2) FirstName-> Georgeon, LastName-> Washington, Country-> US
> >> 3) FirstName-> Tony,   LastName-> George,   Country-> UK
> >> 4) FirstName-> Gordon,LastName-> Brown,Country-> UK
> >> 5) FirstName-> Vladimer,  LastName-> Putin,  Country-> Russia
> >> 
> >> How can i set only the FirstName field as searchable.
> >> For eg. if I search George, I should get FirstName, LastName and Country
> >> of
> >> first and second rows only, and if I search Bush no value should be
> >> returned.
> >> 
> >> I tried by providing various options for the at schema.xml
> >>  
> >>  
> >>  
> >>  
> >> But it is not providing the exact results. 
> >> 
> >> How can I change the field attributes to get this result? Or is there
> >> someother configs for this?
> >> 
> >> Expecting reply
> >> Thanks in advance
> >> con
> >> -- 
> >> View this message in context: 
> >> 
> http://www.nabble.com/Searchable-and-Non-Searchable-Fields-tp21450664p21450664.html
> >> Sent from the Solr - User mailing list archive at Nabble.com.
> > 
> > 
> > 
> 
> -- 
> View this message in context: 
> http://www.nabble.com/Searchable-and-Non-Searchable-Fields-tp21450664p21471595.html
> Sent from the Solr - User mailing list archive at Nabble.com.

Re: Unwanted clustering of search results after sorting by score

2009-01-15 Thread Otis Gospodnetic

Axel,

Others may have better ideas, but the simplest idea that occurs to me right now 
is to really just go over the search results and resort them the way you 
described.  However, I don't think this is as scary as it sounds.  You don't 
really have to go through the whole result set - you only need to do this for 
the N hits you are displaying (10 in your example).  All of the data you need 
to access will already be in memory and cached, so this should be cheap, quick, 
and easy.  The magic factor that's inversely proportional to the number of 
products in a shop could be stored in a separate field at index time.

This should be doable with a function query, too.


Otis
--
Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch



- Original Message 
> From: Axel Tetzlaff 
> To: solr-user@lucene.apache.org
> Sent: Thursday, January 15, 2009 8:15:29 AM
> Subject: Re: Unwanted clustering of search results after sorting by score
> 
> 
> Hi,
> 
> I'm working on the problem Max described as well. We did try to omit the
> norms which lead to the phenomenon that products that have a very extensive
> description were more likely to have a higher score since they contained the
> word more often. Due to many expands of the SynonymFilter at index-time this
> grew especially ugly. But as you already pointed out we should have a deeper
> look at how the score is assembled..
> 
> Nevertheless the second problem of getting a good mix of shops can be
> discussed seperatly. Say we have 5 products per result page and the 10 best
> matches for a search have all the same score. 8 of the products are of one
> shop (A), and the two others by two other shops (B,C).
> 
> What we often get is (letter indicating a product of this shop)
> 1.A
> 2.A
> 3.A
> 4.A
> 5.A
>  second result page 
> 6.A
> 7.B
> 8.A
> 9.C
> 10.  A 
> 
> but what we want to get is s.th. like this:
> 
> 1.A
> 2.C
> 3.B
> 4.A
> 5.A
>  second result page 
> 6.A
> 7.A
> 8.A
> 9.A
> 10.  A 
> 
> As you can imagine there is no uniform distribution of products over shops.
> So sorting by a random field does not work out since there are shops with
> 10s of thousands of products and shops with less than 100 products.
> 
> So theoretically I would sort by score and then by a magic factor which gets
> greater the less products of this shop (eventually with that same score) are
> already in the search result. Alternativly to a second sorting criteria the
> score could be diminished with as well I guess...
> 
> What really bothers me, is that this requirement seems to need an extra
> iteration over the search result which keeps track of the distribution of
> products and shops in the search result.
> 
> We're really thankful for any hint on howto tackle this problem,
> Axel
> -- 
> View this message in context: 
> http://www.nabble.com/Unwanted-clustering-of-search-results-after-sorting-by-score-tp20977761p21477387.html
> Sent from the Solr - User mailing list archive at Nabble.com.

Re: place log4j.properties

2009-01-15 Thread Matthew Runo

Have you tried placing it up in /WEB-INF/classes/? I'd think that'd be  
the root of the classpath for solr, and maybe where it's looking for  
the file?


If you figure it out, could you update the wiki?

--Matthew

On Jan 14, 2009, at 3:39 AM, Marc Sturlese wrote:



Hey there,
I have changed the log system in the nightly build to log4j  
following this

comment:

http://wiki.apache.org/solr/SolrLogging

Everything is loaded correclty but I am geting this INFO:

log4j:WARN No appenders could be found for logger
(org.apache.solr.servlet.SolrDispatchFilter).
log4j:WARN Please initialize the log4j system properly.

I think the problem is that the wepapp is not finding the  
log4j.properties.

I have tryed placing it in the firs class level:
./WEB-INF/classes/org/apache/solr/servlet/

But doesn't seem to recognize it... Any advice?

Thanks in advance

--
View this message in context: 
http://www.nabble.com/place-log4j.properties-tp21454379p21454379.html
Sent from the Solr - User mailing list archive at Nabble.com.

Help with Solr 1.3 lockups?

2009-01-15 Thread Jerome L Quinn


Hi, all.

I'm running solr 1.3 inside Tomcat 6.0.18.  I'm running a modified query
parser, tokenizer, highlighter, and have a CustomScoreQuery for dates.

After some amount of time, I see solr stop responding to update requests.
When crawling through the logs, I see the following pattern:

Jan 12, 2009 7:27:42 PM org.apache.solr.update.DirectUpdateHandler2 commit
INFO: start commit(optimize=false,waitFlush=false,waitSearcher=true)
Jan 12, 2009 7:28:11 PM org.apache.solr.common.SolrException log
SEVERE: Error during auto-warming of
key:org.apache.solr.search.queryresult...@ce0f92b9:java.lang.OutOfMemoryError
at org.apache.lucene.index.TermBuffer.toTerm(TermBuffer.java:122)
at org.apache.lucene.index.SegmentTermEnum.term
(SegmentTermEnum.java:167)
at org.apache.lucene.index.SegmentMergeInfo.next
(SegmentMergeInfo.java:66)
at org.apache.lucene.index.MultiSegmentReader$MultiTermEnum.next
(MultiSegmentReader.java:492)
at org.apache.lucene.search.FieldCacheImpl$7.createValue
(FieldCacheImpl.java:267)
at org.apache.lucene.search.FieldCacheImpl$Cache.get
(FieldCacheImpl.java:72)
at org.apache.lucene.search.FieldCacheImpl.getInts
(FieldCacheImpl.java:245)
at org.apache.solr.search.function.IntFieldSource.getValues
(IntFieldSource.java:50)
at org.apache.solr.search.function.SimpleFloatFunction.getValues
(SimpleFloatFunction.java:41)
at org.apache.solr.search.function.BoostedQuery$CustomScorer.
(BoostedQuery.java:111)
at org.apache.solr.search.function.BoostedQuery$CustomScorer.
(BoostedQuery.java:97)
at org.apache.solr.search.function.BoostedQuery
$BoostedWeight.scorer(BoostedQuery.java:88)
at org.apache.lucene.search.IndexSearcher.search
(IndexSearcher.java:132)
at org.apache.lucene.search.Searcher.search(Searcher.java:126)
at org.apache.lucene.search.Searcher.search(Searcher.java:105)
at org.apache.solr.search.SolrIndexSearcher.getDocListNC
(SolrIndexSearcher.java:966)
at org.apache.solr.search.SolrIndexSearcher.getDocListC
(SolrIndexSearcher.java:838)
at org.apache.solr.search.SolrIndexSearcher.access$000
(SolrIndexSearcher.java:56)
at org.apache.solr.search.SolrIndexSearcher$2.regenerateItem
(SolrIndexSearcher.java:260)
at org.apache.solr.search.LRUCache.warm(LRUCache.java:194)
at org.apache.solr.search.SolrIndexSearcher.warm
(SolrIndexSearcher.java:1518)
at org.apache.solr.core.SolrCore$3.call(SolrCore.java:1018)
at java.util.concurrent.FutureTask$Sync.innerRun
(FutureTask.java:314)
at java.util.concurrent.FutureTask.run(FutureTask.java:149)
at java.util.concurrent.ThreadPoolExecutor$Worker.runTask
(ThreadPoolExecutor.java:896)
at java.util.concurrent.ThreadPoolExecutor$Worker.run
(ThreadPoolExecutor.java:918)
at java.lang.Thread.run(Thread.java:735)

Jan 12, 2009 7:28:11 PM org.apache.tomcat.util.net.JIoEndpoint$Acceptor run
SEVERE: Socket accept failed
Throwable occurred: java.lang.OutOfMemoryError
at java.net.PlainSocketImpl.socketAccept(Native Method)
at java.net.PlainSocketImpl.accept(PlainSocketImpl.java:414)
at java.net.ServerSocket.implAccept(ServerSocket.java:464)
at java.net.ServerSocket.accept(ServerSocket.java:432)
at
org.apache.tomcat.util.net.DefaultServerSocketFactory.acceptSocket
(DefaultServerSocketFactory.java:61)
at org.apache.tomcat.util.net.JIoEndpoint$Acceptor.run
(JIoEndpoint.java:310)
at java.lang.Thread.run(Thread.java:735)

<<<>
<< Java dumps core and heap at this point >>
<<<>

Jan 12, 2009 7:28:21 PM org.apache.solr.common.SolrException log
SEVERE: org.apache.lucene.store.LockObtainFailedException: Lock obtain
timed out: SingleInstanceLock: write.lock
at org.apache.lucene.store.Lock.obtain(Lock.java:85)
at org.apache.lucene.index.IndexWriter.init(IndexWriter.java:1140)
at org.apache.lucene.index.IndexWriter.(IndexWriter.java:938)
at org.apache.solr.update.SolrIndexWriter.
(SolrIndexWriter.java:116)
at org.apache.solr.update.UpdateHandler.createMainIndexWriter
(UpdateHandler.java:122)
at org.apache.solr.update.DirectUpdateHandler2.openWriter
(DirectUpdateHandler2.java:167)
at org.apache.solr.update.DirectUpdateHandler2.addDoc
(DirectUpdateHandler2.java:221)
at org.apache.solr.update.processor.RunUpdateProcessor.processAdd
(RunUpdateProcessorFactory.java:59)
at org.apache.solr.handler.XmlUpdateRequestHandler.processUpdate
(XmlUpdateRequestHandler.java:196)
at
org.apache.solr.handler.XmlUpdateRequestHandler.handleRequestBody
(XmlUpdateRequestHandler.java:123)
at org.apache.solr.handler.RequestHandlerBase.handleRequest
(RequestHandlerBase.java:131)
at org.apache.solr.core.SolrCore.execute(SolrCore.java:1204)
at org.apache.solr.servlet.SolrDispatchFilter.execute
(

Solr/Lucene capabilities--Newbie Question

2009-01-15 Thread kgrogan0321


Hello,
I have been tasked with evaluating a few open source tools for implementing
an Enterprise search in a new project(Solr/Lucene being one of them).  

Can anyone help to answer if Solr/Lucene can: 
1)Handle field/row level security?
2)implement DROOLS rules on a query of multiple records?  If so how does it
work internally and are there any performance hits?
3)Handle multiple data sources?
4)Break up and dispatch queries?


I do aplogize that my question(s) are a little general, as we are only in
the beginning stages of the project.  I appreciate any help or answers
anyone can give :)

Thanks,
Karen

-- 
View this message in context: 
http://www.nabble.com/Solr-Lucene-capabilities--Newbie-Question-tp21484427p21484427.html
Sent from the Solr - User mailing list archive at Nabble.com.

53 matches

Mail list logo