RE: Does SOLR supports multiple instances within the same webapplication?

2007-11-13 Thread Dilip.TS
Hi,

 I am using SOLR 1.2 stable build currently , and this version does not have
this feature (Support for Multiple SOLR cores).

 How do I get this feature? I tried to open the .patch file but I could not
understand.

 I was looking into the link http://svn.apache.org/viewvc/lucene/solr/ for
the solr1.3 version which supports this feature.

 If possible could you please point me to the link/instruction where I can
check out this feature or the latest source and build the SOLR binary.

 Thanks in advance,
 Dilip


-Original Message-
From: Ryan McKinley [mailto:[EMAIL PROTECTED]
Sent: Monday, November 12, 2007 9:23 PM
To: solr-user@lucene.apache.org
Subject: Re: Does SOLR supports multiple instances within the same
webapplication?


Dilip.TS wrote:
> Hello,
>
>   Does SOLR supports multiple instances within the same web application?
If
> so how is this achieved?
>

If you want multiple indices, you can run multiple web-apps.

If you need multiple indices in the same web-app, check SOLR-350 -- it
is still in development, and make sure you *really* need it before going
that route.

ryan



- I/O exception (java.net.BindException) caught when processing request: Address already in use: connect while updating

2007-11-13 Thread Daniele Salvatico


Hi all,

i have a Solr instance running under tomcat 5.5. My application uses a java
servlet to handle data. I have a problem with "add-update" process:  at the
first run the add process is ok and very fast . At this first run i add
around 2.000 docs. Then i commit and all is ok.
The problem is that if i run the same the "add - update" process again,
sometimes (not all the times) it fails with this error:

13-nov-2007 8.17.32 org.apache.solr.handler.XmlUpdateRequestHandler update
INFO: added id={2524} in 15ms
- I/O exception (java.net.BindException) caught when processing request:
Address already in use: connect
- Retrying request
- I/O exception (java.net.BindException) caught when processing request:
Address already in use: connect
- Retrying request
- I/O exception (java.net.BindException) caught when processing request:
Address already in use: connect
- Retrying request
Fatal transport error: Address already in use: connect
java.net.BindException: Address already in use: connect
at java.net.PlainSocketImpl.socketConnect(Native Method)
at java.net.PlainSocketImpl.doConnect(Unknown Source)
at java.net.PlainSocketImpl.connectToAddress(Unknown Source)
at java.net.PlainSocketImpl.connect(Unknown Source)
at java.net.SocksSocketImpl.connect(Unknown Source)
at java.net.Socket.connect(Unknown Source)
at java.net.Socket.connect(Unknown Source)
at java.net.Socket.(Unknown Source)
at java.net.Socket.(Unknown Source)
at
org.apache.commons.httpclient.protocol.DefaultProtocolSocketFactory.createSocket(DefaultProtocolSocketFactory.java:79)
at
org.apache.commons.httpclient.protocol.DefaultProtocolSocketFactory.createSocket(DefaultProtocolSocketFactory.java:121)
at
org.apache.commons.httpclient.HttpConnection.open(HttpConnection.java:706)
at
org.apache.commons.httpclient.HttpMethodDirector.executeWithRetry(HttpMethodDirector.java:386)
at
org.apache.commons.httpclient.HttpMethodDirector.executeMethod(HttpMethodDirector.java:170)
at
org.apache.commons.httpclient.HttpClient.executeMethod(HttpClient.java:396)
at
org.apache.commons.httpclient.HttpClient.executeMethod(HttpClient.java:324)
at
intellisemantic.servlet.solr.DWServlet.sendPostCommand(DWServlet.java:985)
at
intellisemantic.servlet.solr.DWServlet.processExecuteCommand(DWServlet.java:826)
at intellisemantic.servlet.solr.DWServlet.indexing(DWServlet.java:435)
at intellisemantic.servlet.solr.DWServlet.doGet(DWServlet.java:163)
at intellisemantic.servlet.solr.DWServlet.doPost(DWServlet.java:1113)
at javax.servlet.http.HttpServlet.service(HttpServlet.java:709)
at javax.servlet.http.HttpServlet.service(HttpServlet.java:802)
at
org.apache.catalina.core.ApplicationFilterChain.internalDoFilter(ApplicationFilterChain.java:252)
at
org.apache.catalina.core.ApplicationFilterChain.doFilter(ApplicationFilterChain.java:173)
at
org.apache.catalina.core.StandardWrapperValve.invoke(StandardWrapperValve.java:213)
at
org.apache.catalina.core.StandardContextValve.invoke(StandardContextValve.java:178)
at
org.apache.catalina.core.StandardHostValve.invoke(StandardHostValve.java:126)
at
org.apache.catalina.valves.ErrorReportValve.invoke(ErrorReportValve.java:105)
at
org.apache.catalina.core.StandardEngineValve.invoke(StandardEngineValve.java:107)
at
org.apache.catalina.connector.CoyoteAdapter.service(CoyoteAdapter.java:148)
at
org.apache.coyote.http11.Http11Processor.process(Http11Processor.java:869)
at
org.apache.coyote.http11.Http11BaseProtocol$Http11ConnectionHandler.processConnection(Http11BaseProtocol.java:664)
at
org.apache.tomcat.util.net.PoolTcpEndpoint.processSocket(PoolTcpEndpoint.java:527)
at
org.apache.tomcat.util.net.LeaderFollowerWorkerThread.runIt(LeaderFollowerWorkerThread.java:80)
at
org.apache.tomcat.util.threads.ThreadPool$ControlRunnable.run(ThreadPool.java:684)
at java.lang.Thread.run(Unknown Source)
- Servlet.service() for servlet dw threw exception
java.lang.NullPointerException
at
intellisemantic.servlet.solr.DWServlet.sendPostCommand(DWServlet.java:1022)
at
intellisemantic.servlet.solr.DWServlet.processExecuteCommand(DWServlet.java:826)
at intellisemantic.servlet.solr.DWServlet.indexing(DWServlet.java:435)
at intellisemantic.servlet.solr.DWServlet.doGet(DWServlet.java:163)
at intellisemantic.servlet.solr.DWServlet.doPost(DWServlet.java:1113)
at javax.servlet.http.HttpServlet.service(HttpServlet.java:709)
at javax.servlet.http.HttpServlet.service(HttpServlet.java:802)
at
org.apache.catalina.core.ApplicationFilterChain.internalDoFilter(ApplicationFilterChain.java:252)
at
org.apache.catalina.core.ApplicationFilterChain.doFilter(ApplicationFilterChain.java:173)
at
org.apache.catalina.core.StandardWra

Re: Does SOLR supports multiple instances within the same webapplication?

2007-11-13 Thread Walter Ferrara
take a look @:
http://lucene.apache.org/solr/version_control.html#Anonymous+Access+%28read-only%29
and http://www.apache.org/dev/version-control.html#anon-svn

You may want to use an IDE (eclipse/netbeans/...) to svn there (look for
"trunk" dir); this way you could easily download the trunk and compile
via build.xml maybe even without touching the keyboard.
You can apply the patch via the IDE or via shell with the classic patch
command from project main dir (the one that contains build.xml)

Walter

Dilip.TS wrote:
> Hi,
>
>  I am using SOLR 1.2 stable build currently , and this version does not have
> this feature (Support for Multiple SOLR cores).
>
>  How do I get this feature? I tried to open the .patch file but I could not
> understand.
>
>  I was looking into the link http://svn.apache.org/viewvc/lucene/solr/ for
> the solr1.3 version which supports this feature.
>
>  If possible could you please point me to the link/instruction where I can
> check out this feature or the latest source and build the SOLR binary.
>
>  Thanks in advance,
>  Dilip
>
>
> -Original Message-
> From: Ryan McKinley [mailto:[EMAIL PROTECTED]
> Sent: Monday, November 12, 2007 9:23 PM
> To: solr-user@lucene.apache.org
> Subject: Re: Does SOLR supports multiple instances within the same
> webapplication?
>
>
> Dilip.TS wrote:
>   
>> Hello,
>>
>>   Does SOLR supports multiple instances within the same web application?
>> 
> If
>   
>> so how is this achieved?
>>
>> 
>
> If you want multiple indices, you can run multiple web-apps.
>
> If you need multiple indices in the same web-app, check SOLR-350 -- it
> is still in development, and make sure you *really* need it before going
> that route.
>
> ryan
>
>
>   


Re: - I/O exception (java.net.BindException) caught when processing request: Address already in use: connect while updating

2007-11-13 Thread Ian Lea
Daniele


>From the stack trace it looks like the java.net.BindException is
happening when you are calling HttpClient code.  Searching google for
"java.net.BindException httpclient" finds various hits.  Maybe one of
those will help.  I don't think this has anything directly to do with
solr or tomcat.


--
Ian.




On Nov 13, 2007 9:16 AM, Daniele Salvatico <[EMAIL PROTECTED]> wrote:
>
>
> Hi all,
>
> i have a Solr instance running under tomcat 5.5. My application uses a java
> servlet to handle data. I have a problem with "add-update" process:  at the
> first run the add process is ok and very fast . At this first run i add
> around 2.000 docs. Then i commit and all is ok.
> The problem is that if i run the same the "add - update" process again,
> sometimes (not all the times) it fails with this error:
>
> 13-nov-2007 8.17.32 org.apache.solr.handler.XmlUpdateRequestHandler update
> INFO: added id={2524} in 15ms
> - I/O exception (java.net.BindException) caught when processing request:
> Address already in use: connect
> - Retrying request
> - I/O exception (java.net.BindException) caught when processing request:
> Address already in use: connect
> - Retrying request
> - I/O exception (java.net.BindException) caught when processing request:
> Address already in use: connect
> - Retrying request
> Fatal transport error: Address already in use: connect
> java.net.BindException: Address already in use: connect
> at java.net.PlainSocketImpl.socketConnect(Native Method)
> at java.net.PlainSocketImpl.doConnect(Unknown Source)
> at java.net.PlainSocketImpl.connectToAddress(Unknown Source)
> at java.net.PlainSocketImpl.connect(Unknown Source)
> at java.net.SocksSocketImpl.connect(Unknown Source)
> at java.net.Socket.connect(Unknown Source)
> at java.net.Socket.connect(Unknown Source)
> at java.net.Socket.(Unknown Source)
> at java.net.Socket.(Unknown Source)
> at
> org.apache.commons.httpclient.protocol.DefaultProtocolSocketFactory.createSocket(DefaultProtocolSocketFactory.java:79)
> at
> org.apache.commons.httpclient.protocol.DefaultProtocolSocketFactory.createSocket(DefaultProtocolSocketFactory.java:121)
> at
> org.apache.commons.httpclient.HttpConnection.open(HttpConnection.java:706)
> at
> org.apache.commons.httpclient.HttpMethodDirector.executeWithRetry(HttpMethodDirector.java:386)
> at
> org.apache.commons.httpclient.HttpMethodDirector.executeMethod(HttpMethodDirector.java:170)
> at
> org.apache.commons.httpclient.HttpClient.executeMethod(HttpClient.java:396)
> at
> org.apache.commons.httpclient.HttpClient.executeMethod(HttpClient.java:324)
> at
> intellisemantic.servlet.solr.DWServlet.sendPostCommand(DWServlet.java:985)
> at
> intellisemantic.servlet.solr.DWServlet.processExecuteCommand(DWServlet.java:826)
> at intellisemantic.servlet.solr.DWServlet.indexing(DWServlet.java:435)
> at intellisemantic.servlet.solr.DWServlet.doGet(DWServlet.java:163)
> at intellisemantic.servlet.solr.DWServlet.doPost(DWServlet.java:1113)
> at javax.servlet.http.HttpServlet.service(HttpServlet.java:709)
> at javax.servlet.http.HttpServlet.service(HttpServlet.java:802)
> at
> org.apache.catalina.core.ApplicationFilterChain.internalDoFilter(ApplicationFilterChain.java:252)
> at
> org.apache.catalina.core.ApplicationFilterChain.doFilter(ApplicationFilterChain.java:173)
> at
> org.apache.catalina.core.StandardWrapperValve.invoke(StandardWrapperValve.java:213)
> at
> org.apache.catalina.core.StandardContextValve.invoke(StandardContextValve.java:178)
> at
> org.apache.catalina.core.StandardHostValve.invoke(StandardHostValve.java:126)
> at
> org.apache.catalina.valves.ErrorReportValve.invoke(ErrorReportValve.java:105)
> at
> org.apache.catalina.core.StandardEngineValve.invoke(StandardEngineValve.java:107)
> at
> org.apache.catalina.connector.CoyoteAdapter.service(CoyoteAdapter.java:148)
> at
> org.apache.coyote.http11.Http11Processor.process(Http11Processor.java:869)
> at
> org.apache.coyote.http11.Http11BaseProtocol$Http11ConnectionHandler.processConnection(Http11BaseProtocol.java:664)
> at
> org.apache.tomcat.util.net.PoolTcpEndpoint.processSocket(PoolTcpEndpoint.java:527)
> at
> org.apache.tomcat.util.net.LeaderFollowerWorkerThread.runIt(LeaderFollowerWorkerThread.java:80)
> at
> org.apache.tomcat.util.threads.ThreadPool$ControlRunnable.run(ThreadPool.java:684)
> at java.lang.Thread.run(Unknown Source)
> - Servlet.service() for servlet dw threw exception
> java.lang.NullPointerException
> at
> intellisemantic.servlet.solr.DWServlet.sendPostCommand(DWServlet.java:1022)
> at
> intellisemantic.servlet.solr.DWServlet.processExecuteCommand(DWServlet.java:826)
> at intellisemantic.servlet.solr.DWServlet.indexing(DWServlet.jav

Re: - I/O exception (java.net.BindException) caught when processing request: Address already in use: connect while updating

2007-11-13 Thread Daniele Salvatico

Thanks Ian,

I'll search for HttpClient related problem.


Daniele


Ian Lea wrote:
> 
> Daniele
> 
> 
> From the stack trace it looks like the java.net.BindException is
> happening when you are calling HttpClient code.  Searching google for
> "java.net.BindException httpclient" finds various hits.  Maybe one of
> those will help.  I don't think this has anything directly to do with
> solr or tomcat.
> 
> 
> --
> Ian.
> 
> 
> 
> 
> On Nov 13, 2007 9:16 AM, Daniele Salvatico <[EMAIL PROTECTED]>
> wrote:
>>
>>
>> Hi all,
>>
>> i have a Solr instance running under tomcat 5.5. My application uses a
>> java
>> servlet to handle data. I have a problem with "add-update" process:  at
>> the
>> first run the add process is ok and very fast . At this first run i add
>> around 2.000 docs. Then i commit and all is ok.
>> The problem is that if i run the same the "add - update" process again,
>> sometimes (not all the times) it fails with this error:
>>
>> 13-nov-2007 8.17.32 org.apache.solr.handler.XmlUpdateRequestHandler
>> update
>> INFO: added id={2524} in 15ms
>> - I/O exception (java.net.BindException) caught when processing request:
>> Address already in use: connect
>> - Retrying request
>> - I/O exception (java.net.BindException) caught when processing request:
>> Address already in use: connect
>> - Retrying request
>> - I/O exception (java.net.BindException) caught when processing request:
>> Address already in use: connect
>> - Retrying request
>> Fatal transport error: Address already in use: connect
>> java.net.BindException: Address already in use: connect
>> at java.net.PlainSocketImpl.socketConnect(Native Method)
>> at java.net.PlainSocketImpl.doConnect(Unknown Source)
>> at java.net.PlainSocketImpl.connectToAddress(Unknown Source)
>> at java.net.PlainSocketImpl.connect(Unknown Source)
>> at java.net.SocksSocketImpl.connect(Unknown Source)
>> at java.net.Socket.connect(Unknown Source)
>> at java.net.Socket.connect(Unknown Source)
>> at java.net.Socket.(Unknown Source)
>> at java.net.Socket.(Unknown Source)
>> at
>> org.apache.commons.httpclient.protocol.DefaultProtocolSocketFactory.createSocket(DefaultProtocolSocketFactory.java:79)
>> at
>> org.apache.commons.httpclient.protocol.DefaultProtocolSocketFactory.createSocket(DefaultProtocolSocketFactory.java:121)
>> at
>> org.apache.commons.httpclient.HttpConnection.open(HttpConnection.java:706)
>> at
>> org.apache.commons.httpclient.HttpMethodDirector.executeWithRetry(HttpMethodDirector.java:386)
>> at
>> org.apache.commons.httpclient.HttpMethodDirector.executeMethod(HttpMethodDirector.java:170)
>> at
>> org.apache.commons.httpclient.HttpClient.executeMethod(HttpClient.java:396)
>> at
>> org.apache.commons.httpclient.HttpClient.executeMethod(HttpClient.java:324)
>> at
>> intellisemantic.servlet.solr.DWServlet.sendPostCommand(DWServlet.java:985)
>> at
>> intellisemantic.servlet.solr.DWServlet.processExecuteCommand(DWServlet.java:826)
>> at
>> intellisemantic.servlet.solr.DWServlet.indexing(DWServlet.java:435)
>> at
>> intellisemantic.servlet.solr.DWServlet.doGet(DWServlet.java:163)
>> at
>> intellisemantic.servlet.solr.DWServlet.doPost(DWServlet.java:1113)
>> at javax.servlet.http.HttpServlet.service(HttpServlet.java:709)
>> at javax.servlet.http.HttpServlet.service(HttpServlet.java:802)
>> at
>> org.apache.catalina.core.ApplicationFilterChain.internalDoFilter(ApplicationFilterChain.java:252)
>> at
>> org.apache.catalina.core.ApplicationFilterChain.doFilter(ApplicationFilterChain.java:173)
>> at
>> org.apache.catalina.core.StandardWrapperValve.invoke(StandardWrapperValve.java:213)
>> at
>> org.apache.catalina.core.StandardContextValve.invoke(StandardContextValve.java:178)
>> at
>> org.apache.catalina.core.StandardHostValve.invoke(StandardHostValve.java:126)
>> at
>> org.apache.catalina.valves.ErrorReportValve.invoke(ErrorReportValve.java:105)
>> at
>> org.apache.catalina.core.StandardEngineValve.invoke(StandardEngineValve.java:107)
>> at
>> org.apache.catalina.connector.CoyoteAdapter.service(CoyoteAdapter.java:148)
>> at
>> org.apache.coyote.http11.Http11Processor.process(Http11Processor.java:869)
>> at
>> org.apache.coyote.http11.Http11BaseProtocol$Http11ConnectionHandler.processConnection(Http11BaseProtocol.java:664)
>> at
>> org.apache.tomcat.util.net.PoolTcpEndpoint.processSocket(PoolTcpEndpoint.java:527)
>> at
>> org.apache.tomcat.util.net.LeaderFollowerWorkerThread.runIt(LeaderFollowerWorkerThread.java:80)
>> at
>> org.apache.tomcat.util.threads.ThreadPool$ControlRunnable.run(ThreadPool.java:684)
>> at java.lang.Thread.run(Unknown Source)
>> - Servlet.service() for servlet dw threw exception
>> java.lang.NullPointerException
>> at
>> intellisemantic.se

solr internals

2007-11-13 Thread Dwarak R
Hi All

When a word document or excel is sent to solr, I can see my indexes are getting 
updated [via UpdateRichDocuments patch]. I wish to know what really happens 
behind the screen. Can anyone explain internals of solr on adding word 
documents ?. How do we fix fields for word doc which is 5 - 10 pages ?. Any 
advise will be great  

Regards

Dwarak R

This message is for the designated recipient only and may contain privileged, 
proprietary, or otherwise private information. If you have received it in 
error, please notify the sender&[EMAIL PROTECTED]  immediately and delete the 
original. Any other use of the email by you is prohibited.


Re: PriceJunkie.com using solr!

2007-11-13 Thread William Silva
Hi Mike,
I´m checking out www.pricejunkie.com and Í would like to know how do
you group the products and find the price range. Is it a batch process
? Are you using MoreLikeThis to do it ?
Thanks,
William.



>ahait is wonderful.

>2007/5/24, Mike Austin <[EMAIL PROTECTED]>:

>Just one.

>-Original Message-
>From: James liu [EMAIL PROTECTED]
>Sent: Wednesday, May 16, 2007 10:30 PM
>To: solr-user@lucene.apache.org
>Subject: Re: PriceJunkie.com using solr!


>how many solr instance?


>2007/5/17, Yonik Seeley <[EMAIL PROTECTED]>:
>
> Congrats, very nice job!
> It's fast too.
>
> -Yonik
>
> On 5/16/07, Mike Austin <[EMAIL PROTECTED]> wrote:
> > I just wanted to say thanks to everyone for the creation of
solr.  I've
> been
> > using it for a while now and I have recently brought one of my side
> projects
> > online.  I have several other projects that will be using solr for
it's
> > search and facets.
> >
> > Please check out www.pricejunkie.com and let us know what you think..
> You
> > can give feedback and/or sign up on the mailing list for future
updates.
> > The site is very basic right now and many new and useful features plus
> > merchants and product categories will be coming soon!  I thought it
> would be
> > a good idea to at least have a few people use it to get some feedback
> early
> > and often.
> >
> > Some of the nice things behind the scenes that we did with solr:
> > - created custom request handlers that have category to facet to
> attribute
> > caching built in
> > - category to facet management
> > - ability to manage facet groups (attributes within a set
facet)
> and assign
> > them to categories
> > - ability to create any category structure and share facet
> groups
> >
> > - facet inheritance for any category (a facet group can be defined on
a
> > parent category and pushed down to all children)
> > - ability to create sub-categories as facets instead of normal sub
> > categories
> > - simple xml configuration for the final outputted category
> configuration
> > file
> >
> >
> > I'm sure there are more cool things but that is all for now.  Join the
> > mailing list to see more improvements in the future.
> >
> > Also.. how do I get added to the Using Solr wiki page?
> >
> >
> > Thanks,
> > Mike Austin
>


Solr/bin script - Solaris bash version?

2007-11-13 Thread Jae Joo
Hi,

Is there Solaris bash based script available? The couple of command is not
working, and wondering any available scripts I can use before I update it.

For ex. snapshooter, snappuller, snapinstaller



Thanks,

Jae


snapshot files

2007-11-13 Thread Jae Joo
Hi,

I have successfully built generated the snalshot files but have a question.
Does each snapshot file has all files in the index directory?

Here is the file list in the index
_0.fdt_0.fnm_0.nrm_0.tii_1.fdt
_1.fnm_1.nrm_1.tiisegments.gen
_0.fdx_0.frq_0.prx_0.tis_1.fdx
_1.frq_1.prx_1.tissegments_3

And here is the file list of 2 snapshot files.

 snapshot.20071113094936
_0.fdt_0.fdx_0.fnm_0.frq_0.nrm
_0.prx_0.tii_0.tissegments.gen  segments_2


 snapshot.20071113095508
_0.fdt_0.fnm_0.nrm_0.tii_1.fdt
_1.fnm_1.nrm_1.tiisegments.gen
_0.fdx_0.frq_0.prx_0.tis_1.fdx
_1.frq_1.prx_1.tissegments_3

The later one have all files same as index directory.

I have changed the snapshooter script because the bash in solaris do not
have cp -l option.

#cp -lr ${data_dir}/index ${temp} --> original
mkdir ${temp}
ln  ${data_dir}/index/* ${temp}



Thanks,

Jae Joo


two solr instances - index and commit

2007-11-13 Thread Jae Joo
Hi,

I have two solr instance running under different tomcat environment.
One solr instance is for indexing and would like to commit to the other solr
instance.

This is what I tried, but failed.
using post.sh (without commit), the docs are  indexed in solr-1 instance.
After indexed,
call commit command with the attribute of solr-2.

Can any help me?

Jae


LuceneInAction.zip?

2007-11-13 Thread David Neubert
I purchased Lucene In Action (really great book by the way, one of the best 
technical books (if not the best) that I can ever read.

Its making me embarrassed about some of the questions I have already posted :)  

That said, here is another one -- I found LuceneInAction.zip on 
www.lucenebook.com and downloaded to get the examples in LIA.

Is there any copy of this upgraded to Lucene2.2.0 code?  While the conversion 
to 2.2.0 has been fairly obvious for all the cases I have looked at so far, 
there seems to be allot of use of deprecated methods in external libraries as 
well.

Just wondering if there was a short cut -- and an updated version of this zip 
that I have not located?

Very well done though -- this book and zip is extremely helpful -- even as I 
tried to get grounded in SOLR as well.

Thanks,

Dave




  

Never miss a thing.  Make Yahoo your home page. 
http://www.yahoo.com/r/hs

Multiple uniqueKey fields

2007-11-13 Thread Dave C .
Hello, I just started using Solr and have a question regarding unique keys.

Is it somehow possible to specify more than 1 unique key?
The
problem is, some of my indexed data has one particular unique field,
while other data does not have this field at all and uses another field
for unique-ness.
Any suggestions on how to overcome this?

Thanks,
David
_
Peek-a-boo FREE Tricks & Treats for You!
http://www.reallivemoms.com?ocid=TXT_TAGHM&loc=us

Re: LuceneInAction.zip?

2007-11-13 Thread Mike Klaas

On 13-Nov-07, at 9:08 AM, David Neubert wrote:

I purchased Lucene In Action (really great book by the way, one of  
the best technical books (if not the best) that I can ever read.


Its making me embarrassed about some of the questions I have  
already posted :)


That said, here is another one -- I found LuceneInAction.zip on  
www.lucenebook.com and downloaded to get the examples in LIA.


Is there any copy of this upgraded to Lucene2.2.0 code?  While the  
conversion to 2.2.0 has been fairly obvious for all the cases I  
have looked at so far, there seems to be allot of use of deprecated  
methods in external libraries as well.


I don't think so, but I suggest asking this question on java- 
[EMAIL PROTECTED], which has a much broader lucene-related  
audience.


-Mike


RE: Exception in SOLR when querying for fields of type string

2007-11-13 Thread Stu Hood
The first question is, what version of Solr are you using?

Thanks,
Stu


-Original Message-
From: Kasi Sankaralingam <[EMAIL PROTECTED]>
Sent: Tuesday, November 13, 2007 2:27pm
To: solr-user@lucene.apache.org 
Subject: Exception in SOLR when querying for fields of type string

Hi,

I am running into nullpointerexception on the SOLR side, when I do the 
following:

a)  Define a dynamic field in the schema of type string (say title_s)

b)  Do a query in the SOLR admin tool title_s: photo book

I get a null pointer exception when I run a search query on this.

If I enclose the search term within double quotes like "photo book", it works 
fine.

Any ideas?

Thanks,

kasi




Re: Faceting over limited result set

2007-11-13 Thread Mike Klaas

On 12-Nov-07, at 8:03 AM, Chris Hostetter wrote:



if what you are interested in is stats on the first N docs  
according to a
specific sort (score or otherwise) then you could write a custom  
request

handler that executed a search with a limit of N, got the DocList,
iterated over it to build a DocSet, and then used that DocSet to do
faceting ... but that would probably take even longer then just  
using the

full DocSet matching the entire query.


An implementation might look like:

DocList superlist;
int facetDocLimit = params.getInt(DMP.FACET_DOCLIMIT, -1);
if(facetDocLimit > 0 && facetDocLimit != req.getLimit()) {
  superlist = s.getDocList(query, restrictions,
   SolrPluginUtils.getSort(req),
   req.getStart(), facetDocLimit,
   flags);
  results.docSet = SearcherUtils.getDocSetFromDocList 
(superlist, s);

  results.docList = superlist.subset(0, req.getLimit());
} else {

Where getDocSetFromDocList() uses DocSetHitCollector to build a DocSet.

To answer the performance question: There is a gain to be had when  
doing lots of faceting on huge indices, if N is low (say, 500-1000).   
One problem with the implementation above is that it stymies the  
query caching in SolrIndexSearcher (since the generated DocList is >  
the cache upper bound).


-Mike


RE: Multiple uniqueKey fields

2007-11-13 Thread Will Johnson
key = sometimesUniqueField + "_" + otherTimesUniqueuField;

If one of them is always guaranteed to be unique you should be fine.

- will

-Original Message-
From: Dave C. [mailto:[EMAIL PROTECTED] 
Sent: Tuesday, November 13, 2007 2:07 PM
To: solr-user@lucene.apache.org
Subject: Multiple uniqueKey fields

Hello, I just started using Solr and have a question regarding unique keys.

Is it somehow possible to specify more than 1 unique key?
The
problem is, some of my indexed data has one particular unique field,
while other data does not have this field at all and uses another field
for unique-ness.
Any suggestions on how to overcome this?

Thanks,
David
_
Peek-a-boo FREE Tricks & Treats for You!
http://www.reallivemoms.com?ocid=TXT_TAGHM&loc=us



Index/Query question

2007-11-13 Thread Marc Jones

Hello all,

I am in the process of learning Solr (and the underlying Lucene  
engine).  I have been unable to successfully implement the following  
scenario, so I was hoping someone on the list might have some insight.


- I want to index data values for different data types (fields) that  
are currently housed in a database.

- I want to search across the fields.
- I need (I think) to store the string representation (actual value)  
AND a tokenized/parsed representation -- which is where I fall down.


Example:

I have an email address ([EMAIL PROTECTED]), a database name  
(5_user_db), and a username (user).  Here is what I would like to  
see happen (dynamic search as keys are typed) ..


1.  I enter 'user' and I get all three above in the search result.  I  
can accomplish this today using the solr.WordDelimiterFilterFactory.
2.  I then add '@' so I have 'user@', which I want to return only the  
email address record.  However, this will not happen with  
solr.WordDelimiterFilterFactory, but I can get this result to work if  
I only index it as a string.  But, then my search results for 'user'  
will not return the database record.


Any thoughts?  Let me know if I need to clarify any points.

Thanks!
Marc



RE: Multiple uniqueKey fields

2007-11-13 Thread Dave C .
Thanks, I thought of this shortly after I posted the question!

- david



> From: [EMAIL PROTECTED]
> To: solr-user@lucene.apache.org
> Subject: RE: Multiple uniqueKey fields
> Date: Tue, 13 Nov 2007 15:37:46 -0500
> 
> key = sometimesUniqueField + "_" + otherTimesUniqueuField;
> 
> If one of them is always guaranteed to be unique you should be fine.
> 
> - will
> 
> -Original Message-
> From: Dave C. [mailto:[EMAIL PROTECTED] 
> Sent: Tuesday, November 13, 2007 2:07 PM
> To: solr-user@lucene.apache.org
> Subject: Multiple uniqueKey fields
> 
> Hello, I just started using Solr and have a question regarding unique keys.
> 
> Is it somehow possible to specify more than 1 unique key?
> The
> problem is, some of my indexed data has one particular unique field,
> while other data does not have this field at all and uses another field
> for unique-ness.
> Any suggestions on how to overcome this?
> 
> Thanks,
> David
> _
> Peek-a-boo FREE Tricks & Treats for You!
> http://www.reallivemoms.com?ocid=TXT_TAGHM&loc=us
> 

_
Help yourself to FREE treats served up daily at the Messenger Café. Stop by 
today.
http://www.cafemessenger.com/info/info_sweetstuff2.html?ocid=TXT_TAGLM_OctWLtagline

Solr java tutorial

2007-11-13 Thread Dave C .
Hi everyone,

My next question is: is there a beginner's tutorial on how to programmatically 
query solr?
I have seen some users' do things like:


SolrServer server = getSolrServer();

SolrQuery
solrQuery = new SolrQuery();

   
solrQuery.setQuery(..);

   
QueryResponse rsp = server.query(solrQuery);

But is there some tutorial that outlines the basic classes to use, etc?
I know there is a javadoc but it's not helpful if I don't know what to use.

Thanks a lot,
David

_
Help yourself to FREE treats served up daily at the Messenger Café. Stop by 
today.
http://www.cafemessenger.com/info/info_sweetstuff2.html?ocid=TXT_TAGLM_OctWLtagline

RE: Exception in SOLR when querying for fields of type string

2007-11-13 Thread Kasi Sankaralingam
I am using the latest version of solr (1.2)

-Original Message-
From: Stu Hood [mailto:[EMAIL PROTECTED]
Sent: Tuesday, November 13, 2007 11:32 AM
To: solr-user@lucene.apache.org
Subject: RE: Exception in SOLR when querying for fields of type string

The first question is, what version of Solr are you using?

Thanks,
Stu


-Original Message-
From: Kasi Sankaralingam <[EMAIL PROTECTED]>
Sent: Tuesday, November 13, 2007 2:27pm
To: solr-user@lucene.apache.org 
Subject: Exception in SOLR when querying for fields of type string

Hi,

I am running into nullpointerexception on the SOLR side, when I do the 
following:

a)  Define a dynamic field in the schema of type string (say title_s)

b)  Do a query in the SOLR admin tool title_s: photo book

I get a null pointer exception when I run a search query on this.

If I enclose the search term within double quotes like "photo book", it works 
fine.

Any ideas?

Thanks,

kasi




Re: Solr java tutorial

2007-11-13 Thread Ryan McKinley


Not yet, but there should be!

Currently people learn it from looking at the source and tests.  I 
started to add something to:


 http://wiki.apache.org/solr/Solrj

it (obviously) still needs work.

If you are using eclipse (or similar), after typing "solrQuery." you 
should get a drop down of all the available options - i find this 
invaluable for learning what is possible.


ryan



Dave C. wrote:

Hi everyone,

My next question is: is there a beginner's tutorial on how to programmatically 
query solr?
I have seen some users' do things like:


SolrServer server = getSolrServer();

SolrQuery
solrQuery = new SolrQuery();

   
solrQuery.setQuery(..);


   
QueryResponse rsp = server.query(solrQuery);


But is there some tutorial that outlines the basic classes to use, etc?
I know there is a javadoc but it's not helpful if I don't know what to use.

Thanks a lot,
David

_
Help yourself to FREE treats served up daily at the Messenger Café. Stop by 
today.
http://www.cafemessenger.com/info/info_sweetstuff2.html?ocid=TXT_TAGLM_OctWLtagline




Re: Exception in SOLR when querying for fields of type string

2007-11-13 Thread Ryan McKinley


Can you post the full exception?



b) Do a query in the SOLR admin tool title_s: photo book



Do you have a space after the ':'?

 q=title_s: photo book

I expect that would fail (though null pointer is not a very nice error)

 q=title_s:"photo book"

should work fine:

 title_s:photo book

would match "photo" for field 'title_s' and "book" for the default field.


ryan


Re: Multiple uniqueKey fields

2007-11-13 Thread Walter Underwood
I had a similar problem with three sources of keys that have collisions
between the values. I prefix a single letter for each source.

movies: M12345
people: P12345

and so on.

wunder

On 11/13/07 12:37 PM, "Will Johnson" <[EMAIL PROTECTED]> wrote:

> key = sometimesUniqueField + "_" + otherTimesUniqueuField;
> 
> If one of them is always guaranteed to be unique you should be fine.
> 
> - will
> 
> -Original Message-
> From: Dave C. [mailto:[EMAIL PROTECTED]
> Sent: Tuesday, November 13, 2007 2:07 PM
> To: solr-user@lucene.apache.org
> Subject: Multiple uniqueKey fields
> 
> Hello, I just started using Solr and have a question regarding unique keys.
> 
> Is it somehow possible to specify more than 1 unique key?
> The
> problem is, some of my indexed data has one particular unique field,
> while other data does not have this field at all and uses another field
> for unique-ness.
> Any suggestions on how to overcome this?
> 
> Thanks,
> David
> _
> Peek-a-boo FREE Tricks & Treats for You!
> http://www.reallivemoms.com?ocid=TXT_TAGHM&loc=us
> 



RE: Exception in SOLR when querying for fields of type string

2007-11-13 Thread Kasi Sankaralingam
It is not tokenized, it is a string field, so will it still match
"photo" for field 'title_s' and "book" for the default field?


Here is the exception stack trace:
HTTP Status 500 - null java.lang.NullPointerException at 
org.apache.solr.search.SolrQueryParser.getFieldQuery(SolrQueryParser.java:73) 
at org.apache.lucene.queryParser.QueryParser.Term(QueryParser.java:1092) at 
org.apache.lucene.queryParser.QueryParser.Clause(QueryParser.java:979) at 
org.apache.lucene.queryParser.QueryParser.Query(QueryParser.java:907) at 
org.apache.lucene.queryParser.QueryParser.TopLevelQuery(QueryParser.java:896) 
at org.apache.lucene.queryParser.QueryParser.parse(QueryParser.java:146) at 
org.apache.solr.search.QueryParsing.parseQuery(QueryParsing.java:94) at 
org.apache.solr.request.StandardRequestHandler.handleRequestBody(StandardRequestHandler.java:109)
 at 
org.apache.solr.handler.RequestHandlerBase.handleRequest(RequestHandlerBase.java:77)
 at org.apache.solr.core.SolrCore.execute(SolrCore.java:658) at 
org.apache.solr.servlet.SolrDispatchFilter.execute(SolrDispatchFilter.java:191) 
at 
org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:159)
 at 
org.apache.catalina.core.ApplicationFilterChain.internalDoFilter(ApplicationFilterChain.java:202)
 at 
org.apache.catalina.core.ApplicationFilterChain.doFilter(ApplicationFilterChain.java:173)
 at 
org.apache.catalina.core.StandardWrapperValve.invoke(StandardWrapperValve.java:213)
 at 
org.apache.catalina.core.StandardContextValve.invoke(StandardContextValve.java:178)
 at 
org.apache.catalina.core.StandardHostValve.invoke(StandardHostValve.java:126) 
at 
org.apache.catalina.valves.ErrorReportValve.invoke(ErrorReportValve.java:105) 
at org.apache.catalina.valves.AccessLogValve.invoke(AccessLogValve.java:541) at 
org.apache.catalina.core.StandardEngineValve.invoke(StandardEngineValve.java:107)
 at org.apache.catalina.connector.CoyoteAdapter.service(CoyoteAdapter.java:148) 
at org.apache.coyote.http11.Http11Processor.process(Http11Processor.java:869) 
at 
org.apache.coyote.http11.Http11BaseProtocol$Http11ConnectionHandler.processConnection(Http11BaseProtocol.java:664)
 at 
org.apache.tomcat.util.net.PoolTcpEndpoint.processSocket(PoolTcpEndpoint.java:527)
 at 
org.apache.tomcat.util.net.LeaderFollowerWorkerThread.runIt(LeaderFollowerWorkerThread.java:80)
 at 
org.apache.tomcat.util.threads.ThreadPool$ControlRunnable.run(ThreadPool.java:684)
 at java.lang.Thread.run(Thread.java:595)


-Original Message-
From: Ryan McKinley [mailto:[EMAIL PROTECTED]
Sent: Tuesday, November 13, 2007 2:26 PM
To: solr-user@lucene.apache.org
Subject: Re: Exception in SOLR when querying for fields of type string


Can you post the full exception?

>
> b) Do a query in the SOLR admin tool title_s: photo book
>

Do you have a space after the ':'?

  q=title_s: photo book

I expect that would fail (though null pointer is not a very nice error)

  q=title_s:"photo book"

should work fine:

  title_s:photo book

would match "photo" for field 'title_s' and "book" for the default field.


ryan


RE: Solr java tutorial

2007-11-13 Thread Dave C .
I'm really confused...is SolrJ coming in Solr 1.3 (which hasn't been released 
yet)?
It doesn't seem to be found in Solr 1.2 which I am using.

please enlighten..

- David


> Date: Tue, 13 Nov 2007 17:21:13 -0500
> From: [EMAIL PROTECTED]
> To: solr-user@lucene.apache.org
> Subject: Re: Solr java tutorial
> 
> 
> Not yet, but there should be!
> 
> Currently people learn it from looking at the source and tests.  I 
> started to add something to:
> 
>   http://wiki.apache.org/solr/Solrj
> 
> it (obviously) still needs work.
> 
> If you are using eclipse (or similar), after typing "solrQuery." you 
> should get a drop down of all the available options - i find this 
> invaluable for learning what is possible.
> 
> ryan
> 
> 
> 
> Dave C. wrote:
> > Hi everyone,
> > 
> > My next question is: is there a beginner's tutorial on how to 
> > programmatically query solr?
> > I have seen some users' do things like:
> > 
> > 
> > SolrServer server = getSolrServer();
> > 
> > SolrQuery
> > solrQuery = new SolrQuery();
> > 
> >
> > solrQuery.setQuery(..);
> > 
> >
> > QueryResponse rsp = server.query(solrQuery);
> > 
> > But is there some tutorial that outlines the basic classes to use, etc?
> > I know there is a javadoc but it's not helpful if I don't know what to use.
> > 
> > Thanks a lot,
> > David
> > 
> > _
> > Help yourself to FREE treats served up daily at the Messenger Café. Stop by 
> > today.
> > http://www.cafemessenger.com/info/info_sweetstuff2.html?ocid=TXT_TAGLM_OctWLtagline
> 

_
Windows Live Hotmail and Microsoft Office Outlook – together at last.  Get it 
now.
http://office.microsoft.com/en-us/outlook/HA102225181033.aspx?pid=CL100626971033

Re: Solr java tutorial

2007-11-13 Thread Ryan McKinley

Dave C. wrote:

I'm really confused...is SolrJ coming in Solr 1.3 (which hasn't been released 
yet)?
It doesn't seem to be found in Solr 1.2 which I am using.

please enlighten..



correct. solrj is in solr-1.3-dev -- to use it you need to use a nightly 
build.


It has not been officially released yet.  While it is quite stable, the 
interfaces are still subject to change until the final release.


ryan


RE: Solr java tutorial

2007-11-13 Thread Dave C .
Ah...
:(

Is there a timeline for the 1.3 release?

- david


> Date: Tue, 13 Nov 2007 18:33:01 -0500
> From: [EMAIL PROTECTED]
> To: solr-user@lucene.apache.org
> Subject: Re: Solr java tutorial
> 
> Dave C. wrote:
> > I'm really confused...is SolrJ coming in Solr 1.3 (which hasn't been 
> > released yet)?
> > It doesn't seem to be found in Solr 1.2 which I am using.
> > 
> > please enlighten..
> > 
> 
> correct. solrj is in solr-1.3-dev -- to use it you need to use a nightly 
> build.
> 
> It has not been officially released yet.  While it is quite stable, the 
> interfaces are still subject to change until the final release.
> 
> ryan

_
Boo! Scare away worms, viruses and so much more! Try Windows Live OneCare!
http://onecare.live.com/standard/en-us/purchase/trial.aspx?s_cid=wl_hotmailnews

RE: two solr instances - index and commit

2007-11-13 Thread Kasi Sankaralingam
This works, the only thing you need to be aware of is the NFS problem if you are
running in a distributed environment sharing a NFS partition.

a) Index and commit on instance (Typically partitioned as an index server)
b) Issue a commit on the search server (like a read only mode)

Things to watch out for, you will get stale NFS problem, I replaced lucene core
that is shipped with solr to the latest one and it works.

-Original Message-
From: Jae Joo [mailto:[EMAIL PROTECTED]
Sent: Tuesday, November 13, 2007 9:06 AM
To: solr-user
Subject: two solr instances - index and commit

Hi,

I have two solr instance running under different tomcat environment.
One solr instance is for indexing and would like to commit to the other solr
instance.

This is what I tried, but failed.
using post.sh (without commit), the docs are  indexed in solr-1 instance.
After indexed,
call commit command with the attribute of solr-2.

Can any help me?

Jae


Re: Faceting over limited result set

2007-11-13 Thread Pieter Berkel
On Nov 14, 2007 6:44 AM, Mike Klaas <[EMAIL PROTECTED]> wrote:
>
> An implementation might look like:
>
>  DocList superlist;
>  int facetDocLimit = params.getInt(DMP.FACET_DOCLIMIT, -1);
>  if(facetDocLimit > 0 && facetDocLimit != req.getLimit()) {
>superlist = s.getDocList(query, restrictions,
> SolrPluginUtils.getSort(req),
> req.getStart(), facetDocLimit,
> flags);
>results.docSet = SearcherUtils.getDocSetFromDocList
> (superlist, s);
>results.docList = superlist.subset(0, req.getLimit());
>  } else {
>
> Where getDocSetFromDocList() uses DocSetHitCollector to build a DocSet.
>
> To answer the performance question: There is a gain to be had when
> doing lots of faceting on huge indices, if N is low (say, 500-1000).
> One problem with the implementation above is that it stymies the
> query caching in SolrIndexSearcher (since the generated DocList is >
> the cache upper bound).
>
> -Mike

Thanks Mike, that looks like a good place to start.  While I really
can't think of any practical use for limiting the size of DocSet other
than simple faceting, the new search component architecture make it a
little more difficult to confine any implementation to only the facet
component (unless there is an efficient way to obtain a subset of a
DocSet, which there doesn't seem to be).  I'm also aware of the query
caching issues arising from SolrIndexSearcher however if N is
sufficiently low this (hopefully) shouldn't be too much of a problem.

I can't find either the SearcherUtils class nor any reference to a
getDocSetFromDocList() method in svn trunk, is this deprecated or
custom-build code?

-Piete


Re: Solr java tutorial

2007-11-13 Thread Mike Klaas
Not really--there have been a few threads on this topic recently.   
Perhaps in a couple months?


It may depend on the timing of the lucene release.

-MIke

On 13-Nov-07, at 3:41 PM, Dave C. wrote:


Ah...
:(

Is there a timeline for the 1.3 release?

- david



Date: Tue, 13 Nov 2007 18:33:01 -0500
From: [EMAIL PROTECTED]
To: solr-user@lucene.apache.org
Subject: Re: Solr java tutorial

Dave C. wrote:
I'm really confused...is SolrJ coming in Solr 1.3 (which hasn't  
been released yet)?

It doesn't seem to be found in Solr 1.2 which I am using.

please enlighten..



correct. solrj is in solr-1.3-dev -- to use it you need to use a  
nightly

build.

It has not been officially released yet.  While it is quite  
stable, the

interfaces are still subject to change until the final release.

ryan


_
Boo! Scare away worms, viruses and so much more! Try Windows Live  
OneCare!
http://onecare.live.com/standard/en-us/purchase/trial.aspx? 
s_cid=wl_hotmailnews




Re: Exception in SOLR when querying for fields of type string

2007-11-13 Thread Yonik Seeley
On Nov 13, 2007 6:23 PM, Kasi Sankaralingam <[EMAIL PROTECTED]> wrote:
> It is not tokenized, it is a string field, so will it still match
> "photo" for field 'title_s' and "book" for the default field?

Yes, because the query parser splits up things by whitespace before
analyzers are even applied.
Do you have a default field defined?

-Yonik


Embedded Solr

2007-11-13 Thread Dave C .
Hello again,

This is a horribly newbie question, but what exactly is meant by "embedded 
solr"?

Thanks a lot,
David

_
Boo! Scare away worms, viruses and so much more! Try Windows Live OneCare!
http://onecare.live.com/standard/en-us/purchase/trial.aspx?s_cid=wl_hotmailnews

Re: Embedded Solr

2007-11-13 Thread Ryan McKinley
Standard solr is a .war file that you install on your system and run 
within a servlet container (jetty, resin, tomcat, etc)


"embedded" solr refers to running solr without the servlet container.

ryan


Dave C. wrote:

Hello again,

This is a horribly newbie question, but what exactly is meant by "embedded 
solr"?

Thanks a lot,
David

_
Boo! Scare away worms, viruses and so much more! Try Windows Live OneCare!
http://onecare.live.com/standard/en-us/purchase/trial.aspx?s_cid=wl_hotmailnews




Re: Query and heap Size

2007-11-13 Thread Mike Klaas

On 12-Nov-07, at 6:02 AM, Jae Joo wrote:

In my system, the heap size (old generation) keeps growing up  
caused by

heavy traffic.
I have adjusted the size of young generation, but it does not work  
well.


Does anyone have any recommendation regarding this issue? - Solr
configuration and/or web.xml ...etc...


Perhaps check your cache statistics on the admin gui. Is it possible  
that you have set the capacity high and they are just filling up?


Another thing to look out for is if you tend to sort on many  
different fields, but rarely.


-Mike


Re: Faceting over limited result set

2007-11-13 Thread Mike Klaas

On 13-Nov-07, at 4:44 PM, Pieter Berkel wrote:


On Nov 14, 2007 6:44 AM, Mike Klaas <[EMAIL PROTECTED]> wrote:

Thanks Mike, that looks like a good place to start.  While I really
can't think of any practical use for limiting the size of DocSet other
than simple faceting, the new search component architecture make it a
little more difficult to confine any implementation to only the facet
component (unless there is an efficient way to obtain a subset of a
DocSet, which there doesn't seem to be).


DocSets (so far) are unordered so I don't see how that would work.


I'm also aware of the query
caching issues arising from SolrIndexSearcher however if N is
sufficiently low this (hopefully) shouldn't be too much of a problem.

I can't find either the SearcherUtils class nor any reference to a
getDocSetFromDocList() method in svn trunk, is this deprecated or
custom-build code?


Custom.  It is a handful of lines that just passes the docs from a  
DocIterator to DocSetHitCollector.


-Mike


solr - other document formats

2007-11-13 Thread Dwarak R
Hey All

I read an article on http://www.xml.com/lpt/a/1668

Its states that 

"As we've seen, the XML format used by Solr for indexing is quite simple. 
Extracting the relevant metadata to create these XML documents from the many 
formats floating around, however, is another story. Fortunately, Lucene users 
have the same problem and have been working on it for quite a while; the Lucene 
FAQ lists a number of references to parsers and filters which can be used to 
extract content and metadata from many common document formats. 
Solr won't index spreadsheets or other formats out of the box, but that is not 
its role: you should see Solr as the "search engine" component of a broader 
"search system," where extraction of content and metadata is handled by other 
components. This will help to keep your search system maintainable and 
testable, and it helps the Solr team focus on doing one thing well."

Parsing documents like pdf, ms word document, excel to xml will be done other 
component ? 

Somebody advise 

Regards

Dwarak R

This message is for the designated recipient only and may contain privileged, 
proprietary, or otherwise private information. If you have received it in 
error, please notify the sender&[EMAIL PROTECTED]  immediately and delete the 
original. Any other use of the email by you is prohibited.