Re: Can find solr jars in WEB-INF/lib directory

2009-03-04 Thread Erik Hatcher


On Mar 4, 2009, at 12:47 AM, The Flight Captain wrote:
Thanks for the DirectSolrConnection tip, that eliminated the need  
for parsing
the XML as a string, and all that code need to instantiate a  
container. I am
now Using SolrCore.getSolrCore(). This method is deprecated though,  
what is
the standard way of instaniating a SoreCore to pass it as a  
parameter to

DirectSolrConnection?


The DirectSolrConnection( String instanceDir, String dataDir, String  
loggingPath ) constructor has this code:


  CoreContainer cores = new CoreContainer(new  
SolrResourceLoader(instanceDir));
  SolrConfig solrConfig = new SolrConfig(instanceDir,  
SolrConfig.DEFAULT_CONF_FILE, null);
  CoreDescriptor dcore = new CoreDescriptor(cores, "",  
solrConfig.getResourceLoader().getInstanceDir());
  IndexSchema indexSchema = new IndexSchema(solrConfig,  
instanceDir+"/conf/schema.xml", null);
  core = new SolrCore( null, dataDir, solrConfig, indexSchema,  
dcore);


Erik



Re: Search with Date field

2009-03-04 Thread Marc Sturlese

Normally you will need to filer by dates... you can do it in two ways...
http://fq=date_field[NOW-7DAY TO NOW]
http://...fq=date_field[2007-01-01T21:21:21 TO 2009-01-01T13:13:13]

I strongly recomend you to use /DAY in the first example. The search will be
faster as it is omitting hour-min-sec.
http://fq=date_field[NOW/DAY-7DAY TO NOW/DAY]

Is this what you where asking for?



solruser wrote:
> 
> Can you reply with example of solr request?
> 
> Amit
> 
> On Mar 3, 2009, at 11:08 PM, dabboo  wrote:
> 
>>
>> Hi,
>>
>> Date search is not happening in solr. I am giving the field name and  
>> date
>> value as the request but it is not returning any value.
>>
>> Please suggest.
>>
>> Thanks,
>> Amit Garg
>> -- 
>> View this message in context:
>> http://www.nabble.com/Search-with-Date-field-tp22324779p22324779.html
>> Sent from the Solr - User mailing list archive at Nabble.com.
>>
> 
> 

-- 
View this message in context: 
http://www.nabble.com/Search-with-Date-field-tp22324779p22326871.html
Sent from the Solr - User mailing list archive at Nabble.com.



Re: Very long commit time.

2009-03-04 Thread Jérôme Etévé
Great,

  It went down to less than 10 secs now :)
What I don't really understand is that my autowarmCount were pretty
low ( like 128 ) and still the autowarming of the caches were very
slow.

Can you explain more why it can be that slow ?

Cheers !

Jerome.

On Tue, Mar 3, 2009 at 8:00 PM, Yonik Seeley  wrote:
> Looks like cache autowarming.
> If you have statically defined warming queries in solrconfig.xml, you
> could try setting autowarmCount=0 for all the caches.
>
> -Yonik
> http://www.lucidimagination.com
>
>
> On Tue, Mar 3, 2009 at 2:37 PM, Jérôme Etévé  wrote:
>> Dear solr fans,
>>
>>  I have a solr index of roughly 8M docs and I have here a little
>> problem when I commit some insertion into it.
>>
>>  The insert itself is very fast, but my commit takes 163 seconds.
>>
>>  Here's the solr trace the commit leaves:
>>
>>  INFO: start commit(optimize=false,waitFlush=false,waitSearcher=true)
>>  03-Mar-2009 20:20:35 org.apache.solr.search.SolrIndexSearcher 
>> INFO: Opening searc...@7de212f9 main
>> 03-Mar-2009 20:20:35 org.apache.solr.update.DirectUpdateHandler2 commit
>> INFO: end_commit_flush
>> 03-Mar-2009 20:20:35 org.apache.solr.search.SolrIndexSearcher warm
>> INFO: autowarming searc...@7de212f9 main from searc...@732d8b11 main
>>
>> filterCache{lookups=0,hits=0,hitratio=0.00,inserts=16,evictions=0,size=16,warmupTime=71641,cumulative_lookups=90,cumulative_hits=68,cumulative_hitratio=0.75,cumulative_inserts=22,cumulative_evictions=0}
>> 03-Mar-2009 20:21:52 org.apache.solr.search.SolrIndexSearcher warm
>> INFO: autowarming result for searc...@7de212f9 main
>>
>> filterCache{lookups=0,hits=0,hitratio=0.00,inserts=16,evictions=0,size=16,warmupTime=76905,cumulative_lookups=90,cumulative_hits=68,cumulative_hitratio=0.75,cumulative_inserts=22,cumulative_evictions=0}
>> 03-Mar-2009 20:21:52 org.apache.solr.search.SolrIndexSearcher warm
>> INFO: autowarming searc...@7de212f9 main from searc...@732d8b11 main
>>
>> queryResultCache{lookups=24,hits=24,hitratio=1.00,inserts=32,evictions=0,size=32,warmupTime=82406,cumulative_lookups=6310,cumulative_hits=268,cumulative_hitratio=0.04,cumulative_inserts=6041,cumulative_evictions=5522}
>> 03-Mar-2009 20:23:17 org.apache.solr.search.SolrIndexSearcher warm
>> INFO: autowarming result for searc...@7de212f9 main
>>
>> queryResultCache{lookups=0,hits=0,hitratio=0.00,inserts=32,evictions=0,size=32,warmupTime=85591,cumulative_lookups=6310,cumulative_hits=268,cumulative_hitratio=0.04,cumulative_inserts=6041,cumulative_evictions=5522}
>> 03-Mar-2009 20:23:17 org.apache.solr.search.SolrIndexSearcher warm
>> INFO: autowarming searc...@7de212f9 main from searc...@732d8b11 main
>>
>> documentCache{lookups=720,hits=710,hitratio=0.98,inserts=40,evictions=0,size=40,warmupTime=0,cumulative_lookups=415308,cumulative_hits=283661,cumulative_hitratio=0.68,cumulative_inserts=131647,cumulative_evictions=131105}
>> 03-Mar-2009 20:23:17 org.apache.solr.search.SolrIndexSearcher warm
>> INFO: autowarming result for searc...@7de212f9 main
>>
>> documentCache{lookups=0,hits=0,hitratio=0.00,inserts=0,evictions=0,size=0,warmupTime=0,cumulative_lookups=415308,cumulative_hits=283661,cumulative_hitratio=0.68,cumulative_inserts=131647,cumulative_evictions=131105}
>> 03-Mar-2009 20:23:17 org.apache.solr.core.QuerySenderListener newSearcher
>> INFO: QuerySenderListener sending requests to searc...@7de212f9 main
>>
>> // Then the few warm up queries defined in solrconfig.xml
>>
>> INFO: Closing searc...@732d8b11 main
>>
>> filterCache{lookups=0,hits=0,hitratio=0.00,inserts=16,evictions=0,size=16,warmupTime=71641,cumulative_lookups=90,cumulative_hits=68,cumulative_hitratio=0.75,cumulative_inserts=22,cumulative_evictions=0}
>>
>> queryResultCache{lookups=24,hits=24,hitratio=1.00,inserts=32,evictions=0,size=32,warmupTime=82406,cumulative_lookups=6310,cumulative_hits=268,cumulative_hitratio=0.04,cumulative_inserts=6041,cumulative_evictions=5522}
>>
>> documentCache{lookups=720,hits=710,hitratio=0.98,inserts=40,evictions=0,size=40,warmupTime=0,cumulative_lookups=415308,cumulative_hits=283661,cumulative_hitratio=0.68,cumulative_inserts=131647,cumulative_evictions=131105}
>> 03-Mar-2009 20:23:18 org.apache.solr.update.processor.LogUpdateProcessor 
>> finish
>> INFO: {commit=} 0 163189
>> 03-Mar-2009 20:23:18 org.apache.solr.core.SolrCore execute
>> INFO: [jobs] webapp=/cjsolr path=/update/ params={} status=0 QTime=163189
>>
>>
>> I'm sure I'm doing something wrong. Does this 163 seconds commit time
>> have to do with the commit parameters :
>> (optimize=false,waitFlush=false,waitSearcher=true)  ??
>>
>> Thanks for any help.
>>
>> Cheers !!
>>
>> Jerome.
>>
>> --
>> Jerome Eteve.
>>
>> Chat with me live at http://www.eteve.net
>>
>> jer...@eteve.net
>>
>



-- 
Jerome Eteve.

Chat with me live at http://www.eteve.net

jer...@eteve.net


Re: datadir issue for multicore on latest trunk

2009-03-04 Thread Walter Ferrara
it also ignore dataDir directive in solr.xml, in fact adding:



doesn't change the behavior.

this seems a bug introduced somewhere after 2nd february
any clue?


On Tue, Mar 3, 2009 at 5:56 PM, Walter Ferrara  wrote:

> there is a strange behavior which seems to affect hudson today (March 3rd)
> build but not (for example) hudson February 2th build.
> Basically when I start the multicore enviroment, it just create datadir in
> the current path.
> To replicate:
> 1. download latest trunk
> 2. go to example directory
>
> $ ls
> README.txt  example-DIH  exampledocs  logs   solr  start.jar
> work
> etc exampleAnalysis  lib  multicore  start.bat  webapps
>
> $ java -Dsolr.solr.home=multicore -jar start.jar
> (then kill/sleep the process)
>
> $ ls
> README.txt  etc  exampledocs  multicore  start.jar
> core0   example-DIH  lib  solr  webapps
> core1   exampleAnalysis  logs start.bat  work
>
> you see core0 and core1 directory where they should not be :-); solr-1041
> doesn't fix in this case.
>
> ciao,
> Walter
>
>


Re: datadir issue for multicore on latest trunk

2009-03-04 Thread Noble Paul നോബിള്‍ नोब्ळ्
>
is not honored automatically. the  tag has to be present in
solrconfig.xml and it should use this value.

but you can specify it as follows


then it should be fine.

can you just paste the log messages as  solr starts
--Noble


On Wed, Mar 4, 2009 at 4:15 PM, Walter Ferrara  wrote:
> it also ignore dataDir directive in solr.xml, in fact adding:
>    
>        
>    
> doesn't change the behavior.
>
> this seems a bug introduced somewhere after 2nd february
> any clue?
>
>
> On Tue, Mar 3, 2009 at 5:56 PM, Walter Ferrara  wrote:
>
>> there is a strange behavior which seems to affect hudson today (March 3rd)
>> build but not (for example) hudson February 2th build.
>> Basically when I start the multicore enviroment, it just create datadir in
>> the current path.
>> To replicate:
>> 1. download latest trunk
>> 2. go to example directory
>>
>> $ ls
>> README.txt  example-DIH      exampledocs  logs       solr      start.jar
>> work
>> etc         exampleAnalysis  lib          multicore  start.bat  webapps
>>
>> $ java -Dsolr.solr.home=multicore -jar start.jar
>> (then kill/sleep the process)
>>
>> $ ls
>> README.txt  etc              exampledocs  multicore  start.jar
>> core0       example-DIH      lib          solr      webapps
>> core1       exampleAnalysis  logs         start.bat  work
>>
>> you see core0 and core1 directory where they should not be :-); solr-1041
>> doesn't fix in this case.
>>
>> ciao,
>> Walter
>>
>>
>



-- 
--Noble Paul


Custom Request Handler

2009-03-04 Thread dabboo

Hi,

Can someone please tell me if we can write our own Custom Request Handler
and Custom Solr Core?

I want to call execute method of custom solrcore class instead of SolrCore
class.

Actually, I have to modify getQParser method of SolrCore class, so that it
shud start returning my custom plugin.

Please suggest.

Thanks,
Amit Garg

-- 
View this message in context: 
http://www.nabble.com/Custom-Request-Handler-tp22328365p22328365.html
Sent from the Solr - User mailing list archive at Nabble.com.



Re: datadir issue for multicore on latest trunk

2009-03-04 Thread Walter Ferrara
using:
  


  
doesn't work either

here the output:

2009-03-04 12:50:54.890::INFO:  Logging to STDERR via
org.mortbay.log.StdErrLog
2009-03-04 12:50:54.968::INFO:  jetty-6.1.3
4-mar-2009 12.50.55 org.apache.solr.servlet.SolrDispatchFilter init
INFO: SolrDispatchFilter.init()
4-mar-2009 12.50.55 org.apache.solr.core.SolrResourceLoader
locateInstanceDir
INFO: JNDI not configured for solr (NoInitialContextEx)
4-mar-2009 12.50.55 org.apache.solr.core.SolrResourceLoader
locateInstanceDir
INFO: using system property solr.solr.home: multicore
4-mar-2009 12.50.55 org.apache.solr.core.CoreContainer$Initializer
initialize
INFO: looking for solr.xml:
d:\DEV\apache-solr-2009-03-03_08-06-53\example\multicore\solr.xml
4-mar-2009 12.50.55 org.apache.solr.core.SolrResourceLoader 
INFO: Solr home set to 'multicore/'
4-mar-2009 12.50.55 org.apache.solr.core.SolrResourceLoader
createClassLoader
INFO: Reusing parent classloader
4-mar-2009 12.50.55 org.apache.solr.core.SolrResourceLoader 
INFO: Solr home set to 'multicore\core0/'
4-mar-2009 12.50.55 org.apache.solr.core.SolrResourceLoader
createClassLoader
INFO: Reusing parent classloader
4-mar-2009 12.50.55 org.apache.solr.core.SolrConfig 
INFO: Loaded SolrConfig: solrconfig.xml
4-mar-2009 12.50.55 org.apache.solr.schema.IndexSchema readSchema
INFO: Reading Solr Schema
4-mar-2009 12.50.55 org.apache.solr.schema.IndexSchema readSchema
INFO: Schema name=example core zero
4-mar-2009 12.50.55 org.apache.solr.util.plugin.AbstractPluginLoader load
INFO: created string: org.apache.solr.schema.StrField
4-mar-2009 12.50.55 org.apache.solr.schema.IndexSchema readSchema
INFO: default search field is name
4-mar-2009 12.50.55 org.apache.solr.schema.IndexSchema readSchema
INFO: query parser default operator is OR
4-mar-2009 12.50.55 org.apache.solr.schema.IndexSchema readSchema
INFO: unique key field: id
4-mar-2009 12.50.55 org.apache.solr.core.SolrCore 
INFO: [core0] Opening new SolrCore at multicore\core0/,
dataDir=core0//multicore/core0/
4-mar-2009 12.50.55 org.apache.solr.core.SolrCore 
INFO: JMX monitoring not detected for core: core0
4-mar-2009 12.50.55 org.apache.solr.core.SolrCore parseListener
INFO: [core0] Searching for listeners: //listen...@event="firstSearcher"]
4-mar-2009 12.50.55 org.apache.solr.core.SolrCore parseListener
INFO: [core0] Searching for listeners: //listen...@event="newSearcher"]
4-mar-2009 12.50.55 org.apache.solr.core.SolrCore initIndex
AVVERTENZA: [core0] Solr index directory 'core0\multicore\core0\index'
doesn't exist. Creating new index...
4-mar-2009 12.50.55 org.apache.solr.update.SolrIndexWriter getDirectory
AVVERTENZA: No lockType configured for core0//multicore/core0/index/
assuming 'simple'
4-mar-2009 12.50.55 org.apache.solr.util.plugin.AbstractPluginLoader load
INFO: created standard: org.apache.solr.handler.StandardRequestHandler
4-mar-2009 12.50.55 org.apache.solr.util.plugin.AbstractPluginLoader load
INFO: created /update: org.apache.solr.handler.XmlUpdateRequestHandler
4-mar-2009 12.50.55 org.apache.solr.util.plugin.AbstractPluginLoader load
INFO: created /admin/: org.apache.solr.handler.admin.AdminHandlers
4-mar-2009 12.50.55 org.apache.solr.search.SolrIndexSearcher 
INFO: Opening searc...@1e57e8f main
4-mar-2009 12.50.55
org.apache.solr.update.DirectUpdateHandler2$CommitTracker 
INFO: AutoCommit: disabled
4-mar-2009 12.50.55 org.apache.solr.handler.component.SearchHandler inform
INFO: Adding
component:org.apache.solr.handler.component.querycompon...@19a32e0
4-mar-2009 12.50.55 org.apache.solr.handler.component.SearchHandler inform
INFO: Adding
component:org.apache.solr.handler.component.facetcompon...@8238f4
4-mar-2009 12.50.55 org.apache.solr.handler.component.SearchHandler inform
INFO: Adding
component:org.apache.solr.handler.component.morelikethiscompon...@16925b0
4-mar-2009 12.50.55 org.apache.solr.handler.component.SearchHandler inform
INFO: Adding
component:org.apache.solr.handler.component.highlightcompon...@297ffb
4-mar-2009 12.50.55 org.apache.solr.handler.component.SearchHandler inform
INFO: Adding
component:org.apache.solr.handler.component.statscompon...@914f6a
4-mar-2009 12.50.55 org.apache.solr.handler.component.SearchHandler inform
INFO: Adding  debug
component:org.apache.solr.handler.component.debugcompon...@1f4cbee
4-mar-2009 12.50.55 org.apache.solr.core.CoreContainer register
INFO: registering core: core0
4-mar-2009 12.50.55 org.apache.solr.core.SolrCore registerSearcher
INFO: [core0] Registered new searcher searc...@1e57e8f main
4-mar-2009 12.50.55 org.apache.solr.core.SolrResourceLoader 
INFO: Solr home set to 'multicore\core1/'
4-mar-2009 12.50.55 org.apache.solr.core.SolrResourceLoader
createClassLoader
INFO: Reusing parent classloader
4-mar-2009 12.50.55 org.apache.solr.core.SolrConfig 
INFO: Loaded SolrConfig: solrconfig.xml
4-mar-2009 12.50.55 org.apache.solr.schema.IndexSchema readSchema
INFO: Reading Solr Schema
4-mar-2009 12.50.55 org.apache.solr.schema.IndexSchema readSchema
INFO: Schema name=example cor

Re: datadir issue for multicore on latest trunk

2009-03-04 Thread Noble Paul നോബിള്‍ नोब्ळ्
On Wed, Mar 4, 2009 at 5:24 PM, Walter Ferrara  wrote:
> using:
>  
>    
>    
>  
> doesn't work either

dataDir="/multicore/core0" means the path is absolute.
where did it create?

>
> here the output:
>
> 2009-03-04 12:50:54.890::INFO:  Logging to STDERR via
> org.mortbay.log.StdErrLog
> 2009-03-04 12:50:54.968::INFO:  jetty-6.1.3
> 4-mar-2009 12.50.55 org.apache.solr.servlet.SolrDispatchFilter init
> INFO: SolrDispatchFilter.init()
> 4-mar-2009 12.50.55 org.apache.solr.core.SolrResourceLoader
> locateInstanceDir
> INFO: JNDI not configured for solr (NoInitialContextEx)
> 4-mar-2009 12.50.55 org.apache.solr.core.SolrResourceLoader
> locateInstanceDir
> INFO: using system property solr.solr.home: multicore
> 4-mar-2009 12.50.55 org.apache.solr.core.CoreContainer$Initializer
> initialize
> INFO: looking for solr.xml:
> d:\DEV\apache-solr-2009-03-03_08-06-53\example\multicore\solr.xml
> 4-mar-2009 12.50.55 org.apache.solr.core.SolrResourceLoader 
> INFO: Solr home set to 'multicore/'
> 4-mar-2009 12.50.55 org.apache.solr.core.SolrResourceLoader
> createClassLoader
> INFO: Reusing parent classloader
> 4-mar-2009 12.50.55 org.apache.solr.core.SolrResourceLoader 
> INFO: Solr home set to 'multicore\core0/'
> 4-mar-2009 12.50.55 org.apache.solr.core.SolrResourceLoader
> createClassLoader
> INFO: Reusing parent classloader
> 4-mar-2009 12.50.55 org.apache.solr.core.SolrConfig 
> INFO: Loaded SolrConfig: solrconfig.xml
> 4-mar-2009 12.50.55 org.apache.solr.schema.IndexSchema readSchema
> INFO: Reading Solr Schema
> 4-mar-2009 12.50.55 org.apache.solr.schema.IndexSchema readSchema
> INFO: Schema name=example core zero
> 4-mar-2009 12.50.55 org.apache.solr.util.plugin.AbstractPluginLoader load
> INFO: created string: org.apache.solr.schema.StrField
> 4-mar-2009 12.50.55 org.apache.solr.schema.IndexSchema readSchema
> INFO: default search field is name
> 4-mar-2009 12.50.55 org.apache.solr.schema.IndexSchema readSchema
> INFO: query parser default operator is OR
> 4-mar-2009 12.50.55 org.apache.solr.schema.IndexSchema readSchema
> INFO: unique key field: id
> 4-mar-2009 12.50.55 org.apache.solr.core.SolrCore 
> INFO: [core0] Opening new SolrCore at multicore\core0/,
> dataDir=core0//multicore/core0/
> 4-mar-2009 12.50.55 org.apache.solr.core.SolrCore 
> INFO: JMX monitoring not detected for core: core0
> 4-mar-2009 12.50.55 org.apache.solr.core.SolrCore parseListener
> INFO: [core0] Searching for listeners: //listen...@event="firstSearcher"]
> 4-mar-2009 12.50.55 org.apache.solr.core.SolrCore parseListener
> INFO: [core0] Searching for listeners: //listen...@event="newSearcher"]
> 4-mar-2009 12.50.55 org.apache.solr.core.SolrCore initIndex
> AVVERTENZA: [core0] Solr index directory 'core0\multicore\core0\index'
> doesn't exist. Creating new index...
> 4-mar-2009 12.50.55 org.apache.solr.update.SolrIndexWriter getDirectory
> AVVERTENZA: No lockType configured for core0//multicore/core0/index/
> assuming 'simple'
> 4-mar-2009 12.50.55 org.apache.solr.util.plugin.AbstractPluginLoader load
> INFO: created standard: org.apache.solr.handler.StandardRequestHandler
> 4-mar-2009 12.50.55 org.apache.solr.util.plugin.AbstractPluginLoader load
> INFO: created /update: org.apache.solr.handler.XmlUpdateRequestHandler
> 4-mar-2009 12.50.55 org.apache.solr.util.plugin.AbstractPluginLoader load
> INFO: created /admin/: org.apache.solr.handler.admin.AdminHandlers
> 4-mar-2009 12.50.55 org.apache.solr.search.SolrIndexSearcher 
> INFO: Opening searc...@1e57e8f main
> 4-mar-2009 12.50.55
> org.apache.solr.update.DirectUpdateHandler2$CommitTracker 
> INFO: AutoCommit: disabled
> 4-mar-2009 12.50.55 org.apache.solr.handler.component.SearchHandler inform
> INFO: Adding
> component:org.apache.solr.handler.component.querycompon...@19a32e0
> 4-mar-2009 12.50.55 org.apache.solr.handler.component.SearchHandler inform
> INFO: Adding
> component:org.apache.solr.handler.component.facetcompon...@8238f4
> 4-mar-2009 12.50.55 org.apache.solr.handler.component.SearchHandler inform
> INFO: Adding
> component:org.apache.solr.handler.component.morelikethiscompon...@16925b0
> 4-mar-2009 12.50.55 org.apache.solr.handler.component.SearchHandler inform
> INFO: Adding
> component:org.apache.solr.handler.component.highlightcompon...@297ffb
> 4-mar-2009 12.50.55 org.apache.solr.handler.component.SearchHandler inform
> INFO: Adding
> component:org.apache.solr.handler.component.statscompon...@914f6a
> 4-mar-2009 12.50.55 org.apache.solr.handler.component.SearchHandler inform
> INFO: Adding  debug
> component:org.apache.solr.handler.component.debugcompon...@1f4cbee
> 4-mar-2009 12.50.55 org.apache.solr.core.CoreContainer register
> INFO: registering core: core0
> 4-mar-2009 12.50.55 org.apache.solr.core.SolrCore registerSearcher
> INFO: [core0] Registered new searcher searc...@1e57e8f main
> 4-mar-2009 12.50.55 org.apache.solr.core.SolrResourceLoader 
> INFO: Solr home set to 'multicore\core1/'
> 4-mar-2009 12.50.55 org.apache.solr.core.SolrResourceLoader
> crea

Re: datadir issue for multicore on latest trunk

2009-03-04 Thread Noble Paul നോബിള്‍ नोब्ळ्
looks like a bug. must reopen the issue

On Wed, Mar 4, 2009 at 5:26 PM, Noble Paul നോബിള്‍  नोब्ळ्
 wrote:
> On Wed, Mar 4, 2009 at 5:24 PM, Walter Ferrara  wrote:
>> using:
>>  
>>    
>>    
>>  
>> doesn't work either
>
> dataDir="/multicore/core0" means the path is absolute.
> where did it create?
>
>>
>> here the output:
>>
>> 2009-03-04 12:50:54.890::INFO:  Logging to STDERR via
>> org.mortbay.log.StdErrLog
>> 2009-03-04 12:50:54.968::INFO:  jetty-6.1.3
>> 4-mar-2009 12.50.55 org.apache.solr.servlet.SolrDispatchFilter init
>> INFO: SolrDispatchFilter.init()
>> 4-mar-2009 12.50.55 org.apache.solr.core.SolrResourceLoader
>> locateInstanceDir
>> INFO: JNDI not configured for solr (NoInitialContextEx)
>> 4-mar-2009 12.50.55 org.apache.solr.core.SolrResourceLoader
>> locateInstanceDir
>> INFO: using system property solr.solr.home: multicore
>> 4-mar-2009 12.50.55 org.apache.solr.core.CoreContainer$Initializer
>> initialize
>> INFO: looking for solr.xml:
>> d:\DEV\apache-solr-2009-03-03_08-06-53\example\multicore\solr.xml
>> 4-mar-2009 12.50.55 org.apache.solr.core.SolrResourceLoader 
>> INFO: Solr home set to 'multicore/'
>> 4-mar-2009 12.50.55 org.apache.solr.core.SolrResourceLoader
>> createClassLoader
>> INFO: Reusing parent classloader
>> 4-mar-2009 12.50.55 org.apache.solr.core.SolrResourceLoader 
>> INFO: Solr home set to 'multicore\core0/'
>> 4-mar-2009 12.50.55 org.apache.solr.core.SolrResourceLoader
>> createClassLoader
>> INFO: Reusing parent classloader
>> 4-mar-2009 12.50.55 org.apache.solr.core.SolrConfig 
>> INFO: Loaded SolrConfig: solrconfig.xml
>> 4-mar-2009 12.50.55 org.apache.solr.schema.IndexSchema readSchema
>> INFO: Reading Solr Schema
>> 4-mar-2009 12.50.55 org.apache.solr.schema.IndexSchema readSchema
>> INFO: Schema name=example core zero
>> 4-mar-2009 12.50.55 org.apache.solr.util.plugin.AbstractPluginLoader load
>> INFO: created string: org.apache.solr.schema.StrField
>> 4-mar-2009 12.50.55 org.apache.solr.schema.IndexSchema readSchema
>> INFO: default search field is name
>> 4-mar-2009 12.50.55 org.apache.solr.schema.IndexSchema readSchema
>> INFO: query parser default operator is OR
>> 4-mar-2009 12.50.55 org.apache.solr.schema.IndexSchema readSchema
>> INFO: unique key field: id
>> 4-mar-2009 12.50.55 org.apache.solr.core.SolrCore 
>> INFO: [core0] Opening new SolrCore at multicore\core0/,
>> dataDir=core0//multicore/core0/
>> 4-mar-2009 12.50.55 org.apache.solr.core.SolrCore 
>> INFO: JMX monitoring not detected for core: core0
>> 4-mar-2009 12.50.55 org.apache.solr.core.SolrCore parseListener
>> INFO: [core0] Searching for listeners: //listen...@event="firstSearcher"]
>> 4-mar-2009 12.50.55 org.apache.solr.core.SolrCore parseListener
>> INFO: [core0] Searching for listeners: //listen...@event="newSearcher"]
>> 4-mar-2009 12.50.55 org.apache.solr.core.SolrCore initIndex
>> AVVERTENZA: [core0] Solr index directory 'core0\multicore\core0\index'
>> doesn't exist. Creating new index...
>> 4-mar-2009 12.50.55 org.apache.solr.update.SolrIndexWriter getDirectory
>> AVVERTENZA: No lockType configured for core0//multicore/core0/index/
>> assuming 'simple'
>> 4-mar-2009 12.50.55 org.apache.solr.util.plugin.AbstractPluginLoader load
>> INFO: created standard: org.apache.solr.handler.StandardRequestHandler
>> 4-mar-2009 12.50.55 org.apache.solr.util.plugin.AbstractPluginLoader load
>> INFO: created /update: org.apache.solr.handler.XmlUpdateRequestHandler
>> 4-mar-2009 12.50.55 org.apache.solr.util.plugin.AbstractPluginLoader load
>> INFO: created /admin/: org.apache.solr.handler.admin.AdminHandlers
>> 4-mar-2009 12.50.55 org.apache.solr.search.SolrIndexSearcher 
>> INFO: Opening searc...@1e57e8f main
>> 4-mar-2009 12.50.55
>> org.apache.solr.update.DirectUpdateHandler2$CommitTracker 
>> INFO: AutoCommit: disabled
>> 4-mar-2009 12.50.55 org.apache.solr.handler.component.SearchHandler inform
>> INFO: Adding
>> component:org.apache.solr.handler.component.querycompon...@19a32e0
>> 4-mar-2009 12.50.55 org.apache.solr.handler.component.SearchHandler inform
>> INFO: Adding
>> component:org.apache.solr.handler.component.facetcompon...@8238f4
>> 4-mar-2009 12.50.55 org.apache.solr.handler.component.SearchHandler inform
>> INFO: Adding
>> component:org.apache.solr.handler.component.morelikethiscompon...@16925b0
>> 4-mar-2009 12.50.55 org.apache.solr.handler.component.SearchHandler inform
>> INFO: Adding
>> component:org.apache.solr.handler.component.highlightcompon...@297ffb
>> 4-mar-2009 12.50.55 org.apache.solr.handler.component.SearchHandler inform
>> INFO: Adding
>> component:org.apache.solr.handler.component.statscompon...@914f6a
>> 4-mar-2009 12.50.55 org.apache.solr.handler.component.SearchHandler inform
>> INFO: Adding  debug
>> component:org.apache.solr.handler.component.debugcompon...@1f4cbee
>> 4-mar-2009 12.50.55 org.apache.solr.core.CoreContainer register
>> INFO: registering core: core0
>> 4-mar-2009 12.50.55 org.apache.solr.core.SolrCore registerSearcher
>> INFO: [core0] Registere

Stemming in Solr

2009-03-04 Thread dabboo

Hi, 

I am trying to implement stemming in solr. If user searches for walk then
all the records which have walk, walking, walks, walked etc should display.

Please suggest.

Thanks,
Amit Garg
-- 
View this message in context: 
http://www.nabble.com/Stemming-in-Solr-tp22328850p22328850.html
Sent from the Solr - User mailing list archive at Nabble.com.



Re: Stemming in Solr

2009-03-04 Thread Lukáš Vlček
Hi,
did you check Snowball stemmers (http://snowball.tartarus.org/)?

Regards,
Lukas

On Wed, Mar 4, 2009 at 1:12 PM, dabboo  wrote:

>
> Hi,
>
> I am trying to implement stemming in solr. If user searches for walk then
> all the records which have walk, walking, walks, walked etc should display.
>
> Please suggest.
>
> Thanks,
> Amit Garg
> --
> View this message in context:
> http://www.nabble.com/Stemming-in-Solr-tp22328850p22328850.html
> Sent from the Solr - User mailing list archive at Nabble.com.
>
>


-- 
http://blog.lukas-vlcek.com/


Re: Stemming in Solr

2009-03-04 Thread Lukáš Vlček
May be you can also check
http://wiki.apache.org/solr/AnalyzersTokenizersTokenFiltersStemming is
discussed there...

On Wed, Mar 4, 2009 at 1:18 PM, Lukáš Vlček  wrote:

> Hi,
> did you check Snowball stemmers (http://snowball.tartarus.org/)?
>
> Regards,
> Lukas
>
>
> On Wed, Mar 4, 2009 at 1:12 PM, dabboo  wrote:
>
>>
>> Hi,
>>
>> I am trying to implement stemming in solr. If user searches for walk then
>> all the records which have walk, walking, walks, walked etc should
>> display.
>>
>> Please suggest.
>>
>> Thanks,
>> Amit Garg
>> --
>> View this message in context:
>> http://www.nabble.com/Stemming-in-Solr-tp22328850p22328850.html
>> Sent from the Solr - User mailing list archive at Nabble.com.
>>
>>
>
>
> --
> http://blog.lukas-vlcek.com/
>



-- 
http://blog.lukas-vlcek.com/


Re: datadir issue for multicore on latest trunk

2009-03-04 Thread Walter Ferrara
tried with


but no luck, the dataDir parameter seems ignored, no matter what is written
there

On Wed, Mar 4, 2009 at 12:58 PM, Noble Paul നോബിള്‍ नोब्ळ् <
noble.p...@gmail.com> wrote:

> looks like a bug. must reopen the issue
>
> On Wed, Mar 4, 2009 at 5:26 PM, Noble Paul നോബിള്‍  नोब्ळ्
>  wrote:
> > On Wed, Mar 4, 2009 at 5:24 PM, Walter Ferrara 
> wrote:
> >> using:
> >>  
> >>
> >>
> >>  
> >> doesn't work either
> >
> > dataDir="/multicore/core0" means the path is absolute.
> > where did it create?
> >
> >>
> >> here the output:
> >>
> >> 2009-03-04 12:50:54.890::INFO:  Logging to STDERR via
> >> org.mortbay.log.StdErrLog
> >> 2009-03-04 12:50:54.968::INFO:  jetty-6.1.3
> >> 4-mar-2009 12.50.55 org.apache.solr.servlet.SolrDispatchFilter init
> >> INFO: SolrDispatchFilter.init()
> >> 4-mar-2009 12.50.55 org.apache.solr.core.SolrResourceLoader
> >> locateInstanceDir
> >> INFO: JNDI not configured for solr (NoInitialContextEx)
> >> 4-mar-2009 12.50.55 org.apache.solr.core.SolrResourceLoader
> >> locateInstanceDir
> >> INFO: using system property solr.solr.home: multicore
> >> 4-mar-2009 12.50.55 org.apache.solr.core.CoreContainer$Initializer
> >> initialize
> >> INFO: looking for solr.xml:
> >> d:\DEV\apache-solr-2009-03-03_08-06-53\example\multicore\solr.xml
> >> 4-mar-2009 12.50.55 org.apache.solr.core.SolrResourceLoader 
> >> INFO: Solr home set to 'multicore/'
> >> 4-mar-2009 12.50.55 org.apache.solr.core.SolrResourceLoader
> >> createClassLoader
> >> INFO: Reusing parent classloader
> >> 4-mar-2009 12.50.55 org.apache.solr.core.SolrResourceLoader 
> >> INFO: Solr home set to 'multicore\core0/'
> >> 4-mar-2009 12.50.55 org.apache.solr.core.SolrResourceLoader
> >> createClassLoader
> >> INFO: Reusing parent classloader
> >> 4-mar-2009 12.50.55 org.apache.solr.core.SolrConfig 
> >> INFO: Loaded SolrConfig: solrconfig.xml
> >> 4-mar-2009 12.50.55 org.apache.solr.schema.IndexSchema readSchema
> >> INFO: Reading Solr Schema
> >> 4-mar-2009 12.50.55 org.apache.solr.schema.IndexSchema readSchema
> >> INFO: Schema name=example core zero
> >> 4-mar-2009 12.50.55 org.apache.solr.util.plugin.AbstractPluginLoader
> load
> >> INFO: created string: org.apache.solr.schema.StrField
> >> 4-mar-2009 12.50.55 org.apache.solr.schema.IndexSchema readSchema
> >> INFO: default search field is name
> >> 4-mar-2009 12.50.55 org.apache.solr.schema.IndexSchema readSchema
> >> INFO: query parser default operator is OR
> >> 4-mar-2009 12.50.55 org.apache.solr.schema.IndexSchema readSchema
> >> INFO: unique key field: id
> >> 4-mar-2009 12.50.55 org.apache.solr.core.SolrCore 
> >> INFO: [core0] Opening new SolrCore at multicore\core0/,
> >> dataDir=core0//multicore/core0/
> >> 4-mar-2009 12.50.55 org.apache.solr.core.SolrCore 
> >> INFO: JMX monitoring not detected for core: core0
> >> 4-mar-2009 12.50.55 org.apache.solr.core.SolrCore parseListener
> >> INFO: [core0] Searching for listeners:
> //listen...@event="firstSearcher"]
> >> 4-mar-2009 12.50.55 org.apache.solr.core.SolrCore parseListener
> >> INFO: [core0] Searching for listeners: //listen...@event="newSearcher"]
> >> 4-mar-2009 12.50.55 org.apache.solr.core.SolrCore initIndex
> >> AVVERTENZA: [core0] Solr index directory 'core0\multicore\core0\index'
> >> doesn't exist. Creating new index...
> >> 4-mar-2009 12.50.55 org.apache.solr.update.SolrIndexWriter getDirectory
> >> AVVERTENZA: No lockType configured for core0//multicore/core0/index/
> >> assuming 'simple'
> >> 4-mar-2009 12.50.55 org.apache.solr.util.plugin.AbstractPluginLoader
> load
> >> INFO: created standard: org.apache.solr.handler.StandardRequestHandler
> >> 4-mar-2009 12.50.55 org.apache.solr.util.plugin.AbstractPluginLoader
> load
> >> INFO: created /update: org.apache.solr.handler.XmlUpdateRequestHandler
> >> 4-mar-2009 12.50.55 org.apache.solr.util.plugin.AbstractPluginLoader
> load
> >> INFO: created /admin/: org.apache.solr.handler.admin.AdminHandlers
> >> 4-mar-2009 12.50.55 org.apache.solr.search.SolrIndexSearcher 
> >> INFO: Opening searc...@1e57e8f main
> >> 4-mar-2009 12.50.55
> >> org.apache.solr.update.DirectUpdateHandler2$CommitTracker 
> >> INFO: AutoCommit: disabled
> >> 4-mar-2009 12.50.55 org.apache.solr.handler.component.SearchHandler
> inform
> >> INFO: Adding
> >> component:org.apache.solr.handler.component.querycompon...@19a32e0
> >> 4-mar-2009 12.50.55 org.apache.solr.handler.component.SearchHandler
> inform
> >> INFO: Adding
> >> component:org.apache.solr.handler.component.facetcompon...@8238f4
> >> 4-mar-2009 12.50.55 org.apache.solr.handler.component.SearchHandler
> inform
> >> INFO: Adding
> >>
> component:org.apache.solr.handler.component.morelikethiscompon...@16925b0
> >> 4-mar-2009 12.50.55 org.apache.solr.handler.component.SearchHandler
> inform
> >> INFO: Adding
> >> component:org.apache.solr.handler.component.highlightcompon...@297ffb
> >> 4-mar-2009 12.50.55 org.apache.solr.handler.component.SearchHandler
> inform
> >> INFO: Adding
> >> component:org.apach

Re: Custom Request Handler

2009-03-04 Thread Grant Ingersoll
Can you explain the problem you are having?  Is there a bug in the  
SolrCore that you are working around?  Or, are you just trying to  
plugin your own Query Parser?  If that is the case, there is a way to  
do that already.  You might find http://wiki.apache.org/solr/ 
SolrPlugins helpful.


-Grant

On Mar 4, 2009, at 6:42 AM, dabboo wrote:



Hi,

Can someone please tell me if we can write our own Custom Request  
Handler

and Custom Solr Core?

I want to call execute method of custom solrcore class instead of  
SolrCore

class.

Actually, I have to modify getQParser method of SolrCore class, so  
that it

shud start returning my custom plugin.

Please suggest.

Thanks,
Amit Garg

--
View this message in context: 
http://www.nabble.com/Custom-Request-Handler-tp22328365p22328365.html
Sent from the Solr - User mailing list archive at Nabble.com.



--
Grant Ingersoll
http://www.lucidimagination.com/

Search the Lucene ecosystem (Lucene/Solr/Nutch/Mahout/Tika/Droids)  
using Solr/Lucene:

http://www.lucidimagination.com/search



Re: Very long commit time.

2009-03-04 Thread Yonik Seeley
On Wed, Mar 4, 2009 at 5:25 AM, Jérôme Etévé  wrote:
> Great,
>
>  It went down to less than 10 secs now :)
> What I don't really understand is that my autowarmCount were pretty
> low ( like 128 ) and still the autowarming of the caches were very
> slow.
>
> Can you explain more why it can be that slow ?

One possibility is a lack of physical memory available to the OS for
caching reads on both the old index and the new index.  This would
cause all of the queries to be slower if they ended up doing real disk
IO for each query/filter being warmed.

-Yonik
http://www.lucidimagination.com


Tomcat5 + Solr. Problems in deploying the Webapp

2009-03-04 Thread Sudharshan S
Hi all,
I am trying to setup a solr instance with Tomcat5 on a Fedora10
machine. Here is what I did,

1.) Copy the apache-solr-nightly.war to webapps/solr.war
2.) Set solr.solr.home in tomcat.conf
3.) Use the Manager interface of tomcat to deploy the webapp

But, while doing so, I get the following exceptions.

Mar 4, 2009 6:55:09 PM org.apache.catalina.core.StandardContext filterStart
SEVERE: Exception starting filter SolrRequestFilter
java.lang.NoClassDefFoundError: Could not initialize class
org.apache.solr.core.SolrConfig
at 
org.apache.solr.servlet.SolrDispatchFilter.init(SolrDispatchFilter.java:76)
at 
org.apache.catalina.core.ApplicationFilterConfig.getFilter(ApplicationFilterConfig.java:221)
at 
org.apache.catalina.core.ApplicationFilterConfig.setFilterDef(ApplicationFilterConfig.java:302)
at 
org.apache.catalina.core.ApplicationFilterConfig.(ApplicationFilterConfig.java:78)
at 
org.apache.catalina.core.StandardContext.filterStart(StandardContext.java:3635)
at 
org.apache.catalina.core.StandardContext.start(StandardContext.java:4222)
at 
org.apache.catalina.manager.ManagerServlet.start(ManagerServlet.java:1173)
at 
org.apache.catalina.manager.HTMLManagerServlet.start(HTMLManagerServlet.java:549)
at 
org.apache.catalina.manager.HTMLManagerServlet.doGet(HTMLManagerServlet.java:105)
at javax.servlet.http.HttpServlet.service(HttpServlet.java:617)
at javax.servlet.http.HttpServlet.service(HttpServlet.java:717)
at 
org.apache.catalina.core.ApplicationFilterChain.internalDoFilter(ApplicationFilterChain.java:269)
at 
org.apache.catalina.core.ApplicationFilterChain.doFilter(ApplicationFilterChain.java:188)
at 
org.apache.catalina.core.StandardWrapperValve.invoke(StandardWrapperValve.java:213)
at 
org.apache.catalina.core.StandardContextValve.invoke(StandardContextValve.java:172)
at 
org.apache.catalina.authenticator.AuthenticatorBase.invoke(AuthenticatorBase.java:525)
at 
org.apache.catalina.core.StandardHostValve.invoke(StandardHostValve.java:127)
at 
org.apache.catalina.valves.ErrorReportValve.invoke(ErrorReportValve.java:117)
at 
org.apache.catalina.core.StandardEngineValve.invoke(StandardEngineValve.java:108)
at 
org.apache.catalina.connector.CoyoteAdapter.service(CoyoteAdapter.java:174)
at 
org.apache.coyote.http11.Http11Processor.process(Http11Processor.java:875)
at 
org.apache.coyote.http11.Http11BaseProtocol$Http11ConnectionHandler.processConnection(Http11BaseProtocol.java:665)
at 
org.apache.tomcat.util.net.PoolTcpEndpoint.processSocket(PoolTcpEndpoint.java:528)
at 
org.apache.tomcat.util.net.LeaderFollowerWorkerThread.runIt(LeaderFollowerWorkerThread.java:81)
at 
org.apache.tomcat.util.threads.ThreadPool$ControlRunnable.run(ThreadPool.java:689)
at java.lang.Thread.run(Thread.java:636)


What am I missing? If it matters I am running the nightly build from
March 3 2009.

Thanks and Regards
Sudharshan S
Blog : http://www.sudharsh.wordpress.com
IRC   : Sup3rkiddo @ Freenode, Gimpnet


How to search the database tables using solr.

2009-03-04 Thread Radha C.
Hi,
 
I am working in a software concern. We are having some R&D base work like
making use of solr search tool for searching our Oracle database
content. I read through the documnets provided in the site, there I found
the solr supports only the document search and we have to provide the
document in the xml with the whole content that we need to search. But Is
there any way to index our database content and search the indexed database
content.
 
Any help will be appriciated. Thanks.


Re: How to search the database tables using solr.

2009-03-04 Thread Shalin Shekhar Mangar
On Wed, Mar 4, 2009 at 7:32 PM, Radha C.  wrote:

> Hi,
>
> I am working in a software concern. We are having some R&D base work like
> making use of solr search tool for searching our Oracle database
> content. I read through the documnets provided in the site, there I found
> the solr supports only the document search and we have to provide the
> document in the xml with the whole content that we need to search. But Is
> there any way to index our database content and search the indexed database
> content.
>

Take a look at DataImportHandler.

http://wiki.apache.org/solr/DataImportHandler
-- 
Regards,
Shalin Shekhar Mangar.


MoreLikeThis filtering

2009-03-04 Thread Doug Steigerwald
Is it possible to filter similarities found by the MLT component/ 
handler?  Something like mlt.fq=site_id:86?


We have 32 cores in our Solr install, and some of those cores have up  
to 8 sites indexed in them.  Typically those cores will have one very  
large site with a few hundred thousand indexed documents, and lots of  
small sites with significantly less documents indexed.


We're looking to implement a MLT component for our sites but want the  
similar stories to be only for a specific site (not all sites in the  
core).


Is there a way to do something like this, or will we have to make mods  
(I'm not seeing anything jump out at me in the Solr 1.3.0 or Lucene  
2.4.0 code)?


/solr/dsteiger/mlt?q=story_id:188665+AND+site_id:86&mlt.fq=site_id:86

(We have all all of our other defaults set up in the handler config.)

Thanks.
---
Doug Steigerwald
Software Developer
McClatchy Interactive
dsteigerw...@mcclatchyinteractive.com



Re: MoreLikeThis filtering

2009-03-04 Thread Otis Gospodnetic

Doug,

does the good old 'fq' not work with MLT?  It should...


Otis
--
Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch



- Original Message 
> From: Doug Steigerwald 
> To: solr-user@lucene.apache.org
> Sent: Wednesday, March 4, 2009 9:20:40 AM
> Subject: MoreLikeThis filtering
> 
> Is it possible to filter similarities found by the MLT component/handler?  
> Something like mlt.fq=site_id:86?
> 
> We have 32 cores in our Solr install, and some of those cores have up to 8 
> sites 
> indexed in them.  Typically those cores will have one very large site with a 
> few 
> hundred thousand indexed documents, and lots of small sites with 
> significantly 
> less documents indexed.
> 
> We're looking to implement a MLT component for our sites but want the similar 
> stories to be only for a specific site (not all sites in the core).
> 
> Is there a way to do something like this, or will we have to make mods (I'm 
> not 
> seeing anything jump out at me in the Solr 1.3.0 or Lucene 2.4.0 code)?
> 
> /solr/dsteiger/mlt?q=story_id:188665+AND+site_id:86&mlt.fq=site_id:86
> 
> (We have all all of our other defaults set up in the handler config.)
> 
> Thanks.
> ---
> Doug Steigerwald
> Software Developer
> McClatchy Interactive
> dsteigerw...@mcclatchyinteractive.com



RE: How to search the database tables using solr.

2009-03-04 Thread Radha C.
Thanks Shalin,
 
We just stepped on solr. This information is very much useful for me. But
before that I want some clear details about where to start..
I want to test this in my local environment, so I need some basic
information about how to start using this ( database and solr ). Do you have
some information on this?
 
Thanks in advance.
 

  _  

From: Shalin Shekhar Mangar [mailto:shalinman...@gmail.com] 
Sent: Wednesday, March 04, 2009 7:48 PM
To: solr-user@lucene.apache.org; cra...@ceiindia.com
Subject: Re: How to search the database tables using solr.


On Wed, Mar 4, 2009 at 7:32 PM, Radha C.  wrote:


Hi,

I am working in a software concern. We are having some R&D base work like
making use of solr search tool for searching our Oracle database
content. I read through the documnets provided in the site, there I found
the solr supports only the document search and we have to provide the
document in the xml with the whole content that we need to search. But Is
there any way to index our database content and search the indexed database
content.



Take a look at DataImportHandler.

http://wiki.apache.org/solr/DataImportHandler
-- 
Regards,
Shalin Shekhar Mangar.



Re: MoreLikeThis filtering

2009-03-04 Thread Doug Steigerwald
'fq' seems to only work with finding the documents with your original  
query, not for filtering the similar documents.


Doug

On Mar 4, 2009, at 9:28 AM, Otis Gospodnetic wrote:



Doug,

does the good old 'fq' not work with MLT?  It should...


Otis
--
Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch



- Original Message 

From: Doug Steigerwald 
To: solr-user@lucene.apache.org
Sent: Wednesday, March 4, 2009 9:20:40 AM
Subject: MoreLikeThis filtering

Is it possible to filter similarities found by the MLT component/ 
handler?

Something like mlt.fq=site_id:86?

We have 32 cores in our Solr install, and some of those cores have  
up to 8 sites
indexed in them.  Typically those cores will have one very large  
site with a few
hundred thousand indexed documents, and lots of small sites with  
significantly

less documents indexed.

We're looking to implement a MLT component for our sites but want  
the similar

stories to be only for a specific site (not all sites in the core).

Is there a way to do something like this, or will we have to make  
mods (I'm not
seeing anything jump out at me in the Solr 1.3.0 or Lucene 2.4.0  
code)?


/solr/dsteiger/mlt?q=story_id:188665+AND+site_id:86&mlt.fq=site_id:86

(We have all all of our other defaults set up in the handler config.)

Thanks.
---
Doug Steigerwald
Software Developer
McClatchy Interactive
dsteigerw...@mcclatchyinteractive.com




Re: MoreLikeThis filtering

2009-03-04 Thread Doug Steigerwald
Hm.  I checked out a clean Solr 1.3.0 and indexed the example docs and  
set up a simple MLT handler the example queries on the Wiki work fine  
(fq can filter out docs).  Our build has a slight change to  
QueryComponent so another query isn't done when we use localsolr+field  
collapsing, but that change doesn't look like it would make a  
difference.  It just conditionally sets rb.setNeedDocSet() to true or  
false.


Will run some tests on a clean fresh build of Solr to see if it's our  
build.


Doug

On Mar 4, 2009, at 9:28 AM, Otis Gospodnetic wrote:



Doug,

does the good old 'fq' not work with MLT?  It should...


Otis
--
Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch



- Original Message 

From: Doug Steigerwald 
To: solr-user@lucene.apache.org
Sent: Wednesday, March 4, 2009 9:20:40 AM
Subject: MoreLikeThis filtering

Is it possible to filter similarities found by the MLT component/ 
handler?

Something like mlt.fq=site_id:86?

We have 32 cores in our Solr install, and some of those cores have  
up to 8 sites
indexed in them.  Typically those cores will have one very large  
site with a few
hundred thousand indexed documents, and lots of small sites with  
significantly

less documents indexed.

We're looking to implement a MLT component for our sites but want  
the similar

stories to be only for a specific site (not all sites in the core).

Is there a way to do something like this, or will we have to make  
mods (I'm not
seeing anything jump out at me in the Solr 1.3.0 or Lucene 2.4.0  
code)?


/solr/dsteiger/mlt?q=story_id:188665+AND+site_id:86&mlt.fq=site_id:86

(We have all all of our other defaults set up in the handler config.)

Thanks.
---
Doug Steigerwald
Software Developer
McClatchy Interactive
dsteigerw...@mcclatchyinteractive.com




Re: MoreLikeThis filtering

2009-03-04 Thread Doug Steigerwald
Sorry.  The examples on the wiki aren't working with the 'fq' to  
filter the similarities.  It just filters the actual queries.


http://localhost:8983/solr/mlt?q=id:SP2514N&mlt.fl=manu,cat&mlt.mindf=1&mlt.mintf=1&fq=popularity:6&mlt.displayTerms=details&mlt=true

The popularity of the doc found is 6, and trying to use 'fq=popularity: 
6' brings back similarities with a popularity other than 6.


Doug

On Mar 4, 2009, at 10:39 AM, Doug Steigerwald wrote:

Hm.  I checked out a clean Solr 1.3.0 and indexed the example docs  
and set up a simple MLT handler the example queries on the Wiki work  
fine (fq can filter out docs).  Our build has a slight change to  
QueryComponent so another query isn't done when we use localsolr 
+field collapsing, but that change doesn't look like it would make a  
difference.  It just conditionally sets rb.setNeedDocSet() to true  
or false.


Will run some tests on a clean fresh build of Solr to see if it's  
our build.


Doug

On Mar 4, 2009, at 9:28 AM, Otis Gospodnetic wrote:



Doug,

does the good old 'fq' not work with MLT?  It should...


Otis
--
Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch



- Original Message 

From: Doug Steigerwald 
To: solr-user@lucene.apache.org
Sent: Wednesday, March 4, 2009 9:20:40 AM
Subject: MoreLikeThis filtering

Is it possible to filter similarities found by the MLT component/ 
handler?

Something like mlt.fq=site_id:86?

We have 32 cores in our Solr install, and some of those cores have  
up to 8 sites
indexed in them.  Typically those cores will have one very large  
site with a few
hundred thousand indexed documents, and lots of small sites with  
significantly

less documents indexed.

We're looking to implement a MLT component for our sites but want  
the similar

stories to be only for a specific site (not all sites in the core).

Is there a way to do something like this, or will we have to make  
mods (I'm not
seeing anything jump out at me in the Solr 1.3.0 or Lucene 2.4.0  
code)?


/solr/dsteiger/mlt?q=story_id:188665+AND+site_id:86&mlt.fq=site_id: 
86


(We have all all of our other defaults set up in the handler  
config.)


Thanks.
---
Doug Steigerwald
Software Developer
McClatchy Interactive
dsteigerw...@mcclatchyinteractive.com




Re: Very long commit time.

2009-03-04 Thread Jérôme Etévé
On Wed, Mar 4, 2009 at 1:21 PM, Yonik Seeley  wrote:
> On Wed, Mar 4, 2009 at 5:25 AM, Jérôme Etévé  wrote:
>> Great,
>>
>>  It went down to less than 10 secs now :)
>> What I don't really understand is that my autowarmCount were pretty
>> low ( like 128 ) and still the autowarming of the caches were very
>> slow.
>>
>> Can you explain more why it can be that slow ?
>
> One possibility is a lack of physical memory available to the OS for
> caching reads on both the old index and the new index.  This would
> cause all of the queries to be slower if they ended up doing real disk
> IO for each query/filter being warmed.

Strange, we've got plenty of memory on this box and the swap is zero.
But well, I'm happy we went around the problem. What's your experience
with commits with ~10M docs ( and ~128 autowarming count in caches ) ?

Cheers.

Jerome.



-- 
Jerome Eteve.

Chat with me live at http://www.eteve.net

jer...@eteve.net


Re: indexing while optimizing

2009-03-04 Thread Laimonas Simutis
Upgraded to solr 1.3, and also moved it to a different machine (more
CPU the rest is the same). All the problems seem to have gone away...
Should have upgraded from the start.

On Fri, Feb 27, 2009 at 2:50 PM, Laimonas Simutis  wrote:
> Hey,
>
> my SOLR setup looks like the following:
>
> server running apache-tomcat with solr1.2, index size is about 1G (a
> bit more than 4 million documents).
>
> I have another machine that basically every minute or so sends some
> documents to be indexed. I have autocommit turned on with maxDocs:
> 5000, maxTime: 30ms.
>
> Also on the server cron job runs twice per day to optimize the index.
> And sometimes it happens that the index messages come at the time of
> optimize running. I know that running optimize on an index that is
> about to have documents added or deleted is not that useful, but it
> did help eliminate "too many file handles open" problem.
>
> Is it bad that I try to index when the optimize is running? I do see
> failures on the client side from time to time, but the messages get
> resent and indexed eventually.
>
> One recurring problem is that once per 36 hours or so SOLR server
> becomes really unresponsive, just spinning crazy on CPU and it is all
> in java (solr) process. When I try to shut down apache, apache goes
> down but the java process is left running. I am trying to pin point
> where the problem is, and wonder if my indexing-commit is not right.
> The box is solely dedicated for solr, so there is really nothing else
> running on it.
>
> Any pointers or observations appreciated.
>
> thanks,
>
> L
>


Re: How to search the database tables using solr.

2009-03-04 Thread Shalin Shekhar Mangar
On Wed, Mar 4, 2009 at 7:51 PM, Radha C.  wrote:

> Thanks Shalin,
>
> We just stepped on solr. This information is very much useful for me. But
> before that I want some clear details about where to start..
> I want to test this in my local environment, so I need some basic
> information about how to start using this ( database and solr ). Do you
> have
> some information on this?
>

I think the easiest way is to start using Solr is with the embedded jetty
container. Modify the example/conf/schema.xml file and add your own fields
etc. Read through the DataImportHandler wiki page and at the
example/example-DIH directory in the solr zip/tarball.

If you have a specific doubt/question, ask on the list.

-- 
Regards,
Shalin Shekhar Mangar.


Re: Tomcat5 + Solr. Problems in deploying the Webapp

2009-03-04 Thread Matt Mitchell
Hi,

Have you looked at this page: http://wiki.apache.org/solr/SolrTomcat

It almost sounds like you're deploying twice? Putting the solr.war in
webapps would be one way, and the other would be a context config file +
using the web manager. If you're using the config/context, then don't put
the solr.war in webapps, tomcat should do that for you after deploying with
the manager.

Matt

On Wed, Mar 4, 2009 at 8:55 AM, Sudharshan S  wrote:

> Hi all,
> I am trying to setup a solr instance with Tomcat5 on a Fedora10
> machine. Here is what I did,
>
> 1.) Copy the apache-solr-nightly.war to webapps/solr.war
> 2.) Set solr.solr.home in tomcat.conf
> 3.) Use the Manager interface of tomcat to deploy the webapp
>
> But, while doing so, I get the following exceptions.
> 
> Mar 4, 2009 6:55:09 PM org.apache.catalina.core.StandardContext filterStart
> SEVERE: Exception starting filter SolrRequestFilter
> java.lang.NoClassDefFoundError: Could not initialize class
> org.apache.solr.core.SolrConfig
>at
> org.apache.solr.servlet.SolrDispatchFilter.init(SolrDispatchFilter.java:76)
>at
> org.apache.catalina.core.ApplicationFilterConfig.getFilter(ApplicationFilterConfig.java:221)
>at
> org.apache.catalina.core.ApplicationFilterConfig.setFilterDef(ApplicationFilterConfig.java:302)
>at
> org.apache.catalina.core.ApplicationFilterConfig.(ApplicationFilterConfig.java:78)
>at
> org.apache.catalina.core.StandardContext.filterStart(StandardContext.java:3635)
>at
> org.apache.catalina.core.StandardContext.start(StandardContext.java:4222)
>at
> org.apache.catalina.manager.ManagerServlet.start(ManagerServlet.java:1173)
>at
> org.apache.catalina.manager.HTMLManagerServlet.start(HTMLManagerServlet.java:549)
>at
> org.apache.catalina.manager.HTMLManagerServlet.doGet(HTMLManagerServlet.java:105)
>at javax.servlet.http.HttpServlet.service(HttpServlet.java:617)
>at javax.servlet.http.HttpServlet.service(HttpServlet.java:717)
>at
> org.apache.catalina.core.ApplicationFilterChain.internalDoFilter(ApplicationFilterChain.java:269)
>at
> org.apache.catalina.core.ApplicationFilterChain.doFilter(ApplicationFilterChain.java:188)
>at
> org.apache.catalina.core.StandardWrapperValve.invoke(StandardWrapperValve.java:213)
>at
> org.apache.catalina.core.StandardContextValve.invoke(StandardContextValve.java:172)
>at
> org.apache.catalina.authenticator.AuthenticatorBase.invoke(AuthenticatorBase.java:525)
>at
> org.apache.catalina.core.StandardHostValve.invoke(StandardHostValve.java:127)
>at
> org.apache.catalina.valves.ErrorReportValve.invoke(ErrorReportValve.java:117)
>at
> org.apache.catalina.core.StandardEngineValve.invoke(StandardEngineValve.java:108)
>at
> org.apache.catalina.connector.CoyoteAdapter.service(CoyoteAdapter.java:174)
>at
> org.apache.coyote.http11.Http11Processor.process(Http11Processor.java:875)
>at
> org.apache.coyote.http11.Http11BaseProtocol$Http11ConnectionHandler.processConnection(Http11BaseProtocol.java:665)
>at
> org.apache.tomcat.util.net.PoolTcpEndpoint.processSocket(PoolTcpEndpoint.java:528)
>at
> org.apache.tomcat.util.net.LeaderFollowerWorkerThread.runIt(LeaderFollowerWorkerThread.java:81)
>at
> org.apache.tomcat.util.threads.ThreadPool$ControlRunnable.run(ThreadPool.java:689)
>at java.lang.Thread.run(Thread.java:636)
> 
>
> What am I missing? If it matters I am running the nightly build from
> March 3 2009.
>
> Thanks and Regards
> Sudharshan S
> Blog : http://www.sudharsh.wordpress.com
> IRC   : Sup3rkiddo @ Freenode, Gimpnet
>


Re: Very long commit time.

2009-03-04 Thread Yonik Seeley
On Wed, Mar 4, 2009 at 11:04 AM, Jérôme Etévé  wrote:
> Strange, we've got plenty of memory on this box and the swap is zero.
> But well, I'm happy we went around the problem. What's your experience
> with commits with ~10M docs ( and ~128 autowarming count in caches ) ?

Probably depends somewhat on the nature of the filters and queries.
It does sound a little slow, so perhaps they could be optimized in some way.
If you have enough memory, the autowarming queries shouldn't be slower
than the normal queries, so you should be able to use your logs to
find the slow queries and try to figure out why they are slower.

-Yonik
http://www.lucidimagination.com


Re: Tomcat5 + Solr. Problems in deploying the Webapp

2009-03-04 Thread Sudharshan S
Hi Matt,

On Wed, Mar 4, 2009 at 9:52 PM, Matt Mitchell  wrote:
> Hi,
>
> Have you looked at this page: http://wiki.apache.org/solr/SolrTomcat

Yes. I mimicked the SimpleInstall method

>
> It almost sounds like you're deploying twice? Putting the solr.war in
> webapps would be one way, and the other would be a context config file +
> using the web manager. If you're using the config/context, then don't put
> the solr.war in webapps, tomcat should do that for you after deploying with
> the manager.

Pardon me, but my experience with Tomcat is only a few hours old. From
the Manager interface (manager/html) to be exact, I found out that the
solr instance was not running, Hence clicked 'start'. The logs showed
me the aforementioned exception. I guess using config/context is
optional? since reading from the archives and your post, it seems that
a config/context occurence and putting solr.war in webapps are
mutually exclusive.

If it helps, I have the following configuration option set in tomcat5.cont

JAVA_OPTS="$JAVA_OPTS
-Dcatalina.ext.dirs=$CATALINA_HOME/shared/lib:$CATALINA_HOME/common/lib
-Dsolr.solr.home=/home/sudharsh/Projects/solr/solr"

Regards
Sudharshan S
Blog : http://www.sudharsh.wordpress.com
IRC   : Sup3rkiddo @ Freenode, Gimpnet


Custom Field Type

2009-03-04 Thread Fouad Mardini
Hello,

I have a multivalued field in my schema of type text_ws, values are of the
form #int #int
I need to be able to query on the first and sort on the second, this does
not seem to be enabled out of the box
I looked around for a while and it seems the recommended approach is to
create a custom field type and implement this logic in the getSortField
method
But since the field is multivalued, I need to sort by the value i am
searching for, so i need access to the current query being executed.
Also, i can't seem to figure out the correct -classpath to give to javac for
it to find the packages needed to create the class file (my java is a bit
rusty to say the least)

Thanks,
Fouad


Re: Custom Field Type

2009-03-04 Thread Yonik Seeley
On Wed, Mar 4, 2009 at 12:24 PM, Fouad Mardini  wrote:
> I have a multivalued field in my schema of type text_ws, values are of the
> form #int #int
> I need to be able to query on the first and sort on the second, this does
> not seem to be enabled out of the box

Can you put the two numbers in separate fields for this purpose?
If you can't do it from the indexer, a schema with copyField in
conjunction with PatternTokenizerFactory could do it.

-Yonik
http://www.lucidimagination.com


Re: How to search the database tables using solr.

2009-03-04 Thread Glen Newton
Also take a look at LuSql:
 http://lab.cisti-icist.nrc-cnrc.gc.ca/cistilabswiki/index.php/LuSql

2009/3/4 Shalin Shekhar Mangar :
> On Wed, Mar 4, 2009 at 7:32 PM, Radha C.  wrote:
>
>> Hi,
>>
>> I am working in a software concern. We are having some R&D base work like
>> making use of solr search tool for searching our Oracle database
>> content. I read through the documnets provided in the site, there I found
>> the solr supports only the document search and we have to provide the
>> document in the xml with the whole content that we need to search. But Is
>> there any way to index our database content and search the indexed database
>> content.
>>
>
> Take a look at DataImportHandler.
>
> http://wiki.apache.org/solr/DataImportHandler
> --
> Regards,
> Shalin Shekhar Mangar.
>



-- 

-


Re: Multilanguage

2009-03-04 Thread Karl Wettin


17 feb 2009 kl. 21.26 skrev Grant Ingersoll:

I believe Karl Wettin submitted a Lucene patch for a Language  
guesser: http://issues.apache.org/jira/browse/LUCENE-826 but it is  
marked as won't fix.


The test case of LUCENE-1039 is a language classifier. I've use patch  
to detect languages of user queries (where I know the text contains  
text that is rather simple to classify as as specific language).



 karl


Re: MoreLikeThis filtering

2009-03-04 Thread Doug Steigerwald

Hah.  Sorry, I'm really out of it today.

The MoreLikeThisComponent doesn't seem to work for filtering using fq,  
but the MoreLikeThisHandler does.


Problem solved, we'll just use the handler instead of a component.

Doug

On Mar 4, 2009, at 11:02 AM, Doug Steigerwald wrote:

Sorry.  The examples on the wiki aren't working with the 'fq' to  
filter the similarities.  It just filters the actual queries.


http://localhost:8983/solr/mlt?q=id:SP2514N&mlt.fl=manu,cat&mlt.mindf=1&mlt.mintf=1&fq=popularity:6&mlt.displayTerms=details&mlt=true

The popularity of the doc found is 6, and trying to use  
'fq=popularity:6' brings back similarities with a popularity other  
than 6.


Doug

On Mar 4, 2009, at 10:39 AM, Doug Steigerwald wrote:

Hm.  I checked out a clean Solr 1.3.0 and indexed the example docs  
and set up a simple MLT handler the example queries on the Wiki  
work fine (fq can filter out docs).  Our build has a slight change  
to QueryComponent so another query isn't done when we use localsolr 
+field collapsing, but that change doesn't look like it would make  
a difference.  It just conditionally sets rb.setNeedDocSet() to  
true or false.


Will run some tests on a clean fresh build of Solr to see if it's  
our build.


Doug

On Mar 4, 2009, at 9:28 AM, Otis Gospodnetic wrote:



Doug,

does the good old 'fq' not work with MLT?  It should...


Otis
--
Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch



- Original Message 

From: Doug Steigerwald 
To: solr-user@lucene.apache.org
Sent: Wednesday, March 4, 2009 9:20:40 AM
Subject: MoreLikeThis filtering

Is it possible to filter similarities found by the MLT component/ 
handler?

Something like mlt.fq=site_id:86?

We have 32 cores in our Solr install, and some of those cores  
have up to 8 sites
indexed in them.  Typically those cores will have one very large  
site with a few
hundred thousand indexed documents, and lots of small sites with  
significantly

less documents indexed.

We're looking to implement a MLT component for our sites but want  
the similar

stories to be only for a specific site (not all sites in the core).

Is there a way to do something like this, or will we have to make  
mods (I'm not
seeing anything jump out at me in the Solr 1.3.0 or Lucene 2.4.0  
code)?


/solr/dsteiger/mlt?q=story_id:188665+AND+site_id: 
86&mlt.fq=site_id:86


(We have all all of our other defaults set up in the handler  
config.)


Thanks.
---
Doug Steigerwald
Software Developer
McClatchy Interactive
dsteigerw...@mcclatchyinteractive.com




Re: MoreLikeThis filtering

2009-03-04 Thread Andrew Ingram
I posted a while back with this problem and I've finally got it working 
using the following method:


in solrconfig.xml:

 
   
 id,title
 0
   
 


then when making the request, I do a normal search for the item with the 
necessary filter query (in my case discontinued:false) and set the 
qt=mlt to activate the mlt handler.


So in your case:

http://localhost:8983/solr/select?q=id:SP2514N&mlt.fl=manu,cat&mlt.mindf=1&mlt.mintf=1&fq=popularity:6&mlt.displayTerms=details&mlt=true&qt=mlt

I had to use qt because I'm using a library to access solr and it 
doesn't include support for alternative handlers at the path level.


Regards,
Andrew Ingram


Doug Steigerwald wrote:
Sorry.  The examples on the wiki aren't working with the 'fq' to 
filter the similarities.  It just filters the actual queries.


http://localhost:8983/solr/mlt?q=id:SP2514N&mlt.fl=manu,cat&mlt.mindf=1&mlt.mintf=1&fq=popularity:6&mlt.displayTerms=details&mlt=true 



The popularity of the doc found is 6, and trying to use 
'fq=popularity:6' brings back similarities with a popularity other 
than 6.


Doug

On Mar 4, 2009, at 10:39 AM, Doug Steigerwald wrote:

Hm.  I checked out a clean Solr 1.3.0 and indexed the example docs 
and set up a simple MLT handler the example queries on the Wiki work 
fine (fq can filter out docs).  Our build has a slight change to 
QueryComponent so another query isn't done when we use 
localsolr+field collapsing, but that change doesn't look like it 
would make a difference.  It just conditionally sets 
rb.setNeedDocSet() to true or false.


Will run some tests on a clean fresh build of Solr to see if it's our 
build.


Doug

On Mar 4, 2009, at 9:28 AM, Otis Gospodnetic wrote:



Doug,

does the good old 'fq' not work with MLT?  It should...


Otis
--
Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch



- Original Message 

From: Doug Steigerwald 
To: solr-user@lucene.apache.org
Sent: Wednesday, March 4, 2009 9:20:40 AM
Subject: MoreLikeThis filtering

Is it possible to filter similarities found by the MLT 
component/handler?

Something like mlt.fq=site_id:86?

We have 32 cores in our Solr install, and some of those cores have 
up to 8 sites
indexed in them.  Typically those cores will have one very large 
site with a few
hundred thousand indexed documents, and lots of small sites with 
significantly

less documents indexed.

We're looking to implement a MLT component for our sites but want 
the similar

stories to be only for a specific site (not all sites in the core).

Is there a way to do something like this, or will we have to make 
mods (I'm not
seeing anything jump out at me in the Solr 1.3.0 or Lucene 2.4.0 
code)?


/solr/dsteiger/mlt?q=story_id:188665+AND+site_id:86&mlt.fq=site_id:86

(We have all all of our other defaults set up in the handler config.)

Thanks.
---
Doug Steigerwald
Software Developer
McClatchy Interactive
dsteigerw...@mcclatchyinteractive.com







Re: DataImportHandler and delta-import question

2009-03-04 Thread Garafola Timothy
Thanks,

I set up a another test instance of solr and ran a full import within
the DIH Development Console.  I examined the query and found that
last_index_time is not getting set in the query.  Yet the value does
get updated after a full import completes (outside of the development
console).  Is there some place that I need to set the path to the
dataimport.properties file?

On Tue, Mar 3, 2009 at 8:03 PM, Noble Paul നോബിള്‍  नोब्ळ्
 wrote:
> I do not see anything wrong with this .It should have worked . Can you
> check that dataimport.properties is created (by DIH) in the conf
> directory? . check the content?
>
>
> are you sure that the query
>
> select DId from 2_Doc where ModifiedDate > '${dataimporter.last_index_time}'
>
> works with  a date format -MM-dd HH:mm:ss . This is the format
> which DIH sends the date in . If the format is wrong you may need to
> format it using a dateformat function.
>
> see here
>
> http://wiki.apache.org/solr/DataImportHandler#head-5675e913396a42eb7c6c5d3c894ada5dadbb62d7
>
>
>  The trunk DIH can work with Solr1.3 (you may need to put the DIH jar
> and slf4j). Can
> - Show quoted text -
> On Wed, Mar 4, 2009 at 3:53 AM, Garafola Timothy  
> wrote:
>> I'm using solr 1.3 and am trying to get a delta-import with the DIH.
>> Recently the wiki, http://wiki.apache.org/solr/DataImportHandler, was
>> updated explaining that delta import is a 1.4 feature now but it was
>> still possible get a delta using the full import example here,
>> http://wiki.apache.org/solr/DataImportHandlerFaq#fullimportdelta.  I
>> tried this but each time I run DIH, it reimports all rows and updates.
>>
>> Below is my data-config.xml.  I set rootEntity to false and issued
>> command=full-import&clean=false&optimize=false through DIH.  Am I
>> doing something wrong here or is the DataImportHandlerFaq incorrect?
>>
>> 
>>        > url="jdbc:mysql://pencil-somewhere.com:2/SomeDB" user="someUser"
>>  password="somePassword"/>
>>        
>>                >                        query = "select DId from 2_Doc where
>> ModifiedDate > '${dataimporter.last_index_time}'
>>                                      and DocType != 'Research Articles'">
>>                        > transformer="RegexTransformer"
>>                                query = "SELECT d.DId, d.SiteId,
>> d.DocTitle, d.DocURL, d.DocDesc,
>>                                        d.DocType, d.Tags, d.Source,
>> d.Last90DaysRFIsPercent,
>>                                        d.ModifiedDate, d.DocGuid, d.Author,
>>                                        i.Industry FROM 2_Doc d LEFT
>> OUTER JOIN tmp_DocIndustry i
>>                                        ON (d.DocId=i.DocId AND
>> d.SiteId=i.SiteId) where d.DocType != 'Research articles'
>>                                        and d.DId = '${item.DId}' and
>> d.ModifiedDate > '${dataimporter.last_index_time}'">
>>                                
>>                                
>>                                
>>                                > ="DocTitle"/>
>>                                
>>                                
>>                                > regex="^(.{0,800})\b.*$" sourceColName="DocDesc"/>
>>                                
>>                                > splitBy=";" sourceColName="Tags"/>
>>                                
>>                                > "Last90DaysRFIsPercent"   name ="Last90DaysRFIsPercent"/>
>>                                > ="ModifiedDate"/>
>>                                
>>                                
>>                                > ="Industry" sourceColName="Industry"/>
>>                        
>>                
>>        
>> 
>>
>> Thanks,
>> -Tim
>>
>
>
>
> --
> --Noble Paul
>



-- 
-Tim


Possible bug with sdouble?

2009-03-04 Thread Jonathan Ariel
Hi everyone!
So it seems like I ran into a bug with sdouble.
I have a document with two fields.
field1 is double






when I index I send for both of them the same value (7300) and when I'm
querying I'm getting the following

7300
7.3E7

What do you think? Why am I getting different notations?

BTW I'm using solr 1.2

Thanks!

Jonathan


Re: Possible bug with sdouble?

2009-03-04 Thread Otis Gospodnetic

I'll do the obvious one first. :)
Have you tried Solr 1.3 or, even better, 1.4-dev nightly?

 Otis
--
Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch



- Original Message 
> From: Jonathan Ariel 
> To: solr-user@lucene.apache.org
> Sent: Wednesday, March 4, 2009 2:56:03 PM
> Subject: Possible bug with sdouble?
> 
> Hi everyone!
> So it seems like I ran into a bug with sdouble.
> I have a document with two fields.
> field1 is double
> 
> 
> stored="true" omitNorms="true"/>
> 
> indexed="true" stored="true" omitNorms="true"/>
> 
> 
> 
> when I index I send for both of them the same value (7300) and when I'm
> querying I'm getting the following
> 
> 7300
> 7.3E7
> 
> What do you think? Why am I getting different notations?
> 
> BTW I'm using solr 1.2
> 
> Thanks!
> 
> Jonathan



Re: Possible bug with sdouble?

2009-03-04 Thread Yonik Seeley
On Wed, Mar 4, 2009 at 2:56 PM, Jonathan Ariel  wrote:
> Hi everyone!
> So it seems like I ran into a bug with sdouble.
> I have a document with two fields.
> field1 is double
>
>  stored="true" omitNorms="true"/>
>  indexed="true" stored="true" omitNorms="true"/>
>
> when I index I send for both of them the same value (7300) and when I'm
> querying I'm getting the following
>
> 7300
> 7.3E7
>
> What do you think? Why am I getting different notations?

The straight "double" type indexes things as a string w/o doing any
sort of normalization.
So a value of 100.000 will be indexed and stored as 100.000.

"sdouble" needs to parse the float, and it gets more normalized as a byproduct.
If anything, one could consider lack of normalization a bug with
double, not sdouble.

-Yonik
http://www.lucidimagination.com


commit / new searcher delay?

2009-03-04 Thread Steve Conover
I'm doing some testing of a solr master/slave config and find that,
after syncing my slave, I need to sleep for about 400ms after commit
to "see" the new index state.  i.e. if I don't sleep, and I execute a
query, I get results that reflect the prior state of the index.

I suspect this has something to do with waiting for the searcher to
warm and switch over (?).  Though, I'm confused because when I print
out /solr/admin/registry.jsp, the hashcode of the Searcher changes
immediately (as the commit docs say, the commit operation blocks by
default until a new searcher is in place).  I've tried turning off all
caching, to no effect.

Anyone have any idea what could be going on here?  Ideally, 
would be an operation that blocks until the exact moment when the new
searcher is in place and is actually serving based on the new index
state...

-Steve


use () in the query string

2009-03-04 Thread Cheng Zhang
Hello,

In my case, my query of "id_s_i_s_nm:(om_B00114162K*)" returned nothing but 
query "id_s_i_s_nm:om_B00114162K*" returned the right result. 

What's the difference between using () or not.

Thanks a lot,
Kevin



Re: commit / new searcher delay?

2009-03-04 Thread Shalin Shekhar Mangar
On Thu, Mar 5, 2009 at 6:06 AM, Steve Conover  wrote:

> I'm doing some testing of a solr master/slave config and find that,
> after syncing my slave, I need to sleep for about 400ms after commit
> to "see" the new index state.  i.e. if I don't sleep, and I execute a
> query, I get results that reflect the prior state of the index.
>

How are you sending the commit? You should use commit with waitSearcher=true
and waitFlush=true so that it blocks until the new searcher becomes
available for querying.


-- 
Regards,
Shalin Shekhar Mangar.


Re: DataImportHandler and delta-import question

2009-03-04 Thread Noble Paul നോബിള്‍ नोब्ळ्
the dataimport.properties is created only after one successful import
.so it is available only from second import onwards. probably you can
create one manually and put it in the conf dir.

On Thu, Mar 5, 2009 at 12:52 AM, Garafola Timothy  wrote:
> Thanks,
>
> I set up a another test instance of solr and ran a full import within
> the DIH Development Console.  I examined the query and found that
> last_index_time is not getting set in the query.  Yet the value does
> get updated after a full import completes (outside of the development
> console).  Is there some place that I need to set the path to the
> dataimport.properties file?
>
> On Tue, Mar 3, 2009 at 8:03 PM, Noble Paul നോബിള്‍  नोब्ळ्
>  wrote:
>> I do not see anything wrong with this .It should have worked . Can you
>> check that dataimport.properties is created (by DIH) in the conf
>> directory? . check the content?
>>
>>
>> are you sure that the query
>>
>> select DId from 2_Doc where ModifiedDate > '${dataimporter.last_index_time}'
>>
>> works with  a date format -MM-dd HH:mm:ss . This is the format
>> which DIH sends the date in . If the format is wrong you may need to
>> format it using a dateformat function.
>>
>> see here
>>
>> http://wiki.apache.org/solr/DataImportHandler#head-5675e913396a42eb7c6c5d3c894ada5dadbb62d7
>>
>>
>>  The trunk DIH can work with Solr1.3 (you may need to put the DIH jar
>> and slf4j). Can
>> - Show quoted text -
>> On Wed, Mar 4, 2009 at 3:53 AM, Garafola Timothy  
>> wrote:
>>> I'm using solr 1.3 and am trying to get a delta-import with the DIH.
>>> Recently the wiki, http://wiki.apache.org/solr/DataImportHandler, was
>>> updated explaining that delta import is a 1.4 feature now but it was
>>> still possible get a delta using the full import example here,
>>> http://wiki.apache.org/solr/DataImportHandlerFaq#fullimportdelta.  I
>>> tried this but each time I run DIH, it reimports all rows and updates.
>>>
>>> Below is my data-config.xml.  I set rootEntity to false and issued
>>> command=full-import&clean=false&optimize=false through DIH.  Am I
>>> doing something wrong here or is the DataImportHandlerFaq incorrect?
>>>
>>> 
>>>        >> url="jdbc:mysql://pencil-somewhere.com:2/SomeDB" user="someUser"
>>>  password="somePassword"/>
>>>        
>>>                >>                        query = "select DId from 2_Doc where
>>> ModifiedDate > '${dataimporter.last_index_time}'
>>>                                      and DocType != 'Research Articles'">
>>>                        >> transformer="RegexTransformer"
>>>                                query = "SELECT d.DId, d.SiteId,
>>> d.DocTitle, d.DocURL, d.DocDesc,
>>>                                        d.DocType, d.Tags, d.Source,
>>> d.Last90DaysRFIsPercent,
>>>                                        d.ModifiedDate, d.DocGuid, d.Author,
>>>                                        i.Industry FROM 2_Doc d LEFT
>>> OUTER JOIN tmp_DocIndustry i
>>>                                        ON (d.DocId=i.DocId AND
>>> d.SiteId=i.SiteId) where d.DocType != 'Research articles'
>>>                                        and d.DId = '${item.DId}' and
>>> d.ModifiedDate > '${dataimporter.last_index_time}'">
>>>                                
>>>                                
>>>                                
>>>                                >> ="DocTitle"/>
>>>                                
>>>                                
>>>                                >> regex="^(.{0,800})\b.*$" sourceColName="DocDesc"/>
>>>                                
>>>                                >> splitBy=";" sourceColName="Tags"/>
>>>                                
>>>                                >> "Last90DaysRFIsPercent"   name ="Last90DaysRFIsPercent"/>
>>>                                >> ="ModifiedDate"/>
>>>                                
>>>                                
>>>                                >> ="Industry" sourceColName="Industry"/>
>>>                        
>>>                
>>>        
>>> 
>>>
>>> Thanks,
>>> -Tim
>>>
>>
>>
>>
>> --
>> --Noble Paul
>>
>
>
>
> --
> -Tim
>



-- 
--Noble Paul


Re: datadir issue for multicore on latest trunk

2009-03-04 Thread Noble Paul നോബിള്‍ नोब्ळ्
I have reopened the issue. We will fix it completely in a day or two.

On Wed, Mar 4, 2009 at 6:30 PM, Walter Ferrara  wrote:
> tried with
>    
>    
> but no luck, the dataDir parameter seems ignored, no matter what is written
> there
>
> On Wed, Mar 4, 2009 at 12:58 PM, Noble Paul നോബിള്‍ नोब्ळ् <
> noble.p...@gmail.com> wrote:
>
>> looks like a bug. must reopen the issue
>>
>> On Wed, Mar 4, 2009 at 5:26 PM, Noble Paul നോബിള്‍  नोब्ळ्
>>  wrote:
>> > On Wed, Mar 4, 2009 at 5:24 PM, Walter Ferrara 
>> wrote:
>> >> using:
>> >>  
>> >>    
>> >>    
>> >>  
>> >> doesn't work either
>> >
>> > dataDir="/multicore/core0" means the path is absolute.
>> > where did it create?
>> >
>> >>
>> >> here the output:
>> >>
>> >> 2009-03-04 12:50:54.890::INFO:  Logging to STDERR via
>> >> org.mortbay.log.StdErrLog
>> >> 2009-03-04 12:50:54.968::INFO:  jetty-6.1.3
>> >> 4-mar-2009 12.50.55 org.apache.solr.servlet.SolrDispatchFilter init
>> >> INFO: SolrDispatchFilter.init()
>> >> 4-mar-2009 12.50.55 org.apache.solr.core.SolrResourceLoader
>> >> locateInstanceDir
>> >> INFO: JNDI not configured for solr (NoInitialContextEx)
>> >> 4-mar-2009 12.50.55 org.apache.solr.core.SolrResourceLoader
>> >> locateInstanceDir
>> >> INFO: using system property solr.solr.home: multicore
>> >> 4-mar-2009 12.50.55 org.apache.solr.core.CoreContainer$Initializer
>> >> initialize
>> >> INFO: looking for solr.xml:
>> >> d:\DEV\apache-solr-2009-03-03_08-06-53\example\multicore\solr.xml
>> >> 4-mar-2009 12.50.55 org.apache.solr.core.SolrResourceLoader 
>> >> INFO: Solr home set to 'multicore/'
>> >> 4-mar-2009 12.50.55 org.apache.solr.core.SolrResourceLoader
>> >> createClassLoader
>> >> INFO: Reusing parent classloader
>> >> 4-mar-2009 12.50.55 org.apache.solr.core.SolrResourceLoader 
>> >> INFO: Solr home set to 'multicore\core0/'
>> >> 4-mar-2009 12.50.55 org.apache.solr.core.SolrResourceLoader
>> >> createClassLoader
>> >> INFO: Reusing parent classloader
>> >> 4-mar-2009 12.50.55 org.apache.solr.core.SolrConfig 
>> >> INFO: Loaded SolrConfig: solrconfig.xml
>> >> 4-mar-2009 12.50.55 org.apache.solr.schema.IndexSchema readSchema
>> >> INFO: Reading Solr Schema
>> >> 4-mar-2009 12.50.55 org.apache.solr.schema.IndexSchema readSchema
>> >> INFO: Schema name=example core zero
>> >> 4-mar-2009 12.50.55 org.apache.solr.util.plugin.AbstractPluginLoader
>> load
>> >> INFO: created string: org.apache.solr.schema.StrField
>> >> 4-mar-2009 12.50.55 org.apache.solr.schema.IndexSchema readSchema
>> >> INFO: default search field is name
>> >> 4-mar-2009 12.50.55 org.apache.solr.schema.IndexSchema readSchema
>> >> INFO: query parser default operator is OR
>> >> 4-mar-2009 12.50.55 org.apache.solr.schema.IndexSchema readSchema
>> >> INFO: unique key field: id
>> >> 4-mar-2009 12.50.55 org.apache.solr.core.SolrCore 
>> >> INFO: [core0] Opening new SolrCore at multicore\core0/,
>> >> dataDir=core0//multicore/core0/
>> >> 4-mar-2009 12.50.55 org.apache.solr.core.SolrCore 
>> >> INFO: JMX monitoring not detected for core: core0
>> >> 4-mar-2009 12.50.55 org.apache.solr.core.SolrCore parseListener
>> >> INFO: [core0] Searching for listeners:
>> //listen...@event="firstSearcher"]
>> >> 4-mar-2009 12.50.55 org.apache.solr.core.SolrCore parseListener
>> >> INFO: [core0] Searching for listeners: //listen...@event="newSearcher"]
>> >> 4-mar-2009 12.50.55 org.apache.solr.core.SolrCore initIndex
>> >> AVVERTENZA: [core0] Solr index directory 'core0\multicore\core0\index'
>> >> doesn't exist. Creating new index...
>> >> 4-mar-2009 12.50.55 org.apache.solr.update.SolrIndexWriter getDirectory
>> >> AVVERTENZA: No lockType configured for core0//multicore/core0/index/
>> >> assuming 'simple'
>> >> 4-mar-2009 12.50.55 org.apache.solr.util.plugin.AbstractPluginLoader
>> load
>> >> INFO: created standard: org.apache.solr.handler.StandardRequestHandler
>> >> 4-mar-2009 12.50.55 org.apache.solr.util.plugin.AbstractPluginLoader
>> load
>> >> INFO: created /update: org.apache.solr.handler.XmlUpdateRequestHandler
>> >> 4-mar-2009 12.50.55 org.apache.solr.util.plugin.AbstractPluginLoader
>> load
>> >> INFO: created /admin/: org.apache.solr.handler.admin.AdminHandlers
>> >> 4-mar-2009 12.50.55 org.apache.solr.search.SolrIndexSearcher 
>> >> INFO: Opening searc...@1e57e8f main
>> >> 4-mar-2009 12.50.55
>> >> org.apache.solr.update.DirectUpdateHandler2$CommitTracker 
>> >> INFO: AutoCommit: disabled
>> >> 4-mar-2009 12.50.55 org.apache.solr.handler.component.SearchHandler
>> inform
>> >> INFO: Adding
>> >> component:org.apache.solr.handler.component.querycompon...@19a32e0
>> >> 4-mar-2009 12.50.55 org.apache.solr.handler.component.SearchHandler
>> inform
>> >> INFO: Adding
>> >> component:org.apache.solr.handler.component.facetcompon...@8238f4
>> >> 4-mar-2009 12.50.55 org.apache.solr.handler.component.SearchHandler
>> inform
>> >> INFO: Adding
>> >>
>> component:org.apache.solr.handler.component.morelikethiscompon...@16925b0
>> >> 4-mar-2009 12.50.55 org.apache.solr.handler.compone

Column Specific Query with q parameter

2009-03-04 Thread dabboo

Hi,

I am implementing column specific query with q query parameter. for e.g.

?q=prdMainTitle_product_s:math & qt=dismaxrequest

The above query doesnt work while if I use the same query with q.alt
parameter, it works.

?q=&q.alt= prdMainTitle_product_s:math & qt=dismaxrequest

Please suggest, how to achieve this with q query.

Thanks,
Amit Garg
-- 
View this message in context: 
http://www.nabble.com/Column-Specific-Query-with-q-parameter-tp22345960p22345960.html
Sent from the Solr - User mailing list archive at Nabble.com.



Re: Column Specific Query with q parameter

2009-03-04 Thread Matt Mitchell
The syntax for the q param when using dismax is different from standard.
Check this out:


http://wiki.apache.org/solr/DisMaxRequestHandler#head-df8184dddf870336839490ba276ea6ac566d0bdf

q.alt under dismax is parsed using the standard query parser though:


http://wiki.apache.org/solr/DisMaxRequestHandler#head-9d23a23915b7932490069d3262ef7f3625e398ff

Using dismax with that query... you could do it using the fq param:

  ?fq=prdMainTitle_product_s:math&qt=dismaxrequest&q.alt=*:*

But make sure you understand how the fq param works; how solr uses its
caching...


http://wiki.apache.org/solr/CommonQueryParameters#head-6522ef80f22d0e50d2f12ec487758577506d6002

Hope this helps,

Matt

On Thu, Mar 5, 2009 at 1:30 AM, dabboo  wrote:

>
> Hi,
>
> I am implementing column specific query with q query parameter. for e.g.
>
> ?q=prdMainTitle_product_s:math & qt=dismaxrequest
>
> The above query doesnt work while if I use the same query with q.alt
> parameter, it works.
>
> ?q=&q.alt= prdMainTitle_product_s:math & qt=dismaxrequest
>
> Please suggest, how to achieve this with q query.
>
> Thanks,
> Amit Garg
> --
> View this message in context:
> http://www.nabble.com/Column-Specific-Query-with-q-parameter-tp22345960p22345960.html
> Sent from the Solr - User mailing list archive at Nabble.com.
>
>


Any standard/specification for Search ??

2009-03-04 Thread souravm
Hi Guys,

Are you aware of any standard/specification (like JSR 168/286 for portals, CMIS 
for CMS) for Search engines ?

Is there any such specification people are working on currently ?

Regards,
Sourav

 CAUTION - Disclaimer *
This e-mail contains PRIVILEGED AND CONFIDENTIAL INFORMATION intended solely 
for the use of the addressee(s). If you are not the intended recipient, please 
notify the sender by e-mail and delete the original message. Further, you are 
not 
to copy, disclose, or distribute this e-mail or its contents to any other 
person and 
any such actions are unlawful. This e-mail may contain viruses. Infosys has 
taken 
every reasonable precaution to minimize this risk, but is not liable for any 
damage 
you may sustain as a result of any virus in this e-mail. You should carry out 
your 
own virus checks before opening the e-mail or attachment. Infosys reserves the 
right to monitor and review the content of all messages sent to or from this 
e-mail 
address. Messages sent to or from this e-mail address may be stored on the 
Infosys e-mail system.
***INFOSYS End of Disclaimer INFOSYS***


Re: Distributed Search

2009-03-04 Thread asdf qwerty
> : > > Ok, so it wouldn't be possible to have a smaller, faster authoritative
> : > > shard for near-real-time updates while keeping the entire dataset in a
> : > > second shard which is updates less frequently?
>
> I believe Otis's point is that many people use distributed search across
> shards where some are large and mostly static and one is small and
> frequently updated with new docs in order to get some performance
> advantages out of hte long cache lifes on the larger shard(s) ... but this
> typically works best when you only "add" new docs, and don't modify old
> ones (or only modify docs added very recently so they're always in the
> small shard) while the bigger shards are treated as "archives" that don't
> change.
>
> To be deterministic you can't have the same uniqueKey in multiple shards.

Hmm, partitioning by document has a lot of merit, but having this be
(configurably) deterministic would seem to enable some interesting
features, such as simple 'tagging' by partitioning by document fields.

For example, you could have a large essentially read-only index of
documents and a separate small index for tags.  To tag a document, you
would create (or update) a document in the tag index containing the
uniqueKey from the main index as well as a multivalued tag field, and
whenever you search, you fire off a distributed search across the two
shards, but pulling the fields from the main index (eg
/solr/select?fq=tag1&shards=main_index/path,tag_index/path&q=*:*).

My specific use case is a bit more involved, but if there were either
some way to deterministically pick the shard source *or* to
dynamically (additively) merge the multiple docs sharing the same
uniqueKey from separate shards, it would be quite helpful.  The later
would provide the general case functionality to have partial document
updates, except even more powerful.  However, I could get by with just
the former - using the main index for all scoring but being able to
augment documents for filtering.

I'm not a solr expert by any means, so if there is another recommended
way to achieve that functionality, I'd love some guidance.  Or, if
this is just a rare case, I guess it'd be time for me to roll up my
sleeves and hack up some solr code.  Making QueryComponent
configurably deterministic would suffice (eg a
"shard.primary=main_index/path" parameter, perhaps?  or even just
treating the shards parameter as an ordered list with the primary
first?).  Adding field merging would likely be... more involved
though.

Thanks in advance for any advice!
-pete


Re: How to search the database tables using solr.

2009-03-04 Thread Venu Mittal
Does anybody has any stats to share on how much time does DataImportHandler 
takes to index a given set of data ?

I am currently indexing 18 millions rows in 1.5 - 2 hours by sending xmls to 
solr. 




From: Shalin Shekhar Mangar 
To: solr-user@lucene.apache.org; cra...@ceiindia.com
Sent: Wednesday, March 4, 2009 8:15:07 AM
Subject: Re: How to search the database tables using solr.

On Wed, Mar 4, 2009 at 7:51 PM, Radha C.  wrote:

> Thanks Shalin,
>
> We just stepped on solr. This information is very much useful for me. But
> before that I want some clear details about where to start..
> I want to test this in my local environment, so I need some basic
> information about how to start using this ( database and solr ). Do you
> have
> some information on this?
>

I think the easiest way is to start using Solr is with the embedded jetty
container. Modify the example/conf/schema.xml file and add your own fields
etc. Read through the DataImportHandler wiki page and at the
example/example-DIH directory in the solr zip/tarball.

If you have a specific doubt/question, ask on the list.

-- 
Regards,
Shalin Shekhar Mangar.