Re: Is negative boost possible?

2009-08-19 Thread Marc Sturlese
:>the only way to "negative boost" is to "positively boost" the inverse... :> :> (*:* -field1:value_to_penalize)^10 This will do the job aswell as bq supports pure negative queries (at least in trunk): bq=-field1:value_to_penalize^10 http://wiki.apache.org/solr/SolrRelevancyFAQ#head-76e53d

Re: Replication over multi-core solr

2009-08-19 Thread Licinio Fernández Maurelo
Hi Vivek, currently we want to add cores dynamically when the active one reaches some capacity, can you give me some hints to achieve such this functionality? (Just wondering if you have used shell-scripting or you have code some 100% Java based solution) Thx 2009/8/19 Noble Paul നോബിള്‍ नोब्ळ्

Problems importing HTML content contained within XML document

2009-08-19 Thread venn hardy
Hello, I have just started trying out SOLR to index some XML documents that I receive. I am using the SOLR 1.3 and its HttpDataSource in conjunction with the XPathEntityProcessor. I am finding the data import really useful so far, but I am having a few problems when I try and import HTML c

Re: CorruptIndexException: Unknown format version

2009-08-19 Thread Licinio Fernández Maurelo
It looks like your solr lucene-core version doesn't match with the lucene version used to generate the index, as Yonik said, looks like there is a lucene library conflict. 2009/8/19 Chris Hostetter : > > : how can that happen, it is a new index, and it is already corrupt? > : > : Did anybody else

Re: Replication over multi-core solr

2009-08-19 Thread vivek sar
Licinio, Please open a separate thread - as it's a different issue - and I can respond there. -vivek 2009/8/19 Licinio Fernández Maurelo : > Hi Vivek, > currently we want to add cores dynamically when the active one reaches > some capacity, > can you give me some hints to achieve such this func

Adding cores dynamically

2009-08-19 Thread Licinio Fernández Maurelo
Hi there, currently we want to add cores dynamically when the active one reaches some capacity, can anyone give me some hints to achieve such this functionality? (Just wondering if you have used shell-scripting or you have code some 100% Java based solution) Thx -- Lici

Re: Replication over multi-core solr

2009-08-19 Thread Licinio Fernández Maurelo
Ok 2009/8/19 vivek sar : > Licinio, > >  Please open a separate thread - as it's a different issue - and I can > respond there. > > -vivek > > 2009/8/19 Licinio Fernández Maurelo : >> Hi Vivek, >> currently we want to add cores dynamically when the active one reaches >> some capacity, >> can you g

Re: Spanish Stemmer

2009-08-19 Thread Licinio Fernández Maurelo
Hi, take a look at this: Un saludo 2009/8/19 Robert Muir : > hi, it looks like you might just have a simple typo: > >   > > if you change it to language="Spanish"

Re: Strange error with shards

2009-08-19 Thread Licinio Fernández Maurelo
Looks like the index is corrupted, try restoring it 2009/8/18 ahammad : > > Hello, > > I have been using multicore/shards for the past 5 months or so with no > problems at all. I just added another core to my Solr server, but for some > reason I can never get the shards working when that specific

Re: Problems importing HTML content contained within XML document

2009-08-19 Thread Martijn v Groningen
Hi Venn, I think what is happening when the BODY element is being processed by xpath expressen (/document/category/BODY), is that it does not retrieve the text content from the P elements inside the body element. The expression will only retrieve text content that is directly a child of the BODY e

Re: Problems importing HTML content contained within XML document

2009-08-19 Thread Noble Paul നോബിള്‍ नोब्ळ्
try this this should slurp al the tags under body On Wed, Aug 19, 2009 at 1:44 PM, venn hardy wrote: > > Hello, > > I have just started trying out SOLR to index some XML documents that I > receive. I am > using the SOLR 1.3 and its HttpDataSource in conjunction with the > XPathEntityProcessor.

Re: Problems importing HTML content contained within XML document

2009-08-19 Thread Noble Paul നോബിള്‍ नोब्ळ्
sorry 2009/8/19 Noble Paul നോബിള്‍ नोब्ळ् : > try this > > > this should slurp al the tags under body > > On Wed, Aug 19, 2009 at 1:44 PM, venn hardy wrote: >> >> Hello, >> >> I have just started trying out SOLR to index some XML documents that I >> receive. I am >> using the SOLR 1.3 and its

Re: Relevant results with DisMaxRequestHandler

2009-08-19 Thread Vincent Pérès
Wow, it's like the 'mm' parameters is just appeared for the first time... Yes, I read the doc few times, but never understood that the documents who doesn't match any of the expressions will not be return... my apologize everything seems more clear now thanks to the min number parameter. Thank yo

Re: JVM Heap utilization & Memory leaks with Solr

2009-08-19 Thread Rahul R
Fuad, We have around 5 million documents and around 3700 fields. All documents will not have values for all the fields JRockit is not approved for use within my organization. But thanks for the info anyway. Regards Rahul On Tue, Aug 18, 2009 at 9:41 AM, Funtick wrote: > > BTW, you should re

Solr-773 (GEO Module) question

2009-08-19 Thread johan . sjoberg
Hi, we're glancing at the GEO search module known from the jira issue 773 (http://issues.apache.org/jira/browse/SOLR-773). It seems to us that the issue is still open and not yet included in the nightly builds. Is there a release plan for the nightly builds, and is this module considered core

Re: MultiCore Queries? are they possible

2009-08-19 Thread Shalin Shekhar Mangar
On Tue, Aug 18, 2009 at 5:47 PM, Ninad Raut wrote: > Hi, > Can we create a Join query between two indexes on two cores? Is this > possible in Solr? > I have a index which stores author profiles and other index which stores > content and a author id as a reference. Can I query as > select Content,A

Re: Strange error with shards

2009-08-19 Thread Shalin Shekhar Mangar
On Tue, Aug 18, 2009 at 9:01 PM, ahammad wrote: > HTTP Status 500 - null java.lang.NullPointerException at > > org.apache.solr.handler.component.QueryComponent.mergeIds(QueryComponent.java:437) > at > > The way I created this shard was to copy an existing one, erasing all the > data files/folders

Re: Passing a Cookie in SolrJ

2009-08-19 Thread Shalin Shekhar Mangar
On Tue, Aug 18, 2009 at 10:18 PM, Ramirez, Paul M (388J) < paul.m.rami...@jpl.nasa.gov> wrote: > Hi All, > > The project I am working on is using Solr and OpenSSO (Sun's single sign on > service). I need to write some sample code for our users that shows them how > to query Solr and I would just l

Re: How to boost fields with many terms against single-term?

2009-08-19 Thread Shalin Shekhar Mangar
On Wed, Aug 19, 2009 at 12:32 AM, Fuad Efendi wrote: > I don't want single-term docs such as "home" to appear in top for simple > search for a home; I need "home improvement made easy" in top... How to > implement it at query time? > If you always want "home improvement made easy" on top for "ho

Re: Strange error with shards

2009-08-19 Thread ahammad
Each core has a different database as a datasource, which means that they have different DB structures and fields. That is why the schemas are different. I figured out the cause of this problem. You were right, it was the uniqueKey field. All of my cores have that field set to "id" but for this n

Putting a something as first query result

2009-08-19 Thread Tobias Brennecke
Hello, I'm a bit new to solr and have the following problem, it's about events and venues. If a user types a name of a venue, then I'd like to return the exact match for the venue first and then the list of events taking place at this venue. Currently I have defined a document bound to a database

Re: Strange error with shards

2009-08-19 Thread Shalin Shekhar Mangar
On Wed, Aug 19, 2009 at 6:44 PM, ahammad wrote: > > Each core has a different database as a datasource, which means that they > have different DB structures and fields. That is why the schemas are > different. > > If all the shards should have the same schema, then what is the point of > > shar

Data Modeling

2009-08-19 Thread Vladimir Landman
Hi, I am trying to create a schema for Solr. Here is a relational model of what our data might look like: Inventory - Sku Price Weight Attributes --- AttributeName AttributeValue Applications -- Id (Auto-Incrementing) Sku VehicleYear Ve

RE: JVM Heap utilization & Memory leaks with Solr

2009-08-19 Thread Fuad Efendi
Hi Rahul, JRockit could be used at least in a test environment to monitor JVM (and troubleshoot SOLR, licensed for-free for developers!); they have even Eclipse plugin now, and it is licensed by Oracle (BEA)... But, of course, in large companies test environment is in hands of testers :) But...

multi words synonyms

2009-08-19 Thread Jae Joo
Hi, I would like to make the synonym for internal medicine to physician or doctor. but it is not working properly. Anyone help me? synonym.index.txt internal medicine => physician synonyms.query.txt physician, internal medicine => physician, doctor In the Analysis tool, I can see clearly that

Shutdown Solr

2009-08-19 Thread Miller, Michael P.
Does anyone know a graceful way to shutdown Solr? (other than killing the process with Ctrl-C)

Re: Shutdown Solr

2009-08-19 Thread Tobias Brennecke
it catches the kill signal and shuts down as it should, I guess :) because it writes stuff to the log after pressing ^c 2009/8/19 Miller, Michael P. > Does anyone know a graceful way to shutdown Solr? (other than killing > the process with Ctrl-C) >

Re: Data Modeling

2009-08-19 Thread Smiley, David W.
This is the sort of Solr fundamentals question my book (chapter 2) will help you with. Think about what your user interface is. What are users searching for? That is, what exactly comes back from search results? It's not clear from your description what your search scenario is. ~ David Smil

Re: Solr-773 (GEO Module) question

2009-08-19 Thread Ryan McKinley
On Aug 19, 2009, at 6:45 AM, johan.sjob...@findwise.se wrote: Hi, we're glancing at the GEO search module known from the jira issue 773 (http://issues.apache.org/jira/browse/SOLR-773). It seems to us that the issue is still open and not yet included in the nightly builds. correct Is

RE: Shutdown Solr

2009-08-19 Thread Fuad Efendi
catalina.sh stop But SolrServlet catches everything and forgets to implement destroy()! I am absolutely unsure about Ctrl-C and even have many concerns regarding catalina.sh stop... J2EE/JEE does not specify any support for threads outside than container-managed... I hope SolrServlet closes Lu

RE: Shutdown Solr

2009-08-19 Thread Fuad Efendi
SolrDispatchFilter has it: public void destroy() { if (cores != null) { cores.shutdown(); cores = null; } } It should gracefully shutdown all background threads (used by Lucene index-merge etc) Tomcat: catalina.sh stop, shutdown.sh, etc.; Ctrl-C is not graceful

RE: Shutdown Solr

2009-08-19 Thread Fuad Efendi
Most probably Ctrl-C is graceful for Tomcat, and kill -9 too... Tomcat is smart... I prefer "/etc/init.d/my_tomcat" wrapper around catalina.sh ("su tomcat", /var/lock etc...) - ok then, Graceful Shutdown depends on how you started Tomcat.

strange sorting results: each word in field is sorted

2009-08-19 Thread Paul Rosen
I'm trying to sort, but I am not always getting the correct results and I'm not sure where to start tracking down the problem. You can see the problem here (at least until it's fixed!): http://nines.performantsoftware.com/search/saved?user=paul&name=poem If you sort by Title/Ascending, you ge

Re: Shutdown Solr

2009-08-19 Thread Paul Tomblin
On Wed, Aug 19, 2009 at 2:43 PM, Fuad Efendi wrote: > Most probably Ctrl-C is graceful for Tomcat, and kill -9 too... Tomcat is > smart... I prefer "/etc/init.d/my_tomcat" wrapper around catalina.sh ("su > tomcat", /var/lock etc...) - ok then, Graceful Shutdown depends on how you > started Tomcat.

Re: strange sorting results: each word in field is sorted

2009-08-19 Thread Erik Hatcher
On Aug 19, 2009, at 2:45 PM, Paul Rosen wrote: You can see the problem here (at least until it's fixed!): http://nines.performantsoftware.com/search/saved?user=paul&name=poem Hi Paul - that project looks familiar! :) If you sort by Title/Ascending, you get partially sorted results, but it

RE: Shutdown Solr

2009-08-19 Thread Fuad Efendi
Thanks... "kill" should be / can be graceful; "kill -9" should kill immediately... no any hang, whole point... http://www.nabble.com/Is-kill--9-safe-or-not--td24866506.html -Original Message- From: ptomb...@gmail.com [mailto:ptomb...@gmail.com] On Behalf Of Paul Tomblin Sent: August-19

WordDelimiterFilter => MultiPhraseQuery?

2009-08-19 Thread jOhn
My issue is with the use of WordDelimiterFilter and how the QueryParser (Dismax) converts the query into a MultiPhraseQuery. This is on solr 1.3 / lucene 2.4.1. For example: 1. yuma -> 3:10 to Yuma 2. yUma -> no results For #2 it gets split into y + uma and becomes a MultiPhraseQuery requiring

Re: strange sorting results: each word in field is sorted

2009-08-19 Thread Paul Rosen
Erik Hatcher wrote: On Aug 19, 2009, at 2:45 PM, Paul Rosen wrote: You can see the problem here (at least until it's fixed!): http://nines.performantsoftware.com/search/saved?user=paul&name=poem Hi Paul - that project looks familiar! :) Hi Erik! I should hope so! And I've gone a year witho

FW: Data Modeling

2009-08-19 Thread Vladimir Landman
I hit reply and sent this to just David, but I think it should go to the whole list: Hi David, I want to do 2 kinds of things with Solr  Maybe 3 in the future 1. I want to use  it on our website so that a customer can filter down products by different attributes.  So suppose we have: Inve

Re: Adding cores dynamically

2009-08-19 Thread vivek sar
Lici, We're doing similar thing with multi-core - when a core reaches capacity (in our case 200 million records) we start a new core. We are doing this via web service call (Create web service), http://wiki.apache.org/solr/CoreAdmin This is all done in java code - before writing we check the

Re: strange sorting results: each word in field is sorted

2009-08-19 Thread Erik Hatcher
On Aug 19, 2009, at 3:50 PM, Paul Rosen wrote: I'm surprised you're not seeing an exception when trying to sort on title given this configuration. Sorting must be done on single valued indexed fields, that have at most a single term indexed per document. I recommend you use copyField to c

Re: Passing a Cookie in SolrJ

2009-08-19 Thread Lance Norskog
SolrJ uses the Apache Commons HTTP client. This describes the authentication system: http://hc.apache.org/httpclient-3.x/authentication.html *This has code to use authentication* https://issues.apa

Re: Shutdown Solr

2009-08-19 Thread Lance Norskog
In production systems I have done a three-stage technique. First, use the container's standard shutdown tool. Tomcat, JBoss, Jetty all have their own. Then, sleep for maybe 60 seconds. Then do kill, sleep more, then 'kill -9'. On Wed, Aug 19, 2009 at 12:21 PM, Fuad Efendi wrote: > Thanks... "kill

Re: DataImportHandler ignoring most rows

2009-08-19 Thread Lance Norskog
It usually helps to make a database view of your query, and then load the DIH from that view. There are cases where some query syntaxes are mangled on the way to the DB. 2009/8/18 Noble Paul നോബിള്‍ नोब्ळ् > this comment says that > 7 > > the query fetched only 7 rows. If possible open a tool

Re: DataImportHandler ignoring most rows

2009-08-19 Thread erikea...@yahoo.com
I switched to the ms driver an now all is well. Must be an incompatibility with the JSQLConnect driver. Sent from my iPhone On Aug 18, 2009, at 11:47 PM, Noble Paul നോബിള്‍ नो ब्ळ् wrote: > this comment says that > 7 > > the query fetched only 7 rows. If possible open a tool and just run

RE: Data Modeling

2009-08-19 Thread Smiley, David W.
It's getting clearer Vladimir. So fundamentally your users are searching for products (apparently auto parts) and the different attributes would become navigation filters. If this is right, then your initial schema (the first email) is a start, although it's a little ambigous to interpert it b

Re: dynamic changes to schema

2009-08-19 Thread Marco Westermann
Hi, thanks for your answers, I think I have to go more in deatail. we are talking about a shop-application which have products I want to search for. This products normally have the standard attributes like sku, a name, a price and so on. But the user can add attributes to the product. So for e

Re: dynamic changes to schema

2009-08-19 Thread Erik Hatcher
However, you can have a dynamic "*" field mapping that catches all field names that aren't already defined - though all of the fields will be the same field type. Erik On Aug 19, 2009, at 5:48 PM, Marco Westermann wrote: Hi, thanks for your answers, I think I have to go more in d

【solr DIH】A problem about solr delta-imports

2009-08-19 Thread huenzhao
Hi all, There is a problem when I use solr delta-imports to update the index. I have added the "last_modified" column in the table. After I use the "full-import" command to index the database data, the "dataimport.properties" file contains nothing, and when I use the "delta-import" command to upd

Re: 【solr DIH】A problem about solr delta-imports

2009-08-19 Thread Noble Paul നോബിള്‍ नोब्ळ्
which version of solr are you using? .Solr1.3 had a bug with this. On Thu, Aug 20, 2009 at 9:42 AM, huenzhao wrote: > > Hi all, > > There is a problem when I use solr delta-imports to update the index. I have > added the "last_modified" column in the table. After I use the "full-import" > command

Re: 【solr DIH】A problem about solr delta-imports

2009-08-19 Thread huenzhao
The version is 1.3. After I used the full-import, the tomcat log show that the solr did not call the SolrWriter class. Do you know the solution of this bug? Noble Paul നോബിള്‍ नोब्ळ्-2 wrote: > > which version of solr are you using? .Solr1.3 had a bug with this. > > On Thu, Aug 20, 2009 at

Is wildcard search not correctly analyzed at query?

2009-08-19 Thread Alexander Herzog
Hi all sorry for the long post We are switching from indexdata's zebra to solr for a new book archival/preservation project with multiple languages, so expect more questions soon (sorry for that) The features of solr are pretty cool and more or less overwhelming! But there is one thing I found a

Re: dynamic changes to schema

2009-08-19 Thread Constantijn Visinescu
There's that or you can just change the user entered "my_author" field into "my_author_customattribute" in code after the user has entered it and add a *_customattribute to your schema. you'd have to add the postfix in code also at querytime and off you go. Constantijn On Wed, Aug 19, 2009 at 11