Aliases for fields

2009-08-18 Thread Licinio Fernández Maurelo
Hello everybody, can i set an alias for a field? Something like : is there any jira issue related? Thx -- Lici

Re: Aliases for fields

2009-08-18 Thread Avlesh Singh
What could possibly be a use case for such a need? Cheers Avlesh 2009/8/18 Licinio Fernández Maurelo > Hello everybody, > > can i set an alias for a field? Something like : > > stored="true" multiValued="false" termVectors="false" > alias="source.date"/> > > is there any jira issue related? >

Re: Aliases for fields

2009-08-18 Thread Licinio Fernández Maurelo
Currently we are trying to unmarshall objets from the index (solr bean tags didn't fully acomplish this issue in our project due to model complexity). It will be nice to set an alias for some fields to match the pojo.property name. Don't know if there is an alternative (maybe copyfield?) to implem

Re: Aliases for fields

2009-08-18 Thread Avlesh Singh
> > solr bean tags didn't fully acomplish this issue in our project due to > model complexity > Did you try annotating your pojo in this manner? @Field("index_field_name) pojoPropertyName; It will be nice to set an alias for some fields to match the pojo.property > name. Don't know if there is an

Re: Questions about MLT

2009-08-18 Thread Avlesh Singh
Invalid question? Cheers Avlesh On Mon, Aug 17, 2009 at 10:05 PM, Avlesh Singh wrote: > I have an index of documents which contain these two fields: > termVectors="true" termPositions="true" termOffsets="true"/> > termVectors="true" termPositions="true" termOffsets="true"/> > > Using the MLT

Re: How can i get lucene index format version information?

2009-08-18 Thread Licinio Fernández Maurelo
Nobody knoes how can i get exactly this info : index format : -9 (UNKNOWN) Despite of knowing 2.9-dev 794238 - 2009-07-15 18:05:08 helps, i assume that it doesn't implies an index format change Am i wrong? El 11 de agosto de 2009 11:53, Licinio Fernández Maurelo escribió: > Thanks all for your

Re: Aliases for fields

2009-08-18 Thread Licinio Fernández Maurelo
Our purpose is to reuse the data stored in our indexes serving it to multiple format clients (xml, php, json) directly (no mapper tier wanted). As clients model entities names doesn't match index field names, we want to use alias in some way to adapt the response for the client. Taking a look a

Re: Cannot get solr 1.3.0 to run properly with plesk 9.2.1 on CentOS

2009-08-18 Thread Constantijn Visinescu
Ok ... sounds like something is screwed up somewhere(tm). Hard part is figuring out where :) My suggestion: Throw everything that's remotely related to the webapp you're trying to build off that server and make sure you get all of it. (no stray bits of solr confuration files leftover anywhere).

Re: Maximum number of values in a multi-valued field.

2009-08-18 Thread Constantijn Visinescu
Hi, This would also depend on the amount of documents you got in your solr index. 10k ? 100k? 1m? 10m? 100m? I'm by no means an expert on solr but i recently had a similar question and to get my answer i grabbed a new blank solr index, modiefied my schema.xml, reindexed all my data assuming the

Re: Aliases for fields

2009-08-18 Thread Noble Paul നോബിള്‍ नोब्ळ्
2009/8/18 Licinio Fernández Maurelo : > Hello everybody, > > can i set an alias for a field? Something like : > > stored="true" multiValued="false" termVectors="false" > alias="source.date"/> > > is there any jira issue related? yes https://issues.apache.org/jira/browse/SOLR-1205 > > Thx > > -- >

Re: Aliases for fields

2009-08-18 Thread Fergus McMenemie
>What could possibly be a use case for such a need? > I would love to see such a feature. I have a multi core solr setup with each core having utterly different content. Each core has its own "custom search app" that exploits nuances specific to a particular data set. The fieldnames are chosen a

Distributed index

2009-08-18 Thread ToJira
Hi, I am very new to Solr and overall a newbie in software developing. I have a problem with cross-platform implementation. Basically I have a local index running on a windows server 2003 aided with a web service (asp.net) for the user queries. However, I need to add another index on a remote Lin

Re: Issue with Collection & Distribution

2009-08-18 Thread william pink
Hi, Sorry for the delayed response didn't even realise I had got a reply, those logs are from the slave and the both version of Solr are the same Solr Implementation Version: 1.2.0 - Yonik - 2007-06-02 17:35:12 It maybe worth upgrading them? Thank you for the assistance, Will On Thu, Aug 13, 2

Index health checking

2009-08-18 Thread Licinio Fernández Maurelo
As you suppose, i'm asking if currently solr implements this functionality or there is any related jira issue. A few days ago, our solr server suffered an unsafe power shutdown. After restoring, we found wrong behaviour (we got NullPointerException when aplying sort criteria in some queries) due t

Writing and using your own Query class in solr 1.4 (trunk)

2009-08-18 Thread Jérôme Etévé
Hi all, I have a custom search component which uses a query I wrote. Basically, this Query (called DocSetQuery) is a Query decorator that skips any document which is not in a given document set. My code used to work perfectly in solr 1.3 but in solr 1.4, it seems that my DocSetQuery has lost all

MultiCore Queries? are they possible

2009-08-18 Thread Ninad Raut
Hi, Can we create a Join query between two indexes on two cores? Is this possible in Solr? I have a index which stores author profiles and other index which stores content and a author id as a reference. Can I query as select Content,AuthorName from Core0,Core1 where core0.authorid = core1.authorid

Re: Writing and using your own Query class in solr 1.4 (trunk)

2009-08-18 Thread Mark Miller
You have run into some stuff that has been somewhat rolled back in Lucene. QueryWieght, and the methods it brought have been reverted. Shortly (when Solr trunk updates Lucene), Solr will go back to just createWeight and weight. The main change that will be left is that Weight will be an abstr

Re: Index health checking

2009-08-18 Thread Grant Ingersoll
See http://issues.apache.org/jira/browse/SOLR-566. Patches welcome. On Aug 18, 2009, at 7:46 AM, Licinio Fernández Maurelo wrote: As you suppose, i'm asking if currently solr implements this functionality or there is any related jira issue. A few days ago, our solr server suffered an unsafe p

Proximity Search

2009-08-18 Thread Ninad Raut
Hi, I want to count the words between two significant words like "shell" and "petroleum". Or want to write a query to find all the documents where the content has "shell" and "petroleum" in close proximity of less than 10 words between them. Can such quries be created in Solr? Regards, Ninad Raut.

Re: Issue with Collection & Distribution

2009-08-18 Thread Bill Au
I say it is worth upgrading since 1.2 is old. 1.4 is almost ready to be released. So you may want to wait a little while longer. There are many nice new features in 1.4. There are performance improvement too. In the mean time, you can just get the latest version of the scripts from SVN. Those

Re: Writing and using your own Query class in solr 1.4 (trunk)

2009-08-18 Thread Jérôme Etévé
Hi Mark, Thanks for clarifying this. So should I keep both sets of method implemented? I guess it won't hurt when solr trunk will use the updated version of lucene without those methods. What I don't get is that neither my createWeight or createQueryWeight methods seem to be called when I call r

Release Date Solr 1.4

2009-08-18 Thread Daniel Knapp
Hello Mailinglist, does anyone know the release date from Solr 1.4? Thanks for your reply. Regards, Daniel

Re: Writing and using your own Query class in solr 1.4 (trunk)

2009-08-18 Thread Mark Miller
I'm pretty sure one of them is called. In the version you have: public void search(Query query, HitCollector results) throws IOException { search(createQueryWeight(query), null, new HitCollectorWrapper(results)); } protected QueryWeight createQueryWeight(Query query) throws IOExceptio

Re: Release Date Solr 1.4

2009-08-18 Thread Mark Miller
Daniel Knapp wrote: Hello Mailinglist, does anyone know the release date from Solr 1.4? Thanks for your reply. Regards, Daniel The last note I saw said we hope to release 1.4 a week or so after Lucene 2.9 (though of course a week may not end up being enough). It will follow Lucene 2.9 tho

Re: Query not working as expected

2009-08-18 Thread Matt Schraeder
Awesome that works great. Thanks a lot! >>> markrmil...@gmail.com 8/17/2009 5:32:46 PM >>> Matt Schraeder wrote: > I'm attempting to write a query as follows: > > ($query^10) OR (NOT ($query)) which effectively would return everything, but > if it matches the first query it will get a higher sc

Re: Release Date Solr 1.4

2009-08-18 Thread Constantijn Visinescu
Last i heard the eta was aprox a month, but they won't release it untill it's ready. Check JIRA here for the list of open issues that need fixing before 1.4 https://issues.apache.org/jira/secure/IssueNavigator.jspa?sorter/field=updated&sorter/order=DESC Constantijn Visinescu On Tue, Aug 18, 2009

Re: Writing and using your own Query class in solr 1.4 (trunk)

2009-08-18 Thread Jérôme Etévé
That's right. I just had another decorator which was not adapted for the new API. My fault .. Thanks, Jerome. 2009/8/18 Mark Miller : > I'm pretty sure one of them is called. In the version you have: > > public void search(Query query, HitCollector results) > throws IOException { > search(c

Re: Cannot get solr 1.3.0 to run properly with plesk 9.2.1 on CentOS

2009-08-18 Thread Aaron Aberg
Constantijn, First of all, I want you to know how much I appreciate you not giving up on me. Second of all, your instructions were really great. I think that I am getting closer to solving this issue. I am STILL get that error but after a full tomcat reboot it picked up my solr.home environment va

Re: Cannot get solr 1.3.0 to run properly with plesk 9.2.1 on CentOS

2009-08-18 Thread Constantijn Visinescu
Am pretty sure solr.xml is if you want to define multiple solr cores for your application.So it makes sense that solr checks for that first, however if it doesn't find one it continues to start up with a single core. I KNOW it runs just fine for me without a solr.xml. The exception seems to be c

Re: Issue with Collection & Distribution

2009-08-18 Thread william pink
Many thanks Bill Yeah I did take a look at the features of 1.4 and they do look fantastic very much looking forward to them On Tue, Aug 18, 2009 at 1:42 PM, Bill Au wrote: > I say it is worth upgrading since 1.2 is old. 1.4 is almost ready to be > released. So you may want to wait a little wh

Re: Cannot get solr 1.3.0 to run properly with plesk 9.2.1 on CentOS

2009-08-18 Thread Marco Westermann
/spellings.txt /usr/share/solr/conf/admin-extra.html /usr/share/solr/conf/stopwords.txt /usr/share/solr/README.txt I'm pretty sure I should have a solr.xml somewhere for tomcat. What do you think? Thanks again for all the help, Aaron __ Hinweis von ESET NOD32 Antivirus, Signaturdatenban

Re: spellcheck component in 1.4 distributed

2009-08-18 Thread Ian Connor
Once it goes through Ruby the stack is not as neat but here is what the error comes through as: String_index_out_of_range_1 __javalangStringIndexOutOfBoundsException_String_index_out_of_range_1 __at_javalangAbstractStringBuilderreplaceAbstractStringBuilderjava797 __at_javalangStringBuilderreplaceS

Re: Proximity Search

2009-08-18 Thread Toby Cole
See the Lucene query parser syntax documentation: http://lucene.apache.org/java/2_3_2/queryparsersyntax.html#Proximity%20Searches basically... "shell petroleum"~10 should do the trick (if you're using a standard request handler, can't remember if dismax supports proximity). On 18 A

Is negative boost possible?

2009-08-18 Thread Larry He
Hi all, I am looking for a way to assign negative boost to a term in Solr query. Our use scenario is that we want to boost matching documents that are updated recently and penalize those that have not been updated for a long time. There are other terms in the query that would affect the scores as

Re: Proximity Search

2009-08-18 Thread Erik Hatcher
On Aug 18, 2009, at 8:28 AM, Ninad Raut wrote: Hi, I want to count the words between two significant words like "shell" and "petroleum". Or want to write a query to find all the documents where the content has "shell" and "petroleum" in close proximity of less than 10 words between them.

[ANNOUNCEMENT] Newly released book: Solr 1.4 Enterprise Search Server

2009-08-18 Thread Smiley, David W.
Fellow Solr users, I've finally finished the book "Solr 1.4 Enterprise Search Server" with my co-author Eric. We are proud to present the first book on Solr and hope you find it a valuable resource. You can find full details about the book and purchase it here: http://www.packtpub.com/solr-1

Solr 1.3 JNDI Datasource

2009-08-18 Thread brianeno
Hello, We have deployed Solr in our application within Weblogic and all is working well. The last piece I am struggling with is configuring the datasource for our data import handler to work with our Weblogic configured JNDI datasource. Can anyone lead me in the right direction how to configur

Can I search for a term in any field or a list of fields?

2009-08-18 Thread Paul Tomblin
I've got "text" and so if I do an unqualified search it only finds in the field text. If I want to search title, I can do "title:foo", but what if I want to find if the search term is in any field, or if it's in "text" or "title" or "concept" or "keywords"? I already tried "*:foo", but that throw

Re: Solr 1.3 JNDI Datasource

2009-08-18 Thread Noble Paul നോബിള്‍ नोब्ळ्
DIH in Solr1.3 does not support JNDI datasource. Only 1.supports. On Tue, Aug 18, 2009 at 7:41 PM, brianeno wrote: > > Hello, >  We have deployed Solr in our application within Weblogic and all is > working well. The last piece I am struggling with is configuring the > datasource for our data imp

MoreLikeThis (MLT) in 1.4 distributed

2009-08-18 Thread mike anderson
I'm trying to get MLT working in 1.4 distributed mode. I was hoping the patch *SOLR-788 *would do the trick, but after applying the patch by hand to revision 737810 (it kept choking on component/MoreLikeThisComponent.java) I still get nothing. The URL I am using is this: http://localhost:8983/solr

Re: Can I search for a term in any field or a list of fields?

2009-08-18 Thread Marco Westermann
Hi Paul, I would say, you should use the copyField tag in the schema. eg: the text-field has to be difined as multivalued=true. When you now do an unqualified search, it will search every field, which is copied to the text-field. with best regards, Marco Westermann Paul Tomblin schrieb:

Re: Is negative boost possible?

2009-08-18 Thread Koji Sekiguchi
Hi, Use decimal figure less than 1, e.g. 0.5, to express less importance. Koji Larry He wrote: Hi all, I am looking for a way to assign negative boost to a term in Solr query. Our use scenario is that we want to boost matching documents that are updated recently and penalize those that have n

Re: Can I search for a term in any field or a list of fields?

2009-08-18 Thread Paul Tomblin
So if I want to make it so that the default search always searches three specific fields, I can make another field multi-valued that they are all copied into? On Tue, Aug 18, 2009 at 10:46 AM, Marco Westermann wrote: > I would say, you should use the copyField tag in the schema. eg: > > > > the t

Re: Can I search for a term in any field or a list of fields?

2009-08-18 Thread Toby Cole
I would consider using the dismax query handler. This allows you to send a list of keywords or phrases along with the fields to search over. e.g., you could use ?qt=dismax&q=foo&qf=title+text+keywords+concept More details here: http://wiki.apache.org/solr/DisMaxRequestHandler On 18 Aug 2009,

solrconfig.xml and ExtractingRequestHandler

2009-08-18 Thread Kevin Miller
I am using the 8/11/09 nightly build of Solr and have a couple of questions about the ExtractingRequestHandler in the solrconfig.xml file. 1. What is the purpose of 'startup="lazy"' in the requestHandler? 2. Is there a way to change the information in the requestHandler so that the text within a

Re: Can I search for a term in any field or a list of fields?

2009-08-18 Thread Marco Westermann
exactly! for example you could create a field called "all". And you copy your fields to it, which should be searched, when all fields are searched. then you have two possibilities: either you make this field the defaultSearchField for use of unqualified searches. or you qualify the field in th

dynamic changes to schema

2009-08-18 Thread Marco Westermann
Hi there, is there a possibility to change the solr-schema over php dynamically. The web-application I want to index at the moment has the feature to add fields to entitys and you can tell this fields that they are searchable. To realize this with solr the schema has to change when a searchabl

Re: solrconfig.xml and ExtractingRequestHandler

2009-08-18 Thread Mark Miller
Kevin Miller wrote: I am using the 8/11/09 nightly build of Solr and have a couple of questions about the ExtractingRequestHandler in the solrconfig.xml file. 1. What is the purpose of 'startup="lazy"' in the requestHandler? Makes it so the RequestHandler won't be inited until its actually a

Re: MoreLikeThis (MLT) in 1.4 distributed

2009-08-18 Thread Grant Ingersoll
Are there errors in the logs? -Grant On Aug 18, 2009, at 10:42 AM, mike anderson wrote: I'm trying to get MLT working in 1.4 distributed mode. I was hoping the patch *SOLR-788 *would do the trick, but after applying the patch by hand to revision 737810 (it kept choking on component/MoreLike

Strange error with shards

2009-08-18 Thread ahammad
Hello, I have been using multicore/shards for the past 5 months or so with no problems at all. I just added another core to my Solr server, but for some reason I can never get the shards working when that specific core is anywhere in the URL (either in the shards list or the base URL). HTTP Stat

Re: Can I search for a term in any field or a list of fields?

2009-08-18 Thread Paul Tomblin
On Tue, Aug 18, 2009 at 11:04 AM, Marco Westermann wrote: > exactly! for example you could create a field called "all". And you copy > your fields to it, which should be searched, when all fields are searched. > Awesome, that worked great. I made my "all" field 'stored="false" indexed="true"' and

Re: Release Date Solr 1.4

2009-08-18 Thread Michael
I think this link gets you the exact bug count: it's Constantijn's link, filtered to Unresolved Solr issues marked for fixing in 1.4: https://issues.apache.org/jira/secure/IssueNavigator.jspa?reset=true&&pid=12310230&fixfor=12313351&resolution=-1&sorter/field=issuekey&sorter/order=DESC

Re: MoreLikeThis (MLT) in 1.4 distributed

2009-08-18 Thread mike anderson
There doesn't appear to be any related errors in the log. I've included it below anyhow (there is a java.lang.NumberFormatException, i'm not sure what that is). thanks, mike for the query: http://localhost:8983/solr/select?q=%22theory%20of%20colorful%20graphs%22&mlt=true&mlt.fl=abstract&mlt.mindf=

Passing a Cookie in SolrJ

2009-08-18 Thread Ramirez, Paul M (388J)
Hi All, The project I am working on is using Solr and OpenSSO (Sun's single sign on service). I need to write some sample code for our users that shows them how to query Solr and I would just like to point them to the SolrJ documentation but I can't see an easy way to be able to pass a cookie w

Re: dynamic changes to schema

2009-08-18 Thread Constantijn Visinescu
use a dynamic field ? On Tue, Aug 18, 2009 at 5:09 PM, Marco Westermann wrote: > Hi there, > > is there a possibility to change the solr-schema over php dynamically. The > web-application I want to index at the moment has the feature to add fields > to entitys and you can tell this fields that t

Re: Release Date Solr 1.4

2009-08-18 Thread Chris Hostetter
: In-Reply-To: : Subject: Release Date Solr 1.4 : References: : <4a8a9c42.20...@gmail.com> : http://people.apache.org/~hossman/#threadhijack Thread Hijacking on Mailing Lists When starting a new discussion on a mailing list, please do not reply to an existing message, instead start a

Re: Release Date Solr 1.4

2009-08-18 Thread Yonik Seeley
On Tue, Aug 18, 2009 at 9:02 AM, Mark Miller wrote: > The last note I saw said we hope to release 1.4 a week or so after Lucene > 2.9 (though of course a week may not end up being enough). Yep, I think this is still doable. -Yonik http://www.lucidimagination.com

RE: Passing a Cookie in SolrJ

2009-08-18 Thread Fuad Efendi
> some sample code for our users that shows them how to query Solr - I believe you don't have to use SolrJ to query Solr; SolrJ can query and parse XML response from server; if your clients can use raw URL as a query and raw XML (JSON etc.) as a response - you don't need SolrJ. To pass cookie wi

Can synonyms be defined in a multi-valued field or a database?

2009-08-18 Thread Kelly Taylor
I need the ability to remotely administer synonyms for each of my Solr standalone instances. It seems that my only option is that of uploading a file per instance, restarting the respective Solr instance(s), and then rebuilding my indexes. Can synonyms be defined in a multi-valued field or a data

RE: SOLR - extremely strange behavior! Documents disappeared...

2009-08-18 Thread Fuad Efendi
UPDATE: Crazy staff with SLES10 SP2 default installation/partitioning, LVM (Logical Volume Manager) shows 400Gb available, but... I lost 90% of index without even noticing that! Aug 16, 2009 8:04:32 PM org.apache.solr.common.SolrException log SEVERE: java.io.IOException: No space left on device

Re: Passing a Cookie in SolrJ

2009-08-18 Thread Chris Hostetter
: Subject: Passing a Cookie in SolrJ : In-Reply-To: <8efd35820908180833u4140682bjcfbf2816b1710...@mail.gmail.com> http://people.apache.org/~hossman/#threadhijack Thread Hijacking on Mailing Lists When starting a new discussion on a mailing list, please do not reply to an existing message, inste

How to boost fields with many terms against single-term?

2009-08-18 Thread Fuad Efendi
I don't want single-term docs such as "home" to appear in top for simple search for a home; I need "home improvement made easy" in top... How to implement it at query time? Thanks!

Re: Is negative boost possible?

2009-08-18 Thread Chris Hostetter
: Use decimal figure less than 1, e.g. 0.5, to express less importance. but that's stil la positive boost ... it still increases the scores of documents that match. the only way to "negative boost" is to "positively boost" the inverse... (*:* -field1:value_to_penalize)^10 : > I am loo

Using Solr Cell to index a Word Document

2009-08-18 Thread Kevin Miller
I am using the Solr nightly build 8/11/09. I have set the text field in the solrconfig.xml file to be stored. I index an MS Word document and when I search for a word in the text of the document and it pulls up the xml format. The text field is showing the text of the document but there are a

Re: How to boost fields with many terms against single-term?

2009-08-18 Thread Bill Au
Lucene's default scoring formula gives shorter fields a higher score: http://lucene.apache.org/java/2_4_0/api/org/apache/lucene/search/Similarity.html Sounds like you want the opposite. You can write your own Similarity class overriding the lengthNorm() method: http://lucene.apache.org/java/2_4

Re: How can i get lucene index format version information?

2009-08-18 Thread Shalin Shekhar Mangar
2009/8/18 Licinio Fernández Maurelo > Nobody knoes how can i get exactly this info : index format : -9 (UNKNOWN) > I think Luke may be using an older version of Lucene which is not able to read the index created by Solr. > > Despite of knowing 2.9-dev 794238 - > 2009-07-15 18:05:08 helps, i as

Re: Maximum number of values in a multi-valued field.

2009-08-18 Thread Shalin Shekhar Mangar
On Tue, Aug 18, 2009 at 4:20 AM, Arv wrote: > > All, > We are considering some new changes to our Solr schema to better support > some new functionality for our application. To that extent, we want to add > an additional field that is multi-valued, but will contain a large number > of > values pe

Replication over multi-core solr

2009-08-18 Thread vivek sar
Hi, We use multi-core setup for Solr, where new cores are added dynamically to solr.xml. Only one core is active at a time. My question is how can the replication be done for multi-core - so every core is replicated on the slave? I went over the wiki, http://wiki.apache.org/solr/SolrReplication

Re: Distributed index

2009-08-18 Thread Shalin Shekhar Mangar
On Tue, Aug 18, 2009 at 3:49 PM, ToJira wrote: > > Hi, > > I am very new to Solr and overall a newbie in software developing. I have a > problem with cross-platform implementation. Basically I have a local index > running on a windows server 2003 aided with a web service (asp.net) for > the > use

Re: spellcheck component in 1.4 distributed

2009-08-18 Thread Shalin Shekhar Mangar
On Mon, Aug 17, 2009 at 8:32 PM, Ian Connor wrote: > Hi, > > Just a quick update to the list. Mike and I were able to apply it to 1.4 > and > it works. We have it loaded on a few production servers and there is an odd > "StringIndexOutOfBoundsException" error but most of the time it seems to > wo

Re: Using Solr Cell to index a Word Document

2009-08-18 Thread Mark Miller
Solr defers to Tika for this. Tika uses getParagraph text from the POI WordExtractor class: http://poi.apache.org/apidocs/org/apache/poi/hwpf/extractor/WordExtractor.html POI appears to be in limbo and I'm not seeing anything in WordExtractor that looks like it might help you. I'd inquire at

Re: [ANNOUNCEMENT] Newly released book: Solr 1.4 Enterprise Search Server

2009-08-18 Thread Shalin Shekhar Mangar
Fantastic! This is great news for Solr! Congratulations! You might want to post this to the general-lucene mailing list and the linkedin group too. On Tue, Aug 18, 2009 at 7:39 PM, Smiley, David W. wrote: > Fellow Solr users, > > I've finally finished the book "Solr 1.4 Enterprise Search Server

Faceting Performance Factors

2009-08-18 Thread CameronL
Our current search is faceting on a single integer field. The field is multi-valued. facet=true facet.mincount=1 facet.limit=-1 facet.field=fieldA The number of unique values in our index for fieldA is around 8000, and a typical query can return about 500 counts. A typical single document can ha

Re: dynamic changes to schema

2009-08-18 Thread Marco Westermann
es.de Handelsregister Köln HR B 30904 Ust.-Id.: DE199672015 Finanzamt Köln-Nord. UstID: nicht vergeben Aufsichtsratsvorsitzender: Michael Morgenstern Vorstand: Andrej Radonic, Peter Zander __ Hinweis von ESET NOD32 Antivirus, Signaturdatenbank-Version 4346 (20090818) __ E-Mail

RE: DataImportHandler - very slow delta import

2009-08-18 Thread Matthew Painter
Thanks; that confirms my observed behaviour. However, why would the delta query have to make a single db call per changed row? For simple delta queries like mine below, batching a chunk of rows at the time from the database seems quite doable. Or are there less-trivial situations where batching

DataImportHandler ignoring most rows

2009-08-18 Thread Erik Earle
Using: - apache-solr-1.3.0 - java 1.6 - tomcat 6 - sql server 2005 w/ JSQLConnect 4.0 driver I have a group table with 3007 rows. I have confirmed the key is unique with "select distinct id from group" and it returns 3007. When i re-index using http://host:port/solr/dataimport?command=full-im

Re: MoreLikeThis (MLT) in 1.4 distributed

2009-08-18 Thread mike anderson
Perhaps it was something about the way I applied the patch by hand, but after trying it again (on a later revision, maybe that was the trick), I got solr to acknowledge I was using MLT when also passing the shards parameter. However, unlike a query without shards, I get numFound=0 for all results:

RE: DataImportHandler - very slow delta import

2009-08-18 Thread Matthew Painter
I was wary of the potential maintenance issues and clutter involved with copying each entity block as suggested below (they're all large and there are around ten of them), so I just modifying the main full import query to be of the syntax: query="select x,y,z from table where modifiedtime > '$

Re: CorruptIndexException: Unknown format version

2009-08-18 Thread Chris Hostetter
: how can that happen, it is a new index, and it is already corrupt? : : Did anybody else something like this? "Unknown format version" doesn't mean your index is corrupt .. it means the version of LUcnee parsing the index doesn't recognize the index format version ... typically it means you a

Re: Relevant results with DisMaxRequestHandler

2009-08-18 Thread Chris Hostetter
: The 'qf' parameter used in the dismax seems to work with a 'AND' separator. : I have much more results without dixmax. Is there any way to keep the same : amount of document and process the 'qf' ? did you read any of the docs on dismax? http://wiki.apache.org/solr/DisMaxRequestHandler

Re: DataImportHandler ignoring most rows

2009-08-18 Thread Erik Earle
Upgraded to the tip from svn and still no love. - Original Message From: Erik Earle To: solr-user@lucene.apache.org Sent: Tuesday, August 18, 2009 3:16:47 PM Subject: DataImportHandler ignoring most rows Using: - apache-solr-1.3.0 - java 1.6 - tomcat 6 - sql server 2005 w/ JSQLConnec

Re: schema configuration with different kind of score report

2009-08-18 Thread Chris Hostetter
: Hence, some sort of different query will be applied, which I am unable to : ascertain. well that would be step one. before anyone can help you generate a "different kind of score report" you have to be able to describe the general algorithm you want for determining when there is a match and

Re: Cannot get solr 1.3.0 to run properly with plesk 9.2.1 on CentOS

2009-08-18 Thread Aaron Aberg
Marco might be right about the JRE thing. Here is my classpath entry when Tomcat starts up java.library.path: /usr/lib/jvm/java-1.6.0-openjdk-1.6.0.0/jre/lib/i386/client:/usr/lib/jvm/java-1.6.0-openjdk-1.6.0.0/jre/lib/i386:/usr/lib/jvm/java-1.6.0-openjdk-1.6.0.0/jre/../lib/i386:/usr/java/packages/l

Re: Faceting Performance Factors

2009-08-18 Thread Jason Rutherglen
Hi Cameron, You'll need to upgrade to Solr 1.4 as the 1.3 method of faceting is quite slow (i.e. intersecting bitsets). 1.4 uses UnInvertedField which caches the terms per doc and iterates/counts them. The 1.3 method is slow because for every term (i.e. unique field value) there needs to be a bits

RE: Cannot get solr 1.3.0 to run properly with plesk 9.2.1 on CentOS

2009-08-18 Thread Fuad Efendi
The only difference between JRE and JDK (from TOMCAT viewpoint) is absence of javac compiler for JSPs. But it will complain only if you try to use JSPs (via admin console). Have you tried to install SOLR on your localbox and play with samples described at many WIKI pages? -Original Message-

RE: Cannot get solr 1.3.0 to run properly with plesk 9.2.1 on CentOS

2009-08-18 Thread Fuad Efendi
I forgot to add: compiler is inside "tools.jar" in some cases if I am correct... doesn't matter really... try to access Tomcat default homepage before trying to use SOLR! The only difference between JRE and JDK (from TOMCAT viewpoint) is absence of javac compiler for JSPs. But it will

RE: [ANNOUNCEMENT] Newly released book: Solr 1.4 Enterprise Search Server

2009-08-18 Thread Fuad Efendi
Some very smart guys at Hadoop even posted some discount codes at WIKI, and it's even possible to buy in-advance not published yet chapters :) - everything changes extremely quick... Why did you keeep it in secret? Waiting for SOLR-4.1 :))) - do you still use outdated pre-1.4 "faceting" term in y

Spanish Stemmer

2009-08-18 Thread Darien Rosa
Hello, I am trying to configure Solr to index Spanish documents and I've found some problems with the Spanish stemmer. I have a basic install using Tomcat. I suspect that the Spanish stemmer isn't working very well. The site http://snowball.tartarus.org/algorithms/spanish/stemmer.html shows a s

Re: Spanish Stemmer

2009-08-18 Thread Robert Muir
hi, it looks like you might just have a simple typo: if you change it to language="Spanish" it should work. -- Robert Muir rcm...@gmail.com

Re: Cannot get solr 1.3.0 to run properly with plesk 9.2.1 on CentOS

2009-08-18 Thread Aaron Aberg
Tomcat is running fine. It's solr that is having the issue. I keep seeing people talk about this: -Dsolr.solr.home='/some/path' Should I be putting that somewhere? Or is that already taken care of when I edited the web.xml file in my solr.war file? On Tue, Aug 18, 2009 at 7:29 PM, Fuad Efendi wr

RE: [ANNOUNCEMENT] Newly released book: Solr 1.4 Enterprise Search Server

2009-08-18 Thread Smiley, David W.
Hi Faud. It's true I didn't publicize its release beforehand; I have no idea if it is normal to do so or not. I guess I'm a bit shy. I honestly have no clue what you're referring to as the successor to the "faceting" term. ~ David Smiley From: Fuad Efe

Re: Cannot get solr 1.3.0 to run properly with plesk 9.2.1 on CentOS

2009-08-18 Thread Chris Hostetter
: -Dsolr.solr.home='/some/path' : : Should I be putting that somewhere? Or is that already taken care of : when I edited the web.xml file in my solr.war file? No ... you do not need to set that system property if you already have it working because of modifications to the web.xml ... according

RE: Cannot get solr 1.3.0 to run properly with plesk 9.2.1 on CentOS

2009-08-18 Thread Fuad Efendi
>>-Dsolr.solr.home='/some/path' CORRECT: -Dsolr.data.dir=.. It should be in java startup parameters; for instance, JAVA_OPTS="-server -Zms32768M -Xmx32768M -Dsolr.data.dir=/some/path" inside catalina.sh as a first statement... According to the logs you posted probably mistake in s

Re: dynamic changes to schema

2009-08-18 Thread Constantijn Visinescu
t;>> >>> interSales AG Internet Commerce >>> Subbelrather Str. 247 >>> 50825 Köln >>> >>> Tel 02 21 - 27 90 50 >>> Fax 02 21 - 27 90 517 >>> Mail i...@intersales.de >>> Mail m...@intersales.de >>> Web www.intersales.d

RE: Cannot get solr 1.3.0 to run properly with plesk 9.2.1 on CentOS

2009-08-18 Thread Fuad Efendi
DO NOT RELY on your hosting provider. They use automated tools creating complete mess with approved for production on CentOS versions of Lucene, Servlet-API, java.util.* package, and etc; look at this: > Here is my classpath entry when Tomcat starts up > java.library.path: > /usr/lib/jvm/java-1.6.

Re: DataImportHandler ignoring most rows

2009-08-18 Thread Noble Paul നോബിള്‍ नोब्ळ्
this comment says that 7 the query fetched only 7 rows. If possible open a tool and just run the same query and see how many rows are returned On Wed, Aug 19, 2009 at 3:46 AM, Erik Earle wrote: > Using: > - apache-solr-1.3.0 > - java 1.6 > - tomcat 6 > - sql server 2005 w/ JSQLConnect 4.0 driv

Re: Replication over multi-core solr

2009-08-18 Thread Noble Paul നോബിള്‍ नोब्ळ्
On Wed, Aug 19, 2009 at 2:27 AM, vivek sar wrote: > Hi, > >  We use multi-core setup for Solr, where new cores are added > dynamically to solr.xml. Only one core is active at a time. My > question is how can the replication be done for multi-core - so every > core is replicated on the slave? repli