Re: SolrJ new javabin format

2010-10-19 Thread Shawn Heisey
On 10/19/2010 2:40 PM, Chris Hostetter wrote: The formats are not currently compatible. The first priority was to get the format fixed so it was using true UTF8 (instead of Java's bastardized modified UTF8) in a way that would generate a clear error if people attempted to use an older SolrJ to

Re: Lucene vs Solr

2010-10-19 Thread Pradeep Singh
Is that right? On Tue, Oct 19, 2010 at 11:08 PM, findbestopensource < findbestopensou...@gmail.com> wrote: > Hello all, > > I have posted an article Lucene vs Solr > http://www.findbestopensource.com/article-detail/lucene-vs-solr > > Please feel free to add your comments. > > Regards > Aditya > w

Re: Implementing Search Suggestion on Solr

2010-10-19 Thread Pablo Recio
Yeah, I know. Does anyone could tell me wich one is the good way? Regards, > What an interesting application :-) > > Dennis Gearon > > Signature Warning > > It is always a good idea to learn from your own mistakes. It is usually a better idea to learn from others’ mistakes, so yo

Not able to subscribe to ML

2010-10-19 Thread Abdullah Shaikh
Just a test mail to check if my mails are reaching the ML. I dont know, but my mails are failing to reach the ML with the following error : Delivery to the following recipient failed permanently: solr-user@lucene.apache.org Technical details of permanent failure: Google tried to deliver you

Multiple partial word searching with dismax handler

2010-10-19 Thread Chamnap Chhorn
Hi, I have some problem with combining the query with multiple parital-word searching in dismax handler. In order to make multiple partial word searching, I use EdgeNGramFilterFactory, and my query must be something like this: "name_ngram:sun name_ngram:hot" in q.alt combined with my search handle

Re: Negative filter using the "appends" element

2010-10-19 Thread Erick Erickson
I suspect, but don't know for sure, that you need to modify it to *:* - tag:test but I confess I'm not at all sure that it'll work in this context.. Best Erick On Tue, Oct 19, 2010 at 11:10 AM, Kevin Cunningham < kcunning...@telligent.com> wrote: > I'm using Solr 1.4 with the standard reque

Re: Documents and cores

2010-10-19 Thread Erick Erickson
This is something most everybody has to get over when transitioning from the DB world to Solr/Lucene. The schema describes the #possible# fields in the document. There is absolutely no requirement that #every# document in the index have all these fields in them (unless #you# define it so with . So

Re: boosting injection

2010-10-19 Thread Erick Erickson
The main disadvantage of index-time boosting is that you must reindex your corpus entirely if you want to alter the boost factors. And there's no very good way to anticipate what boost factors will give you the results you want I wonder if you could cheat and do some basic string processing on

Re: Spatial

2010-10-19 Thread Grant Ingersoll
On Oct 19, 2010, at 6:23 PM, Pradeep Singh wrote: > https://issues.apache.org/jira/browse/LUCENE-2519 > > If I change my code as per 2519 > > to have this - > > public double[] coords(double latitude, double longitude) { >double rlat = Math.toRadians(latitude); >double rlong = Math.to

Re: Solr PHP PECL Extension going to Stable Release - Wishing for Any New Features?

2010-10-19 Thread Israel Ekpo
Hi All, Just wanted to post an update on where we stand with all the requests for new features List of Features Requested In SOLR PECL Extension 1. Ability to Send Custom Requests to Custom URLS other than select, update, terms etc. 2. Ability to add files (pdf, office documents etc) 3. Windows

Re: query results file for trec_eval

2010-10-19 Thread Ahmet Arslan
> I am a student and I am trying to run evaluation for TREC > format document. I > have the judgments. I would like to have the output of my > queries for use > with trec_eval software. Can someone please point me how to > make Solr spit > out output in this format? Or at least point me to some > m

xi:include

2010-10-19 Thread Peter A. Kirk
Hi I am trying to use xi:include in my solrconfig.xml. For example: http://localhost/config/config.aspx"; /> This works fine, as long as config.aspx exists, and as long as it returns valid xml. Sometimes though, the config.aspx can fail, and return invalid xml. Then I get a problem, as Solr's

Spatial

2010-10-19 Thread Pradeep Singh
https://issues.apache.org/jira/browse/LUCENE-2519 If I change my code as per 2519 to have this - public double[] coords(double latitude, double longitude) { double rlat = Math.toRadians(latitude); double rlong = Math.toRadians(longitude); double nlat = rlong * Math.cos(rlat); re

Re: Negative filter using the "appends" element

2010-10-19 Thread Ahmet Arslan
> Does not work: >     >       name="fq">-tag:test >     Can you append &echoParams=all to your search url and verify that that fq=-tag:test included in response?

Re: Documents and Cores, take 2

2010-10-19 Thread Ken Stanley
Ron, In the past I've worked with SOLR for a product that required the ability to search - separately - for companies, people, business lists, and a combination of the previous three. In designing this in SOLR, I found that using a combination of explicit field definitions and dynamic fields ( htt

Re: How can i get collect stemmed query?

2010-10-19 Thread Ahmet Arslan
Oh you are constructing the string 'fly +body:away' in your StemFilter? Just to make sure, does this q=+body:(fly away) return your document? And analysis.jsp (at query time) displays 'fly +body:away' from the string 'flyaway'? I don't know why are you doing this but your stemfilter should return

Documents and Cores, take 2

2010-10-19 Thread Olson, Ron
Hi all- I have a newbie design question about documents, especially with SQL databases. I am trying to set up Solr to go against a database that, for example, has "items" and "people". The way I see it, and I don't know if this is right or not (thus the question), is that I see both as separate

Re: SolrJ new javabin format

2010-10-19 Thread Chris Hostetter
: The CHANGES.txt file in branch_3x says that the javabin format has changed in : Solr 3.1, so you need to update SolrJ as well as Solr. Is the SolrJ included : in 3.1 compatible with both 3.1 and 1.4.1? If not, that's going to make a : graceful upgrade of my replicated distributed installation

Re: Upgrade to Solr 1.4, very slow at start up when loading all cores

2010-10-19 Thread Chris Hostetter
: We will take this approach in our production environment but meanwhile I am : curious if this issue will be addressed: it seems the new/first searchers do : not really buy any performance benefits because it uses so much memory, : especially at core loading time. There's nothing inheriently wro

Re: Documents and cores

2010-10-19 Thread Chris Hostetter
: Subject: Documents and cores : References: <4cbd939c.3020...@atcult.it> : : In-Reply-To: http://people.apache.org/~hossman/#threadhijack Thread Hijacking on Mailing Lists When starting a new discussion on a mailing list, please do not reply to an existing message, instead start a fresh ema

Re: I need to indexing the first character of a field in another field

2010-10-19 Thread Renato Wesenauer
Hi guys, I read all suggestions and I did some tests, and finally, the indexing process is working. I did the extraction of initial character of three fields. Here are the functions: function extraiInicial(valor) { if (valor != "" && valor != null) { valor = valor.substring(0

Re: Dismax phrase boosts on multi-value fields

2010-10-19 Thread Jonathan Rochkind
You are correct. The query needs to match as a phrase. It doesn't need to match "everything". Note that if a value is: "long sentence with my blue rabbit in it", then query "my blue rabbit" will also match as a phrase, for phrase boosting or query purposes. Jonathan Jason Brown wrote: H

Dismax phrase boosts on multi-value fields

2010-10-19 Thread Jason Brown
Hi - I have a multi-value field, so say for example it consists of 'my black cat' 'my white dog' 'my blue rabbit' The field is whitespace parsed when put into the index. I have a phrase query boost configured on this field which I understand kicks in when my search term is found entirely in

Facet Use Case

2010-10-19 Thread Edgar Espina
Hi Guys, Let me describe you the use case for our search applications: a- The user enter to the search application to latest 20 document are displayed. b- A Tag cloud component is populate with the facet available from a. c- The user type something in the text box. d- The documents are tagged

does solr support posting gzipped content?

2010-10-19 Thread danomano
Hi folks, I was wondering if there is any native support for posting gzipped files to solr? i.e. I'm testing a project where we inject our log files into solr for indexing, these logs files are gzipped, and I figure it would take less network bandwith to inject gzipped files directl. is there a

Re: query results file for trec_eval

2010-10-19 Thread Ezequiel Calderara
I don't know anything about the TREC format document, but i think if you want text output, you can do it by using the http://wiki.apache.org/solr/XsltResponseWriter to transform the xml to a text... On Tue, Oct 19, 2010 at 12:29 PM, Valli Indraganti < valli.indraga...@gmail.com> wrote: > Hello! >

FW: Dismax phrase boosts on multi-value fields

2010-10-19 Thread Jason Brown
-Original Message- From: Jason Brown Sent: Tue 19/10/2010 13:45 To: d...@lucene.apache.org Subject: Dismax phrase boosts on multi-value fields Hi - I have a multi-value field, so say for example it consists of 'my black cat' 'my white dog' 'my blue rabbit' The field is whitespace p

RE: query results file for trec_eval

2010-10-19 Thread abhatna...@vantage.com
If I understand your use case correctly. You will have to write your own response writer. Only the below response writers are available . Query response writer Description XMLResponseWriter The most general-purpose response format outputs its results in XML, as demonstrated by the blogging a

query results file for trec_eval

2010-10-19 Thread Valli Indraganti
Hello! I am a student and I am trying to run evaluation for TREC format document. I have the judgments. I would like to have the output of my queries for use with trec_eval software. Can someone please point me how to make Solr spit out output in this format? Or at least point me to some material

Negative filter using the "appends" element

2010-10-19 Thread Kevin Cunningham
I'm using Solr 1.4 with the standard request handler and attempting to apply a negative fq for all requests via the "appends" elements but its not being applied. Is this an intended limitation? I looked in JIRA for an existing issue but nothing jumped out. Works fine: tag:test

Timeouts in distributed search using Solr + Zookeeper

2010-10-19 Thread Cinquini, Luca (3880)
Hi, we are looking at Solr+Zookeeper as the architecture for enabling federated searches among geographically distributed data centers. I wonder if anybody can comment on what is the status of enabling timeouts with respect to distributed searches in a Solr-Zookeeper environment. Specifi

Re: boosting injection

2010-10-19 Thread Andrea Gazzarini
Hi Ken, yes I'm a java developer so I think I should be able to do that but I was wondering if there's a way to solve my issue without coding. Problem is that I need to adjust this query in a short time and in addition I cannot justify (at this stage of the project) additional software art

Documents and cores

2010-10-19 Thread Olson, Ron
Hi all- I have a newbie design question about documents, especially with SQL databases. I am trying to set up Solr to go against a database that, for example, has "items" and "people". The way I see it, and I don't know if this is right or not (thus the question), is that I see both as separate

Re: **SPAM** Re: boosting injection

2010-10-19 Thread Ken Stanley
Andrea, Another approach, aside of Markus' suggestion, would be to create your own handler that could intercept the query and perform whatever necessary transformations that you need at query time. However, that would require having Java knowledge (which I make no assumption). Regards, Ken It l

Re: boosting injection

2010-10-19 Thread Andrea Gazzarini
Y-E-A-H! I think it's so! Markus, what are disadvantages of this boosting strategy? Thanks a lot Andrea Il 19/10/2010 16:25, Markus Jelsma ha scritto: Index-time boosting maybe? http://wiki.apache.org/solr/UpdateXmlMessages#Optional_attributes_for_.22field.22 On Tuesday, October 19, 2010 04

Re: **SPAM** Re: boosting injection

2010-10-19 Thread Markus Jelsma
Index-time boosting maybe? http://wiki.apache.org/solr/UpdateXmlMessages#Optional_attributes_for_.22field.22 On Tuesday, October 19, 2010 04:23:46 pm Andrea Gazzarini wrote: > Hi Ken, > thanks for your response...unfortunately it doesn't solve my problem. > > I cannot chnage the client behaviour

Re: boosting injection

2010-10-19 Thread Andrea Gazzarini
Hi Ken, thanks for your response...unfortunately it doesn't solve my problem. I cannot chnage the client behaviour so the query must be a query and not only the query terms. In this scenario, It would be great, for example, if I could declare the boost in the schema field definitionbut

Re: **SPAM** Re: boosting injection

2010-10-19 Thread Andrea Gazzarini
Hi Ken, thanks for your response...unfortunately it doesn't solve my problem. I cannot chnage the client behaviour so the query must be a query and not only the query terms. In this scenario, It would be great, for example, if I could declare the boost in the schema field definitionbut I th

Re: snapshot-4.0 and maven

2010-10-19 Thread Matt Mitchell
Hey thanks Tommy. To be more specific, I'm trying to use SolrJ in a clojure project. When I try to use SolrJ using what you showed me, I get errors saying lucene classes can't be found etc.. Is there a way to build everything SolrJ (snapshot-4.0) needs into one jar? Matt On Mon, Oct 18, 2010 at 1

Re: boosting injection

2010-10-19 Thread Ken Stanley
Andrea, Using the SOLR dismax query handler, you could set up queries like this to boost on fields of your choice. Basically, the q parameter would be the query terms (without the field definitions, and a qf (Query Fields) parameter that you use to define your boost(s): http://wiki.apache.org/solr

boosting injection

2010-10-19 Thread Andrea Gazzarini
Hi all, I have a client that is sending this query q=title:history AND author:joyce is it possible to "transform" at runtime this query in this way: q=title:history^10 AND author:joyce^5 ? Best regards, Andrea

Re: count(*) equivilent in Solr/Lucene

2010-10-19 Thread Grant Ingersoll
On Oct 19, 2010, at 2:09 AM, Dennis Gearon wrote: > I/my team will have to look at that and decode it,LOL! I get some of it. > > The database version returns 1 row, with the answer. > > What does this return and how fast is it on BIG indexes? rows=0 returns 0 rows, but the total count will be

Re: Commits on service after shutdown

2010-10-19 Thread Jan Høydahl / Cominvent
You never get full control of commits, as Solr will auto-commit anyway whenever the (configurable) input buffer is full. With the current architecture you cannot really trust adds or commits to 100% certainly be successful, because the server may have been restarted between an add and commit() w

Re: Uppercase and lowercase queries

2010-10-19 Thread PeterKerk
I now used textfield...and it works, so thanks! :) -- View this message in context: http://lucene.472066.n3.nabble.com/Uppercase-and-lowercase-queries-tp1731349p1731423.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Uppercase and lowercase queries

2010-10-19 Thread Markus Jelsma
Yes, and reindex. And i suggest not to use `string` as the name of the fieldType as it will confuse later. On Tuesday, October 19, 2010 12:25:53 pm Pradeep Singh wrote: > Use text field. > > On Tue, O

Re: Uppercase and lowercase queries

2010-10-19 Thread Markus Jelsma
Because you need to reindex. On Tuesday, October 19, 2010 12:19:53 pm PeterKerk wrote: > I want to query on cityname. This works when I query for example: > "Boston" > > But when I query "boston" it didnt show any results. In the database is > stored: "Boston". > > So I thought: I should change

Re: Uppercase and lowercase queries

2010-10-19 Thread Pradeep Singh
Use text field. On Tue, Oct 19, 2010 at 3:19 AM, PeterKerk wrote: > > I want to query on cityname. This works when I query for example: > "Boston" > > But when I query "boston" it didnt show any results. In the database is > stored: "Boston". > > So I thought: I should change the filter on this

Uppercase and lowercase queries

2010-10-19 Thread PeterKerk
I want to query on cityname. This works when I query for example: "Boston" But when I query "boston" it didnt show any results. In the database is stored: "Boston". So I thought: I should change the filter on this field to make everything lowercase. The field definition for city is: So I cha

Re: Removing Common Web Page Header and Footer from All Content Fetched by Nutch

2010-10-19 Thread Israel Ekpo
Thanks Otis and Markus for your input. I will check it out today. On Tue, Oct 19, 2010 at 4:45 AM, Markus Jelsma wrote: > Unfortunately, Nutch still uses Tika 0.7 in 1.2 and trunk. Nutch needs to > be > upgraded to Tika 0.8 (when it's released or just the current trunk). Also, > the > Boilerpipe

Re: Removing Common Web Page Header and Footer from All Content Fetched by Nutch

2010-10-19 Thread Markus Jelsma
Unfortunately, Nutch still uses Tika 0.7 in 1.2 and trunk. Nutch needs to be upgraded to Tika 0.8 (when it's released or just the current trunk). Also, the Boilerpipe API needs to be exposed through Nutch configuration, which extractor can be used, which parameters need to be set etc. Upgrading

RE: DIH - configure password in 1 place and store it in encrypted form?

2010-10-19 Thread Ephraim Ofir
You could include a common file with the JdbcDataSource (http://wiki.apache.org/solr/SolrConfigXml#XInclude) or add the password as a property in solr.xml in the container scope (http://wiki.apache.org/solr/CoreAdmin#Configuration) so it will be available to all cores. Personally, I use a single co

RE: DIH delta-import question

2010-10-19 Thread Ephraim Ofir
According to the DIH wiki, delta-import is only supported by sql (http://wiki.apache.org/solr/DataImportHandler#Using_delta-import_comman d-1) Ephraim Ofir -Original Message- From: Bernd Fehling [mailto:bernd.fehl...@uni-bielefeld.de] Sent: Friday, October 15, 2010 8:20 AM To: solr-user