Why my email always been rejected?

2012-03-19 Thread 怪侠
I send email to :solr-user@lucene.apache.org, but I always receive the rejected email. It can't send successful.

Re: SOLR 3.3 DIH and Java 1.6

2012-03-19 Thread Mikhail Khludnev
Hello, Have you tried jdk 6 from Oracle? On Tue, Mar 20, 2012 at 8:41 AM, randolf.julian < randolf.jul...@dominionenterprises.com> wrote: > I am trying to use the data import handler to update SOLR index with Oracle > data. In the SOLR schema, a dynamic field called PHOTO_* has been defined. > I

querying on shards

2012-03-19 Thread Ankita Patil
Hi, I wanted to know whether it is feasible to query on all the shards even if the query yields data only from a few shards n not all. Or is it better to mention those shards explicitly from which we get the data and only query on them. for example : I have 4 shards. Now I have a query which yiel

Re: Error while trying to load JSON

2012-03-19 Thread Chris Hostetter
: Ok, so my issue is that it must be a flat structure. Why isn't the JSON : parser able to deconstruct the object into a flatter structure for indexing? : Shouldn't it be able to take any valid JSON structure? Becuase no one has contributed code to do it? In general accepting any arbitrary JSON

Re: Any way to get reference to original request object from within Solr component?

2012-03-19 Thread Chris Hostetter
: I have a custom component which depends on the ordering of a : multi-valued parameter. Unfortunately it looks like the values do not : come back in the same order as they were put in the URL. Here is some : code to explain the behavior: ... : and I notice that the values are ordered d

url-encoded strings from Admin Query String search

2012-03-19 Thread Aaron Couch
Sorry if this is documented somewhere but I did searched a lot and couldn't find the answer to this. I'd like to be able to type in test queries and the /admin page but when I do the url string created is url-encoded so that the resulting string doesn't work. If I type: TEST&qf=t_FIELD_NAME in

SOLR automatic failover with Zookeeper

2012-03-19 Thread Ankit Bhatnagar
Hi folks, Is there a way to configure Solr master -failure using Zookeoper?  Any experiences... Ankit

Re: Faster Solr Indexing

2012-03-19 Thread Peyman Faratin
Hi Erick, Dimitry and Mikhail thank you all for your time. I tried all of the suggestions below and am happy to report that indexing speeds have improved. There were several confounding problems including - a bank of (~20) regexes that were poorly optimized and compiled at each indexing step -

Re: Is there a way for SOLR / SOLRJ to index files directly bypassing HTTP streaming?

2012-03-19 Thread Yonik Seeley
On Mon, Mar 19, 2012 at 5:48 PM, vybe3142 wrote: > Thanks for the response > > No, the file is plain text. > > All I'm trying to do is index plain ASCII text files via a remote reference > to their file paths. The XML update handler expects a specific format of XML. The json, CSV, javabin update

Re: Is there a way for SOLR / SOLRJ to index files directly bypassing HTTP streaming?

2012-03-19 Thread vybe3142
BTW, .. using the client I pasted, I get the same error even with the standard supplied executable SOLR jar. -- View this message in context: http://lucene.472066.n3.nabble.com/Is-there-a-way-for-SOLR-SOLRJ-to-index-files-directly-bypassing-HTTP-streaming-tp3833419p3840483.html Sent from the Solr

Re: Is there a way for SOLR / SOLRJ to index files directly bypassing HTTP streaming?

2012-03-19 Thread vybe3142
Thanks for the response No, the file is plain text. All I'm trying to do is index plain ASCII text files via a remote reference to their file paths. I guess what I need to do is specify the content type as text. I don't think a "content-type" param will help since this behavior is tied to the

Re: Performance Question

2012-03-19 Thread Bill Bell
The size of the index does matter practically speaking. Bill Bell Sent from mobile On Mar 19, 2012, at 11:41 AM, Mikhail Khludnev wrote: > Exactly. That's what I mean. > > On Mon, Mar 19, 2012 at 6:15 PM, Jamie Johnson wrote: > >> Mikhail, >> >> Thanks for the response. Just to be clear

Re: Is there a way for SOLR / SOLRJ to index files directly bypassing HTTP streaming?

2012-03-19 Thread Yonik Seeley
On Mon, Mar 19, 2012 at 4:38 PM, vybe3142 wrote: > Okay, I added the javabin handler snippet to the solrconfig.xml file > (actually shared across all cores).  I got further (the request made it past > tomcat and into SOLR) but  haven't quite succeeded yet. > > Server trace: > Mar 19, 2012 3:31:35

Re: To truncate or not to truncate (group.truncate vs. facet)

2012-03-19 Thread rasser
I see your point. If I understand it correct it will however mean that i need to return 10(brands)x100(resultToShow) = 1000 docs to facilitate that all 100 results to show is of the same brand. Correnct? And tomorrow (or later) the customer will also want a facet on 5 new fields eg. "production y

Re: To truncate or not to truncate (group.truncate vs. facet)

2012-03-19 Thread Erick Erickson
Well, it depends on how many groups we're talking here. Grouping returns the top N results _within_ each group. So if you have 10 brands and grouped on brand, you'd get back the top, say, 3 results for Ford, Volvo, Fiat, Saab, Audi.up to 10 groups. The number of groups you get back is governed

Re: To truncate or not to truncate (group.truncate vs. facet)

2012-03-19 Thread rasser
Thanks for the suggestion Martijn. I did see that v. 4 had a group.facet parameter that seems to fit my needs. But since I'm new to Solr I'm not really comfortable using version 4 in production, let alone a nightly build. Any other ideas? - I cant imagine that i'm the first person ever trying to

Re: To truncate or not to truncate (group.truncate vs. facet)

2012-03-19 Thread rasser
Thanks for your reply Erick, As far as I can see grouping on brand (group.field=brand) will only work if all rows are returned. Correct? - if this is so it will not fly since only the top results are returned (rows=100) Or am I missing something. Thanks -- View this message in context: http:

Re: Is there a way for SOLR / SOLRJ to index files directly bypassing HTTP streaming?

2012-03-19 Thread vybe3142
Okay, I added the javabin handler snippet to the solrconfig.xml file (actually shared across all cores). I got further (the request made it past tomcat and into SOLR) but haven't quite succeeded yet. Server trace: Mar 19, 2012 3:31:35 PM org.apache.solr.core.SolrCore execute INFO: [testcore1] we

Re: data/index/segments_u (No such file or directory)

2012-03-19 Thread Erick Erickson
Did you remove the entire contents of the prod index before copying from the dev box? This should work fine Best Erick On Mon, Mar 19, 2012 at 2:48 PM, wrote: > Hello, > > I have copied solr's data folder from dev linux box to prod one. When > starting solr I get this error in prod server.

Re: Solr Memory Usage Question

2012-03-19 Thread Erick Erickson
Faceting and sorting are the two biggest places people get into trouble. You've been asking questions about Solr Cloud, so I assume you're working on a trunk release. Note that most everything people know about memory consumption painfully gained over the years is...wrong on trunk. Or at least may

Re: Is there a way for SOLR / SOLRJ to index files directly bypassing HTTP streaming?

2012-03-19 Thread Erick Erickson
My guess is that this isn't defined in the solrconfig.xml file for your testcore1/conf.. If you modeled your testcore1 after the solrconfig.xml files in the example/multicore/core* directories, these are extremely simplified. You might try copying the one from example/solr/conf and removing s

Re: Is there a way for SOLR / SOLRJ to index files directly bypassing HTTP streaming?

2012-03-19 Thread vybe3142
Still No luck.Please help point out what I'm doing wrong. Neither the (commented out ) first approach (including the content with the request) nor the 2nd approach seem to work. Nothing seems to be acknowledged at the tomcat server either. I get the error: Starting SOLR doc indexing client 2 Exc

Solr Memory Usage Question

2012-03-19 Thread Jamie Johnson
I am not currently running into memory issues, but I was wondering if anyone could explain to me Solrs memory usage? What does Solr actually store in memory? What are some of the largest memory consumers (i.e. faceting, sorting, etc). Is the best way to start addressing questions like this to ju

data/index/segments_u (No such file or directory)

2012-03-19 Thread alxsss
Hello, I have copied solr's data folder from dev linux box to prod one. When starting solr I get this error in prod server. In dev solr starts sucessfully. Caused by: java.io.FileNotFoundException: /home/apache-solr-3.5.0/example/solr/data/index/segments_u (No such file or directory)

Re: Performance Question

2012-03-19 Thread Mikhail Khludnev
Exactly. That's what I mean. On Mon, Mar 19, 2012 at 6:15 PM, Jamie Johnson wrote: > Mikhail, > > Thanks for the response. Just to be clear you're saying that the size > of the index does not matter, it's more the size of the results? > > On Fri, Mar 16, 2012 at 2:43 PM, Mikhail Khludnev > wro

Re: Does the Solr provide hightlight token position in the field?

2012-03-19 Thread neosky
Thanks! I look at the api carefully before, but not very sure. So,it seems that the highlighter might not be helpful. I am considering alternative solution for this problem. I would like to what exactly want, for instance I got a candidate record from my query:RVCES(I implement a 5 gram index) MSA

Re: Boosting terms

2012-03-19 Thread Pawel Rog
Thanks a lot, I'll read it :) It seems to be helpfull On Sun, Mar 18, 2012 at 8:58 PM, Ahmet Arslan wrote: > >> Is there any possibility to boost >> terms during indexing? Searching >> that using google I found information that there is no such >> feature in >> Solr (we can only boost fields). Is

Re: Boosting near terms

2012-03-19 Thread Samuel García Martínez
I just read the edismax pf functionality and it covers all my needs. Many thanks :) 2012/3/19 Samuel García Martínez > Hi all, i have a question about boosting a given doc based on query terms > proximity. > > *query*: dog food*doc1*: "my dog eat food"*doc2*: "i have purchased the dog > food o

Boosting near terms

2012-03-19 Thread Samuel García Martínez
Hi all, i have a question about boosting a given doc based on query terms proximity. *query*: dog food*doc1*: "my dog eat food"*doc2*: "i have purchased the dog food on amazon paying with paypal" Current scoring tf/idf algorithm scores higher doc1 because length normalization. Is there any metho

Re: To truncate or not to truncate (group.truncate vs. facet)

2012-03-19 Thread Martijn v Groningen
Hi Rasmus, You might want to use the group.facet parameter: http://wiki.apache.org/solr/FieldCollapsing#Request_Parameters I think that will give you the right facet counts with faceting. The parameter is not available in Solr 3.x, so you'll need to use a 4.0 nightly build. Martijn On 19 March

is the SolrJ call to add collection of documents a blocking function call ?

2012-03-19 Thread ramdev.wudali
Hi: I am trying to index a collection of SolrInputDocs to a Solr server. I was wondering if the call I make to add the documents (the add(Collection) call ) is a blocking function call ? I would also like to know if the add call is a call that would take longer for a larger collection of do

processing of merged tokens

2012-03-19 Thread Carlos Gonzalez-Cadenas
Hello, For our search system we'd like to be able to process merged tokens (sorry, I don't know what's the proper name for this), i.e. when a user enters a query like "hotelsin barcelona", we'd like to know that the user means "hotels in barcelona". At some point in the past we implemented this k

回复: Invalid version (expected 2, but 60) or the data in not in'javabin' format

2012-03-19 Thread 怪侠
I use the same version of solr and solrJ. I try version3.3.0 and version 3.5.0, but I get the same error. -- 原始邮件 -- 发件人: "Tomás Fernández L�"; 发送时间: 2012年3月19日(星期一) 晚上7:19 收件人: "solr-user"; 主题: Re: Invalid version (expected 2, but 60) or the data

Highlighting matched interesting terms in MoreLikeThisHandler...

2012-03-19 Thread Ravish Bhagdev
Hi All, I wonder if anyone else has had a requirement similar to this: I'm using MLT handler to return matching documents, matched on a specific field which works perfectly. But I want to be able to show which interesting terms matched for a given result set. If there was a way of listing these

Re: Performance Question

2012-03-19 Thread Jamie Johnson
Mikhail, Thanks for the response. Just to be clear you're saying that the size of the index does not matter, it's more the size of the results? On Fri, Mar 16, 2012 at 2:43 PM, Mikhail Khludnev wrote: > Hello, > > Frankly speaking the computational complexity of Lucene search depends from > siz

回复: Invalid version (expected 2, but 60) or the data in not in'javabin' format

2012-03-19 Thread 怪侠
I use the same version of server(solr) and client(solrJ). I try the version3.3.0 and version3.5.0, but get the same error. -- 原始邮件 -- 发件人: "Tomás Fernández L�"; 发送时间: 2012年3月19日(星期一) 晚上7:19 收件人: "solr-user"; 主题: Re: Invalid version (expected 2, but

Re: Re: maxClauseCount Exception

2012-03-19 Thread Erick Erickson
Don't highlight on it. As Chris pointed out, it's the_highlighting_ that builds this ginormous clause. Just something like _searching_ on text:* has short-circuits in the code that refrain from this expansion. Best Erick On Mon, Mar 19, 2012 at 8:42 AM, Darren Govoni wrote: > true. but how can

Re: SolrJ : Size of solr Results

2012-03-19 Thread nutchsolruser
Hey iorixxx, Thank you. -- View this message in context: http://lucene.472066.n3.nabble.com/SolrJ-Size-of-solr-Results-tp3838555p3838877.html Sent from the Solr - User mailing list archive at Nabble.com.

RE: Re: maxClauseCount Exception

2012-03-19 Thread Darren Govoni
true. but how can you find documents containing that field without expanding 1000 clauses? --- Original Message --- On 3/19/2012 07:24 AM Erick Erickson wrote:bq: So all I want to do is a simple "all docs with something in this field, and to highlight the field" But that doesn't reall

Re: To truncate or not to truncate (group.truncate vs. facet)

2012-03-19 Thread Erick Erickson
Groups and faceting are orthogonal and really have nothing to do with each other, so that might be where part of the problem lies. In your example, you can consider grouping by brand and count the *groups* returned, not elements within those groups. Then you're simply counting up the groups return

Re: maxClauseCount Exception

2012-03-19 Thread Erick Erickson
bq: So all I want to do is a simple "all docs with something in this field, and to highlight the field" But that doesn't really make sense to do at the Solr/Lucene level. All you're saying is that you want that field highlighted. Wouldn't it be much easier to just do this at the app level whenever

Re: Does the Solr provide hightlight token position in the field?

2012-03-19 Thread Ahmet Arslan
> Does the hightlight can provide the > exact position of the query > For instance: > MSAQLRKPTA*RVCES*CGRAEHWDDDLEAWQIARTDGTKQVGSPHCLHEWDINGNFNPVAMDD > I want to know the Position of "R" in the highlight token. > I want to do the secondary query based on the position, There is an unresolved jira

Re: SolrJ : Size of solr Results

2012-03-19 Thread Ahmet Arslan
> QueryResponse rsp = solrServer.query( query ); > SolrDocumentList docs = rsp.getResults(); > > here i am getting docs.size() is 100 how can i change the > size. or is there > any other way to iterate docs above 100. i am stucked here. By rows and start parameter. query.setStart(0); query.setRo

SolrJ : Size of solr Results

2012-03-19 Thread nutchsolruser
Hi All, QueryResponse rsp = solrServer.query( query ); SolrDocumentList docs = rsp.getResults(); here i am getting docs.size() is 100 how can i change the size. or is there any other way to iterate docs above 100. i am stucked here. Thanks, myauu -- View this message in context: http://lucene.4

To truncate or not to truncate (group.truncate vs. facet)

2012-03-19 Thread Rasmus Østergård
I have an index that contains variants of cars. In this small sample I have 2 car models (Audi and Volvo) and the Audi is available in black or white, whereas the Volvo is only available in black. On the search page I want to display products not variants - in this test case 2 products should b

Re: Problem witch adding classpath

2012-03-19 Thread ViruS
Hi, Thanks this works! I also yesterday try to make new instance of SOLR and I find bug in my configuration. When i copied files to SOLR_HOME/../dist/ and also i input in configuration directive to contrib they have some collision. My configuration now is SOLR_HOME//conf/solrconfig.xml:

To truncate or not to truncate (group.truncate vs. facet)

2012-03-19 Thread Rasmus Østergård
I have an index that contains variants of cars. In this small sample I have 2 car models (Audi and Volvo) and the Audi is available in black or white, whereas the Volvo is only available in black. On the search page I want to display products not variants - in this test case 2 products should b

Re: Invalid version (expected 2, but 60) or the data in not in 'javabin' format

2012-03-19 Thread Tomás Fernández Löbbe
This looks like you are using a SolrJ version different than the Solr server version you are using. Make sure that server and client are using the same Solr version. On Mon, Mar 19, 2012 at 8:02 AM, Markus Jelsma wrote: > You probably have a non-char codepoint hanging around somewhere. You can >

Re: Invalid version (expected 2, but 60) or the data in not in 'javabin' format

2012-03-19 Thread Markus Jelsma
You probably have a non-char codepoint hanging around somewhere. You can strip them away: http://unicode.org/cldr/utility/list-unicodeset.jsp?a=[:Noncharacter_Code_Point=True:] On Mon, 19 Mar 2012 10:33:35 +0800, "怪侠" <87863...@qq.com> wrote: Hi, all. I want to update the file's index. The fo