Invalid Date String Exception

2013-02-22 Thread Raja Kulasekaran
Hi, I got the exception *"Invalid Date String" *as I run the crawl against webpages . * * Each one use their own date format and as a developer we don't have a control on it. Instead of throwing exception, It should suppose to convert into a Solr based format . Can you suggest me how do I overcom

Re: faceting and clustering on MLT via stream.body

2013-02-22 Thread Otis Gospodnetic
Hi Peyman, Right, it's exactly the opposite of what you want - faceting is done on all matches and carrot2 tends to be fed top N matches. Otis Solr & ElasticSearch Support http://sematext.com/ On Feb 22, 2013 12:34 PM, "Peyman Faratin" wrote: > Hi > > I would to run a mlt search (in Solrj) of a

how to overrride pre and post tags when usefastVectorHighlighter is set to true

2013-02-22 Thread alxsss
Hello, I was unable to change pre and post tags for highlighting when usefastVectorHighlighter is set to true. Changing default tags in solrconfig.xml works for standard highlighter though. I searched mailing list and the net with no success. I use solr-4.1.0. Thanks. Alex.

Re: Repartition solr cloud

2013-02-22 Thread Mark Miller
You could copy each shard to a single node and then use the merge index feature to merge them into one index and then start up a single Solr node on that. Use the same configs. - Mark On Feb 22, 2013, at 8:11 PM, Erol Akarsu wrote: > I have a solr cloud 7 nodes, each has 2 shards. > Now, I wo

Re: solrcloud data directory structure

2013-02-22 Thread Mark Miller
It just means at some point a replication was done that required flipping to a new directory. It's expected. Once you flip from the index directory to an index. directory, you never go back. - Mark On Feb 22, 2013, at 8:14 PM, Mingfeng Yang wrote: > I see the items under my solorcloud data di

solrcloud data directory structure

2013-02-22 Thread Mingfeng Yang
I see the items under my solorcloud data directory of "replica node" as drwxr-xr-x 2 solr solr42 Feb 22 18:19 index drwxr-xr-x 2 solr solr 12288 Feb 23 01:00 index.20130222181947835 -rw-r--r-- 1 solr solr78 Feb 22 18:25 index.properties -rw-r--r-- 1 solr solr 209 Feb 22 18:25 replication

RE: Is it possible to manually select a shard leader in a running SolrCloud?

2013-02-22 Thread Vaillancourt, Tim
Yeah, exactly - although there are workarounds, this is probably worth a feature request whether or not it is turned down. I have created these 2 new feature requests (after not being able to find duplicates through searching): 1) Please add support for manual leader election/promotion: https:

Re: Interesting issue with "special characters" in a string field value

2013-02-22 Thread Jack Park
Michael, I don't think you misunderstood. I will soon give a full response here, but am on the road at the moment. Many thanks Jack On Friday, February 22, 2013, Michael Della Bitta < michael.della.bi...@appinions.com> wrote: > My mistake, I misunderstood the problem. > > Michael Della Bitta > >

Re: Interesting issue with "special characters" in a string field value

2013-02-22 Thread Michael Della Bitta
My mistake, I misunderstood the problem. Michael Della Bitta Appinions 18 East 41st Street, 2nd Floor New York, NY 10017-6271 www.appinions.com Where Influence Isn’t a Game On Fri, Feb 22, 2013 at 3:55 PM, Chris Hostetter wrote: > > : If you'r

Re: Interesting issue with "special characters" in a string field value

2013-02-22 Thread Chris Hostetter
: If you're submitting documents as XML, you're always going to have to : escape meaningful XML characters going in. If you ask for them back as : XML, you should be prepared to unescape special XML characters as that still wouldn't explain the discrepency he's claiming to see between the json &

Re: Interesting issue with "special characters" in a string field value

2013-02-22 Thread Michael Della Bitta
Hi Jack, If you're submitting documents as XML, you're always going to have to escape meaningful XML characters going in. If you ask for them back as XML, you should be prepared to unescape special XML characters as output. Same goes for JSON, etc. There's really no way around this... it's just a

Interesting issue with "special characters" in a string field value

2013-02-22 Thread Jack Park
I have a multi-value stored field called "details" I've been deliberately sending it values like If I fetch a document with that field at the admin query console, using XML, I get: If I fetch with JSON, I get: "details": [ "" ], Even more curious, if I

Re: Is it possible to manually select a shard leader in a running SolrCloud?

2013-02-22 Thread Boogie Shafer
i think the use case here is more of a management one. -wanting to explicitly configure a specific node as leader (the reasons for this could vary) -wanting to gracefully/safely move a leader role from a specific node without going thru an actual election process (as was mentioned previously, why

Re: SolrCloud Master/Master

2013-02-22 Thread Daniel Collins
Yes, that's a good solution to this whole Cloud/DC issue (which seems to have cropped up several times), you have one of the ZK instances "external" to the cloud. You can lose any 1 machine, and the others are still ok. The next level would be a Cloud of 3 servers + 2 external ZKs, that would

Re: Solr Grouping and empty fields

2013-02-22 Thread Daniel Collins
We had something similar to be fair, a cluster information field which was unfortunately optional, so all the documents that didn't have this field set grouped together. It isn't Solr's fault, to be fair, we told it to group on the values of field Z, null is a valid value and lots of documents

RE: Is it possible to manually select a shard leader in a running SolrCloud?

2013-02-22 Thread Vaillancourt, Tim
Thanks Mark, Sounds good. We are still at the load test stage and will see how this goes. I imagine this is more of a concerning in concept than in reality. :) Cheers, Tim -Original Message- From: Mark Miller [mailto:markrmil...@gmail.com] Sent: Thursday, February 21, 2013 7:52 PM To:

Re: Solr splitting my words

2013-02-22 Thread scallawa
Thanks for all your help. I was able to get my test environment in and tested with just the document that I was searching. I was able to figure out the analyzer and found that the index_synonyms file was used improperly and it caused solr to generate the wrong terms for the data it was given.

Re: How do I create two collections on the same cluster?

2013-02-22 Thread alxsss
Hi, What if you add new collection to solr.xml file? Alex. -Original Message- From: Shankar Sundararaju To: solr-user Sent: Thu, Feb 21, 2013 8:51 pm Subject: How do I create two collections on the same cluster? I am using Solr 4.1. I created collection1 consisting of 2 lea

Re: How do I create two collections on the same cluster?

2013-02-22 Thread Shankar Sundararaju
Thank you Shawn. Neither using ip address nor maxShardsPerNode=1 has any effect. Let me try few other things today and I'll get back to you. -Shankar On Thu, Feb 21, 2013 at 9:05 PM, Shawn Heisey wrote: > On 2/21/2013 9:50 PM, Shankar Sundararaju wrote: > >> I am using Solr 4.1. >> >> I creat

Re: SolrCloud as my primary data store

2013-02-22 Thread Shawn Heisey
On 2/22/2013 9:02 AM, jimtronic wrote: Yes, these are good points. I'm using solr to leverage user preference data and I need that data available real time. SQL just can't do the kind of things I'm able to do in solr, so I have to wait until the write (a user action, a user preference, etc) gets

faceting and clustering on MLT via stream.body

2013-02-22 Thread Peyman Faratin
Hi I would to run a mlt search (in Solrj) of a short piece of text delivered via the stream.body. This part works. What I would like to be able to do is to do 2 things: - faceting on some number (not ALL) of the results - cluster (using carrot2) all of the results Is this possible? I believe f

Re: semantic search questions

2013-02-22 Thread Jack Park
Hi Vinay, Perhaps you could say more about what you are looking for? What use cases, say. Did you see the book _Taming Text_? Thanks Jack On Fri, Feb 22, 2013 at 8:48 AM, Vinay B, wrote: > Hi, > > A few questions, some specific to UIMA, others more general. > 1. The SOLR/UIMA example employs 3r

Re: Solr Grouping and empty fields

2013-02-22 Thread Oussama Jilal
OK I'm sorry if I did not explained well my need. I'll try to give a better explanation. What I have : Millions of documents that have a field X , another field Y and another field Z which is not required (So it can be empty in some documents and not in others). What I want to do : Search fo

Re: SolrCloud as my primary data store

2013-02-22 Thread Gora Mohanty
On 22 February 2013 21:32, jimtronic wrote: > Yes, these are good points. I'm using solr to leverage user preference data > and I need that data available real time. SQL just can't do the kind of > things I'm able to do in solr, so I have to wait until the write (a user > action, a user preference

Re: SolrCloud as my primary data store

2013-02-22 Thread jimtronic
Yes, these are good points. I'm using solr to leverage user preference data and I need that data available real time. SQL just can't do the kind of things I'm able to do in solr, so I have to wait until the write (a user action, a user preference, etc) gets to solr from the db anyway. I'm kind of

Re: Solr Grouping and empty fields

2013-02-22 Thread Jack Krupansky
What?!?! You want them grouped but not grouped together?? What on earth does that mean?! I mean, either they are included or they are not. All results will be in some group, so where exactly do you want these "not to be grouped together" documents to be grouped? In any case, please clarify what

Re: DIH deleting documents

2013-02-22 Thread Gora Mohanty
On 22 February 2013 12:11, cveres wrote: > I should also add that some of the books don't have chapters, so the query > won't succeed for these books. > But in this case I expected that the document won't be added at all .. > rather than first added then deleted (which I am now suspecting is the >

Re: Solr Grouping and empty fields

2013-02-22 Thread Oussama Jilal
Thank you Johannes, but I want the documents having the field empty to be included in the results, just not to be grouped together, and if I understood your solution correctly, it will simply remove those documents from the results (Note : The field values are very variable and unknown to me).

Re: SolrCloud Master/Master

2013-02-22 Thread Shawn Heisey
On 2/22/2013 4:57 AM, ipuskaric wrote: I need two Solr servers, both acting as 'real' Master. They need to be able to receive updates and to be used for querying, because when one server is down, second has to be able to work alone... and when the second is up again he needs to synchronize with f

Re: Solr Grouping and empty fields

2013-02-22 Thread Johannes Rodenwald
Hi Oussama, If you have only a few distinct, unchanging values in the field that you group upon, you could implement a FilterQuery (query parameter "fq") and add it to the query, allowing all valid values, but not an empty field. For example: fq=my_grouping_string_field:( value_a OR value_b OR

Re: Slaves always replicate entire index & Index versions

2013-02-22 Thread Mark Miller
We are fixing this bug here: https://issues.apache.org/jira/browse/SOLR-4471 - Mark On Feb 22, 2013, at 7:07 AM, Artyom wrote: > I have the same problem. This bug appeared in 4.0 rarely, but 4.1 downloads > the full index every time. >

Re: solr issue

2013-02-22 Thread Gora Mohanty
On 22 February 2013 12:40, Pawankumar Rote wrote: > > Hi > > One of our client has purchased SOLR extension. We have integrated it with > mageno community edition. We are facing below issues. > > 1] On click of search box pop up appears - it is ok. But the price displayed > against the product n

Re: can i install new SOLR 4.1 as slaver(3.3 Master)

2013-02-22 Thread Erick Erickson
Look at solr/CHANGES.txt, the upgrade from 3.6 to 4.0-ALPHA section. Best Erick On Thu, Feb 21, 2013 at 5:33 PM, michaelweica wrote: > thanks > > we do have 1 master , 5 slave servers. and we use slave as production > server. > we just update master index file when we have new contents > > n

RE: Edismax odd results

2013-02-22 Thread David Quarterman
Hi Erick, Funnily enough, I cracked it about 5 minutes before your email arrived! Problem was using WhiteSpaceTokenizer instead of Standard AND had the LowerCaseFilter after the PorterStemmingFilter. Getting them in the right order has solved all the problems and we get all our engineer boots,

Re: 答复: solr shards

2013-02-22 Thread Erick Erickson
Basically a hash is performed (details I'm not entirely sure about) and the doc is sent to shard (hashed id)%(num shards). Really, though, I'm not sure there's a good reason to dig into the details other than curiosity. If you change the number of shards, you'll get into trouble. All that said

Re: Edismax odd results

2013-02-22 Thread Erick Erickson
OK, let's see the debug data for explainOther. One thing, though. Your analysis chain is apt to be surprising. The fact that you have 222 terms with the ":" says that you're probably not getting what I'd guess you want. That ':' is part of your token, and will not match "engineering", consider cha

Re: Slaves always replicate entire index & Index versions

2013-02-22 Thread Artyom
I have the same problem. This bug appeared in 4.0 rarely, but 4.1 downloads the full index every time. -- View this message in context: http://lucene.472066.n3.nabble.com/Slaves-always-replicate-entire-index-Index-versions-tp4041256p4042209.html Sent from the Solr - User mailing list archive at

Re: If we Open Source our platform, would it be interesting to you?

2013-02-22 Thread Marcelo Elias Del Valle
Jack, I didn't know this concept, but indeed, it seems like a good way to define our application, and also opens possibilities of improvement. Thanks for the link! Best regards, Marcelo. 2013/2/21 Jack Park > Marcelo > > In some sense, it sounds like you are aiming at building a topic

SolrCloud Master/Master

2013-02-22 Thread ipuskaric
I need two Solr servers, both acting as 'real' Master. They need to be able to receive updates and to be used for querying, because when one server is down, second has to be able to work alone... and when the second is up again he needs to synchronize with first one. As I understand the best way

Solr Grouping and empty fields

2013-02-22 Thread Oussama Jilal
Hi, I need to group some results in solr based on a field, but I don't want documents having that field empty to be grouped together, does anyone know how to achieve that ? -- Oussama Jilal

solr4.1.0 how to config field length

2013-02-22 Thread sely
I am using Solr4.1.0(using original solrconfig.xml) + Lily1.3 + CDH4 >From solrconfig.xml, it says: maxFieldLength was removed in 4.0. To get similar behavior, include a LimitTokenCountFilterFactory in your fieldType definition. E.g. But it doesn't work whatever I set the value of m

Re: Slaves always replicate entire index & Index versions

2013-02-22 Thread raulgrande83
Amit Nithian wrote > For your issue above in your last post, is it possible that there was a > commit on the master in that slight window after solr checks for the > latest > generation of the master but before it downloads the actual files? How > frequent are the commits on your master? No, I don