Re: Need Help in migrating Solr version 1.4 to 4.3

2013-06-25 Thread Sandeep Gupta
Thanks for all the answers. Sure I am going to create new index again with Solr 4.3. Also in application development side, as I said that I am going to use HTTPSolrServer API and I found that we shouldn't create this object multiple times (as per the wiki document http://wiki.apache.org/solr/Solrj

Re: Is there a way to capture div tag by id?

2013-06-25 Thread Jack Krupansky
Sorry, but not only can you not capture that specific , but you cannot capture ANY . Really. For some mysterious reasoning, Tika silently eats HTML parsing events. Plenty of other HTML tags can be captured, but not . Both the Solr Wiki for Solr Cell and the new/Lucid Apache Solr Reference Gu

Re: Need Help in migrating Solr version 1.4 to 4.3

2013-06-25 Thread Sandeep Gupta
Thanks for all the answers. Sure I am going to create new index again with Solr 4.3. Also in application development side, as I said that I am going to use HTTPSolrServer API and I found that we shouldn't create this object multiple times (as per the wiki document http://wiki.apache.org/solr/Solrj

RE: Joins with SolrCloud

2013-06-25 Thread James Thomas
My understanding is the same that "{!join...}" does not work in SolrCloud (aka distributed search) based on: 1. https://issues.apache.org/jira/browse/LUCENE-3759 2. http://wiki.apache.org/solr/DistributedSearch --- see "Limitations" section which refers to the JIRA above -- James -Original

Re: Is it possible to searh Solr with a longer query string?

2013-06-25 Thread Kevin Osborn
If your query is arriving on the server correctly, but throwing an exception, adjust maxBooleanClauses in your solrconfig.xml. I'm not sure what the consequences are of making it too large, but we had to adjust it from the default of 1024 to 5000 in one implementation. Basically, each ID in your q

Re: Is it possible to searh Solr with a longer query string?

2013-06-25 Thread Jack Krupansky
Are you using Tomcat? See: http://wiki.apache.org/solr/SolrTomcat#Enabling_Longer_Query_Requests Enabling Longer Query Requests If you try to submit too long a GET query to Solr, then Tomcat will reject your HTTP request on the grounds that the HTTP header is too large; symptoms may include a

RE: Is it possible to searh Solr with a longer query string?

2013-06-25 Thread yang, gang
Hi, I'm using Solr server to develop a search service, and I encounter a problem when trying to input a longer query string: Here is the code: StringBuffer stringBuffer = new StringBuffer(); ... ... try{ //search Pubmed server( a NCBI server ), it returns a list of IDs.

Re: Joins with SolrCloud

2013-06-25 Thread Upayavira
I have never heard mention that joins support distributed search, so you cannot do a join against a sharded core. However, if from your example, innerCollection was replicated across all nodes, I would think that should work, because all that comes back from each server when a distributed search h

Re: Varnish

2013-06-25 Thread Learner
Check this link.. http://lucene.472066.n3.nabble.com/SolrJ-HTTP-caching-td490063.html -- View this message in context: http://lucene.472066.n3.nabble.com/Varnish-tp4072057p4073205.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Querying multiple collections in SolrCloud

2013-06-25 Thread Chris Toomey
Thanks Jack for the alternatives. The first is interesting but has the downside of requiring multiple queries to get the full matching docs. The second is interesting and very simple, but has the downside of not being modular and being difficult to configure field boosting when the collections ha

Joins with SolrCloud

2013-06-25 Thread Chris Toomey
What are the restrictions/limitations w.r.t. joins when using SolrCloud? Say I have a 3-node cluster and both my "outer" and "inner" collections are sharded 3 ways across the cluster. Could I do a query such as "select?q={!join+from=inner_id+fromIndex=innerCollection+to=outer_id}xx:foo&collection

Re: Querying multiple collections in SolrCloud

2013-06-25 Thread Jack Krupansky
One simple scenario to consider: N+1 collections - one collection per document type with detailed fields for that document type, and one common collection that indexes a subset of the fields. The main user query would be an edismax over the common fields in that "main" collection. You can then

Querying multiple collections in SolrCloud

2013-06-25 Thread Chris Toomey
Hi, I'm investigating using SolrCloud for querying documents of different but similar/related types, and have read through docs. on the wiki and done many searches in these archives, but still have some questions. Thanks in advance for your help. Setup: * Say that I have N distinct types of docum

Re: URL search and indexing

2013-06-25 Thread Jack Krupansky
Yeah, URL Classify does only do so much. That's why you need to combine multiple methods. As a fourth method, you could code up a short JavaScript "StatelessScriptUpdateProcessor" that did something like take a full domain name (such as output by URL Classify) and turn it into multiple values,

Result Grouping

2013-06-25 Thread Bryan Bende
I was reading this documentation on Result Grouping... http://docs.lucidworks.com/display/solr/Result+Grouping which says... sort - sortspec - Specifies how Solr sorts the groups relative to each other. For example, sort=popularity desc will cause the groups to be sorted according to the highest

Re: Common practice for free text field

2013-06-25 Thread Otis Gospodnetic
Hi, Look up edismax parser on the Wiki. The advantage of using it is that you can set different weight on different fields (qf param) and shingle query (pfXXX params). Otis -- Solr & ElasticSearch Support -- http://sematext.com/ Performance Monitoring -- http://sematext.com/spm On Tue, Jun 25

Re: Common practice for free text field

2013-06-25 Thread Manuel Le Normand
By field aliasing I meant something like: f.all_fields.qf=*_txt+*_s+*_int that would sum up to 100 fields On Wed, Jun 26, 2013 at 12:00 AM, Manuel Le Normand < manuel.lenorm...@gmail.com> wrote: > My schema contains about a hundred of fields of various types (int, > strings, plain text, emails).

Common practice for free text field

2013-06-25 Thread Manuel Le Normand
My schema contains about a hundred of fields of various types (int, strings, plain text, emails). I was concerned what is the common practice for searching free text over the index. Assuming there are not boosts related to field matching, these are the options I see: 1. Index and query a "all_f

Re: Name of the couple of popular app/web sites using solar as search engine

2013-06-25 Thread Otis Gospodnetic
How much time have you got? :) http://wiki.apache.org/solr/PublicServers Otis -- Solr & ElasticSearch Support -- http://sematext.com/ Performance Monitoring -- http://sematext.com/spm On Tue, Jun 25, 2013 at 2:45 PM, soumikghosh05 wrote: > Hi All, > > I am planing to use Solar as a search sol

Name of the couple of popular app/web sites using solar as search engine

2013-06-25 Thread soumikghosh05
Hi All, I am planing to use Solar as a search solution for the new application of my company. Can anyone give me couple of names of the popular web sites/application where Solar is being used as search solution. I know eclipse is using Solar. It will help he to convince people. Thanks in Advanc

Re: Name of the couple of popular app/web sites using solar as search engine

2013-06-25 Thread soumikghosh05
Thanks a lot. -- View this message in context: http://lucene.472066.n3.nabble.com/Name-of-the-couple-of-popular-app-web-sites-using-solar-as-search-engine-tp4073157p4073162.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: how to replicate Solr Cloud

2013-06-25 Thread Otis Gospodnetic
Uh, I remember that email, but can't recall where we did it will try to recall it some more and reply if I can manage to dig it out of my brain... Otis -- Solr & ElasticSearch Support -- http://sematext.com/ Performance Monitoring -- http://sematext.com/spm On Tue, Jun 25, 2013 at 2:24 PM,

Re: Name of the couple of popular app/web sites using solar as search engine

2013-06-25 Thread Learner
Check the list here.. http://wiki.apache.org/solr/PublicServers -- View this message in context: http://lucene.472066.n3.nabble.com/Name-of-the-couple-of-popular-app-web-sites-using-solar-as-search-engine-tp4073157p4073160.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: how to replicate Solr Cloud

2013-06-25 Thread Walter Underwood
Also, you have to track two sets of batches, failures, and retries. --wunder On Jun 25, 2013, at 11:30 AM, Kevin Osborn wrote: > Jason, > > My initial reluctance to indexing directly to both data centers is that we > are doing a lot of bulk loading through CSV handler. We never get just 1 > do

Re: how to replicate Solr Cloud

2013-06-25 Thread Kevin Osborn
Jason, My initial reluctance to indexing directly to both data centers is that we are doing a lot of bulk loading through CSV handler. We never get just 1 document at a time. It comes in large batch updates. And now we would have to send the batch updates twice. That is not to say that we won't g

Re: Solr indexer and Hadoop

2013-06-25 Thread Michael Della Bitta
zomghowcanihelp? :) Michael Della Bitta Applications Developer o: +1 646 532 3062 | c: +1 917 477 7906 appinions inc. “The Science of Influence Marketing” 18 East 41st Street New York, NY 10017 t: @appinions | g+: plus.google.com/appinions w: appinions.com <

Re: how to replicate Solr Cloud

2013-06-25 Thread Kevin Osborn
Otis, I did actually stumble upon this link. http://comments.gmane.org/gmane.comp.jakarta.lucene.solr.user/74870 This was from you. You were attempting to replicate data from SolrCloud to some other slaves for heavy-duty queries. You said that you accomplished this. Can you provide a few pointer

Re: how to replicate Solr Cloud

2013-06-25 Thread Jason Hellman
Kevin, I can imagine this working if you consider your second data center a pure slave relationship to your SolrCloud cluster. I haven't tried it, but I don't see why the solrconfig.xml can't identify as a master allowing you to call any of your cores in the cluster to replicate out. That bei

Re: Solr indexer and Hadoop

2013-06-25 Thread Erick Erickson
You might be interested in following: https://issues.apache.org/jira/browse/SOLR-4916 Best Erick On Tue, Jun 25, 2013 at 7:28 AM, Michael Della Bitta wrote: > Jack, > > Sorry, but I don't agree that it's that cut and dried. I've very > successfully worked with terabytes of data in Hadoop that wa

AW: Need Help in migrating Solr version 1.4 to 4.3

2013-06-25 Thread André Widhani
fwiw, I can confirm that Solr 4.x can definitely not read indexes created with 1.4. You'll get an exception like the following: Caused by: org.apache.lucene.index.IndexFormatTooOldException: Format version is not supported (resource: segment _16ofy in resource ChecksumIndexInput(MMapIndexInput

Re: Shard identification

2013-06-25 Thread Erick Erickson
Try sending requests to your shards with &distrib=false. See if the results agree with the SolrCloud graph or whether the docs you get back are inconsistent with the shard labels in the admin page. The &distrib=false bit keeps the query from going to other shards and will tell you if the current st

Re: Need Help in migrating Solr version 1.4 to 4.3

2013-06-25 Thread Erick Erickson
bq: I'm not sure if Solr 4.3 will be able to read Solr 1.4 indexes Solr/Lucene explicitly try to read _one_ major revision backwards. Solr 3.x should be able to read 1.4 indexes. Solr 4.x should be able to read Solr 3.x. No attempt is made to allow Solr 4.x to read Solr 1.4 indexes, so I wouldn't

Re: Can we use Solr to serve data to web analytics & Dashboard charts

2013-06-25 Thread Walter Underwood
We do not know whether Solr will work for you. The only way to find out is to build it and try. You already have a solution that works. Use that. wunder On Jun 25, 2013, at 10:28 AM, pradeep kumar wrote: > Solr is not a solution for my requirement? Please let me know > > Thanks > Pradeep > >

Re: Can we use Solr to serve data to web analytics & Dashboard charts

2013-06-25 Thread Walter Underwood
Sorry, thinking of "man" from Japanese, which is 10K. Using language-specific numbers in an international forum is not a good idea. wunder On Jun 25, 2013, at 10:22 AM, Shawn Heisey wrote: > On 6/25/2013 9:19 AM, Walter Underwood wrote: >> With only 10K records (lahks), a regular RDBMS should b

Re: Can we use Solr to serve data to web analytics & Dashboard charts

2013-06-25 Thread pradeep kumar
Solr is not a solution for my requirement? Please let me know Thanks Pradeep On Tue, Jun 25, 2013 at 10:52 PM, Shawn Heisey wrote: > On 6/25/2013 9:19 AM, Walter Underwood wrote: > >> With only 10K records (lahks), a regular RDBMS should be just fine. I >> don't see any need for Solr with a sm

Re: how to replicate Solr Cloud

2013-06-25 Thread Otis Gospodnetic
I think what is needed is a Leader that, while being a Leader for its own Slice in its local Cluster and Collection (I think I'm using all the latest terminology correctly here), is at the same time a Replica of its own Leader counterpart in the "Primary Cluster". Not currently possible, AFAIK. Or

Re: Can we use Solr to serve data to web analytics & Dashboard charts

2013-06-25 Thread Shawn Heisey
On 6/25/2013 9:19 AM, Walter Underwood wrote: With only 10K records (lahks), a regular RDBMS should be just fine. I don't see any need for Solr with a small dataset like that. Increase the caches sizes on your RDBMS so that all the tables fit in memory. Even with 10Kbytes per record, that is o

Is there a way to capture div tag by id?

2013-06-25 Thread eShard
let's say I have a div with id="myDiv" Is there a way to set up the solr upate/extract handler to capture just that particular div? -- View this message in context: http://lucene.472066.n3.nabble.com/Is-there-a-way-to-capture-div-tag-by-id-tp4073120.html Sent from the Solr - User mailing list a

how to replicate Solr Cloud

2013-06-25 Thread Kevin Osborn
We are going to have two datacenters, each with their own SolrCloud and ZooKeeper quorums. The end result will be that they should be replicas of each other. One method that has been mentioned is that we should add documents to each cluster separately. For various reasons, this may not be ideal fo

SOLR online reference document - WIKI

2013-06-25 Thread Learner
I just came across a wonderful online reference wiki for SOLR and thought of sharing it with the community.. https://cwiki.apache.org/confluence/display/solr/Apache+Solr+Reference+Guide -- View this message in context: http://lucene.472066.n3.nabble.com/SOLR-online-reference-document-WIKI-tp40

Re: [solr cloud] solr hangs when indexing large number of documents from multiple threads

2013-06-25 Thread Vinay Pothnis
Jason and Scott, Thanks for the replies and pointers! Yes, I will consider the 'maxDocs' value as well. How do i monitor the transaction logs during the interval between commits? Thanks Vinay On Mon, Jun 24, 2013 at 8:48 PM, Jason Hellman < jhell...@innoventsolutions.com> wrote: > Scott, > > M

Re: Can we use Solr to serve data to web analytics & Dashboard charts

2013-06-25 Thread pradeep kumar
Well.. Just FYI.. 10 lakhs in each normalized tables.. Query time to fetch If linked together won't be big? And data is growing. On 25 Jun 2013 20:49, "Walter Underwood" wrote: > With only 10K records (lahks), a regular RDBMS should be just fine. I > don't see any need for Solr with a small datas

RE: Several Machines Communication Failure

2013-06-25 Thread Ophir Michaeli
Solr Vesrion: 4.3 Solr Cloud Machine 1: running 2 shards - shard 1: java -Dbootstrap_confdir=./solr/collection1/conf -Dcollection.configName=myconf -DzkRun -DnumShards=2 -jar start.jar shard 2: java -Djetty.port=7574 -DzkHost=localhost:9983 -jar start.jar Machine 2: Running 2 replicas - Shard

Re: Solr 4.3 problems with embedded jetty from maven cargo

2013-06-25 Thread Alexandre Rafalovitch
On Tue, Jun 25, 2013 at 11:03 AM, Daniel Exner wrote: > I'm currently trying to build a adept my Solr Maven Project to version > 4.3 but it keeps complaining about missing SLF4j jars. Have you gone through the page describing logging issues: https://wiki.apache.org/solr/SolrLogging . It is been a

Re: Can we use Solr to serve data to web analytics & Dashboard charts

2013-06-25 Thread pradeep kumar
Thanks for your reply Otis, I think i have not explained clearly in my previous email. We are thinking of 2 options for our new reports/analytics/dashboard implementation. *1st option:* Is to have offline database with star schema which makes querying east for generating reports using any report

Re: Can we use Solr to serve data to web analytics & Dashboard charts

2013-06-25 Thread Walter Underwood
With only 10K records (lahks), a regular RDBMS should be just fine. I don't see any need for Solr with a small dataset like that. Increase the caches sizes on your RDBMS so that all the tables fit in memory. Even with 10Kbytes per record, that is only 100Mbytes of data. wunder On Jun 25, 2013,

Solr 4.3 problems with embedded jetty from maven cargo

2013-06-25 Thread Daniel Exner
Hi all, I'm currently trying to build a adept my Solr Maven Project to version 4.3 but it keeps complaining about missing SLF4j jars. The relevant part from my pom.xml looks like that: > > org.codehaus.cargo > cargo-maven2-plugin > 1.3

Re: Help with synonyms

2013-06-25 Thread Shawn Heisey
On 6/25/2013 8:25 AM, Peter Kirk wrote: > Thanks. I'm looking in to it. > > Somehow it appears that the first line in the synonyms file is not registered > as a synonym. Can this be correct, the first line is ignored? I will look into this later and file an issue if necessary, but if this is act

Re: URL search and indexing

2013-06-25 Thread Erik Hatcher
If you want to query by domain, then index the domain (or just the last piece of it). I'd suggest you somehow (either in your indexer code or via clever analysis tricks) peel off the last piece of the domain as its own string field so you get "com", "it", "edu", "gov", etc all as indexed values

RE: shardkey

2013-06-25 Thread Joshi, Shital
Thanks so much for answering! "it looks like you're doing time based sharding, and one would normally not use the compositeId router for that." What would be the recommend router or alternative if we wanted to do time-based sharding? We are using business date to build composite key (it's a St

Re: URL search and indexing

2013-06-25 Thread Flavio Pompermaier
Basically I have to design the solr document and I was thinking that actually users could be more interested in filtering by domain (*.it or *.com), however I cannot exclude more site-related queries (like ' http://lucene.apache.org/solr/*'). >From what I understood I should configure my schema.xml

Re: Solr indexer and Hadoop

2013-06-25 Thread Michael Della Bitta
Jack, Sorry, but I don't agree that it's that cut and dried. I've very successfully worked with terabytes of data in Hadoop that was stored on an Isilon mounted via NFS, for example. In cases like this, you're using MapReduce purely for it's execution model (which existed far before Hadoop and HDF

Re: Can we use Solr to serve data to web analytics & Dashboard charts

2013-06-25 Thread Otis Gospodnetic
Hi Pradeep, 5-6 hours between email and "Any help?" == "not enough patience" :) The advantage of something like Solr over RDBMS with star schema may be that it is easier to scale horizontally than MySQL, or at least that was the case I last looked at horizontal RDBMS partitioning. But if you are

RE: Help with synonyms

2013-06-25 Thread Peter Kirk
Thanks. I'm looking in to it. Somehow it appears that the first line in the synonyms file is not registered as a synonym. Can this be correct, the first line is ignored? /Peter -Original Message- From: Jack Krupansky [mailto:j...@basetechnology.com] Sent: 24. juni 2013 15:22 To: solr

Re: Solr indexer and Hadoop

2013-06-25 Thread Jack Krupansky
??? Hadoop=HDFS If the data is not in Hadoop/HDFS, just use the normal Solr indexing tools, including SolrCell and Data Import Handler, and possibly ManifoldCF. -- Jack Krupansky -Original Message- From: engy.morsy Sent: Tuesday, June 25, 2013 8:10 AM To: solr-user@lucene.apache.or

Re: URL search and indexing

2013-06-25 Thread Jack Krupansky
As Jan indicates, your users could perform regular expression queries on a URL string field, but maybe you should tell us more about your use case and how your users really want to search. One technique is to copy the URL to a tokenized text field. Then, users can search for names and sub-sequ

Re: URL search and indexing

2013-06-25 Thread Flavio Pompermaier
I bought the book and looking at the example I still don't understand if it possible query all sub-urls of my URL. For example, if the URLClassifyProcessorFactory takes in input "url_s":" http://lucene.apache.org/solr/4_0_0/changes/Changes.html"; and makes some outputs like - "url_domain_s":"lucen

Re: Solr, Shards, multi cores and (reverse proxy)

2013-06-25 Thread Upayavira
Create a new RequestHandler config, say /distrib. Requests will be forwarded to /select, which doesn't have the shards parameter, and everything will be just fine. Upayavira On Tue, Jun 25, 2013, at 02:17 PM, medley wrote: > Thanks. > > It is working now and the QTime has been divided by 10. >

Re: Can we use Solr to serve data to web analytics & Dashboard charts

2013-06-25 Thread pradeep kumar
Any help? On 25 Jun 2013 13:35, "pradeep kumar" wrote: > Sure, > > First of all thanks a lot everyone for very quick reply. > > We have a Ordering system which has a lakhs of records so far in a > normalized RDBMS tables, say Order, Item, Details etc. We are planning to > have a offline database

Re: Solr, Shards, multi cores and (reverse proxy)

2013-06-25 Thread medley
Thanks. It is working now and the QTime has been divided by 10. I would like to put the shard parameters in the requesthandler. I have one solr-config.xml file by core. Is it possible to have a common solr-config.xml file and in that case, a common requesthandler ? Regards Medley -- View thi

Re: Solr indexer and Hadoop

2013-06-25 Thread Otis Gospodnetic
But note that MapReduce and HDFS are not the only way to go. For example, can you split your source data? If you can, you could do that, put them on N machines, and run indexer on all of them, each for some number of threads. Of course, your Solr(Cloud?) cluster better have enough servers/CPU cor

Re: URL search and indexing

2013-06-25 Thread Jack Krupansky
There are examples in my book: http://www.lulu.com/shop/jack-krupansky/solr-4x-deep-dive-early-access-release-1/ebook/product-21079719.html But... I still think you should use a tokenized text field as well - use all three: raw string, tokenized text, and URL classification fields. -- Jack Kru

RE: Solr indexer and Hadoop

2013-06-25 Thread James Thomas
>> The problem I am facing is how to read those data from hard disks which are >> not HDFS If you are planning to use a Map-Reduce job to do the indexing then the source data will definitely have to be on HDFS. The Map function can transform the source data to Solr documents and send them to So

Re: URL search and indexing

2013-06-25 Thread Flavio Pompermaier
That's sound exactly what I'm looking for! However I cannot find an example of how to use it..could you help me please? Moreover, about id field, isn't true that id field shouldn't be analyzed as suggested in http://wiki.apache.org/solr/UniqueKey#Text_field_in_the_document? On Tue, Jun 25, 2013 a

Re: URL search and indexing

2013-06-25 Thread Jan Høydahl
Sure you can query the url directly. Or if you choose you can split it up in multiple components, e.g. using http://lucene.apache.org/solr/4_3_0/solr-core/org/apache/solr/update/processor/URLClassifyProcessor.html -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com 25. ju

Re: Several Machines Communication Failure

2013-06-25 Thread Jan Høydahl
Hi, We cannot help you based on this brief email. Please provide a much more detailed description. Version of Solr, SolrCloud or not. How exactly have you done this move? Relevant configuration snippets, relevant log snippets of what goes wrong... -- Jan Høydahl, search solution architect Comi

Re: URL search and indexing

2013-06-25 Thread Flavio Pompermaier
Sorry but maybe I miss something here..could I declare url as key field and query it too..? At the moment, my schema.xml looks like: ... url Is it ok? or should I add a "baseurl" field of some kind to be able to query all url coming from a certain domain (1st or 2nd leve

Re: Solr indexer and Hadoop

2013-06-25 Thread engy.morsy
Thank you Jack. So, I need to convert those nodes holding data to HDFS. -- View this message in context: http://lucene.472066.n3.nabble.com/Solr-indexer-and-Hadoop-tp4072951p4073013.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Pivot-Facets with ranges

2013-06-25 Thread Jack Krupansky
No, facet.pivot takes a comma-separated list of "fields", with no support for "ranges". But, you can have a combination of field and range facets without pivoting. -- Jack Krupansky -Original Message- From: Jakob Frank Sent: Tuesday, June 25, 2013 6:14 AM To: solr-user@lucene.apache.

Re: Pivot-Facets with ranges

2013-06-25 Thread Upayavira
You can only do this with some index time work. If you index the date field rounded to the various levels you need, then you can pivot facet on your rounded date. At present you will need to do this rounding in your own indexing code before it gets near to Solr. However, I have created some roundin

Re: Solr indexer and Hadoop

2013-06-25 Thread Jack Krupansky
Solr does not have any integrated Hadoop/HDFS crawling or indexing support today. Sorry. LucidWorks Search does have HDFS crawling support: http://docs.lucidworks.com/display/lweug/Using+the+High+Volume+HDFS+Crawler Cloudera Search has HDFS support as well. -- Jack Krupansky -Original Mes

Re: Book progress (Solr 4.x Deep Dive) - see my blog

2013-06-25 Thread Jack Krupansky
Please report any comments or issues to my email address or comment on my blog. Comments on the blog will benefit other readers, but the choice is yours. Thanks! -- Jack Krupansky -Original Message- From: Bernd Fehling Sent: Tuesday, June 25, 2013 2:06 AM To: solr-user@lucene.apache

Several Machines Communication Failure

2013-06-25 Thread Ophir Michaeli
Hi, I have a 2 Solr shards and 2 replicas running on the same machine ok. When I try to put each shard/replica on another machine (and set the ips accordingly) it fails, or work slowly, and fails sometimes. Any explanation for this behavior? Thanks

Re: URL search and indexing

2013-06-25 Thread Jan Høydahl
Probably a good match for the RegExp feature of Solr (given that your url is not tokenized) e.g. q=url:/.*\.it$/ -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com 25. juni 2013 kl. 12:17 skrev Flavio Pompermaier : > Hi to everybody, > I'm quite new to Solr so maybe my q

URL search and indexing

2013-06-25 Thread Flavio Pompermaier
Hi to everybody, I'm quite new to Solr so maybe my question could be trivial for you.. In my use case I have to index stuff contained in some URL so i use url as key of my document and I treat it like a string. However I'd like to be able to query by domain name, like *.it or *. somesite.com, what

Pivot-Facets with ranges

2013-06-25 Thread Jakob Frank
Hi all, is it possible using SOLR 4.3 to combine pivot-facets with (date) range facets? Currently, what I get is sth. like date 2001-06-19T20:31:12Z 1 date 2001-06-20T09:40:35Z 1 cat public

Re: Updating solrconfig and schema.xml for solrcloud in multicore setup

2013-06-25 Thread Utkarsh Sengar
I believe I am hitting this bug: https://issues.apache.org/jira/browse/SOLR-4805 I am using solr 4.3.1 -Utkarsh On Tue, Jun 25, 2013 at 2:56 AM, Utkarsh Sengar wrote: > Yes, I have tried zkCli and it works. > But I also need to restart solr after the schema change right? > > I tried to reload

Re: Updating solrconfig and schema.xml for solrcloud in multicore setup

2013-06-25 Thread Utkarsh Sengar
Yes, I have tried zkCli and it works. But I also need to restart solr after the schema change right? I tried to reload the core, but I think there is an open bug where a core reload is successful but a shard goes down for that core. I just tried it out, i.e tried to reload a core after config chan

Re: Updating solrconfig and schema.xml for solrcloud in multicore setup

2013-06-25 Thread Jan Høydahl
Hi, As I understand, your initial bootstrap works ok (boostrap_conf). What you want help with is *changing* the config on a live system. That's when you are encouraged to use zkCli and don't mess with trying to let Solr bootstrap things - after all it's not a bootstrap anymore, it's a change. D

Re: Updating solrconfig and schema.xml for solrcloud in multicore setup

2013-06-25 Thread Utkarsh Sengar
But as when I launch a solr instance without "-Dbootstrap_conf=true", just once core is launched and I cannot see the other core. This behavior is the same as Mark's reply here: http://mail-archives.apache.org/mod_mbox/lucene-dev/201205.mbox/%3cbb7ad9bf-389b-4b94-8c1b-bbfc4028a...@gmail.com%3E -

Re: Solr Document inside the document

2013-06-25 Thread Jan Høydahl
Documents in Solr are flat - they contain a flat list of fields. Depending on your requirements for the "doc inside doc", there may be several workarounds: * field naming, i.e. "mysubdoc.title", "mysubdoc.author"... to flatten the document inside the main one * query time join as demonstrated i

Re: Updating solrconfig and schema.xml for solrcloud in multicore setup

2013-06-25 Thread Jan Høydahl
Hi, The -Dbootstrap_confdir option is really only meant for a first-time bootstrap for your development environment, not for serious use. Once you got your config into ZK you should modify the config directly in ZK. There are many tools (also 3rd party) for this. But your best choice is probabl

Re: String field does not yield partial match result using qf parameter

2013-06-25 Thread Jan Høydahl
fieldType "string" is not tokenized, so your observation is correct. You need to use a fieldType with analysis and tokenization to get the behavior you want. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com 25. juni 2013 kl. 02:35 skrev "Mugoma Joseph O." : > > It loo

Re: Solr Document inside the document

2013-06-25 Thread Gora Mohanty
On 25 June 2013 14:02, Siva Prasad Janapati wrote: > > Hi, > > I have a requirement where i need to create document inside the document. [...] What do you mean by this? Create it inside which document? One being POSTed to Solr? Regards, Gora

Solr Document inside the document

2013-06-25 Thread Siva Prasad Janapati
Hi, I have a requirement where i need to create document inside the document. For example, ... ... ... . Is there any way to configure the document like this? Regards, Siva [http://smarttechies.wordpress.com/] http://smarttechies.wordpress.com/

Updating solrconfig and schema.xml for solrcloud in multicore setup

2013-06-25 Thread Utkarsh Sengar
Hello, I am trying to update schema.xml for a core in a multicore setup and this is what I do to update it: I have 3 nodes in my solr cluster. 1. Pick node1 and manually update schema.xml 2. Restart node1 with -Dbootstrap_conf=true java -Dsolr.solr.home=multicore -DnumShards=3 -Dbootstrap_conf=

Re: Can we use Solr to serve data to web analytics & Dashboard charts

2013-06-25 Thread pradeep kumar
Sure, First of all thanks a lot everyone for very quick reply. We have a Ordering system which has a lakhs of records so far in a normalized RDBMS tables, say Order, Item, Details etc. We are planning to have a offline database (star schema) and develop reports, data analytical charts with drill

Re: Shard identification

2013-06-25 Thread Shalin Shekhar Mangar
Firstly, using 1 zookeeper machine is not at all ideal. See http://wiki.apache.org/hadoop/ZooKeeper/FAQ#A7 I've never personally seen such an issue. Can you give screen shots of the cloud graph on each node? Use an image hosting service because the mailing list won't allow attachments. On Tue, Ju

Re: Need Help in migrating Solr version 1.4 to 4.3

2013-06-25 Thread Shalin Shekhar Mangar
You must carefully go through the upgrade instructions starting from 1.4 upto 4.3. In particular the instructions for 1.4 to 3.1 and from 3.1 to 4.0 should be given special attention. On Tue, Jun 25, 2013 at 11:43 AM, Sandeep Gupta wrote: > Hello All, > > We are planning to migrate solr 1.4 to So

Solr indexer and Hadoop

2013-06-25 Thread engy.morsy
Hi All, I have TB of data that need to be indexed. I am trying to use hadoop to index those TB. I am still newbie. I thought that the Map function will read data from hard disks and the reduce function will index them. The problem I am facing is how to read those data from hard disks which are n

RE: Shard identification

2013-06-25 Thread Ophir Michaeli
Thanks for the response. I use 4.3 and have this issue. -Original Message- From: Upayavira [mailto:u...@odoko.co.uk] Sent: Tuesday, June 18, 2013 12:57 PM To: solr-user@lucene.apache.org Subject: Re: Shard identification What version of Solr? I had something like this on 4.2.1. Upgragin

Re: Restarting SOLR will remove all cache?

2013-06-25 Thread Toke Eskildsen
On Tue, 2013-06-25 at 07:35 +0200, William Bell wrote: > It goes restart the MMap stuff though. You cannot be sure of that. It is not mandatory, but the system should share memory mapped files with the disk cache. https://en.wikipedia.org/wiki/Memory-mapped_file#Benefits - Toke Eskildsen, State a