Sorl Help!

2014-02-24 Thread Quốc Nguyễn
Dear sir, To Apache Solr support! wish you have a good day! I'm new in Solr, please help me to confirm bellow information : 1. "The URL must use the standard ports for HTTP (80) and HTTPS (443). The port is implied by the scheme, but may also be mentioned in the URL as long as the port is st

Sorl integrate with GAE

2014-02-24 Thread Quốc Nguyễn
Dear sir, To Apache Solr support! wish you have a good day! I'm new in Solr, please help me to confirm bellow information : 1. "The URL must use the standard ports for HTTP (80) and HTTPS (443). The port is implied by the scheme, but may also be mentioned in the URL as long as the port is st

start.jar config

2014-02-24 Thread Can Arel
Hi all, I have a server which uses Solr and for some reason the solr got terminated. When I restart it with java -jar start.jar it uses stdout as logger. Should I just redirect this with > to a file location or is there an idomatic Solr way this should be done? Thanks, Can

Re: start.jar config

2014-02-24 Thread manju16832003
Solr already writes the logs to a file 'solr.log'. Its located in the same folder as start.jar (logs/solr.log). I'm not sure if thats what you looking for :-). -- View this message in context: http://lucene.472066.n3.nabble.com/start-jar-config-tp4119201p4119203.html Sent from the Solr - User

Re: Converting solrdocument response into pojo

2014-02-24 Thread Navaa
Thank you Alexander for your reply. Here I am posting my schema definition But I am not able to resolve this issue please tell me where I am going wrong. -- View this message in context: http://lucene.472066.n3.nabble.com/Converting-solrdocument-response-into-pojo-tp4118743p4119

URLDataSource : Issue assigning single xpath field name to two solr fields

2014-02-24 Thread manju16832003
I'm not sure if I would be missing any configuration params here, however when I tried to assign an xpath field from URLDataSource (XML end point) to two fields defined in schema.xml. Here is my scenario, I have two fields *profile_display* and *profile_indexed* My assignment in DataImpotHandler

Re: Can not index raw binary data stored in Database in BLOB format.

2014-02-24 Thread Gora Mohanty
On 24 February 2014 12:51, Chandan khatua wrote: > Hi, > > > > We have raw binary data stored in database(not word,excel,xml etc files) in > BLOB. > > We are trying to index using TikaEntityProcessor but nothing seems to get > indexed. > > But the same configuration works when xml/word/excel files

Re: Luke 4.6.1 released

2014-02-24 Thread Dmitry Kan
Yes, indeed. Every release of luke is tested on the corresponding solr version's indexes. The indexes are created based on the exampledocs of the solr package. Dmitry On Mon, Feb 17, 2014 at 12:41 AM, Bill Bell wrote: > Yes it works with Solr > > Bill Bell > Sent from mobile > > > > On Feb 16,

Re: Sorl integrate with GAE

2014-02-24 Thread Gora Mohanty
On 24 February 2014 12:39, Quốc Nguyễn wrote: > Dear sir, > > To Apache Solr support! > wish you have a good day! > > I'm new in Solr, please help me to confirm bellow information : > > 1. "The URL must use the standard ports for HTTP (80) and HTTPS (443). > The port is implied by the scheme,

Re: URLDataSource : Issue assigning single xpath field name to two solr fields

2014-02-24 Thread Gora Mohanty
On 24 February 2014 14:45, manju16832003 wrote: > I'm not sure if I would be missing any configuration params here, however > when I tried to assign an xpath field from URLDataSource (XML end point) to > two fields defined in schema.xml. > > Here is my scenario, > I have two fields > *profile_disp

Re: DistributedSearch: Skipping STAGE_GET_FIELDS?

2014-02-24 Thread Shalin Shekhar Mangar
I opened SOLR-5768 https://issues.apache.org/jira/browse/SOLR-5768 On Mon, Feb 24, 2014 at 12:56 AM, Shalin Shekhar Mangar wrote: > Yes that should be simple. But regardless of the parameter, the > fl=id,score use-case should be optimized by default. I think I'll > commit the patch as-is and ope

RE: Can not index raw binary data stored in Database in BLOB format.

2014-02-24 Thread Chandan khatua
Hi Gora ! Your concern was "What is the type of the column used to store the binary data in Oracle?" The column type is BLOB in DB. The column can also have rich text file. Regards, Chandan -Original Message- From: Gora Mohanty [mailto:g...@mimirtech.com] Sent: Monday, February 24, 20

Re: Wikipedia Data Cleaning at Solr

2014-02-24 Thread Furkan KAMACI
My input is that: {| style="text-align: left; width: 50%; table-layout: fixed;" border="0" |} Analysis is as follows: WT textraw_bytesstartendtypeflagsposition style[73 74 79 6c 65]3801 text[74 65 78 74]101402 align[61 6c 69 67 6e]152003 left[6c 65 66 74]222604 width[77 69 64 74 68]283305 50[35

Re: Can not index raw binary data stored in Database in BLOB format.

2014-02-24 Thread Raymond Wiker
I've done something like this; the key was to use a FieldStreamDataSource to read from the BLOB field. Something like then ... On Mon, Feb 24, 2014 at 11:04 AM, Chandan khatua wrote: > Hi Gora ! > > Your concern was "What is the t

RE: Can not index raw binary data stored in Database in BLOB format.

2014-02-24 Thread Chandan khatua
Hi Raymond ! I've data-config.xml like bellow: This is looks like similar to your configuration. But when xml data are in BLOB in database, indexing is done. But, when binary data are in BLOB in database, in

Re: Can not index raw binary data stored in Database in BLOB format.

2014-02-24 Thread Raymond Wiker
Try replacing the inner entity with something like --- this assumes that you get the blob from a column named "MESSAGE" in the outer entity ("messages"). On Mon, Feb 24, 2014 at 11:51 AM, Chandan khatua wrote: > Hi Raymond ! > > I've data-config.xml like bellow: > > > > url="jdbc:o

RE: Can not index raw binary data stored in Database in BLOB format.

2014-02-24 Thread Chandan khatua
I've tried as per your guide. But, no data are indexing. The output of Query screen looks like : 2158 http://www.w3.org/1999/xhtml";> 1460918369230258176 But, the indexed data should be displayed within tag. When xml message are stored in DB in BLOB type, then in

Re: URLDataSource : Issue assigning single xpath field name to two solr fields

2014-02-24 Thread Shalin Shekhar Mangar
The XPathEntityProcessor supports only one field mapping per xpath so using copyField is the only way. On Mon, Feb 24, 2014 at 2:45 PM, manju16832003 wrote: > I'm not sure if I would be missing any configuration params here, however > when I tried to assign an xpath field from URLDataSource (XML

Inconsistency between Leader and replica in solr cloud

2014-02-24 Thread abhijit das
We are currently using Solr Cloud Version 4.3, with the following set-up, a core with 2 shards - Shard1 and Shard2, each shard has replication factor 1. We have noticed that in one of the shards, the document differs between the leader and the replica. Though the doc exists in both the machines

Re: Fault Tolerant Technique of Solr Cloud

2014-02-24 Thread Vineet Mishra
Can you brief as how to make a direct call to Zookeeper instead of Cloud Collection(as currently I was querying the Cloud something like *"http://192.168.2.183:8900/solr/collection1/select?q=*:* "* ) from UI, now if I assume shard 8900 is dow

Re: Can not index raw binary data stored in Database in BLOB format.

2014-02-24 Thread Raymond Wiker
Try running the query for the outer entity ("messages") in an sql client, and verify that your blob column is called MESSAGE. On Mon, Feb 24, 2014 at 12:22 PM, Chandan khatua wrote: > I've tried as per your guide. But, no data are indexing. > The output of Query screen looks like : > > > 21

Re: Can not index raw binary data stored in Database in BLOB format.

2014-02-24 Thread Gora Mohanty
On 24 February 2014 15:34, Chandan khatua wrote: > Hi Gora ! > > Your concern was "What is the type of the column used to store the binary > data in Oracle?" > The column type is BLOB in DB. The column can also have rich text file. Um, your original message said that it does *not* contain richte

Re: Need feedback: Browsing and searching solr-user list emails

2014-02-24 Thread Dmitry Kan
Hello! Just few random points: 1. Interesting site. I'd say there are similar sites, but this one has cleaner interface. How does your site compare to this one, for example, in terms of feature set? http://qnalist.com/questions/4640870/luke-4-6-0-released At least, the user ranking seems to be

Reg queryResultCache...

2014-02-24 Thread Senthilnathan Vijayaraja
Below is the url which will hit the middle layer then middle layer will form the solr query and fire it. *listing?offset=0&sortparam=0&limit=20&q=Chennai~Tambaram~1~2,3~45~2500~800~2000~~24* Chennai-->city Tambaram-->locality 1-->blah 2,3-->blah 45~2500-->price_min and max 800~20

Re: Fault Tolerant Technique of Solr Cloud

2014-02-24 Thread Shalin Shekhar Mangar
Vineet, I'm assuming that you are executing your search from a Java Client. If so, just use CloudSolrServer present in the Solrj API and save yourself from all these troubles. If you are not using a Java client, then you need to put a few or all your servers behind a load balancer and invoke reques

Re: Reg queryResultCache...

2014-02-24 Thread Shalin Shekhar Mangar
Please provide the *Solr* queries that are being invoked by your middle layer along with the results you expect and the results you actually got from Solr with cache-enabled. On Mon, Feb 24, 2014 at 6:23 PM, Senthilnathan Vijayaraja wrote: > Below is the url which will hit the middle layer then m

Re: Inconsistency between Leader and replica in solr cloud

2014-02-24 Thread Yago Riveiro
This bug was fixed on Solr 4.6.1— /Yago Riveiro On Mon, Feb 24, 2014 at 11:56 AM, abhijit das wrote: > We are currently using Solr Cloud Version 4.3, with the following set-up, a > core with 2 shards - Shard1 and Shard2, each shard has replication factor 1. > We have noticed that in one of the

Re: Fault Tolerant Technique of Solr Cloud

2014-02-24 Thread Per Steffensen
On 24/02/14 13:04, Vineet Mishra wrote: Can you brief as how to make a direct call to Zookeeper instead of Cloud Collection(as currently I was querying the Cloud something like *"http://192.168.2.183:8900/solr/collection1/select?q=*:* "* )

RE: Inconsistency between Leader and replica in solr cloud

2014-02-24 Thread Markus Jelsma
Yes, that issue is fixed. We are on trunk and seeing it happen again. Kill some nodes when indexing, trigger OOM or reload the collection and you are in trouble again. -Original message- > From:Yago Riveiro > Sent: Monday 24th February 2014 14:54 > To: solr-user@lucene.apache.org > Sub

Re: start.jar config

2014-02-24 Thread Erick Erickson
Probably when originally started, whoever did it piped the output to dev/null. You can also change this permanently by altering the logging, see: https://wiki.apache.org/solr/SolrLogging Best, Erick On Mon, Feb 24, 2014 at 12:56 AM, manju16832003 wrote: > Solr already writes the logs to a file

Slow query time on stemmed fields

2014-02-24 Thread Jens Meiners
Hi, we've built an index (Solr 4.3), which contains approx. 1 Million docs and its size is around 20 GB (optimized). In our index we have one field which contains the tokenized words of indexed documents and a second field with the stemmed contents (SnowballFilter, German2). During our tests we'

Re: Slow query time on stemmed fields

2014-02-24 Thread Erick Erickson
This is really strange. You should have _fewer_ tokens in your stemmed field. Plus, the up-front processing to stem the field in the query shouldn't be noticeable. Let's see the query and results from &debug=all being added to the URL because something is completely strange here. Best, Erick On

Re: Slow query time on stemmed fields

2014-02-24 Thread Jack Krupansky
Maybe some heap/GC issue from using more of this 20 GB index. Maybe it was running at the edge and just one more field was too much for the heap. The "timing" section of the debug query response should shed a little light. -- Jack Krupansky -Original Message- From: Erick Erickson Sen

Re: DistributedSearch: Skipping STAGE_GET_FIELDS?

2014-02-24 Thread Gregg Donovan
Thank you Shalin and Yonik! Both SOLR-1880 and SOLR-5768 will be very helpful for our distributed search performance. On Mon, Feb 24, 2014 at 5:02 AM, Shalin Shekhar Mangar < shalinman...@gmail.co

Re: Converting solrdocument response into pojo

2014-02-24 Thread Alexandre Rafalovitch
On Mon, Feb 24, 2014 at 8:03 PM, Navaa wrote: > So, you are probably supplying an id and then also merging doctor_id and id fields together. Which gives you two fields values in ID. I would have expected Solr to complaint about it, but either way you have a design issue here. Regards, Alex.

RE: Solr4 performance

2014-02-24 Thread Joshi, Shital
Thanks. We found some evidence that this could be the issue. We're monitoring closely to confirm this. One question though: none of our nodes show more that 50% of physical memory used. So there is enough memory available for memory mapped files. Can this kind of pause still happen? -

Re: DistributedSearch: Skipping STAGE_GET_FIELDS?

2014-02-24 Thread Jeff Wartes
I¹ll second that thank-you, this is awesome. I asked about this issue in 2010, but when I didn¹t hear anything (and disappointingly didn¹t find SOLR-1880), we ended up rolling our own version of this functionality. I¹ve been laboriously migrating it every time we bump our Solr version ever since.

SolrCloud Startup

2014-02-24 Thread KNitin
Hi I have a 4 node solrcloud cluster with more than 50 collections with 4 shards each. Everytime I want to make a schema change, I upload configs to zookeeper and then restart all nodes. However the restart of every node is very slow and takes about 20-30 minutes per node. Is it recommended to m

Re: SolrCloud Startup

2014-02-24 Thread Jeff Wartes
There is a RELOAD collection command you might try: https://cwiki.apache.org/confluence/display/solr/Collections+API#Collection sAPI-api2 I think you¹ll find this a lot faster than restarting your whole JVM. On 2/24/14, 4:12 PM, "KNitin" wrote: >Hi > > I have a 4 node solrcloud cluster with

Re: Solr4 performance

2014-02-24 Thread Michael Della Bitta
I'm not sure how you're measuring free RAM. Maybe this will help: http://www.linuxatemyram.com/play.html Michael Della Bitta Applications Developer o: +1 646 532 3062 appinions inc. "The Science of Influence Marketing" 18 East 41st Street New York, NY 10017 t: @appinions

Fetching uniqueKey and other int quickly from documentCache?

2014-02-24 Thread Gregg Donovan
We fetch a large number of documents -- 1000+ -- for each search. Each request fetches only the uniqueKey or the uniqueKey plus one secondary integer key. Despite this, we find that we spent a sizable amount of time in SolrIndexSearcher#doc(int docId, Set fields). Time is spent fetching the two sto

Re: SolrCloud Startup

2014-02-24 Thread Shawn Heisey
> Hi > > I have a 4 node solrcloud cluster with more than 50 collections with 4 > shards each. Everytime I want to make a schema change, I upload configs to > zookeeper and then restart all nodes. However the restart of every node is > very slow and takes about 20-30 minutes per node. > > Is it re

Re: SolrCloud Startup

2014-02-24 Thread Otis Gospodnetic
Hi, Slow startup could it be your transaction logs are being replayed? Are they very big? Do you see lots of disk reading during those 20-30 minutes? Shawn was referring to http://wiki.apache.org/solr/SolrPerformanceProblems Otis -- Performance Monitoring * Log Analytics * Search Analytics

Re: SolrCloud Startup

2014-02-24 Thread Erick Erickson
What is your firstSearcher set to in solrconfig.xml? If you're doing something really crazy there that might be an issue. But I think Otis' suggestion is a lot more probable. What are your autocommits configured to? Best, Erick On Mon, Feb 24, 2014 at 7:41 PM, Shawn Heisey wrote: > > Hi > > >

Can not index document in solar

2014-02-24 Thread rachun
Dear all, Could you guys please help me? I just try to index document into solar it doesn't give me any error but it doesn't index document too but it used to work but not now please see.. #Solr Log WARN - 2014-02-25 11:30:35.675; org.apache.solr.handler.loader.XMLLoader; Unknown attribute id

Re: Can not index document in solar

2014-02-24 Thread Erick Erickson
Well, what was that last thing you changed? There's really not much here to go on, you have to provide more details about what you've tried, what evidence you have that the doc isn't indexed, etc. have you looked at your Solr admin screen to see if maxDoc has increased? Have you committed your ch

Re: Can not index document in solar

2014-02-24 Thread rachun
Thank you Eric, I figured out something that actually the document is indexed but it doesn't show on my api result because it missed some field. So I would like to delete this post how can I? Thank you very much, Chun. -- View this message in context: http://lucene.472066.n3.nabble.com/Can-n

Re: Can not index document in solar

2014-02-24 Thread Erick Erickson
No, can't delete posts. Having them around keeps a history for others as well, so that's an added benefit. Glad you figured it out! Erick On Mon, Feb 24, 2014 at 9:20 PM, rachun wrote: > Thank you Eric, > I figured out something that actually the document is indexed but it > doesn't > show on

RE: Can not index raw binary data stored in Database in BLOB format.

2014-02-24 Thread Chandan khatua
I have verified that blob column is called MESSAGE. In my data-config file the field column named 'id' is indexed in solr. But the data(field column name="mxMsg") is not indexed. It comes empty with in quotes. The same configuration is working on xml data (stored BLOB type in DB), But not on bin

Re: Reg queryResultCache...

2014-02-24 Thread Senthilnathan Vijayaraja
Hi, select?br="2"+"3"&version=2&fl=id,level,name,city,amenities,$lanorm,$relscore,$bscore&q=*:*&fq={!lucene q.op=OR df=property_type v=$ptype}&ptype=1&fq={!lucene q.op=OR df=city v=$cit}&sort=$bscore desc,$relscore desc&cit=Chennai&relscore=product($banorm,15)&bscore=banorm($la,amenities,10)&la=8

Apache Solr Beginner's Guide

2014-02-24 Thread leon715
For more info : http://www.packtpub.com/apache-solr-beginners-guide/book -- View this message in context: http://lucene.472066.n3.nabble.com/Apache-Solr-Beginner-s-Guide-tp4119486.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Apache Solr Beginner's Guide

2014-02-24 Thread Alexandre Rafalovitch
Yes, and? There is a bunch of books on Solr, including a couple for the beginners. Packt in particular has obviously gone for the volume approach :-) If yo have a question about the book, you may want to send it to Packt or book author or other forums. Or rephrase it as a specific Solr questions t

Sort order in field collapsing (group.field=xxxx)

2014-02-24 Thread Sesha Sendhil Subramanian
Hey, I have a solr cloud setup with 2 shards. I am trying to use solr's field grouping feature My query looks like q=*:*&fq=field:value&group=true&group.field=otherValue The *ordering* of the groups *differs based on which shard the query is fired from* and it seems that docs located in that sam