problem in Velocity spell output

2013-01-14 Thread anurag.jain
if i am giving input antrag then it is showing following line. Did you mean {collationQuery=anurag,hits=1,misspellingsAndCorrections={aturag=anurag}}? i want output in format of Did you mean anurag ? how can i solve it please give me solution.. Thanks in advance Reply please urgent --

Regarding Copyfield

2013-01-14 Thread anurag.jain
hi in copy field i am not storing first_name last_name etc. but in dest = "text" it is showing first_name .. etc. in auto suggestion mode. my copy field are .. ... and field are .. ... and also i want to make own field like text named as "autosuggest" t

Re: core.SolrCore - java.io.FileNotFoundException

2013-01-14 Thread Jun Wang
The problem is occured recently again, this time exception is 2013-01-14 10:17:23,865 ERROR core.SolrCore - java.io.FileNotFoundException: /home/admin/index/core_p_shard4/index/_1ozb.fnm (No such file or directory) at java.io.RandomAccessFile.open(Native Method) at java.io.RandomAc

Re: Is there any way to check what index-time document boost value is?

2013-01-14 Thread Jack Krupansky
The "norm" function query gives you the combination of the index-time boost and length-normalization. And it's a low-resolution approximation at that. That's all that is stored, so that's as good as you can get. See: http://wiki.apache.org/solr/FunctionQuery#norm I believe that "norm" should s

Re: performing a boolean query (OR) with a large number of terms

2013-01-14 Thread Otis Gospodnetic
Hi, Also have a look at mm=0% acting as OR if you end up using dismax: http://wiki.apache.org/solr/ExtendedDisMax#mm_.28Minimum_.27Should.27_Match.29 Otis -- Solr & ElasticSearch Support http://sematext.com/ On Thu, Jan 10, 2013 at 7:47 AM, Erick Erickson wrote: > No, you're pretty much on

Re: I/O exception (java.net.SocketException) caught when processing request: Connection reset

2013-01-14 Thread Otis Gospodnetic
Hi, I suspect you might find some information about the cause on the server side, in your container's logs. I'd look there for the real source of the problem before trying to just reconnect from the client side. Otis -- Solr & ElasticSearch Support http://sematext.com/ On Mon, Jan 14, 2013

RE: SOlr 3.5 and sharding

2013-01-14 Thread Jean-Sebastien Vachon
Ok that was my first thought... thanks for the confirmation -Original Message- From: Michael Ryan [mailto:mr...@moreover.com] Sent: January-14-13 10:06 AM To: solr-user@lucene.apache.org Subject: RE: SOlr 3.5 and sharding If you have the same documents -- with the same uniqueKey -- acro

Re: incorrect solr update behavior

2013-01-14 Thread Gary Yngve
Of course, as soon as I post this, I discover this: https://issues.apache.org/jira/browse/SOLR-4134?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13537900#comment-13538174 i'll give this patch a spin in the morning. (this is not an example of how to use a

Is there any way to check what index-time document boost value is?

2013-01-14 Thread Alexandre Rafalovitch
Hello, I have indexed a document with an assigned document-level boost factor. Is there any way to double-check that the boost factor is actually recorded/used? I tried debug.explain.structured but it does not seem to have it. Regards, Alex. Personal blog: http://blog.outerthoughts.com/ Link

Re: RSS tutorial that comes with the apache-solr not indexing

2013-01-14 Thread Steve Rowe
Yes, thanks Alex, I've fixed solr/example/example-DIH/solr/rss/conf/rss-data-config.xml On Jan 14, 2013, at 2:08 PM, Alexandre Rafalovitch wrote: > Is that something that needs to be updated in the example schema as well > then? > > Regards, > Alex. > > Personal blog: http://blog.outerthoug

Re: Need another way to boost relavence of recent content

2013-01-14 Thread Shawn Heisey
On 1/14/2013 2:56 PM, Markus Jelsma wrote: Depending on use case the functions max, min, scale and map can be used really well to regulate the output of recip. Check their docs and you'll surely work it out. Perhaps scale will work best for you. I need someone to sanity check my work here. H

I/O exception (java.net.SocketException) caught when processing request: Connection reset

2013-01-14 Thread Joe
I have a multi-threaded application in solrj 4. The threads (max 25) share one connection to HttpSolrServer. Each thread is running one query. This worked fine for a while, until it finally crashed with the following messages: Jan 12, 2013 12:52:15 PM org.apache.http.impl.client.DefaultRequestDir

RE: Need another way to boost relavence of recent content

2013-01-14 Thread Markus Jelsma
Hi, Depending on use case the functions max, min, scale and map can be used really well to regulate the output of recip. Check their docs and you'll surely work it out. Perhaps scale will work best for you. Cheers -Original message- > From:Shawn Heisey > Sent: Mon 14-Jan-2013 22:5

RE: Need another way to boost relavence of recent content

2013-01-14 Thread Markus Jelsma
Hi, Depending on use case the functions max, min, scale and map can be used really well to regulate the output of recip. Check their docs and you'll surely work it out. Perhaps scale will work best for you. Cheers -Original message- > From:Shawn Heisey > Sent: Mon 14-Jan-2013 22:50 >

Need another way to boost relavence of recent content

2013-01-14 Thread Shawn Heisey
I implemented the date boosting function outline here, placed into the boost parameter in the request handler: http://wiki.apache.org/solr/FunctionQuery#Date_Boosting Today it was reported to me that this boosting is producing terrible results. A close look at the description reveals that thi

SolrCloud :: Adding replica :: Sync-up issue

2013-01-14 Thread Mishkin, Ernest
Hello, I observed a rather weird issue with SolrCloud. Using Solr 4.0 GA code. Started with a 3-node Zookeeper ensemble (standalone) and a single Solr instance running single collection. numShards was set to 1 during collection creation (don't want sharding, just replication). Everything worke

Re: Index data from multiple tables into Solr

2013-01-14 Thread hassancrowdc
thanx, I got it. How Can i integrate solr with my website? so that i can use it for search? On Mon, Jan 14, 2013 at 4:04 PM, Lance Norskog-2 [via Lucene] < ml-node+s472066n4033291...@n3.nabble.com> wrote: > Try all of the links under the collection name in the lower left-hand > columns. There s

Re: POST query with non-ASCII to solr using httpclient wont work

2013-01-14 Thread Uwe Reh
Hi Jie, maybe there is a simple solution. When we used tomcat as servlet container for solr I notices similar problems. Even with the hints from the solr wiki about unicode and Tomcat, i wasn't able to fix this. So we switched back to Jetty, querys like "q=allfields2%3A能力" are reliable now.

Re: Index data from multiple tables into Solr

2013-01-14 Thread Lance Norskog
Try all of the links under the collection name in the lower left-hand columns. There several administration monitoring tools you may find useful. On 01/14/2013 11:45 AM, hassancrowdc wrote: ok stats are changing, so the data is indexed. But how can i do query with this data, or ow can i search

Re: Question about GC logging timestamps

2013-01-14 Thread Mikhail Khludnev
Shawn, you are welcome! http://www.oracle.com/technetwork/java/gc-tuning-5-138395.html The flag -XX:+PrintGCTimeStamps will additionally print a time stamp at the start of each collection. On Sun, Jan 6, 2013 at 6:54 AM, Michael Ryan wrote: > From my own experience, the timestamp seems to be

Re: Search across a specified number of boundaries

2013-01-14 Thread Mikhail Khludnev
Mike, When Lucene's Analyser indexes the text it adds positions into the index which are lately used by SpanQueries. Have you considered idea of position increment gap? e.g. the first sentence is indexed with words positions: 0,1,2,3,... the second sentence with 100,101,102,103,..., third 200,201,

Re: theory of sets

2013-01-14 Thread Mikhail Khludnev
My answer as usual - BlockJoin. index group as a parent document, and every membership as a child doc. In this case you somehow "denormalize" your items - every item will be indexed N times where N is a number of groups it belongs to. Potentially it can lead to duplication problem, but you haven't

Re: RSS tutorial that comes with the apache-solr not indexing

2013-01-14 Thread Gora Mohanty
On 15 January 2013 00:38, Alexandre Rafalovitch wrote: > Is that something that needs to be updated in the example schema as well > then? The example rss-data-config.xml references http://rss.slashdot.org/Slashdot/slashdot which seems to be broken at the moment, at least for me. This is the same

Re: POST query with non-ASCII to solr using httpclient wont work

2013-01-14 Thread Jie Sun
unfortunately solrj is not an option here... we will have to make a quick fix with a patch out in production. I am still unable to make the solr (3.5) take url encoded query. again passing non-urlencoded query string works with non-ASIIC (Chinese), but fails return anything when sending request wi

Velocity in Multicore

2013-01-14 Thread Ramirez, Paul M (388J)
Hi, I've been unable to get the velocity response writer to work in a multicore environment. Working from the examples that are distributed with Solr I simply started from the multicore example and added a hello.vm into core0/conf/velocity directory. I then updated the solrconfig.xml to add a n

Re: performance improvements on ip look up query

2013-01-14 Thread Mikhail Khludnev
Hello Lee, I suppose caching isn't efficient for this type of searches. I can propose a kind of trick. if you index your docs in order by (STARTIP,ENDIP) tuple, it should make intersection faster. However it's rather theoretical consideration than practical one. More real one is to encode ip rang

Re: RSS tutorial that comes with the apache-solr not indexing

2013-01-14 Thread bibhor
Hi Steve Thank you for your help. After I updated the rss-data-config to following, it worked. http://www.feedforall.com/sample.xml"; processor="XPathEntityProcessor" forEach="/rss/channel/item" transformer="DateFormatTransformer">

Re: configuring schema to match database

2013-01-14 Thread Gora Mohanty
On 14 January 2013 17:28, Jens Grivolla wrote: > On 01/14/2013 12:50 PM, Gora Mohanty wrote: [...] >> Doh! You are right, of course. Brainfart from my side. > > > Ok, I was starting to wonder if I was the one missing something. Re-reading > what I wrote I see I may have sounded a bit rude, that wa

Re: RSS tutorial that comes with the apache-solr not indexing

2013-01-14 Thread Steve Rowe
Hi bibhor, I looked at and , and their top-level structure is: ... This doesn't match your or your Steve On Jan 14, 2013, at 1:02 PM, bibhor wrote: > Hi, > I did try another RSS from here "http://www

Re: RSS tutorial that comes with the apache-solr not indexing

2013-01-14 Thread bibhor
Hi, I did try another RSS from here "http://www.feedforall.com/sample.xml"; but it also didnt work and came back with same message saying it indexed 0 documents. This is my data from rss-data-config.xml http://www.feedforall.com/sample.xml"; processor="XPathEnti

Re: Solr 4.0 SnapPuller version vs. generation issue

2013-01-14 Thread Mark Miller
I've fixed this - thanks Gregg. https://issues.apache.org/jira/browse/SOLR-4303 - Mark On Jan 10, 2013, at 5:41 PM, Mark Miller wrote: > Hmm…I don't recall that change. We use the force, so SolrCloud certainly does > not depend on it. > > It seems like it might be a mistake - some dev code t

Re: Schema Field Names i18n

2013-01-14 Thread Lance Norskog
Will a field have different names in different languages? There is no facility for 'aliases' for field name. Erick is right, this sounds like you need query and update components to implement this. Also, you might try using URL-encoding for the field names. This would save my sanity. On 01/10/

Re: RSS tutorial that comes with the apache-solr not indexing

2013-01-14 Thread Lance Norskog
This example may be out of date, if the RSS feeds from Slashdot have changed. If you know XML and XPaths, try this: Find an rss feed from somewhere that works. Compare the xpaths in it v.s. the xpaths in the DIH script. On 01/13/2013 07:38 PM, bibhor wrote: Hi I am trying to use the RSS tutori

RE: SOlr 3.5 and sharding

2013-01-14 Thread Michael Ryan
If you have the same documents -- with the same uniqueKey -- across multiple shards, the count will not be what you expect. You'll need to ensure that each document exists only on a single shard. -Michael -Original Message- From: Jean-Sebastien Vachon [mailto:jean-sebastien.vac...@wante

How to use shardId

2013-01-14 Thread starbuck
Hi all, I am trying to realize a solr cloud cluster with 2 collections and 4 shards each with 2 replicates hosted by 4 solr instances. If shardNum parm is set to 4 and all solr instances are started after each other it seems to work fine. What I wanted to do now is removing shardNum from JAVA_OPT

Re: regex and highlighter component: highlight and return individual fragments inside a snippet

2013-01-14 Thread Dmitry Kan
it seems, hl.snippets does what I'm after, also discussed here: http://search-lucene.com/m/yxCOc1X0uY42/highlighting+multiple+occurrences&subj=highlighting+multiple+occurrences http://wiki.apache.org/solr/*Highlighting*Parameters#hl.snippets

Re: Calculate a sum.

2013-01-14 Thread stockii
Mikhail Khludnev wrote > You can spend some heap for uninverting the index and > utilize wiki.apache.org/solr/StatsComponent what do you mean with this? Edward Garrett wrote > how many documents are you working with? ~90 million documents ... -- View this message in context: http://lucene

Re: theory of sets

2013-01-14 Thread Alexandre Rafalovitch
Does this have to be in Solr? Given the pre-computed nature of the sub-series, maybe you can encode both series name and series sort order in a separate structure designed for it. Something like Neo4J comes to mind: http://www.neo4j.org/ . Or, this could be a good question for StackOverflow on wh

Re: SolrCloud sort inconsistency

2013-01-14 Thread Erick Erickson
P.S. of course your sorting won't reflect documents that haven't been committed yet. So if you straighten out the params your lists should be in order, but the documents returned may change depending on whether your indexing process adds docs between calls Erick On Mon, Jan 14, 2013 at 7:25

Re: SolrCloud sort inconsistency

2013-01-14 Thread Erick Erickson
Unless it's a cut-n-paste error, you don't have an & in front of the sort parameter so you're not sorting at all. You should have a "sort" section in your response where the params are echoed, something like: "params":{ "sort":"id desc", "fl":"id", "cache":"False", "q":

Re: How to manage solr cloud collections-sharding?

2013-01-14 Thread Erick Erickson
I can at least answer part of this see inline. On Sun, Jan 13, 2013 at 11:44 AM, adfel70 wrote: > Hi, > I know a few question on this issue have already been posted, but I dint > find full answers in any of those posts. > > I'm using solr-4.0.0 > [EOE] I'd _really_ start working with a nigh

Re: configuring schema to match database

2013-01-14 Thread Jens Grivolla
On 01/14/2013 12:50 PM, Gora Mohanty wrote: On 14 January 2013 16:59, Jens Grivolla wrote: [...] Then please show me the query to find users that are fluent in spanish and english. Bonus points if you manage to not retrieve the same user several times. (Hint, your schema stores only one languag

Re: Calculate a sum.

2013-01-14 Thread Edward Garrett
i've had perfectly fine performance with StatsComponent, but have only tested with 50,000 documents. for example i have field syllables and numeric field syllables_count. then i sum the syllable count for any search query. how many documents are you working with? On Mon, Jan 14, 2013 at 10:54 AM,

Re: configuring schema to match database

2013-01-14 Thread Gora Mohanty
On 14 January 2013 16:59, Jens Grivolla wrote: [...] > Then please show me the query to find users that are fluent in spanish and > english. Bonus points if you manage to not retrieve the same user several > times. (Hint, your schema stores only one language skill per row). Doh! You are right, of

Re: configuring schema to match database

2013-01-14 Thread Jens Grivolla
On 01/11/2013 06:14 PM, Gora Mohanty wrote: On 11 January 2013 22:30, Jens Grivolla wrote: [...] Actually, that is what you would get when doing a join in an RDBMS, the cross-product of your tables. This is NOT AT ALL what you typically do in Solr. Best start the other way around, think of So

Re: theory of sets

2013-01-14 Thread Uwe Reh
Am 08.01.2013 10:26, schrieb Uwe Reh: OK, OK, I will try it again with dynamic fields. NO! dynamic fields are nice, but not for my problem. :-( I got more than *52* new fields. I was wrong, the impact on searching is really reasonable. But have you ever used the Admin's Schema Browser wit

Re: Calculate a sum.

2013-01-14 Thread Mikhail Khludnev
Stored fields are famous for its' slowness as well as they requires two io operation per doc. You can spend some heap for uninverting the index and utilize wiki.apache.org/solr/StatsComponent Let us know whether it works for you. 14.01.2013 13:14 пользователь "stockii" написал: > hello. > > My pr

RE: CoreAdmin STATUS performance

2013-01-14 Thread Shahar Davidson
Hi Stefan, I have opened issue SOLR-4302 and attached the suggested patch. Regards, Shahar. -Original Message- From: Stefan Matheis [mailto:matheis.ste...@gmail.com] Sent: Sunday, January 13, 2013 3:11 PM To: solr-user@lucene.apache.org Subject: Re: CoreAdmin STATUS performance Shahar

Re: Calculate a sum.

2013-01-14 Thread stockii
Hey, thx for your reply. i forgot to say. StatsComponent doesnt work with our application. too slow and buggy. but i test with this component with version 1.4 ... maybe some bugfixes in 4.0 ? this is the reason for calculating the sum on client side and some pages. but sometimes its too much for

Re: Calculate a sum.

2013-01-14 Thread Rafał Kuć
Hello! Fetching all the documents, especially for a query that returns many documents can be a pain. However there is a StatsComponent (http://wiki.apache.org/solr/StatsComponent) in Solr, however your field would have to be numeric and indexed. -- Regards, Rafał Kuć Sematext :: http://semat

Calculate a sum.

2013-01-14 Thread stockii
hello. My problem is, that i need to calculate a sum of amounts. this amount is in my index (stored="true"). my php script get all values with paging. but if a request takes too long, jetty is killing this process and i get a "broken pipe". Which is the best/fastest way to get the values of many