Re: How to define my data in schema.xml

2013-06-17 Thread Mysurf Mail
Thanks for your reply. I have tried the simplest approach and it works absolutely fantastic. Huge table - 0s to result. two problems as I described earlier, and that is what I try to solve: 1. I create a flat table just for solar. This requires maintenance and develop. Can I run solr over my regul

RE: SolrJ Howto get local params from QueryResponse

2013-06-17 Thread Holger Rieß
You're right, Jack. I use the local parameters not only to the exclusion of the filter, but also as a kind of marker. I'll try it with QueryResponse.getHeader(). Thank you. Holger Rieß > -Original Message- > From: Jack Krupansky [mailto:j...@basetechnology.com] > Sent: Monday, June 17, 2

Re: Solr data files

2013-06-17 Thread Mysurf Mail
Thanks., On Mon, Jun 17, 2013 at 10:42 PM, Alexandre Rafalovitch wrote: > The index files are under the the collection's directory in the > subdirectory called 'data'. Right next to the directory called 'conf' > where your schema.xml and solrconfig.xml live. > > If the Solr is not running, you c

Re: SOLR Cloud - Disable Transaction Logs

2013-06-17 Thread Shalin Shekhar Mangar
Yes, but at what cost? You are thinking of replacing disk IO with even more slower network IO. The transaction log is a append-only log -- it is not pretty cheap especially so if you compare it with the indexing process. Plus your write request/sec will drop a lot once you start doing synchronous r

Re: what does a zero score mean?

2013-06-17 Thread Gora Mohanty
On 18 June 2013 10:49, Joe Zhang wrote: > I issued a simple query ("apple") to my collection and got 201 documents > back, all of which are scored 0. What does this mean? --- The documents do > contain the query words. My guess is that the float-valued score is getting converted to an integer. Yo

what does a zero score mean?

2013-06-17 Thread Joe Zhang
I issued a simple query ("apple") to my collection and got 201 documents back, all of which are scored 0. What does this mean? --- The documents do contain the query words.

Re: How to define my data in schema.xml

2013-06-17 Thread Gora Mohanty
On 18 June 2013 01:10, Mysurf Mail wrote: > Thanks for your quick reply. Here are some notes: > > 1. Consider that all tables in my example have two columns: Name & > Description which I would like to index and search. > 2. I have no other reason to create flat table other than for solar. So I > w

Re: mm (Minimum 'Should' Match)

2013-06-17 Thread Jack Krupansky
No, although the Query Parsers chapter of the book has a little more description for "mm". I'm hoping I'll have a draft of the book published on Friday. The LucidWorks doc for minMatch in their query parser is here: http://docs.lucidworks.com/display/lweug/Minimum+Match+for+Simple+Queries -

Re: mm (Minimum 'Should' Match)

2013-06-17 Thread anand_solr
Thank Jack. Do you have any link to documents where I can refer for more? I saw one of the link mentioning something similar that I can extend? Do you think this will help? http://everydaydeveloper.blogspot.com/2013/03/minimum-match-per-index-field-solr.html?m=1 On Mon, Jun 17, 2013 at 10:59

Upgrading from 3.6.1 to 4.3.0 and Custom collector

2013-06-17 Thread Peyman Faratin
Hi I am migrating from Lucene 3.6.1 to 4.3.0. I am however not sure how to migrate my custom collector below. this page http://lucene.apache.org/core/4_3_0/MIGRATE.html gives some hints but the instructions are incomplete and looking at the source examples of custom collectors make me want to

Re: mm (Minimum 'Should' Match)

2013-06-17 Thread Jack Krupansky
The "mm" parameter only applies to the top level query, not nested queries. At the top level you have: (...) AND (...) And it's an AND, not OR. The LucidWorks Search query parser does support minMatch at any level, such as: (...)~2 AND (...)~2 -- Jack Krupansky -Original Message-

Re: Avoiding OOM fatal crash

2013-06-17 Thread Roman Chyla
I think you can modify the response writer and stream results instead of building them first and then sending in one go. I am using this technique to dump millions of docs in json format - but in your case you may have to figure out how to dump during streaming if you don't want to save data to dis

mm (Minimum 'Should' Match)

2013-06-17 Thread anand_solr
I am not sure if this is supported out of box in Solr. Search by giving multiple facet fields and query containing set of values for each facet field & minimum should match parameter for each facets. The result should be document contains facets with minimum match. Eg: solr documents of type ele

Re: yet another optimize question

2013-06-17 Thread Otis Gospodnetic
Hi Robi, This goes against the original problem of getting OOMEs, but it looks like each of your Solr caches could be a little bigger if you want to eliminate evictions, with the query results one possibly not being worth keeping if you can't get the hit % up enough. Otis -- Solr & ElasticSearch

Re: yet another optimize question

2013-06-17 Thread Otis Gospodnetic
Yes, in one of the example solrconfig.xml files this is right above the merge factor definition. Otis -- Solr & ElasticSearch Support -- http://sematext.com/ On Mon, Jun 17, 2013 at 8:00 PM, Petersen, Robert wrote: > Hi Upayavira, > > You might have gotten it. Yes we noticed maxdocs was way

Re: Avoiding OOM fatal crash

2013-06-17 Thread Mark Miller
There is a java cmd line arg that lets you run a command on OOM - I'd configure it to log and kill -9 Solr. Then use runit or something to supervice Solr - so that if it's killed, it just restarts. I think that is the best way to deal with OOM's. Other than that, you have to write a middle laye

RE: yet another optimize question

2013-06-17 Thread Petersen, Robert
Hi Upayavira, You might have gotten it. Yes we noticed maxdocs was way bigger than numdocs. There were a lot of files ending in '.del' in the index folder also. We started on 1.3 also. I don't currently have any solr config settings for MergePolicy at all. Am I going to want to put someth

Re: Start custom Java component on Solr start?

2013-06-17 Thread Chris Hostetter
: What is the best thing in Solr to hook into that would allow me to : start (and keep running) a custom piece of code when Solr starts? Say : I want to have something that pulls data from an external queue from : within Solr and indexes it into Solr and I want it start and stop : together with t

Re: Solr Cloud Hangs consistently .

2013-06-17 Thread Yago Riveiro
I do all the indexing through a HTTP POST, with replicationFactor=1 no problem, if is higher deadlock problems can appear A stack trace like this http://lucene.472066.n3.nabble.com/updating-docs-in-solr-cloud-hangs-td4067388.html#a4067862 is that I get -- Yago Riveiro Sent with Sparrow (http:

Re: dynamic field

2013-06-17 Thread Rafał Kuć
Hello! Dynamic field is just a regular field from Lucene point of view, so its content will be treated just like the content of other fields. The difference in on the Solr level. -- Regards, Rafał Kuć Sematext :: http://sematext.com/ :: Solr - Lucene - ElasticSearch > How is daynamic field in

dynamic field

2013-06-17 Thread Mingfeng Yang
How is daynamic field in solr implemented? Does it get saved into the same Document as other regular fields in lucene index? Ming-

Re: Solr Cloud Hangs consistently .

2013-06-17 Thread Mark Miller
If it actually happens with replicationFactor=1, it doesn't likely have anything to do with the update handler issue I'm referring to. In some cases like these, people have better luck with Jetty than Tomcat - we test it much more. For instance, it's setup to help avoid search side distributed d

Re: Solr Cloud Hangs consistently .

2013-06-17 Thread Rishi Easwaran
Update!! This happens with replicationFactor=1 Just for kicks I created a collection with a 24 shards, replicationfactor=1 cluster on my exisiting benchmark env. Same behaviour, SOLR cloud just hangs. Nothing in the logs, top/heap/cpu most metrics looks fine. Only indication seems to be netstat

Re: Parallel queries on a single core

2013-06-17 Thread Shawn Heisey
On 6/17/2013 2:32 PM, Manuel Le Normand wrote: Hello all, Assuming I have a single shard with a single core, how do run multi-threaded queries on Solr 4.x? To run multithreaded queries, just send them at the same time, as you mention below. Solr will run them in parallel, within the limits of

Re: Avoiding OOM fatal crash

2013-06-17 Thread Manuel Le Normand
Unfortunately my organisation's too big to control or teach every employee what are the limits, as well as they can vary (many facets - how much is ok?, asking for too many fields in proportion of too many rows etc) Don't you think it is preferable to "commit" the maxBufferSize in the JVM heap for

Re: Solr cloud: zkHost in solr.xml gets wiped out

2013-06-17 Thread Al Wold
Hi Erick, I tried out your changes from the branch_4x branch. It looks good in terms of preserving the zkHost, but I'm running into an exception because it isn't persisting the instanceDir attribute on the element. I've got a few other things I need to take care of, but as soon as I have time

Re: Parallel queries on a single core

2013-06-17 Thread Manuel Le Normand
Yes, that answers the first part of my question, thanks. So saying N (equally heavy) queries agains N CPUs would run simultaneously, right? Previous posting suggest high qps rate can be solved perfomance-wise by having high replicationFactor. But what's the benefit (performance wise) compared to

Re: Parallel queries on a single core

2013-06-17 Thread Otis Gospodnetic
If I understand your question correctly - what happens with Solr and N parallel queries is not much different from what happens with N processes running in the OS - they all get a slice of the CPU time to do their work. Not sure if that answers your question...? Otis -- Solr & ElasticSearch Suppo

Re: Start custom Java component on Solr start?

2013-06-17 Thread Otis Gospodnetic
Hi, Hm, right, although once Solr stops being a webapp, this won't work any more... Otis -- Solr & ElasticSearch Support http://sematext.com/ On Mon, Jun 17, 2013 at 4:14 PM, Al Wold wrote: > I've used a servlet context listener before and it works pretty well. You > just have a write a sm

Re: Solr Cloud Hangs consistently .

2013-06-17 Thread Yago Riveiro
I can confirm that the deadlock happen with only 2 replicas by shard. I need shutdown one node that host a replica of the shard to recover the indexation capability. -- Yago Riveiro Sent with Sparrow (http://www.sparrowmailapp.com/?sig) On Monday, June 17, 2013 at 6:44 PM, Rishi Easwaran wrot

Re: Solr large boolean filter

2013-06-17 Thread Otis Gospodnetic
Btw. ElasticSearch has a nice feature here. Not sure what it's called, but I call it "named filter". http://www.elasticsearch.org/blog/terms-filter-lookup/ Maybe that's what OP was after? Otis -- Solr & ElasticSearch Support http://sematext.com/ On Mon, Jun 17, 2013 at 4:59 PM, Alexandre R

Re: Avoiding OOM fatal crash

2013-06-17 Thread Walter Underwood
Make them aware of what is required. Solr is not designed to return huge requests. If you need to do this, you will need to run the JVM with a big enough heap to build the request. You are getting OOM because the JVM does not have enough memory to build a response with 100K documents. wunder

Spread the word - Opening at AOL Mail Team in Dulles VA

2013-06-17 Thread Rishi Easwaran
Hi All, With the economy the way it is and many folks still looking. Figured this is a good place as any to publish this. Just today, we got an opening for mid-senior level Software Engineer in our team. Experience with SOLR is a big+. Feel free to have a look at this position. http://www.link

Re: Avoiding OOM fatal crash

2013-06-17 Thread Walter Underwood
Don't request 100K docs in a single query. Fetch them in smaller batches. wunder On Jun 17, 2013, at 1:44 PM, Manuel Le Normand wrote: > Hello again, > > After a heavy query on my index (returning 100K docs in a single query) my > JVM heap's floods and I get an JAVA OOM exception, and then that

Re: SOLR Cloud - Disable Transaction Logs

2013-06-17 Thread Rishi Easwaran
Shalin, Just some thoughts. Near Real time replication- don't we use solrCmdDistributor, which send requests immediately to replicas with a clonedRequest, as an option can't we achieve something similar form CloudSolrserver in Solrj instead of leader doing it. As long as 2 nodes receive write

Re: Solr large boolean filter

2013-06-17 Thread Alexandre Rafalovitch
On Mon, Jun 17, 2013 at 12:35 PM, Igor Kustov wrote: > So I'm using query like > http://127.0.0.1:8080/solr/select?q=*:*&fq={!mqparser}id:%281%202%203%29 If the IDs are purely numeric, I wonder if the better way is to send a bitset. So, bit 1 is on if ID:1 is included, bit 2000 is on if ID:2000 i

Re: Avoiding OOM fatal crash

2013-06-17 Thread Manuel Le Normand
One of my users requested it, they are less aware of what's allowed and I don't want apriori blocking them for long specific request (there are other params that might end up OOMing me). I thought of timeAllowed restriction, but also this solution cannot guarantee during this delay I would not get

Avoiding OOM fatal crash

2013-06-17 Thread Manuel Le Normand
Hello again, After a heavy query on my index (returning 100K docs in a single query) my JVM heap's floods and I get an JAVA OOM exception, and then that my GCcannot collect anything (GC overhead limit exceeded) as these memory chunks are not disposable. I want to afford queries like this, my conc

column of linked table can not be displayed

2013-06-17 Thread Jenny Huang
I am importing data from two database tables into solr. The main table is called 'gene'. The other table is called 'taxon'. The two tables are connected through 'taxon' column in 'gene' table and 'taxon_oid' column in 'taxon' table. In another word, 'gene.taxon = taxon.taxon_oid'. I want the '

Parallel queries on a single core

2013-06-17 Thread Manuel Le Normand
Hello all, Assuming I have a single shard with a single core, how do run multi-threaded queries on Solr 4.x? Specifically, if one user sends a heavy query (legitimate wildcard query for 10 sec), what happens to all other users quering during this period? If the repsonse is that simultaneous queri

Re: Start custom Java component on Solr start?

2013-06-17 Thread Shalin Shekhar Mangar
I assume you don't want it per-core. Custom CoreAdminHandler maybe? On Tue, Jun 18, 2013 at 1:33 AM, Otis Gospodnetic < otis.gospodne...@gmail.com> wrote: > Hi, > > What is the best thing in Solr to hook into that would allow me to > start (and keep running) a custom piece of code when Solr star

Re: Start custom Java component on Solr start?

2013-06-17 Thread Al Wold
I've used a servlet context listener before and it works pretty well. You just have a write a small Java class to receive the event when the app is started, then add it to web.xml. I don't think there's much good official documentation, but this blog post outlines it pretty simply: http://www.

Start custom Java component on Solr start?

2013-06-17 Thread Otis Gospodnetic
Hi, What is the best thing in Solr to hook into that would allow me to start (and keep running) a custom piece of code when Solr starts? Say I want to have something that pulls data from an external queue from within Solr and indexes it into Solr and I want it start and stop together with the Sol

Re: shardkey

2013-06-17 Thread Shalin Shekhar Mangar
No, there is no way to do that right now. I think you'd be better off using custom sharding because you can't really control that two shardKeys must go to two different shards. We can only guarantee that docs with the same shardKey will goto the same shard. On Mon, Jun 17, 2013 at 9:47 PM, Joshi,

Re: SOLR Cloud - Disable Transaction Logs

2013-06-17 Thread Shalin Shekhar Mangar
It is also necessary for near real-time replication, peer sync and recovery. On Tue, Jun 18, 2013 at 1:04 AM, Rishi Easwaran wrote: > Hi, > > Is there a way to disable transaction logs in SOLR cloud. As far as I can > tell no. > Just curious why do we need transaction logs, seems like an I/O int

Re: Solr data files

2013-06-17 Thread Alexandre Rafalovitch
The index files are under the the collection's directory in the subdirectory called 'data'. Right next to the directory called 'conf' where your schema.xml and solrconfig.xml live. If the Solr is not running, you can delete that directory to clear the index content. I don't think you can do that w

Re: How to define my data in schema.xml

2013-06-17 Thread Mysurf Mail
Thanks for your quick reply. Here are some notes: 1. Consider that all tables in my example have two columns: Name & Description which I would like to index and search. 2. I have no other reason to create flat table other than for solar. So I would like to see if I can avoid it. 3. If in my exampl

SOLR Cloud - Disable Transaction Logs

2013-06-17 Thread Rishi Easwaran
Hi, Is there a way to disable transaction logs in SOLR cloud. As far as I can tell no. Just curious why do we need transaction logs, seems like an I/O intensive operation. As long as I have replicatonFactor >1, if a node (leader) goes down, the replica can take over and maintain a durable state

Solr data files

2013-06-17 Thread Mysurf Mail
Where are the core data files located? Can I just delete folder/files in order to quick clean the core/indexes? Thanks

Re: yet another optimize question

2013-06-17 Thread Upayavira
The key figures are numdocs vs maxdocs. Maxdocs-numdocs is the number of deleted docs in your index. This is a 3.6 system you say. But has it been upgraded? I've seen folks who've upgraded from 1.4 or 3.0/3.1 over time, keeping the old config. The consequence of this is that they don't get the rig

Re: Solr Cloud Hangs consistently .

2013-06-17 Thread Rishi Easwaran
FYI..you can ignore http4ClientExpiryService thread in the stack dump. Its a dummy executor service, i created to test out something, unrelated to this issue. -Original Message- From: Rishi Easwaran To: solr-user Sent: Mon, Jun 17, 2013 2:54 pm Subject: Re: Solr Cloud Hangs c

Re: SolrJ Howto get local params from QueryResponse

2013-06-17 Thread Jack Krupansky
The "LocalParams" are just the prefix on the query parameters (e.g., "facet.field") themselves - what you sent on the original query. I mean, you constructed those original parameters in your app code, right? You can also call QueryResponse#getHeader and then locate the original query paramet

Re: Solr Cloud Hangs consistently .

2013-06-17 Thread Rishi Easwaran
Mark, I got a few stack dumps of the instance that was stuck ssdtest-d03:8011 http://apaste.info/cofK http://apaste.info/sv4M http://apaste.info/cxUf I can get dumps of others if needed. Thanks, Rishi. -Original Message- From: Mark Miller To: solr-user Sent: Mon, Jun 17, 201

Re: New operator.

2013-06-17 Thread Roman Chyla
Hello Yanis, We are probably using something similar - eg. 'functional operators' - eg. edismax() to treat everything inside the bracket as an argument for edismax, or pos() to search for authors based on their position. And invenio() which is exactly what you describe, to get results from externa

Re: Solr large boolean filter

2013-06-17 Thread Mikhail Khludnev
nonono, mate! I warn you before by 'Mind term ecoding due to field type!' you need to obtain schema from request, then access fieldtype and convert external string representation into (might be) tricky encoded bytes by readableToIndexed() see FieldType.getFieldQuery() btw, it's a really frequent

Re: New operator.

2013-06-17 Thread Yanis Kakamaikis
Hi all, thanks for your reply. I want to be able to ask a combined query, a normal solr querym but one of the query fields should get it's answer not from within the solr engine, but from an external engine. the rest should work normaly with the ability to do more tasks on the answer like faceti

RE: yet another optimize question

2013-06-17 Thread Petersen, Robert
Hi Otis, Right I didn't restart the JVMs except on the one slave where I was experimenting with using G1GC on the 1.7.0_21 JRE. Also some time ago I made all our caches small enough to keep us from getting OOMs while still having a good hit rate.Our index has about 50 fields which are mos

Refresh implicit core properties after a SWAP

2013-06-17 Thread aus...@3bx.org
I noticed that Shawn mentioned ( https://issues.apache.org/jira/browse/SOLR-4732) that “when you rename or swap cores, the solr.core.name property does NOT get updated until you restart Solr”. I’m wondering if there’s any way possible to update this property other than restarting the entire Solr a

Re: Is there a way to encrypt username and pass in the solr config file

2013-06-17 Thread Gora Mohanty
On 17 June 2013 21:41, Mysurf Mail wrote: > Hi, > I want to encrypt (rsa maybe?) my user name/pass in solr . > Cant leave a simple plain text on the server. > What is the recomended way? I don't think that there is a way to encrypt this information at the moment. The recommended way would be to

Re: How to define my data in schema.xml

2013-06-17 Thread Gora Mohanty
On 17 June 2013 21:39, Mysurf Mail wrote: > Hi, > I have created a flat table from my DB and defined a solr core on it. > It works excellent so far. > > My problem is that my table has two hierarchies. So when flatted it is too > big. What do you mean by "too big"? Have you actually tried indexin

Re: Solr Cloud Hangs consistently .

2013-06-17 Thread Mark Miller
Could you give a simple stack trace dump as well? It's likely the distributed update deadlock that has been reported a few times now - I think usually with a replication factor greater than 2, but I can't be sure. The deadlock involves sending docs concurrently to replicas and I wouldn't have e

Solr Cloud Hangs consistently .

2013-06-17 Thread Rishi Easwaran
Hi All, I am trying to benchmark SOLR Cloud and it consistently hangs. Nothing in the logs, no stack trace, no errors, no warnings, just seems stuck. A little bit about my set up. I have 3 benchmark hosts, each with 96GB RAM, 24 CPU's and 1TB SSD. Each host is configured to have 8 SOLR cloud

Re: Filtered Query in Solr

2013-06-17 Thread Upayavira
you have likely indexed it as a text/analysed field, not as a string field. Your usage suggests that "MUSIC ALBUM" should be a single term, thus you should index it as a string field. Upayavira On Mon, Jun 17, 2013, at 05:21 PM, Prathik Puthran wrote: > "MUSIC ALBUM" is the value of one of the fi

Re: Sorting by field is slow

2013-06-17 Thread Shane Perry
Turns out it was a case of an oversite. My warming queries weren't setting the sort order and as a result don't successfully complete. After setting the sort order things appear to be responding quickly. Thanks for the help. On Mon, Jun 17, 2013 at 9:45 AM, Shane Perry wrote: > Using 4.3.1-SN

Infinite Solr's node recovery loop after ungraceful shutdown of majority of nodes in a cluster

2013-06-17 Thread serhiy.ivanov
Hi Solr Community, We're currently experimenting with test SolrCloud setup and doing some weird failover test scenarios to check how system reacts. Basically, I do have 3 nodes in my Solr Cloud. Cloud is using external ZooKeeper ensemble with 3 nodes. ZooKeeper seems to be working pretty predicta

Re: Solr large boolean filter

2013-06-17 Thread Igor Kustov
Menawhile I'm currently trying to write custom QParser which will use FieldCacheTermsFilter So I'm using query like http://127.0.0.1:8080/solr/select?q=*:*&fq={!mqparser}id:%281%202%203%29 And I couldn't make it work - I just couldn't find a proper constructor and also not sure that i'm filterin

Re: Filtered Query in Solr

2013-06-17 Thread Prathik Puthran
"MUSIC ALBUM" is the value of one of the field (asset_type) in the indexed document. On Mon, Jun 17, 2013 at 9:06 PM, Jack Krupansky wrote: > What does the actual indexed data look like? Maybe "ALBUM" doesn't > immediately follow "MUSIC", at least in that particular field. Or, maybe > you added

RE: shardkey

2013-06-17 Thread Joshi, Shital
Thanks for the links. It was very useful. Is there a way to use implicit router WITH numShards parameter? We have 5 shards and business day (Monday-Friday) is our shardkey. We want to be able to say Monday -> shard1, Tuesday -> shard2. -Original Message- From: Joel Bernstein [mai

Is there a way to encrypt username and pass in the solr config file

2013-06-17 Thread Mysurf Mail
Hi, I want to encrypt (rsa maybe?) my user name/pass in solr . Cant leave a simple plain text on the server. What is the recomended way? Thanks.

Re: out of memory during indexing do to large incoming queue

2013-06-17 Thread Shawn Heisey
On 6/17/2013 4:32 AM, Yoni Amir wrote: I was wondering about your recommendation to use facet.method=enum? Can you explain what is the trade-off here? I understand that I gain a benefit by using less memory, but what with I lose? Is it speed? The problem with facet.method=fc (the default) and

How to define my data in schema.xml

2013-06-17 Thread Mysurf Mail
Hi, I have created a flat table from my DB and defined a solr core on it. It works excellent so far. My problem is that my table has two hierarchies. So when flatted it is too big. Lets consider the following example scenario My Tables are School Students (1:n with school) Teachers(1:n with scho

Re: Sorting by field is slow

2013-06-17 Thread Shane Perry
Using 4.3.1-SNAPSHOT I have identified where the issue is occurring. For a query in the format (it returns one document, sorted by field4) +(field0:UUID0) -field1:string0 +field2:string1 +field3:text0 +field4:"text1" with the field types the method FieldC

Re: Filtered Query in Solr

2013-06-17 Thread Jack Krupansky
What does the actual indexed data look like? Maybe "ALBUM" doesn't immediately follow "MUSIC", at least in that particular field. Or, maybe you added "MUSIC" and "ALBUM" as two separate values for that field and Solr then implicitly added the +100 position gap between them. -- Jack Krupansky

Re: Need assistance in defining solr to process user generated query text

2013-06-17 Thread Mysurf Mail
I have one fact table with a lot of string columns and a few GUIDs just for retreival (Not for search) On Mon, Jun 17, 2013 at 6:01 PM, Jack Krupansky wrote: > It sounds like you have your text indexed in a "string" field (why the > wildcards are needed), or that maybe you are using the "keywor

Re: Filtered Query in Solr

2013-06-17 Thread Prathik Puthran
Can you please explain why the 2nd one works? On Mon, Jun 17, 2013 at 8:49 PM, Prathik Puthran < prathik.puthra...@gmail.com> wrote: > The first one i.e. fq=asset_type:"MUSIC ALBUM" doesen't work. > > However the 2nd one works > fq=asset_type:(+MUSIC +ALBUM) > > Thanks for the response. > > Re

Re: Filtered Query in Solr

2013-06-17 Thread Prathik Puthran
The first one i.e. fq=asset_type:"MUSIC ALBUM" doesen't work. However the 2nd one works fq=asset_type:(+MUSIC +ALBUM) Thanks for the response. Regards, Prathik On Mon, Jun 17, 2013 at 8:41 PM, Upayavira wrote: > Your fq query is: > > fq=asset_type:MUSIC ALBUM > > This is actually interpreted

Re: Filtered Query in Solr

2013-06-17 Thread Upayavira
Your fq query is: fq=asset_type:MUSIC ALBUM This is actually interpreted as: fq=asset_type:MUSIC text:ALBUM You probably want: fq=asset_type:"MUSIC ALBUM" or fq=asset_type:(+MUSIC +ALBUM) or even: fq:{!term f=asset_type}MUSIC ALBUM Upayavira On Mon, Jun 17, 2013, at 03:57 PM, Prathik Puthran w

Any way to have the suggest component be filter query aware?

2013-06-17 Thread Brendan Grainger
Hi All, I expect the answer is no, but just to be sure I am wondering if there is any way to make the suggest component (http://wiki.apache.org/solr/Suggester) filter query aware, i.e. I'd like to have suggestions for a given context, so say if I were searching in the book lucene in action suggest

Re: Need assistance in defining solr to process user generated query text

2013-06-17 Thread Jack Krupansky
It sounds like you have your text indexed in a "string" field (why the wildcards are needed), or that maybe you are using the "keyword" tokenizer rather than the standard tokenizer. What is your default or query fields for dismax/edismax? And what are the field types for those fields? -- Jac

Filtered Query in Solr

2013-06-17 Thread Prathik Puthran
Hi, I am making a select request to solr with with 'fq=asset_type:MUSIC ALBUM' (see query 1 below) as one of the GET parameter. This request does not return any results. However when I send the select request with the parameter 'asset_type=MUSIC ALBUM'(see query 2 below) I get the results. Does t

SolrJ Howto get local params from QueryResponse

2013-06-17 Thread Holger Rieß
Hi, how can I get local params like '{!ex=dyn,cls} AAA001001_0_1.1.1_ss' from QueryResponse? I've tagged filter queries and facet fields with different tags (p.e.'dyn','cls'). I can see the tags in the QueryResponse XML facet.field section: {!ex=dyn}AAA001001_0_1.1.1_ss ...

Need assistance in defining solr to process user generated query text

2013-06-17 Thread Mysurf Mail
Hi, I have been reading solr wiki pages and configured solr successfully over my flat table. I have a few question though regarding the querying and parsing of user generated text. 1. I have understood through this page that I want to use dismax. Through thi

Re: sort=geodist() asc

2013-06-17 Thread Smiley, David W.
Bill, I added this comment: https://issues.apache.org/jira/browse/SOLR-2345?focusedCommentId=13685627&p age=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#commen t-13685627 On 6/17/13 1:50 AM, "William Bell" wrote: >This simple feature of "sort=geodist() asc" is very powerful s

How to get SolrJ-serialization / binary-size statistics ?

2013-06-17 Thread Ralf Heyde
Hi Everybody, The as is situation: We have an application (on Server 1), which fires many (up to 20) Solr-Queries (on Server 2) to produce the result. Since we have network latency for transport and serialization, we will shift the Query-Part to Server 2. The idea behind is, that the complete

Re: Slow Highlighter Performance Even Using FastVectorHighlighter

2013-06-17 Thread Floyd Wu
Hi Michael, How do I configure posthighlighter with my solr 4.2 box? Please kindly point me. Many thanks. 2013/6/15 下午10:48 於 "Michael McCandless" 寫道: > You could also try the new[ish] PostingsHighlighter: > > http://blog.mikemccandless.com/2012/12/a-new-lucene-highlighter-is-born.html > > Mike M

503 - server is shutting down error

2013-06-17 Thread gururaj kosuru
Hi, I am trying to run solr 4.3 on a standalone system using tomcat 6 and I am facing a ExceptionInInitializerError. I am attaching the trace of the log for details. I also get a log4j error(no such file or directory), but I think it is not related to this. The error logs says: [main] ER

Re: AW: Best way to match umlauts

2013-06-17 Thread Jack Krupansky
And this is a key advantage of using the mapping char filter rather than the simple ASCII folding token filter - you can easily go in and modify the mappings for application/domain/environment-specific character mappings such as these. -- Jack Krupansky -Original Message- From: André

Re: Solr large boolean filter

2013-06-17 Thread Jack Krupansky
That would have been one of my top guesses. Take a look at LucidWorks Search and how they have a built-in role-based document access control component. They call the feature "Search Filters": http://docs.lucidworks.com/display/help/Search+Filters+for+Access+Control -- Jack Krupansky -Ori

Re: Searching for cache stats

2013-06-17 Thread Stefan Matheis
JohnThe UI is using /solr/collection1/admin/mbeans?stats=true to get those values, does this help?- Stefan On Monday, June 17, 2013 at 2:32 PM, John Nielsen wrote: Hi,

Re: Solr Server Add causes java.net.SocketException: No buffer space available

2013-06-17 Thread Snubbel
I did try something else. I did add a list of SolrInputDocuments containing 500 Documents at a time. Now it works. Adding every Document seems to be too much after about 1 Documents (even with commits after every 500 Documents). But this is only in Solr 4.3, in 4.0 this was possible without a

Searching for cache stats

2013-06-17 Thread John Nielsen
Hi, I am looking for an automated way of getting cache stats from Solr. Specificly what I am looking for are the cumulative evictions for each cache type for each core: http://screencast.com/t/IrD0VItfVduk An example of how I would like to be able

Re: Solr Server Add causes java.net.SocketException: No buffer space available

2013-06-17 Thread Snubbel
Hello, I did set the autoCommit to 5 Minutes and removed all commit Statements but one, because, you see, my test case is as follows: I need a hugh number of documents in Solr. Then I want to update them with AtomicUpdate and, for comparison, the "classical" way, like we did before Solr 4.3. So,

RE: PostingsHighlighter and analysis

2013-06-17 Thread Markus Jelsma
Hi, Any intelligent suggestions for this issue? Thanks, Markus -Original message- > From:Trey Hyde > Sent: Mon 11-Mar-2013 21:44 > To: solr-user@lucene.apache.org > Subject: PostingsHighlighter and analysis > > debug=timing has told me for a very long time that 99% of my query time f

Re: Solr large boolean filter

2013-06-17 Thread Igor Kustov
> Where do the long list of IDs come from? I'm indexing database, so the id list is security access control list. -- View this message in context: http://lucene.472066.n3.nabble.com/Solr-large-boolean-filter-tp4070747p4070964.html Sent from the Solr - User mailing list archive at Nabble.com.

RE: out of memory during indexing do to large incoming queue

2013-06-17 Thread Yoni Amir
Thanks Shawn, This was very helpful. Indeed I had some terminology problem regarding the segment merging. In any case, I tweaked those parameters that you recommended and it helped a lot. I was wondering about your recommendation to use facet.method=enum? Can you explain what is the trade-off h

Re: Solr Server Add causes java.net.SocketException: No buffer space available

2013-06-17 Thread Erick Erickson
Wild shot in the dark here, but try taking the solrServer.commit() out and rely on the autocommit parameters in solrconfig.xml. And configure autocommit to commit, say, every 5 minutes and do _not_ configure the "numDocs" bit for autocommit. If you do that and don't have this problem, we can talk

Re: sort=geodist() asc

2013-06-17 Thread Erick Erickson
Hmmm, could you simply store a single-valued point field to use for sorting etc? It seems like the problem here is partly the same as for multiValued fields in general, which one should be used? Best Erick On Mon, Jun 17, 2013 at 1:50 AM, William Bell wrote: > This simple feature of "sort=geodis

Re: Different scores for exact and non-exact matching

2013-06-17 Thread Upayavira
q="This is a sample string"^10 (This is a sample string)^5 fuzzy:(This is a sample string) You'd have to define the 'fuzzy' field as an EdgeNGram field, such that 'sample' gets indexed as: s sa sam samp sampl sample Obviously, that'll take more space in your index, but I believe it would g

Highlighting Response

2013-06-17 Thread Furkan KAMACI
Here is my highlight handler: dismax explicit 0.01 content^0.5 anchor^1.0 title^1.2 content^0.5 anchor^1.5 title^1.2 site^1.5 url 100 true *:* title url content 0 title 0 url regex Return value is as follows: + + + response has just url list of results. highlighting has title,

AW: Best way to match umlauts

2013-06-17 Thread André Widhani
We configure both baseletter conversion (removing accents and umlauts) and alternate spelling through the mapping file. For baseletter conversion and mostly german content we transform all accents that are not used in german language (like french é, è, ê etc.) to their baseletter. We do not do

  1   2   >