Facet with large number of unigram entries

2012-12-04 Thread Andreas Niekler
Dear List, i have an index with 2.000.000 articles. All those texts get tokenized while indexing. On this data i run a faceted query like this (to receive associated words): select?q=a_spell:{some word}&facet.method=enum&facet=true&facet.field=Paragraph&facet.limit=10&facet.prefix={some pre

Re: Backing up SolR 4.0

2012-12-04 Thread Andy D'Arcy Jewell
On 03/12/12 18:04, Shawn Heisey wrote: Serious production Solr installs require at least two copies of your index. Failures *will* happen, and sometimes they'll be the kind of failures that will take down an entire machine. You can plan for some failures -- redundant power supply and RAID a

Re: Whole Phrase search in Solr

2012-12-04 Thread NickA
Hello Jack, You are the man! Indeed, this was the problem. We tried several combinations and we thought that we did that too, but somehow we failed to see that your proposal was working! Don't know why, maybe we had something else changed in parallel, don't know. So, THANK YOU, you have been a g

replication of files when index is stable/static (SOLR-1304?)

2012-12-04 Thread Fredrik Rødland
I have a static index with config-files changing frequently. Until now I've distributes these files to all solr-hosts in my current setup manually, but I'm wondering if I can get SOLR to do this using the config-replication. Searching google I've come across https://issues.apache.org/jira/brow

SQL DIH - Can I have some guidance please?

2012-12-04 Thread Spadez
Hi. I am having a bit of trouble figuruing out the DIH for SQL files. I have asked around a few different places but havent got any replies so I was hoping you could help me. *I have a database schema like this:* CREATE TABLE company ( id SERIAL PRIMARY KEY, name varchar(60) NOT NULL );

Re: How to change Solr UI

2012-12-04 Thread Erik Hatcher
It's a shame wt=velocity gets a bad rap because /update isn't out of the box strict with the HTTP/RESTful scene. A delete should be a DELETE of some sort. There are 3rd party standalone apps. There was even a standalone ruby app (flare) that was once upon a time in Solr's svn, but really the S

Re: SOLR4 cluster - strange CPU spike on slave

2012-12-04 Thread John Nielsen
Success! I tried adding -XX:+UseConcMarkSweepGC to java to make it GC earlier. We haven't seen any spikes since. I'm cautiously optimistic though and will be monitoring the servers for a week or so before declaring final victory. The post about mmapdirectory is really interesting. We switched to

Two databases merge into SOLR - How to keep unique ID?

2012-12-04 Thread Spadez
I have two databases (unfortunately they do have to be separate) which get imported into Solr. Each database has a primary key for each time but I am concerned that when it comes to importing the two into SOLR there will be more than one item with the same ID (one from each DB). Therefore, in ord

Re: SQL DIH - Can I have some guidance please?

2012-12-04 Thread Gora Mohanty
On 04/12/2012, Spadez wrote: > Hi. > > I am having a bit of trouble figuruing out the DIH for SQL files. I have > asked around a few different places but havent got any replies so I was > hoping you could help me. > > *I have a database schema like this:* > > CREATE TABLE company ( > id SERIAL

Re: Two databases merge into SOLR - How to keep unique ID?

2012-12-04 Thread Gora Mohanty
On 04/12/2012, Spadez wrote: > I have two databases (unfortunately they do have to be separate) which get > imported into Solr. > > Each database has a primary key for each time but I am concerned that when > it comes to importing the two into SOLR there will be more than one item > with the same

Sorting by multi-valued field

2012-12-04 Thread Thomas Heigl
Hey all! In our system users can create recurring events and search for events starting on or after a given date. Searching and filtering of events works perfectly, but users expect the result set to be ordered by the next start time. For each event, we index a multi-valued date field containing

Cannot run Solr4 from Intellij Idea

2012-12-04 Thread Artyom
After 2 days I have figured out how to open Solr 4 in IntelliJ IDEA 11.1.4 on Tomcat 7. IntelliJ IDEA finds webapp/web/WEB-INF/web.xml and offers to make a facet from it and adds this facet to the "parent" module, from which an artifact can be created. The problem is that Solr cannot run properly.

Re: SQL DIH - Can I have some guidance please?

2012-12-04 Thread Spadez
Thank you so much for your help. Based on the same schema in my first post and your help I created this, have I implemented it correctly based on your suggestion? I tried to comment it:

How to SWAP cores (or collections) with SolrCloud (SOLR-3866)

2012-12-04 Thread Andre Bois-Crettez
Hello, With solr-4.0.0, the useful SWAP command that allows to have a main core serving searches, while a temp core can be re-indexed from scratch, no longer works on SolrCloud, as was discussed here :Solr Swap Function doesn't work when using Solr Clo

Re: How to change Solr UI

2012-12-04 Thread Jack Krupansky
"let's also be clear always that Solr is meant to be behind the firewall" Absolutely, but we are NOT doing that when we provide the Velocity-based /browse UI. Erik, your email example sounds reasonable, so if you want to substitute something like that for the /browse handler, fine. As you poi

Re: SQL DIH - Can I have some guidance please?

2012-12-04 Thread Gora Mohanty
On 04/12/2012, Spadez wrote: > Thank you so much for your help. Based on the same schema in my first post > and your help I created this, have I implemented it correctly based on your > suggestion? I tried to comment it: Looks almost correct. You only need two levels of nesting, and can use prope

Re: SQL DIH - Can I have some guidance please?

2012-12-04 Thread Spadez
Thank you so much for the help, I really appreciate it. -- View this message in context: http://lucene.472066.n3.nabble.com/SQL-DIH-Can-I-have-some-guidance-please-tp4024207p4024250.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: How to change Solr UI

2012-12-04 Thread Erik Hatcher
On Dec 4, 2012, at 08:21 , Jack Krupansky wrote: > "let's also be clear always that Solr is meant to be behind the firewall" > > Absolutely, but we are NOT doing that when we provide the Velocity-based > /browse UI. > Erik, your email example sounds reasonable, so if you want to substitute > s

Re: How to change Solr UI

2012-12-04 Thread Upayavira
I have been mulling on this. The browse UI is getting a little out of date, and has interesting 'features' such as only showing a map for a document if the document has a 'name' field, which makes no real sense at all. Apart from renovating the UI of browse, or possibly replacing it with something

Re: Range Queries performing differently on SortableIntField vs TrieField of type integer

2012-12-04 Thread Upayavira
One small question - did you re-index in-between? The index structure will be different for each. Upayavira On Tue, Dec 4, 2012, at 02:30 PM, Aaron Daubman wrote: > Greetings, > > I'm finally updating an old instance and in testing, discovered that > using > the recommended TrieField instead of

Re: How to change Solr UI

2012-12-04 Thread Jack Krupansky
Or, maybe integrate /browse with the Solr Admin UI and give it a graphic treatment that screams that it is a development tool and not designed to be a model for an app UI. And, I still think it would be good to include SOME example of a prototype app UI with Solr, to drive home the point of "h

Re: Problems while Searching Plural Form of Verb

2012-12-04 Thread Jack Krupansky
Use a stemmer, such as the English Plural-only stemmer,EnglishMinimalStemFilterFactory. See: http://lucene.apache.org/core/4_0_0/analyzers-common/org/apache/lucene/analysis/en/EnglishMinimalStemFilterFactory.html and http://lucene.apache.org/core/4_0_0/analyzers-common/org/apache/lucene/analysis

[Solrj 4.0] How use JOIN

2012-12-04 Thread Roman Slavík
Hi, I can't found any good example how use Join function with Solrj 4.0 api. Let's have this example data: 1 Thomas 40 2 John 17 1 And code: String stringQuery = "(name:Thomas) AND (age:40)"; SolrQuery query = new SolrQuery(); query.setQuery

Re: Range Queries performing differently on SortableIntField vs TrieField of type integer

2012-12-04 Thread Aaron Daubman
Hi Upayavira, One small question - did you re-index in-between? The index structure > will be different for each. > Yes, the Solr 1.4.1 (working) instance was built using the original schema and that solr version. The Solr 3.6.1 (not working) instance was re-built using the new schema and Solr 3.

Re: Range Queries performing differently on SortableIntField vs TrieField of type integer

2012-12-04 Thread Aaron Daubman
I forgot a possibly important piece... Given the different Solr versions, the schema version (and it's related different defaults) is also a change: Solr 1.4.1 Has: Solr 3.6.1 Has: > Solr 1.4.1 Relevant Schema Parts - Working as desired: > > >

Re: Cannot run Solr4 from Intellij Idea

2012-12-04 Thread Aaron Daubman
Interestingly, I have run in to this same (or very similar) issue when attempting to run embedded solr. All of the solr.* classes that were recently moved to lucene would not work with the solr.* shorthand - I had to replace them with the full classpath. As you found, these shorthands in the same s

Re: Luke and SOLR search giving different results

2012-12-04 Thread Erol Akarsu
Thanks Shawn and Jack, I changed solrconfig to set defaul query field (qf) to field content. It works fine now. Erol Akarsu On Mon, Dec 3, 2012 at 5:03 PM, Shawn Heisey wrote: > On 12/3/2012 1:44 PM, Erol Akarsu wrote: > >> I tried as search query not "baş" but "features:baş" in field "q" in

Re: Backing up SolR 4.0

2012-12-04 Thread Shawn Heisey
On 12/4/2012 1:55 AM, Andy D'Arcy Jewell wrote: Is there an easy way to tell (say from a shell script) when "all commits and merges [are] complete"? One important bit of information I just thought of: A default Solr 4 config uses a new directory implementation called NRTCachingDirectory, whic

Re: How to change Solr UI

2012-12-04 Thread Upayavira
That's an interesting take. I agree that Solr needs *something* for folks to use. It is unfortunate that Solr actually has a functioning HTTP infrastructure, because it then makes less sense to build an alternative one up. E.g. How about: http://localhost:8983/solr <- admin UI http://localhost:

Re: How to change Solr UI

2012-12-04 Thread Erik Hatcher
And basically that's what i had in mind with Prism here: Prism's very lightweight, uses Velocity (or not, any Ruby templating technology available), and is entirely separate from Solr. Before that there was Flare: https://github.com/erikhatcher/solr-r

Re: SolrCloud : impossible to create a new collection

2012-12-04 Thread Mark Miller
On Dec 4, 2012, at 5:57 AM, LEFEBVRE Guillaume wrote: > Hello, > > I have a SolrCloud environment with 2 collections running perfectly. > > I would like to create a new collection using : > http://localhost:8080/solr/admin/collections?action=CREATE&name=mycollection&numShards=1&numReplicas=

Re: Range Queries performing differently on SortableIntField vs TrieField of type integer

2012-12-04 Thread Chris Hostetter
: q=*:*&fq=+i_yearStartSort:{* TO 1995}&fq=+i_yearStopSort:{* TO *} ... : Unfortunately, under 3.6.1 with class="solr.TrieField" type="integer", this : query is returning docs that have neither an i_yearStopSort nor a : i_yearStartSort value. H... I can't seem to reproduce this. Here

Re: Range Queries performing differently on SortableIntField vs TrieField of type integer

2012-12-04 Thread Jack Krupansky
Could you show us some input data, both WITH a i_yearStopSort value and WITHOUT the the value? I tried a quick test using the stock Solr 3.6.1 example schema and a dynamic integer field and the filter query did in fact filter out all documents that did not have a value in that field: http://

SolrCell takes InputStream

2012-12-04 Thread Bing Hua
Hi, While using ContentStreamUpdateRequest up = new ContentStreamUpdateRequest("/update/extract"); The two ways of adding a file are up.addFile(File) up.addContentStream(ContentStream) However my raw files are stored on some remote storage devices. I am able to get an InputStream object for the

Creating a collection without bootstrap

2012-12-04 Thread Walter Underwood
I seem to be missing a step or some kind of ordering in creating a new collection without using bootstrap upload. I have these steps: * zookeeper upconfig (pretty sure this is first) * Collection API create collection * zookeeper linkconfig I'm working from this page: http://wiki.apache.org/solr

Re: Loading DictionaryCompoundWordTokenFilterFactory as shared object across all cores

2012-12-04 Thread Chris Hostetter
: Do we have any ways where we can load : DictionaryCompoundWordTokenFilterFactory only once and shared across all : the cores?. I don't think so, but there are tricks you can use in a custom plugin variant depending on your use cases, as well as a really easy solution if the schema's for all of

Re: Sorting by multi-valued field

2012-12-04 Thread Chris Hostetter
: perfectly, but users expect the result set to be ordered by the next start : time. ... : Is there a more elegant way to do this in Solr? A function query or : subquery maybe? I thought about it for quite a while and couldn't come up : with a viable solution. I think you could concievabl

Re: Sorting by multi-valued field

2012-12-04 Thread Upayavira
But it would be a lot harder than either splitting them out into separate docs, or writing code to re-index docs when one of their 'next-event' dates passes, with a new single valued 'next-event' field. Less efficient, but easier to write/manage. Upayavira On Tue, Dec 4, 2012, at 07:35 PM, Chris

Re: Sorting by multi-valued field

2012-12-04 Thread Chris Hostetter
: But it would be a lot harder than either splitting them out into : separate docs, or writing code to re-index docs when one of their : 'next-event' dates passes, with a new single valued 'next-event' field. : Less efficient, but easier to write/manage. Don't get me wrong -- if you can determine

Re: How to change Solr UI

2012-12-04 Thread Upayavira
But there's value in having something packaged within Solr itself, for demo purposes. That would I suspect make it Java (like it or not!) And that would probably not make it very state-of-the art, unless it used jquery, with a very lightweight java portion, which would be possible. Upayavira On

Re: Creating a collection without bootstrap

2012-12-04 Thread Walter Underwood
Here is one problem. On the SolrCloud wiki page, it says "link collection sets to collections", but I'm pretty sure that should read "config set". Also "config set" (or "conf set") is never defined. wunder On Dec 4, 2012, at 11:07 AM, Walter Underwood wrote: > I seem to be missing a step or so

Solr 4 : Optimize very slow

2012-12-04 Thread Sandeep Mestry
Hi All, I have recently migrated from solr 1.4 to solr 4 and have done the basic changes required for solr 4 in solrconfig.xml and schema.xml. I have also rebuilt the index set for solr 4. We run optimize every morning at 4 am and we keep the index updates off during this process. Previously, with

Re: Replication error and Shard Inconsistencies..

2012-12-04 Thread Mark Miller
Hey Annette, Are you using Solr 4.0 final? A version of 4x or 5x? Do you have the logs for when the replica tried to catch up to the leader? Stopping and starting the node is actually a fine thing to do. Perhaps you can try it again and capture the logs. If a node is not listed as live but is

Re: SOLR4 cluster - strange CPU spike on slave

2012-12-04 Thread Mark Miller
On Dec 4, 2012, at 2:25 AM, John Nielsen wrote: > The post about mmapdirectory is really interesting. We switched to using > that from NRTCachingDirectory and am monitoring performance as well. > Initially performance doesn't look stellar, but i suspect that we lack > memory in the server to rea

Re: Solr 4 : Optimize very slow

2012-12-04 Thread Otis Gospodnetic
Hi, You should search the ML archives for : optimize wunder Erick Otis :) Is WAS really AWS? If so, if these are new EC2 instances you are unfortunately unable to do a fair apples to apples comparison. Have you tried a different set of instances? Otis -- Performance Monitoring - http://sematext.

Getting deleted documents during DIH full-import

2012-12-04 Thread Shawn Heisey
I am doing a DIH full import on a very recent checkout from branch_4x. Something I've recently done differently is enabling autocommit. I am seeing that there are deleted documents in some of the indexes. See "Development Build Indexes" at the bottom of the following screenshot. When the im

Re: Getting deleted documents during DIH full-import

2012-12-04 Thread Shawn Heisey
On 12/4/2012 5:33 PM, Shawn Heisey wrote: I am doing a DIH full import on a very recent checkout from branch_4x. Something I've recently done differently is enabling autocommit. I am seeing that there are deleted documents in some of the indexes. See "Development Build Indexes" at the bottom

Re: How to SWAP cores (or collections) with SolrCloud (SOLR-3866)

2012-12-04 Thread Mark Miller
On Dec 4, 2012, at 4:57 AM, Andre Bois-Crettez wrote: > * what can we do to help progress on SOLR-3866 ? Maybe use case > scenarios, detailing desired behavior ? Constrains on what cores or > collections are allowed to SWAP, ie. same config, same doc->shard > assignments ? Yes please - if you

SolrCloud stops handling collection CREATE/DELETE (but responds HTTP 200)

2012-12-04 Thread Brett Hoerner
Hi, I have a Cloud setup of 4 machines. I bootstrapped them with 1 collection, which I called "default" and haven't used since. I'm using an external ZK ensemble that was completely empty before I started this cloud. Once I had all 4 nodes in the cloud I used the collection API to create the real

RE: Solr 4 : Optimize very slow

2012-12-04 Thread Michael Ryan
When I upgraded from 3.2 to 3.6, I found that an optimize - all other variables being the same - took about twice as long. Eventually I was able to track this down to the new default of MMapDirectory. By changing back to NIOFSDirectory, I was able to get the optimize time back down to what it fo

Re: Loading DictionaryCompoundWordTokenFilterFactory as shared object across all cores

2012-12-04 Thread geetha anjali
We are using the same schema, we did try using shareSchema=true in solr.xml, during indexing time, it works fine. It loads single time. But during query time, it loads multiple multiple at core level. On Wed, Dec 5, 2012 at 1:00 AM, Chris Hostetter wrote: > : Do we have any ways where we can load

Re: Solr 4 : Optimize very slow

2012-12-04 Thread Upayavira
I tried that search, without success :-( I suspect what Otis was trying to say was to question why you are optimising. Optimise was necessary under 1.4, but with newer Solr, the new TieredMergePolicy does a much better job of handling background merging, reducing the need for optimize. Try just no

Maximum number of cores

2012-12-04 Thread S_Chawla
Hi, I am using solr4.0, i have created 10 cores in solr. I want to know how many maximum number of cores can be created in solr. -- View this message in context: http://lucene.472066.n3.nabble.com/Maximum-number-of-cores-tp4024398.html Sent from the Solr - User mailing list archive at Nabble.co

Re: Solr 4 : Optimize very slow

2012-12-04 Thread Walter Underwood
It was not necessary under 1.4. It has never been necessary. It was not necessary in Ultraseek Server in 1996, using the same merging model. In some cases, it can be a good idea. Since you are continuously updating, this is not one of those cases. wunder On Dec 4, 2012, at 9:29 PM, Upayavira w

how to assign dedicated server for indexing and add more shard in SolrCloud

2012-12-04 Thread Jason
I'm using master and slave server for scaling. Master is dedicated for indexing and slave is for searching. Now, I'm planning to move SolrCloud. It has leader and replicas. Leader acts like master and replicas acts like slave. Is it right? so, I'm wondering two things. First, How can I assign dedi

Adding filter in solr suggester component.

2012-12-04 Thread sagarzond
Hi We are using solr (version - 3.6) suggester component for auto complete. We indexed solr core column (which we want as auto complete result) and its giving me correct auto complete result. Now I want to add filter on suggester indexed data.