Re: Any filter to map mutiple tokens into one ?

2012-10-12 Thread Konrad Lötzsch
You can build shingles and then use the synonym filter. in this case you will have to think about all these token that you don't need after the shingle filter. Am 12.10.2012 01:35, schrieb T. Kuro Kurosaka: I am looking for a way to fold a particular sequence of tokens into one token. Concret

[ANNOUNCE] Apache Solr 4.0 released.

2012-10-12 Thread Robert Muir
October 12 2012, Apache Solr™ 4.0 available. The Lucene PMC is pleased to announce the release of Apache Solr 4.0. Solr is the popular, blazing fast, open source NoSQL search platform from the Apache Lucene project. Its major features include powerful full-text search, hit highlighting, faceted se

find a way to solr netbeans

2012-10-12 Thread Iwan Hanjoyo
Hi list, Any one know the how-to integration solr with netbeans? The reasons I want to have solr in netbeans: + to avoid the long classpath configuration in the environment variables + avoid complicated steps (especially when starting and restarting the glassfish server), + help with debugging t

Re: Reloading ExternalFileField blocks Solr

2012-10-12 Thread Mikhail Khludnev
Martin, I found slide quite relevant to what are you asking about. http://www.slideshare.net/lucenerevolution/potter-timothy-boosting-documents-in-solr On Tue, Oct 9, 2012 at 7:57 AM, Otis Gospodnetic wrote: > Hi Martin, > > Perhaps you could make a small change in Solr to add "don't reload E

Re: add shard to index

2012-10-12 Thread Radim Kolar
Dne 11.10.2012 1:12, Upayavira napsal(a): That is what is being discussed already. The thing is, at present, Solr requires an even distribution of documents across shards, so you can't just add another shard, assign it to a hash range, and be done with it. You can use shard size as part of scori

Re: SolrJ, optimize, maxSegments

2012-10-12 Thread Erick Erickson
Hmmm, I dug around in the code and found this bit: * Forces merging of all segments that have deleted * documents. The actual merges to be executed are * determined by the {@link MergePolicy}. For example, * the default {@link TieredMergePolicy} will only * pick a segment if t

Re: Custom html headers/footers to solr admin console

2012-10-12 Thread Erick Erickson
Well, I'm certainly not all that up on how that all works, I was mostly trying to make sure you really needed to, and you do But this capability, though rarely requested, seems harmless, so if you wanted to create a patch that allows this but doesn't put anything in the header/footer (or maybe

performance of group.ngroups=true

2012-10-12 Thread Rikke Willer
Hi, I was wondering if there are any plans to work on this issue: https://issues.apache.org/jira/browse/SOLR-2963 ? And possibly any thoughts on how difficult it will be to resolve? Thanks, Rikke

Re: Can I rely on correct handling of interrupted status of threads?

2012-10-12 Thread Robert Krüger
On Tue, Oct 2, 2012 at 11:48 AM, Robert Krüger wrote: > Hi, > > I'm using Solr 3.6.1 in an application embedded directly, i.e. via > EmbeddedSolrServer, not over an HTTP connection, which works > perfectly. Our application uses Thread.interrupt() for canceling > long-running tasks (e.g. through Fu

Re: add shard to index

2012-10-12 Thread Otis Gospodnetic
Hi, Can you share more please? Have you tried this? How well did it work for you? Thanks, Otis -- Search Analytics - http://sematext.com/search-analytics/index.html Performance Monitoring - http://sematext.com/spm/index.html On Fri, Oct 12, 2012 at 7:17 AM, Radim Kolar wrote: > Dne 11.10.201

Re: Search in specific website

2012-10-12 Thread Otis Gospodnetic
Hi Tolga, You'll get more help on the Nutch mailing list. I don't know the schema Nutch uses for Solr off the top of my head, so I can't tell you if maybe it uses "site" for a field or "host" or "url" or "domain" or ... Otis -- Search Analytics - http://sematext.com/search-analytics/index.html P

Re: [ANNOUNCE] Apache Solr 4.0 released.

2012-10-12 Thread Péter Király
I would like to thank you all who participated in this! Thank you very much! Péter 2012/10/12 Robert Muir : > October 12 2012, Apache Solr™ 4.0 available. > The Lucene PMC is pleased to announce the release of Apache Solr 4.0. > > Solr is the popular, blazing fast, open source NoSQL search platf

Re: Any filter to map mutiple tokens into one ?

2012-10-12 Thread T. Kuro Kurosaka
On 10/11/12 4:47 PM, Jack Krupansky wrote: The ":" which normally separates a field name from a term (or quoted string or parenthesized sub-query) is "parsed" by the query parser before analysis gets called, and "*:*" is recognized before analysis as well. So, any attempt to recreate "*:*" in a

Re: PointType doc reindex issue

2012-10-12 Thread Ravi Solr
Thank you very much Hoss, I knew I was doing something stupid. I will change the dynamic fields to stored="false" and check it out. Thanks Ravi Kiran Bhaskar On Wed, Oct 10, 2012 at 3:02 PM, Chris Hostetter wrote: > : I have a weird problem, Whenever I read the doc from solr and > : the

Re: Any filter to map mutiple tokens into one ?

2012-10-12 Thread Jack Krupansky
Okay, let's back up. First, hold off mixing in your proposed solution until after we understand the actual, original problem: 1. What is your field and field type (with analyzer details)? 2. What is your query parser (defType)? 3. What is your query request URL? 4. What is the parsed query (add

Re: Shutting down Solr in Cygwin on Wndows

2012-10-12 Thread Jack Krupansky
That’s "used to see several". -- Jack Krupansky -Original Message- From: Jack Krupansky Sent: Friday, October 12, 2012 12:54 PM To: solr-user@lucene.apache.org Subject: Shutting down Solr in Cygwin on Wndows I used to several several “shutdown” messages when I hit ^C while running So

SolrCloud with PHP

2012-10-12 Thread Shaddy Zeineddine
Hello, I have some questions about the SolrCloud. Can I take full advtange of the Cloud with the PECL Solr client? It was last updated for Solr 3.1 http://pecl.php.net/package/solr Is Jetty the recommended servlet for the Cloud? The documentation about configuring, optimizing, and accessing

Re: SolrCloud with PHP

2012-10-12 Thread Mark Miller
On 10/12/2012 01:42 PM, Shaddy Zeineddine wrote: Hello, I have some questions about the SolrCloud. Can I take full advtange of the Cloud with the PECL Solr client? It was last updated for Solr 3.1 http://pecl.php.net/package/solr I don't know for sure, I don't know that client. If it's HTTP

Re: SolrCloud with PHP

2012-10-12 Thread Mark Miller
On 10/12/2012 01:42 PM, Shaddy Zeineddine wrote: > Hello, > > I have some questions about the SolrCloud. > > Can I take full advtange of the Cloud with the PECL Solr client? It was last > updated for Solr 3.1 http://pecl.php.net/package/solr I don't know for sure, I don't know that client. If it

Solr Cloud and Hadoop

2012-10-12 Thread Rui Vaz
Hello, Solr Cloud and Hadoop are new to me. And I am figuring out an architecture to do a distributed indexing/searching system in a cluster. Integrating them is an option. I would like to know if Hadoop + Solr is still a good option to build the a big index in a cluster, using HDFS and MapReduce

Re: multi-core sharing synonym map

2012-10-12 Thread simon
I definitely haven't tried this ;=) but perhaps you could create your own XXXSynonymFilterFactory as a subclass of SynonymFilterFactory, which would allow you to share the synonym map across all cores - though I think there would need to be a nasty global variable to hold a reference to it... -S

Re: Solr Cloud and Hadoop

2012-10-12 Thread Otis Gospodnetic
Hello Rui, If your data to be indexed is in HDFS, using MapReduce to parallelize indexing is still a good idea. Otis -- Search Analytics - http://sematext.com/search-analytics/index.html Performance Monitoring - http://sematext.com/spm/index.html On Fri, Oct 12, 2012 at 2:35 PM, Rui Vaz wrote:

Re: Any filter to map mutiple tokens into one ?

2012-10-12 Thread T. Kuro Kurosaka
Jack, It goes like this: http://myhost:8983/solr/select?indent=on&version=2.2&q=*%3A*&fq=&start=0&rows=10&fl=*%2Cscore&qt=&wt=&debugQuery=on and edismax is the default query parser in solrconfig.xml. There is a field named text_jpn that uses a Tokenizer that we developed as a product, which we

RE: multi-core sharing synonym map

2012-10-12 Thread Phil Hoy
Yes I was thinking the same thing, although I was hoping there was a more elegant mechanism exposed by the solr infrastructure code to handle the shared map, aside from just using a global that is. Phil -Original Message- From: simon [mailto:mtnes...@gmail.com] Sent: 12 October 2012 1

Re: SolrCloud with PHP

2012-10-12 Thread Shaddy Zeineddine
What I'd like to see added to the SolrCloud wiki page: - The wiki page states that you can send your request to any server, but what if that server goes down? Doesn't there need to be an aliased IP address pointing to an active server? Or, is there client side support like MongoDB replica set

Re: SolrJ, optimize, maxSegments

2012-10-12 Thread Shawn Heisey
On 10/12/2012 6:04 AM, Erick Erickson wrote: Hmmm, I dug around in the code and found this bit: * Forces merging of all segments that have deleted * documents. The actual merges to be executed are * determined by the {@link MergePolicy}. For example, * the default {@link Tie

Re: Solr Cloud and Hadoop

2012-10-12 Thread Timothy Potter
Hi Rui, If you're going to shard and/or replicate your index, then be sure to take a look at CloudSolrServer in the SolrJ client library. CloudSolrServer is an extension to SolrServer that works with Zookeeper to understand the shards and replicas in a Solr cluster. Using CloudSolrServer, there is

Re: SolrCloud with PHP

2012-10-12 Thread Mark Miller
bq. but somehow else might no better. * But someone else might know better* - brain is a bit scrambled today. I'll try and address your questions on the wiki. - Mark On 10/12/2012 03:32 PM, Shaddy Zeineddine wrote: What I'd like to see added to the SolrCloud wiki page: - The wiki page states

Re: Solr Cloud and Hadoop

2012-10-12 Thread Jack Krupansky
You may also want take a look at the DataStax Enterprise product which combines Cassandra, Solr, and Hadoop. See: http://www.datastax.com/products/enterprise -- Jack Krupansky -Original Message- From: Rui Vaz Sent: Friday, October 12, 2012 2:35 PM To: solr-user@lucene.apache.org Subj

How to import a part of index from main Solr server(based on a query) to another Solr server and then do incremental import at intervals later(the updated index)?

2012-10-12 Thread jefferyyuan
I have a main solr server(solr1) which stores indexes of all docs, and want to implement the following function: 1. First make a full import of my doc updated/created recently(last 1 or 2 weeks) from solr1. 2. Make delta import at intervals to copy the change of my doc from solr1 to solr2. - doc ma

Re: Any filter to map mutiple tokens into one ?

2012-10-12 Thread Jack Krupansky
I don't have a Solr 3.5 to check, but SOLR-3261, which was fixed in Solr 3.6 may be your culprit. See: https://issues.apache.org/jira/browse/SOLR-3261 So, try SOlr 3.6 or 3.6.1 or 4.0 to see if your issue goes away. -- Jack Krupansky -Original Message- From: T. Kuro Kurosaka Sent: F

RE: anyone have any clues about this exception

2012-10-12 Thread Petersen, Robert
Hi Erick, After reading the discussion you guys were having about renaming optimize to forceMerge I realized I was guilty of over-optimizing like you guys were worried about! We have about 15 million docs indexed now and we spin about 50-300 adds per second 24/7, most of them being updates to

Re: Solr Cloud and Hadoop

2012-10-12 Thread Rui Vaz
Thank you very much for your replies, Yes Otis one possibility is to copy my data do HDFS and then apply a Map function to create the intermediate indexes across the cluster using SOLR java library in HDFS. I have some doubts concerning this solution: 1 - The int

Re: multi-core sharing synonym map

2012-10-12 Thread Erick Erickson
There are a lot of sub-tasks having to do with lots of cores here: http://wiki.apache.org/solr/LotsOfCores I don't see a reference to this particular issue, but it sure seems like this could be a reasonable thing to add to the list. By extension, all of the files that can be specified (stopwords, q

Re: SolrJ, optimize, maxSegments

2012-10-12 Thread Erick Erickson
Sounds reasonable although I admit I haven't looked deeply. Erick On Fri, Oct 12, 2012 at 3:41 PM, Shawn Heisey wrote: > On 10/12/2012 6:04 AM, Erick Erickson wrote: >> >> Hmmm, I dug around in the code and found this bit: >> * Forces merging of all segments that have deleted >> *

Re: How to import a part of index from main Solr server(based on a query) to another Solr server and then do incremental import at intervals later(the updated index)?

2012-10-12 Thread Erick Erickson
Hmmm, not quite what you asked, but would it work to just replicate from Solr1 to Solr2 when you want to synch? You can trigger this via http, see: http://wiki.apache.org/solr/SolrReplication#HTTP_API If you're talking about individual documents, then the answer is no. There's nothing as far as I

Re: Using

2012-10-12 Thread Erick Erickson
I've been building 4.x regularly. Have you tried "ant clean-jars"? Best Erick On Fri, Oct 12, 2012 at 6:32 PM, P Williams wrote: > Hi, > > Has anyone tried using name="solr-test-framework" rev="4.0.0" conf="test->default"/> with Apache > IVY in their project? > > rev 3.6.1 works but any of the

Re: anyone have any clues about this exception

2012-10-12 Thread Erick Erickson
Right. If I've multiplied right, you're essentially replacing your entire index every day given the rate you're adding documents. Have a look at MergePolicy, here are a couple of references: http://juanggrande.wordpress.com/2011/02/07/merge-policy-internals/ https://lucene.apache.org/core/old_vers

Re: anyone have any clues about this exception

2012-10-12 Thread Walter Underwood
If you are updating all the time, don't forceMerge at all, unless you want to put the overhead of big merges at a known time. Otherwise, leave it alone. wunder On Oct 12, 2012, at 3:56 PM, Erick Erickson wrote: > Right. If I've multiplied right, you're essentially replacing your entire > index

Re: which api to use to manage solr ?

2012-10-12 Thread Otis Gospodnetic
Good evening, SolrJ lives in the same house as Solr itself, so... Otis -- Performance Monitoring - http://sematext.com/spm On Oct 12, 2012 5:39 PM, "autregalaxie" wrote: > Good morning everybody, > > I'm a new user of Solr, i have to develop new interface to manage Solr. I > have found severel

Re: which api to use to manage solr ?

2012-10-12 Thread Lance Norskog
SolrJ is in Java, RSolr and ruby-solr are for ruby, etc. These are for low-level programming. There is a Wordpress plugin for Solr, Django, Magento e-commerce, and some other apps. Blacklight is an content manager for libraries. What do you want to do with Solr? On Fri, Oct 12, 2012 at 4:45 PM,

Re: Using

2012-10-12 Thread Lance Norskog
After that, remove your ivy repository (home/.ivy2) and try again. And rename your Maven repository just to avoid anything. I have had weird problems with connectivity to different Ivy repositories. I use a VPN service that pops out in different countries (blackVPN) and some countries worked and