You can build shingles and then use the synonym filter. in this case you
will have to think about all these token that you don't need after the
shingle filter.
Am 12.10.2012 01:35, schrieb T. Kuro Kurosaka:
I am looking for a way to fold a particular sequence of tokens into
one token.
Concret
October 12 2012, Apache Solr™ 4.0 available.
The Lucene PMC is pleased to announce the release of Apache Solr 4.0.
Solr is the popular, blazing fast, open source NoSQL search platform
from the Apache Lucene project. Its major features include powerful
full-text search, hit highlighting, faceted se
Hi list,
Any one know the how-to integration solr with netbeans?
The reasons I want to have solr in netbeans:
+ to avoid the long classpath configuration in the environment variables
+ avoid complicated steps (especially when starting and restarting the
glassfish server),
+ help with debugging t
Martin,
I found slide quite relevant to what are you asking about.
http://www.slideshare.net/lucenerevolution/potter-timothy-boosting-documents-in-solr
On Tue, Oct 9, 2012 at 7:57 AM, Otis Gospodnetic wrote:
> Hi Martin,
>
> Perhaps you could make a small change in Solr to add "don't reload E
Dne 11.10.2012 1:12, Upayavira napsal(a):
That is what is being discussed already. The thing is, at present, Solr
requires an even distribution of documents across shards, so you can't
just add another shard, assign it to a hash range, and be done with it.
You can use shard size as part of scori
Hmmm, I dug around in the code and found this bit:
* Forces merging of all segments that have deleted
* documents. The actual merges to be executed are
* determined by the {@link MergePolicy}. For example,
* the default {@link TieredMergePolicy} will only
* pick a segment if t
Well, I'm certainly not all that up on how that all works, I was mostly
trying to make sure you really needed to, and you do
But this capability, though rarely requested, seems harmless, so if
you wanted to create a patch that allows this but doesn't put
anything in the header/footer (or maybe
Hi,
I was wondering if there are any plans to work on this issue:
https://issues.apache.org/jira/browse/SOLR-2963 ?
And possibly any thoughts on how difficult it will be to resolve?
Thanks,
Rikke
On Tue, Oct 2, 2012 at 11:48 AM, Robert Krüger wrote:
> Hi,
>
> I'm using Solr 3.6.1 in an application embedded directly, i.e. via
> EmbeddedSolrServer, not over an HTTP connection, which works
> perfectly. Our application uses Thread.interrupt() for canceling
> long-running tasks (e.g. through Fu
Hi,
Can you share more please? Have you tried this? How well did it work for you?
Thanks,
Otis
--
Search Analytics - http://sematext.com/search-analytics/index.html
Performance Monitoring - http://sematext.com/spm/index.html
On Fri, Oct 12, 2012 at 7:17 AM, Radim Kolar wrote:
> Dne 11.10.201
Hi Tolga,
You'll get more help on the Nutch mailing list. I don't know the
schema Nutch uses for Solr off the top of my head, so I can't tell you
if maybe it uses "site" for a field or "host" or "url" or "domain" or
...
Otis
--
Search Analytics - http://sematext.com/search-analytics/index.html
P
I would like to thank you all who participated in this!
Thank you very much!
Péter
2012/10/12 Robert Muir :
> October 12 2012, Apache Solr™ 4.0 available.
> The Lucene PMC is pleased to announce the release of Apache Solr 4.0.
>
> Solr is the popular, blazing fast, open source NoSQL search platf
On 10/11/12 4:47 PM, Jack Krupansky wrote:
The ":" which normally separates a field name from a term (or quoted
string or parenthesized sub-query) is "parsed" by the query parser
before analysis gets called, and "*:*" is recognized before analysis
as well. So, any attempt to recreate "*:*" in a
Thank you very much Hoss, I knew I was doing something stupid. I will
change the dynamic fields to stored="false" and check it out.
Thanks
Ravi Kiran Bhaskar
On Wed, Oct 10, 2012 at 3:02 PM, Chris Hostetter
wrote:
> : I have a weird problem, Whenever I read the doc from solr and
> : the
Okay, let's back up. First, hold off mixing in your proposed solution until
after we understand the actual, original problem:
1. What is your field and field type (with analyzer details)?
2. What is your query parser (defType)?
3. What is your query request URL?
4. What is the parsed query (add
That’s "used to see several".
-- Jack Krupansky
-Original Message-
From: Jack Krupansky
Sent: Friday, October 12, 2012 12:54 PM
To: solr-user@lucene.apache.org
Subject: Shutting down Solr in Cygwin on Wndows
I used to several several “shutdown” messages when I hit ^C while running
So
Hello,
I have some questions about the SolrCloud.
Can I take full advtange of the Cloud with the PECL Solr client? It was
last updated for Solr 3.1 http://pecl.php.net/package/solr
Is Jetty the recommended servlet for the Cloud?
The documentation about configuring, optimizing, and accessing
On 10/12/2012 01:42 PM, Shaddy Zeineddine wrote:
Hello,
I have some questions about the SolrCloud.
Can I take full advtange of the Cloud with the PECL Solr client? It
was last updated for Solr 3.1 http://pecl.php.net/package/solr
I don't know for sure, I don't know that client. If it's HTTP
On 10/12/2012 01:42 PM, Shaddy Zeineddine wrote:
> Hello,
>
> I have some questions about the SolrCloud.
>
> Can I take full advtange of the Cloud with the PECL Solr client? It was last
> updated for Solr 3.1 http://pecl.php.net/package/solr
I don't know for sure, I don't know that client. If it
Hello,
Solr Cloud and Hadoop are new to me. And I am figuring out an
architecture to do a
distributed indexing/searching system in a cluster. Integrating them is an
option.
I would like to know if Hadoop + Solr is still a good option to build the a
big index in a cluster,
using HDFS and MapReduce
I definitely haven't tried this ;=) but perhaps you could create your own
XXXSynonymFilterFactory as a subclass of SynonymFilterFactory, which
would allow you to share the synonym map across all cores - though I think
there would need to be a nasty global variable to hold a reference to it...
-S
Hello Rui,
If your data to be indexed is in HDFS, using MapReduce to parallelize
indexing is still a good idea.
Otis
--
Search Analytics - http://sematext.com/search-analytics/index.html
Performance Monitoring - http://sematext.com/spm/index.html
On Fri, Oct 12, 2012 at 2:35 PM, Rui Vaz wrote:
Jack,
It goes like this:
http://myhost:8983/solr/select?indent=on&version=2.2&q=*%3A*&fq=&start=0&rows=10&fl=*%2Cscore&qt=&wt=&debugQuery=on
and edismax is the default query parser in solrconfig.xml.
There is a field named text_jpn that uses a Tokenizer that we developed
as a product, which we
Yes I was thinking the same thing, although I was hoping there was a more
elegant mechanism exposed by the solr infrastructure code to handle the shared
map, aside from just using a global that is.
Phil
-Original Message-
From: simon [mailto:mtnes...@gmail.com]
Sent: 12 October 2012 1
What I'd like to see added to the SolrCloud wiki page:
- The wiki page states that you can send your request to any server, but
what if that server goes down? Doesn't there need to be an aliased IP
address pointing to an active server? Or, is there client side support
like MongoDB replica set
On 10/12/2012 6:04 AM, Erick Erickson wrote:
Hmmm, I dug around in the code and found this bit:
* Forces merging of all segments that have deleted
* documents. The actual merges to be executed are
* determined by the {@link MergePolicy}. For example,
* the default {@link Tie
Hi Rui,
If you're going to shard and/or replicate your index, then be sure to take
a look at CloudSolrServer in the SolrJ client library. CloudSolrServer is
an extension to SolrServer that works with Zookeeper to understand the
shards and replicas in a Solr cluster. Using CloudSolrServer, there is
bq. but somehow else might no better.
* But someone else might know better* - brain is a bit scrambled today.
I'll try and address your questions on the wiki.
- Mark
On 10/12/2012 03:32 PM, Shaddy Zeineddine wrote:
What I'd like to see added to the SolrCloud wiki page:
- The wiki page states
You may also want take a look at the DataStax Enterprise product which
combines Cassandra, Solr, and Hadoop.
See:
http://www.datastax.com/products/enterprise
-- Jack Krupansky
-Original Message-
From: Rui Vaz
Sent: Friday, October 12, 2012 2:35 PM
To: solr-user@lucene.apache.org
Subj
I have a main solr server(solr1) which stores indexes of all docs, and want
to implement the following function:
1. First make a full import of my doc updated/created recently(last 1 or 2
weeks) from solr1.
2. Make delta import at intervals to copy the change of my doc from solr1 to
solr2. - doc ma
I don't have a Solr 3.5 to check, but SOLR-3261, which was fixed in Solr 3.6
may be your culprit.
See:
https://issues.apache.org/jira/browse/SOLR-3261
So, try SOlr 3.6 or 3.6.1 or 4.0 to see if your issue goes away.
-- Jack Krupansky
-Original Message-
From: T. Kuro Kurosaka
Sent: F
Hi Erick,
After reading the discussion you guys were having about renaming optimize to
forceMerge I realized I was guilty of over-optimizing like you guys were
worried about! We have about 15 million docs indexed now and we spin about
50-300 adds per second 24/7, most of them being updates to
Thank you very much for your replies,
Yes Otis one possibility is to copy my data do HDFS and then apply a Map
function
to create the intermediate indexes across the cluster using SOLR java
library in HDFS.
I have some doubts concerning this solution:
1 - The int
There are a lot of sub-tasks having to do with lots of cores here:
http://wiki.apache.org/solr/LotsOfCores
I don't see a reference to this particular issue, but it sure seems
like this could be a reasonable thing to add to the list. By extension,
all of the files that can be specified (stopwords, q
Sounds reasonable although I admit I haven't looked deeply.
Erick
On Fri, Oct 12, 2012 at 3:41 PM, Shawn Heisey wrote:
> On 10/12/2012 6:04 AM, Erick Erickson wrote:
>>
>> Hmmm, I dug around in the code and found this bit:
>> * Forces merging of all segments that have deleted
>> *
Hmmm, not quite what you asked, but would it work to just
replicate from Solr1 to Solr2 when you want to synch? You
can trigger this via http, see:
http://wiki.apache.org/solr/SolrReplication#HTTP_API
If you're talking about individual documents, then the answer
is no. There's nothing as far as I
I've been building 4.x regularly. Have you tried "ant clean-jars"?
Best
Erick
On Fri, Oct 12, 2012 at 6:32 PM, P Williams
wrote:
> Hi,
>
> Has anyone tried using name="solr-test-framework" rev="4.0.0" conf="test->default"/> with Apache
> IVY in their project?
>
> rev 3.6.1 works but any of the
Right. If I've multiplied right, you're essentially replacing your entire index
every day given the rate you're adding documents.
Have a look at MergePolicy, here are a couple of references:
http://juanggrande.wordpress.com/2011/02/07/merge-policy-internals/
https://lucene.apache.org/core/old_vers
If you are updating all the time, don't forceMerge at all, unless you want to
put the overhead of big merges at a known time. Otherwise, leave it alone.
wunder
On Oct 12, 2012, at 3:56 PM, Erick Erickson wrote:
> Right. If I've multiplied right, you're essentially replacing your entire
> index
Good evening,
SolrJ lives in the same house as Solr itself, so...
Otis
--
Performance Monitoring - http://sematext.com/spm
On Oct 12, 2012 5:39 PM, "autregalaxie" wrote:
> Good morning everybody,
>
> I'm a new user of Solr, i have to develop new interface to manage Solr. I
> have found severel
SolrJ is in Java, RSolr and ruby-solr are for ruby, etc. These are for
low-level programming.
There is a Wordpress plugin for Solr, Django, Magento e-commerce, and
some other apps. Blacklight is an content manager for libraries.
What do you want to do with Solr?
On Fri, Oct 12, 2012 at 4:45 PM,
After that, remove your ivy repository (home/.ivy2) and try again. And
rename your Maven repository just to avoid anything.
I have had weird problems with connectivity to different Ivy
repositories. I use a VPN service that pops out in different countries
(blackVPN) and some countries worked and
42 matches
Mail list logo