I am trying to filter russian stopwords but have not been successful with
that. I am using the following schema entry -
.
..
Intrestingly, Russian synonyms are working fine. English and russian
synonyms get searched
Yonik Seeley schrieb:
We'd love some feedback on how it works to
ensure that it actually is a win for the majority and should be the
default.
I just did a quick test using Solr nightly 2008-11-30. I have an index
of about 2.9 mil bibliographic records, size: 16G. I tested facetting
author na
Hi,
I think I've discovered a bug with the JSONResponseWriter : starting
from the following query -
http://127.0.0.1:8080/solr-urbamet/select?q=(tout:1)&rows=0&sort=TITRE+desc&facet=true&facet.query=SUJET:b*&facet.field=SUJET&facet.prefix=b&facet.limit=1&facet.missing=true&wt=json&json.nl=arrarr
Hello,
I want to index a field with an array of arrays, is that possible in Solr?
I.e I have one multi-valued field with persons and would like one
multi-valued field with their employer, but sometimes there are more than
one employer per person and therefor it would've been good to use a
multi-va
Hello,
Is there any way for Solr to follow links stored in my database and index
the content of these files and HTTP-resources?
Thanks in advance! // Joel
-BEGIN PGP SIGNED MESSAGE-
Hash: SHA1
Till Kinstler schrieb:
Hi,
> I just did a quick test using Solr nightly 2008-11-30. I have an index
> of about 2.9 mil bibliographic records, size: 16G. I tested facetting
> author names, each index document may contain multiple author names, so
> au
Wow -- thanks for all the help!! With everyone's help, I did end up
in a *much* better place:
private static boolean solrCoreExists(String coreName, String
solrRootUrl) throws IOException, SolrServerException
{
CommonsHttpSolrServer adminServer = new
CommonsHttpSolrServer(
Hi Yonik,
May I ask in which class(es) this improvement was made? I've been using the
DocSet, DocList, BitDocSet, HashDocSet from Solr from a few years ago with a
Lucene based app. to do faceting.
Thanks,
Peter
On Mon, Nov 24, 2008 at 11:12 PM, Yonik Seeley <[EMAIL PROTECTED]> wrote:
> A new f
In the dataimport.properties file, there is the timespamp.
#Thu Dec 04 15:36:22 EST 2008
last_index_time=2008-12-04 15\:36\:20
I am using the Oracle (10g) and would like to know which format of timestamp
I have to use in Oracle.
Thanks,
Jae
Thanks for the report Grégoire, it definitely looks like a bug.
Would you mind opening a JIRA issue for this?
-Yonik
On Fri, Dec 5, 2008 at 6:26 AM, Grégoire Neuville
<[EMAIL PROTECTED]> wrote:
> Hi,
>
> I think I've discovered a bug with the JSONResponseWriter : starting
> from the following que
I do have same experience.
What is the CPU in the Solaris box? it is not depending on the operating
system (linux or Solaris). It is depenong on the CPU (Intel ro SPARC).
Don't know why, but based on my performance test, SPARC machine requires
MORE memory for java application.
Jae
On Thu, Dec 4,
Jon,
What do you mean by off a "Zone"? Please clarify
-Raghu
-Original Message-
From: Jon Baer [mailto:[EMAIL PROTECTED]
Sent: Thursday, December 04, 2008 9:56 PM
To: solr-user@lucene.apache.org
Subject: Re: Solr on Solaris
Just curious, is this off a "zone" by any chance?
- Jon
On
Hi Jae,
Its intel based CPU.
-Raghu
-Original Message-
From: Jae Joo [mailto:[EMAIL PROTECTED]
Sent: Friday, December 05, 2008 9:53 AM
To: solr-user@lucene.apache.org
Subject: Re: Solr on Solaris
I do have same experience.
What is the CPU in the Solaris box? it is not depending on th
very similar situation to those already reported. 2.9M bilbiographic
records, with authors being the (previous) bottleneck, and the one
we're starting to test with the new algorithm.
so far, no load tests, but just in single requests i'm seeing the same
improvements...phenomenal improvements, btw
Peter,
It is UnInvertedField class. See also:
https://issues.apache.org/jira/browse/SOLR-475
Peter Keegan wrote:
Hi Yonik,
May I ask in which class(es) this improvement was made? I've been using the
DocSet, DocList, BitDocSet, HashDocSet from Solr from a few years ago with a
Lucene based app.
Hi Tushar,
On 12/05/2008 at 5:18 AM, tushar kapoor wrote:
> I am trying to filter russian stopwords but have not been
> successful with that.
[...]
> words="stopwords.txt"/>
>ignoreCase="true" expand="false"/>
[...]
> Intrestingly, Russian synonyms are work
Are you running Solr in a container more specifically, Ive had few
issues w/ zones in the past and Solr (I believe there are some
networking issues w/ older Solaris versions) ...
They are basically where you can slice ("virtualize") your resources
and divide a box up into something similar
On Fri, Dec 5, 2008 at 5:09 AM, ashokc <[EMAIL PROTECTED]> wrote:
>
> The SOLR wiki says
>
> >>3. Make sure both indexes you want to merge are closed.
>
> What exactly does 'closed' mean?
I think that would mean that the IndexReader and IndexWriter on that index
are closed.
1. Do I need to stop
I gguess you are trying to pass it in the SQL query. Tryit as it is .
If oracle does not take it you can format the date according to what
oracle likes
http://wiki.apache.org/solr/DataImportHandler#head-5675e913396a42eb7c6c5d3c894ada5dadbb62d7
On Fri, Dec 5, 2008 at 8:09 PM, Jae Joo <[EMAIL PROTE
Look at http://wiki.apache.org/solr/DataImportHandler
You may use an outer entity with SqlEntityProcessor and an inner
entity with XPathEntityProcessor
On Fri, Dec 5, 2008 at 5:35 PM, Joel Karlsson <[EMAIL PROTECTED]> wrote:
> Hello,
>
> Is there any way for Solr to follow links stored in my dat
On Thu, Dec 4, 2008 at 6:39 PM, ashokc <[EMAIL PROTECTED]> wrote:
>
> The SOLR wiki says
>
>>>3. Make sure both indexes you want to merge are closed.
>
> What exactly does 'closed' mean?
If you do a commit, and then prevent updates, the index should be
closed (no open IndexWriter).
> 1. Do I need
Thanks for the help Yonik & Shalin.It really makes it easy for me if I do not
have to stop/start the SOLR app during the merge operations.
The reason I have to do this many times a day, is that I am implementing a
simple-minded entity-extraction procedure for the content I am indexing. I
have a u
Jon,
We are running under tomcat. Thanks for the link I will check it out
-Raghu
-Original Message-
From: Jon Baer [mailto:[EMAIL PROTECTED]
Sent: Friday, December 05, 2008 10:57 AM
To: solr-user@lucene.apache.org
Subject: Re: Solr on Solaris
Are you running Solr in a container more sp
Was this one ever addressed? I'm seeing it in some small percentage of the
documents that I index in 1.4-dev 708596M. I don't see a corresponding JIRA
issue.
James Brady-3 wrote:
>
> Hi,
> I'm seeing a problem mentioned in Solr-42, Highlighting problems with
> HTMLStripWhitespaceTokenizerFa
your out of memory :).
each instance of an application server you can technically only
allocate like 1024mb to the JVM, to take advantage of the memory you
need to run multiple instances of the application server.
are you using RAMDirectory with SOLR?
On Thu, Dec 4, 2008 at 10:40 PM, Kashyap, Ra
I am using solrj to query solr and the QueryResponse.getResults() returns a
SolrDocumentList. There is a SolrDocument in the list with the results I
want. The problem is that I want to view these results as XML. How can I get
the SolrDocument to give me XML?
Thanks in advance.
-Dan
--
Vie
Our application processes RSS feeds. Its search activity is heavily
concentrated on the most recent 24 hours, with modest searching across
the past few days, and rare (but important) searching across months or
more. So we create a Solr core for each day, and then search the
appropriate se
When you are saying "application server" do you mean tomcat?
If yes, I have allocated >8GB of heap to tomcat and it uses it all no
problem (64 bit Intel/64 bit Java).
-glen
2008/12/5 Jeryl Cook <[EMAIL PROTECTED]>:
> your out of memory :).
>
> each instance of an application server you can techn
On Dec 4, 2008, at 8:19 PM, Jonathan Ariel wrote:
Hi! I'm wondering what solr is really doing with the exact word vs.
the
stemmed word.
So for example I have 2 documents.
The first one has in the title the word "convertible"
The second one has "convert"
When solr stem the titles, both will be
I'd somehow pass through Solr's XML response, or perhaps consider
using Solr's XSLT response writer to convert to the format you want.
I don't have the magic incantation handy, but it should be possible to
make a request through SolrJ and get the raw response string back in
whatever format
I've seen some strangle results in the last few days of testing, but this one
flies in the face of everything I've read on this forum: Reducing
filterCache size has increased performance.
I have posted my setup here:
http://www.nabble.com/Throughput-Optimization-td20335132.html.
My original fil
On 5-Dec-08, at 2:24 PM, wojtekpia wrote:
I've seen some strangle results in the last few days of testing, but
this one
flies in the face of everything I've read on this forum: Reducing
filterCache size has increased performance.
This isn't really unexpected behaviour. The problem with a
On Fri, Dec 5, 2008 at 5:24 PM, Erik Hatcher <[EMAIL PROTECTED]> wrote:
> I'd somehow pass through Solr's XML response, or perhaps consider using
> Solr's XSLT response writer to convert to the format you want. I don't have
> the magic incantation handy, but it should be possible to make a request
On Fri, Dec 5, 2008 at 5:24 PM, wojtekpia <[EMAIL PROTECTED]> wrote:
>
> I've seen some strangle results in the last few days of testing, but this one
> flies in the face of everything I've read on this forum: Reducing
> filterCache size has increased performance.
>
> I have posted my setup here:
>
Reducing the amount of memory given to java slowed down Solr at first, then
quickly caused the garbage collector to behave badly (same issue as I
referenced above).
I am using the concurrent cache for all my caches.
--
View this message in context:
http://www.nabble.com/Smaller-filterCache-giv
: So i'm basically looking for design pattern/best practice for that scenario
: based on people's experience.
I've taken two approaches in the past...
1) encode the "id" and the "label" in the field value; facet on it;
require clients to know how to decode. This works really well for simple
t
On Fri, Dec 5, 2008 at 5:40 AM, Laurence Rowe <[EMAIL PROTECTED]> wrote:
> 2008/12/4 Shalin Shekhar Mangar <[EMAIL PROTECTED]>:
>
>
> I think we have a slight misunderstanding here. Because there are many
> CMS processes it is possible that the same document will be updated
> concurrently (from di
37 matches
Mail list logo