Does this help?
http://wiki.apache.org/solr/DataImportHandlerFaq#I.27m_using_DataImportHandler_with_a_MySQL_database._My_table_is_huge_and_DataImportHandler_is_going_out_of_memory._Why_does_DataImportHandler_bring_everything_to_memory.3F
On Wed, Oct 28, 2009 at 12:38 AM, William Pierce wrote:
>
Hi,
I am not changing any URL while querying because the custom query
parser plugin is linked with the default request handler.You may have
a look at my first mail for xml snippets which is included in
solrconfig.xml .
Yeah.. I found the line .. "INFO: created queryParserPluginOne:
plugi
-- Forwarded message --
From: Nick Lothian
Date: Wed, Oct 28, 2009 at 11:37 AM
Subject: Full Text Search: Solr on Cassandra
To: "cassandra-u...@incubator.apache.org"
Just in case anyone here is interested, I've managed to get Solr
working on Cassandra using Jake Luciani's Lucand
Hi all,
I just try to migrate from solr-2009-09-09.zip to today's nightly build.
I found that the *_i dynamic field's type remained int but its definition
became
the older Solr schema lacks the definition of int, so I suppose, that its class
is
solr.IntField. I modified the schema in two p
> Have two cores with some common fields in their schemas. I want to perform
> a MLT query on one core and get results from the other schema. Both cores
> have same type of id.
>
>Having the same "type of id" in two different cores is of no good for a MLT
>handler (which in-fact operates on one c
On Tue, Oct 27, 2009 at 3:41 PM, Yonik Seeley
wrote:
>> is there a reason why StreamingUpdateSolrServer does not accept a
>> connection manager argument as CommonsHttpSolrServer does?
>
> I don't see any reason why it shouldn't... can you open a JIRA issue
> and optionally provide a patch?
great
2009/10/28 Király Péter :
> I just try to migrate from solr-2009-09-09.zip to today's nightly build.
> I found that the *_i dynamic field's type remained int but its definition
> became
> omitNorms="true" positionIncrementGap="0"/>
>
> the older Solr schema lacks the definition of int, so I suppos
Great suggestion, I took a look and it seems pretty useful. As a follow up
question, did you do anything to disable Solr caching for certain tests?
-mike
On Tue, Oct 27, 2009 at 8:14 PM, Joshua Tuberville <
joshuatubervi...@eharmony.com> wrote:
> Mike,
>
> For response times I would also look at
I am all for new stuff.
It would be nice to see a working example of ajax-solr before killing
completely solrjs from trunk... at least it was working .. ajax-solr has no how
to, nor any working example..
http://github.com/evolvingweb/ajax-solr
Well why not just remove the javascript folder t
Actually, it wasn't quite working. It also replicated a lot of stuff
that's in ajax solr, which is being more actively developed. Hence the
removal.
On Oct 28, 2009, at 10:16 AM, Antonio Eggberg wrote:
I am all for new stuff.
It would be nice to see a working example of ajax-solr before
I fully understand it was not working probably in production or on other data
sets. But it did serve a purpose for me.. i.e. show a demo to anyone out of my
box.. and I update my local repo with trunk all the time..
I could do ant reuters-start using my laptop and it would work.. my point is
re
Out of curiosity... Why not just make/find a quick example using ajax-
solr and use git to update from them?
On Oct 28, 2009, at 10:29 AM, Antonio Eggberg wrote:
I fully understand it was not working probably in production or on
other data sets. But it did serve a purpose for me.. i.e. show a
Hi Solr masters,
I just wanted to confirm what i read is the best and optimal way of doing
spell check correction.
i am using index based spell check.
For this as the doc says
Create a field ex. my_text with a fieldType my_text_fl which has all the
filters defined.
Copy all the othe fields data
From: "Yonik Seeley"
To:
Sent: Wednesday, October 28, 2009 2:11 PM
Subject: Re: ERROR:SCHEMA-INDEX-MISMATCH
I imagine that your field impl was SortableIntField, not IntField, but
checking your old schema will tell you for sure.
Yes, you are right, I double checked it.
Thanks!
Peter
>
> My thought now is I cannot use MLT and instead must do a query to B using
> the fields from core A ID as query params. Is there big difference in what
> will be returned as results using query instead of MLT?
>
Yes, there is definitely a difference between the results from a MLT handler
and any
Mea maxima culpa,
I had foolishly set the option omitTermFreqAndPositions="false" in an
attempt to save space.
It works when this is set to 'true'.
However, even when it's set to 'false' , the highlighting of a field
continues to work even if the search doesn't.
Does the highlighter use a differ
Hi All,
We have 8 solr shards, index is ~ 90M documents 190GB. :)
4 of the shards have acceptable commit time - 30-60 seconds. The other 4
have drifted over the last couple months to but up around 2-3 minutes. This
is killing our write throughput as you can imagine.
I've included a log dump
Hi, here's two thing that can slow down commits:
1) Autowarming the caches.
2) The Java old generation object garbage collection.
You can try:
- Turning autowarming off (set autowarmCount="0" in the caches configuration)
- If you use the sun jvm, use -XX:+UseConcMarkSweepGC to get a less
blocki
Thanks Jerome,
1. I have shut off autowarming by setting params to 0.
2. My JVM Settings: -Xmx1200m -Xms1200m -XX:-UseGCOverheadLimit
-XX:+UseConcMarkSweepGC -XX:CMSInitiatingOccupancyFraction=50
3. I am using autocommits - every 6 ms. But the commit blocks all the
master request threadpool
Thanks for the reply --
>> In your multi-core set-up, I don't think you are anywhere close to using
>> MLT. The arrangement looks more like a search query.
Does that mean that you cannot do a 'MLT' query from one core result
to get MLT from another (even if there is some common schema between)?
>
> Does that mean that you cannot do a 'MLT' query from one core result to get
> MLT from another (even if there is some common schema between)?
>
You can always run MLT handlers on a core. Each MLT handler takes certain
parameters based on which "similar" results are fetched. You would need to
pa
I have a nested entity on a jdbc data import handler that is causing
an SQL error because the second key is either NULL (blank when
generating the sql) or non-zero INT.
The query is in the following form:
transformer="TemplateTransformer" query="SELECT * FROM table1 ">
Shouldn't this work too?
SELECT * FROM table2 WHERE IS NOT NULL
${table1.somethin_like_a_foreign_key} AND
${table1.somethin_like_a_foreign_key} > 0 AND id =
${table1.somethin_like_a_foreign_key}
Cheers
Avlesh
On Wed, Oct 28, 2009 at 11:03 PM, Jonathan Hendler <
jonathan.hend...@gmail.com> wrote:
No - the SQL will fail to validate because at runtime it will look like
SELECT * FROM table2 WHERE
IS NOT NULL table1.somethin_like_a_foreign_key
AND table1.somethin_like_a_foreign_key > 0
AND id =
Note the "id = "
On Oct 28, 2009, at 1:38 PM, Avlesh Singh wrote:
Shouldn't this work too?
Assuming this to be MySQL, will this work -
SELECT * FROM table2 WHERE id =
IF(ISNULL(${table1.somethin_like_a_foreign_key}), 0,
${table1.somethin_like_a_foreign_key});
Cheers
Avlesh
On Wed, Oct 28, 2009 at 11:12 PM, Jonathan Hendler <
jonathan.hend...@gmail.com> wrote:
> No - the SQL will fail
curious...is it possible to have faceted results ordered by score?
im having a problem where im faceting on a field while searching for
the same word twice, for example:
im searching for "the the" on a tokenized field and faceting by the
untokenized version, faceting returns records with "the the
Hi,
If I give a query that matches a single document, and facet on a particular
field, I get a list of all the terms in that field which appear in that
document.
(I also get some with a count of zero, I don't really understand where they
come from... ?)
Is it possible with faceting, or a simila
Thanks - that solution still causes an error.
But it helped me think of an SQL solution like so :
CONVERT ( '${table1.somethin_like_a_foreign_key}' , UNSIGNED INTEGER )
Convert the integer or NULL to a string, then back again. (ugly but it
works)
On Oct 28, 2009, at 1:48 PM, Avlesh Singh
Awesome! Its not a bad solution at all.
Cheers
Avlesh
On Wed, Oct 28, 2009 at 11:33 PM, Jonathan Hendler <
jonathan.hend...@gmail.com> wrote:
> Thanks - that solution still causes an error.
>
> But it helped me think of an SQL solution like so :
> CONVERT ( '${table1.somethin_like_a_foreign_key}
>
> curious...is it possible to have faceted results ordered by score?
>
First, I am not sure what that means. Score of what? Documents? If yes, how
do you think the same should influence faceting?
Second, there are only two ways you can sort facet values on a field. More
here - http://wiki.apache.
For facets -
http://wiki.apache.org/solr/SimpleFacetParameters#facet.mincount
For terms - http://wiki.apache.org/solr/TermsComponent
Helps?
Cheers
Avlesh
On Wed, Oct 28, 2009 at 11:32 PM, Andrew Clegg wrote:
>
> Hi,
>
> If I give a query that matches a single document, and facet on a particular
Isn't the TermVectorComponent more for one document at a time, and the
TermsComponent for the whole index?
Actually -- having done some digging... What I'm really after is the most
informative terms in a given document, which should take into account global
document frequency as well as term fre
Why can't we use single entity with single SELECT ... LEFT OUTER JOIN ...?
> -Original Message-
> From: Jonathan Hendler [mailto:jonathan.hend...@gmail.com]
> Sent: October-28-09 1:33 PM
> To: solr-user@lucene.apache.org
> Subject: Simple problem with a nested entity and it's SQL
>
> I
THanks - That's a good question.
I thought of using one single SQL statement - but the nested entity's
query is actually quite complex (unlike the example).
So it'd be possible, but more readable as a separate query.
Further, MySQL has some limitations also about temporary tables and
seems l
Hi Hoss,
Thanks for the clarification again.
Now I can see where the problem resides. My client application was
formatting date fields using SimpleDateFormat and as you said, it assumes
host timezone configuration.
: your dateFormat object doesn't know that the 'Z' at the end of the string
you a
This is an issue we experienced a while back. We once again tried to load a
custom class as a plugin jar from the lib directory and began experiencing
severe memory problems again. The code in our jar wasn't being used at
all...the class was only referenced in the schema. I find it strange that
Here's the "all" code snippets -
.
.
all
.
.
It sounds from what you say that I'm going to need to change the field type to
"edgytext". Which won't achieve the result I want, viz. the current "all" plus
the edgytext. Any way to achieve this?
Thanks!
bern
(I am super new to solr, sorry if this is an easy one)
Hi, I want to support an A-Z type view of my data.
I have a DataImportHandler that uses sql (my query is complex, but the
part that matters is:
SELECT f.id, f.title, LEFT(f.title,1) as firstLetterTitle FROM Foo f
I can create this index
Hi Joel, I had a similar issue the other day; in my case the solution turned
out to be that the letters were stopwords. Don't know if this is your answer,
but worth checking.
Bern
-Original Message-
From: Joel Nylund [mailto:jnyl...@yahoo.com]
Sent: Thursday, 29 October 2009 9:17 AM
To:
Thanks Bern, now that you mention it they are in there, I assume if I
remove them it will work, but I probably dont want to do that right?
Is there a way for this particular query to ignore stopwords
thanks
Joel
On Oct 28, 2009, at 6:20 PM, Bernadette Houghton wrote:
Hi Joel, I had a similar
2009/10/28 gabriele renzi :
> On Tue, Oct 27, 2009 at 3:41 PM, Yonik Seeley
> wrote:
>
>>> is there a reason why StreamingUpdateSolrServer does not accept a
>>> connection manager argument as CommonsHttpSolrServer does?
>>
>> I don't see any reason why it shouldn't... can you open a JIRA issue
>>
I think that is not a problem, because your are only storing one
character per field. There are other text field types that do not have
the stop word filter, so give your first letter field that field type.
In this way stopword filter analyser is only disabled for searches on
the first letter field
Well I tried removing those 2 letters from stopwords, didnt seem to
help, I also tried changing the field type to "text_ws", didnt seem to
work. Any other ideas?
thanks
Joel
On Oct 28, 2009, at 6:42 PM, Martijn v Groningen wrote:
I think that is not a problem, because your are only storing
Hi Joel, I'm a relative beginner to solr myself. I think the "s" and "t" are
probably in the stopwords list because there will be a lot of them resulting
from analysing of words such as "don't" and "person's". Whether that's (hey,
another example!) an issue for you will probably depend on what a
Joel, did you restart tomcat? Need to restart each time you change schema.xml.
bern
-Original Message-
From: Joel Nylund [mailto:jnyl...@yahoo.com]
Sent: Thursday, 29 October 2009 10:21 AM
To: solr-user@lucene.apache.org
Subject: Re: weird problem with letters S and T
Well I tried removi
With the new Solr 1.4 release nearing release, Lucid Imagination has put
together a technical white paper with an in-depth review of “What’s New
in Solr 1.4.” You can download the white paper at
http://www.lucidimagination.com/whitepaper/whats-new-in-solr-1-4?sc=AP.
--
- Mark
http://www.lucidima
Or just reload the app pool. No need to restart the whole server
On 28 Oct 2009, at 23:23, "Bernadette Houghton"
wrote:
> Joel, did you restart tomcat? Need to restart each time you change
> schema.xml.
> bern
>
> -Original Message-
> From: Joel Nylund [mailto:jnyl...@yahoo.com]
> Sen
It is easier to put SQL select statements in a view, and just use that
view from the DIH configuration file.
On Tue, Oct 27, 2009 at 12:30 PM, Andrew Clegg wrote:
>
>
> Heh, eventually I decided
>
> "where 4 > node_depth"
>
> was the most pleasing (if slightly WTF-ish) way of writing it...
>
> Ch
>
> Any ideas, are S and T special chars in query for solr?
>
Nope, they are NOT. My guess is that
- You are using a "text" type field for firstLetterTitle which has the
stopword filter applied to it.
- Your "stopwords.txt" file contains the characters "s" and "t" because
of which the
>
> It sounds from what you say that I'm going to need to change the field type
> to "edgytext". Which won't achieve the result I want, viz. the current "all"
> plus the edgytext. Any way to achieve this?
>
I guess there is a mismatch of expectations here. A field can be analyzed in
only ONE way. I
On Wed, Oct 28, 2009 at 6:38 PM, gabriele renzi wrote:
> While I'm at it, FindBugs has a few issues with the solr code base (a
> few hundreds, but most seem false positives) but it does have a few
> concurrency related issues that appear reasonable to me, is it of any
> use if I submit patches for
I have an index of about 3 million documents, and specific list of document
ids that belong in that 3 million (somewhere around 20-50 documents on
average). With my filtered list of documents I want to be able to get
TF-IDF scores calculated based on only that small subset, instead of the
scores
peelman wrote:
>
> I have an index of about 3 million documents, and specific list of
> document ids that belong in that 3 million (somewhere around 20-50
> documents on average). With my filtered list of documents I want to be
> able to get TF-IDF scores or run a MoreLikeThis query against ON
On Wed, 28 Oct 2009 19:20:37 -0400
Joel Nylund wrote:
> Well I tried removing those 2 letters from stopwords, didnt seem to
> help, I also tried changing the field type to "text_ws", didnt seem to
> work. Any other ideas?
Hi Joel,
if your stop word filter was applied on index, you will have
54 matches
Mail list logo