Re: Java heap space

2006-05-01 Thread Marcus Stratmann

Yonik Seeley wrote:

Yes, on a delete operation. I'm not doing any commits until the end of
all delete operations.

I assume this is a delete-by-id and not a delete-by-query?  They work
very differently.


Yes, all queries are delete-by-id.



If you are first deleting so you can re-add a newer version of the
document, you don't need too... overwriting older documents based on
the uniqueKeyField is something Solr does for you!


Yes, I know. But the articles in our (sql-)database get new IDs when 
they are changed so they need to be deleted an re-inserted into the index.




Is it possible to use a profiler to see where all the memory is going?
It sounds like you may have uncovered a memory leak somewhere.


I'm not that experienced concerning Java, but maybe if you give me some 
advice I'm glad if I can help. So far I had a quick look at JMP once but 
that's all.

Don't hesitate to write me a PM on that subject.



Also what OS, what JVM, what appserver are you using?

OS: Linux (Debian GNU/Linux i686)
JVM: Java HotSpot(TM) Server VM (build 1.5.0_06-b05, mixed mode) of 
Sun's JDK 5.0.
Currently I'm using the Jetty installation from the solr nightly builds 
for test purposes.


Marcus


Ruby bindings/SolRuby?

2006-05-01 Thread Brian Lucas
Is anyone working on Ruby bindings?  I noticed a page called "SolRuby" on
the wiki - what is this?

 

I would like to make Solr work like a model in Rails if possible, but don't
want to reinvent the wheel.

 

Thanks, Brian



Re: Java heap space

2006-05-01 Thread Chris Hostetter

: > If you are first deleting so you can re-add a newer version of the
: > document, you don't need too... overwriting older documents based on
: > the uniqueKeyField is something Solr does for you!
:
: Yes, I know. But the articles in our (sql-)database get new IDs when
: they are changed so they need to be deleted an re-inserted into the index.

this is off the subject of the heap space issue ... but if the id changes,
then maybe it shouldn't be the uniqueId of your index? .. your code must
have someone of recognizing that article B with id 222 is a changed
version of article A with id 111 (otherwise how would you know to delete
111 when you insert 222?) ..whatever that mechanism is, perhaps it should
determine your uniqueKey?


-Hoss



Re: Java heap space

2006-05-01 Thread Marcus Stratmann
Chris Hostetter wrote:
> this is off the subject of the heap space issue ... but if the id changes,
> then maybe it shouldn't be the uniqueId of your index? .. your code must
> have someone of recognizing that article B with id 222 is a changed
> version of article A with id 111 (otherwise how would you know to delete
> 111 when you insert 222?) ..whatever that mechanism is, perhaps it should
> determine your uniqueKey?

No, there is no "key" or something that reveals a relation between new
article B and old article A. After B is inserted and A is deleted, all
of A's existence is gone and we do not even know that B is A's
"successor". Changes are simply kept in a table which tells the system
which IDs to delete and which new (or changed) articles to insert,
automatically giving them new IDs. I know this may not be (or at least
sound) perfect and it is not the way things are handled normally. But
this works fine for our needs. We gather information about changes to
our data during the day and apply them on a nightly update (which, I
know, does not imply that IDs have to change).
So, yes, I'm sure I got the right uniqueKey. ;-)

Marcus





Adding xml to SolrQueryResponse

2006-05-01 Thread Mike Austin

Is there a way to add attributes besides name to an xml node returned from
SolrQueryResponse? I've looked at the SolrQueryResponse.add and it looks
like a NamedList is my only option.  I know that I can get by with nodes
that have only the name attribute but it would make life a little easier to
throw some more attributes on a node.

Thanks,
Mike Austin


Re: Adding xml to SolrQueryResponse

2006-05-01 Thread Yonik Seeley

On 5/1/06, Mike Austin <[EMAIL PROTECTED]> wrote:

Is there a way to add attributes besides name to an xml node returned from
SolrQueryResponse? I've looked at the SolrQueryResponse.add and it looks
like a NamedList is my only option.  I know that I can get by with nodes
that have only the name attribute but it would make life a little easier to
throw some more attributes on a node.


There isn't currently, and making things too extensible will get you
back to unconstrained XML again (might as well use DOM or something).

Hmmm, but one could put random XML in the response by making an XML
type that you would get if you put a DOM object in the NamedList for
the response.


  // normal Solr result 

 

Re: Adding xml to SolrQueryResponse

2006-05-01 Thread Chris Hostetter
: Is there a way to add attributes besides name to an xml node returned from
: SolrQueryResponse? I've looked at the SolrQueryResponse.add and it looks
: like a NamedList is my only option.  I know that I can get by with nodes
: that have only the name attribute but it would make life a little easier to
: throw some more attributes on a node.

FYI: The orriginal thinking behind the current XML format was that
itshould only contain concepts that were easy to map to the "primitive"
types found in most type safe langauges: int/float/date/string/etc...
 exists to provide a simple mechanism to support both: Ordered lists
of heterogenously typed objects; and Un-Ordered name/value pairs  ... all
depending on what you prefer.

Allowing arbitrarry attributes would complicate that simple mapping of tag
to primitive object -- Which doesn't mean I'm opposed to the idea, I
acctually think it would be nice if there was a simple way to add a DOM
sub-trees directly to the response -- but I'd advocate not having any core
Solr code use this ... mainly because one of the "Near Term" goals is to
get an XML DTD/Schema that validates all legal responses from any (real or
theoretical) RequestHandler.  Allowing arbitrary XL attributes would
illiminate SolrCore's ability to ensure that the XML responses it
generates validate with that DTD/Schema -- so if we did add support for
it, we'd want to treat it special in some way.

-Hoss