Re: Experience with indexing billions of documents?

2010-04-13 Thread Thomas Koch
ince it's more advanced. Best regards, Thomas Koch, http://www.koch.ro

Re: Experience with indexing billions of documents?

2010-04-12 Thread Thomas Koch
Hi, could I interest you in this project? http://github.com/thkoch2001/lucehbase The aim is to store the index directly in HBase, a database system modelled after google's Bigtable to store data in the regions of tera or petabytes. Best regards, Thomas Koch Lance Norskog: > The 2B li

Re: deploying nightly updates to slaves

2010-04-12 Thread Thomas Koch
plications during the day. Is there by any chance the possibility that you'd rather want to store your data in HBase then in MySQL? I'm working on a project right now to store SOLR/Lucene indices directly in HBase too. I'll be at the webtuesday tomorrow. Maybe I could give an introduction to Hadoop/HBase on a next webtuesday? Beste Grüße, Thomas Koch, http://www.koch.ro

[ANN] Eclipse GIT plugin beta version released

2010-03-31 Thread Thomas Koch
http://www.infoq.com/news/2010/03/egit-released http://aniszczyk.org/2010/03/22/the-start-of-an-adventure-egitjgit-0-7-1/ Maybe, one day, some apache / hadoop projects will use GIT... :-) (Yes, I know git.apache.org.) Best regards, Thomas Koch, http://www.koch.ro

"Overwriting" cores with the same core name

2010-02-11 Thread Thomas Koch
or datadirs that are older then the newest one and all these can be picked up for submission to katta. Now there remain two questions: - When the old core is closed, will there be an implicit commit? - How to be sure, that no more work is in progress on an old core datadir? Thanks, Thomas Koch, http://www.koch.ro

continuously creating index packages for katta with solr

2010-02-11 Thread Thomas Koch
let SOLR start with an empty index again. Does anybody has an idea, how this could be achieved? Thanks a lot, Thomas Koch, http://www.koch.ro

highlighting and external storage

2009-12-22 Thread Thomas Koch
s IO intensive. This would give me the additional benefit, that I could selectively delete the fulltext of older articles when running out of disc space while keeping the url of the document in the index. Do you know, whether sth. like this would be possible? Best regards, Thomas Koch, http://www.koch.ro

Multiple default search fields or catchall field?

2009-12-08 Thread Thomas Koch
lds to get highlighting in every case? - Isn't it a big waste of hard disc space to store the content two times? Thanks for any help, Thomas Koch, http://www.koch.ro

Re: Limit of a one-server-SOLR-installation

2009-10-05 Thread Thomas Koch
Hi Gasol Wu, thanks for your reply. I tried to make the config and syslog shorter and more readable. solrconfig.xml (shortened): false 15 1500 2147483647 1 1000 1 false 10 1000 2147483647 1 t

Limit of a one-server-SOLR-installation

2009-10-05 Thread Thomas Koch
idering a setup like/with Katta? Thanks for your insights, Thomas Koch, http://www.koch.ro

eternal optimize interrupted

2009-08-04 Thread Thomas Koch
know what to do? How to find out which ratio of the index is optimized, how many nights will it take to finish? Best regards, Thomas Koch, http://www.koch.ro