Re: Tagging using SOLR

2007-09-07 Thread Erik Hatcher
On Sep 7, 2007, at 3:09 AM, Doss wrote: Thanks for the guidelines, but basically our idea is to build a system like http://del.icio.us/tag/, is it possible to take counts of similar words from a solr indexed field? How do you define similar words? Here's a tag cloud for a single user in C

Re: Replication broken.. no helpful errors?

2007-09-07 Thread Bill Au
As I had pointed out in my first reply to this thread, you had a directory named temp-snapshot.20070816120113 in your data directory on the slave. Snapinstaller was mistakenly treating that as the lastest snapshot and was installing that every time it was called. Snapinstaller didn't trigger a co

Re: Distribution Information?

2007-09-07 Thread Bill Au
I that case, definitely take a look at SOLR-333: http://issues.apache.org/jira/browse/SOLR-333 On the master there should be a logs/clients directory. Do you have any files in there? Bill On 9/6/07, Matthew Runo <[EMAIL PROTECTED]> wrote: > > Well, I do get... > > Distribution Info > Master Se

Re: Indexing very large files.

2007-09-07 Thread Brian Carmalt
Lance Norskog schrieb: Now I'm curious: what is the use case for documents this large? Thanks, Lance Norskog It is a rand use case, but could become relevant for us. I was told to explore the possibilities, and that's what I'm doing. :) Since I haven't heard any suggestions as to how to

Re: Indexing very large files.

2007-09-07 Thread Walter Underwood
Legal discovery can have requirements like this. --wunder On 9/7/07 4:47 AM, "Brian Carmalt" <[EMAIL PROTECTED]> wrote: > Lance Norskog schrieb: >> Now I'm curious: what is the use case for documents this large? >> >> Thanks, >> >> Lance Norskog >> >> >> > It is a rand use case, but could

Dilbert (off-topic)

2007-09-07 Thread Jeff Rodenburg
It may be off-topic, but it's friday and thought all the java coders would appreciate today's dilbert. (I'm not primary a java dev, but I know the feeling) http://www.dilbert.com/comics/dilbert/archive/dilbert-20070907.html cheers, jeff r.

Re: Tagging using SOLR

2007-09-07 Thread Doss
Dear Thorsten, Erik, Thanks for the guidelines, but basically our idea is to build a system like http://del.icio.us/tag/, is it possible to take counts of similar words from a solr indexed field? Thanks, Mohandoss On 9/6/07, Erik Hatcher <[EMAIL PROTECTED]> wrote: > > > On Sep 6, 2007, at 3:29

Lucene/Solr "OnTheRoad"

2007-09-07 Thread Erik Hatcher
I just added brief mentions of some upcoming Lucene/Solr-related events to this page: http://wiki.apache.org/lucene-java/OnTheRoad Below is some self-promotion of an upcoming class I have agreed to teach. It's uncomfortable to send this sort of thing out, but if I don't then you might n

Re: Distribution Information?

2007-09-07 Thread Matthew Runo
Actually I don't have the clients directory... [EMAIL PROTECTED]: .../logs]$ pwd /opt/solr/logs [EMAIL PROTECTED]: .../logs]$ ls rsyncd-enabled rsyncd.log rsyncd.pid snapcleaner.log snapshooter.log snapshot.current.search2 snapshot.status.search2 [EMAIL PROTECTED]: .../logs]$ It does l

RE: Solr and KStem

2007-09-07 Thread Wagner,Harry
I've implemented a Solr plug-in that wraps KStem for Solr use. KStem is considered to be more appropriate for library usage since it is much less aggressive than Porter (i.e., searches for organization do NOT match on organ!). If there is any interest in feeding this back into Solr I would be happ

Return 2 fields per facet.. name and id, for example?

2007-09-07 Thread Matthew Runo
Hello! I've found something which is either already in SOLR, or should be (as I can see it being very helpful). I couldn't figure out how to do it though.. Lets say I'm trying to print out a page of products, and I want to provide a list of brands to filter by. It would be great if in my

Re: Return 2 fields per facet.. name and id, for example?

2007-09-07 Thread Yonik Seeley
On 9/7/07, Matthew Runo <[EMAIL PROTECTED]> wrote: > I've found something which is either already in SOLR, or should be > (as I can see it being very helpful). I couldn't figure out how to do > it though.. > > Lets say I'm trying to print out a page of products, and I want to > provide a list of br

Re: Solr and KStem

2007-09-07 Thread Yonik Seeley
On 9/7/07, Wagner,Harry <[EMAIL PROTECTED]> wrote: > I've implemented a Solr plug-in that wraps KStem for Solr use. KStem is > considered to be more appropriate for library usage since it is much > less aggressive than Porter (i.e., searches for organization do NOT > match on organ!). If there is

Re: Return 2 fields per facet.. name and id, for example?

2007-09-07 Thread Matthew Runo
Ahh... sneaky. I'll probably do the combined-name#id method. ++ | Matthew Runo | Zappos Development | [EMAIL PROTECTED] | 702-943-7833 ++ On Sep 7, 2007, at 12:38 PM, Yonik Seeley

Re: Solr and KStem

2007-09-07 Thread Walter Underwood
Even if KStem isn't ASL, we could include the plug-in code with notes about how to get the stemmer. Or, the Solr plug-in could be contributed to the group that manages the KStem distribution: http://ciir.cs.umass.edu/cgi-bin/downloads/downloads.cgi wunder On 9/7/07 12:59 PM, "Yonik Seeley" <[E

Re: Indexing very large files.

2007-09-07 Thread Mike Klaas
On 7-Sep-07, at 4:47 AM, Brian Carmalt wrote: Lance Norskog schrieb: Now I'm curious: what is the use case for documents this large? It is a rand use case, but could become relevant for us. I was told to explore the possibilities, and that's what I'm doing. :) Since I haven't heard any

FW: Space costs of dynamic fields?

2007-09-07 Thread Lance Norskog
Are there any extra costs for dynamic v.s. static fields? That is, if I have the same dynamic field in 95% of my documents, should I just make it static and empty in the other 5%? Will query speed or change? Which choice will use more pace? Otherwise, the only downside of dynamic fields is that y

org.apache.lucene.util.English missing

2007-09-07 Thread Lance Norskog
Hi folks- The Lucene Spellchecker unit test expects a Java class org.apache.lucene.util.English. I can't find it in the source trees on svn.apache.org. Can someone please mail it to me? Thanks, Lance Norskog

FW: Minor mistake on the Wiki

2007-09-07 Thread Lance Norskog
In the page http://wiki.apache.org/solr/UpdateXmlMessages We find: Optional attributes on "doc" * boost = - default is 1.0 (See Lucene docs for definition of boost.) * NOTE: make sure norms are enabled (omitNorms="false" in the schema.xml) for any fi

adding without overriding dups - DirectUpdateHandler2.java does not implement?

2007-09-07 Thread Lance Norskog
Hi- It appears that DirectUpdateHandler2.java does not actually implement the parameters that control whether to override existing documents. Should I use DirectUpdateHandler instead? Apparently DUH is slower than DUH2, but DUH implements these parameters. (We do so many overwrites that switchin

Re: FW: Minor mistake on the Wiki

2007-09-07 Thread Yonik Seeley
On 9/7/07, Lance Norskog <[EMAIL PROTECTED]> wrote: > In the page http://wiki.apache.org/solr/UpdateXmlMessages > > We find: > > Optional attributes on "doc" > > * boost = - default is 1.0 (See Lucene docs for > definition of boost.) > * NOTE: make sure

Re: org.apache.lucene.util.English missing

2007-09-07 Thread Otis Gospodnetic
Really? Weird. It's here: /home/otis/dev/repos/lucene/java/trunk [EMAIL PROTECTED] trunk]$ ff English.java ./src/test/org/apache/lucene/util/English.java Note that this is Lucene and that it's src/test. Otis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Simpy -- http://www.simpy.c

Re: adding without overriding dups - DirectUpdateHandler2.java does not implement?

2007-09-07 Thread Yonik Seeley
On 9/7/07, Lance Norskog <[EMAIL PROTECTED]> wrote: > It appears that DirectUpdateHandler2.java does not actually implement the > parameters that control whether to override existing documents. It's been proposed that most of these be deprecated anyway and replaced with a simple overwrite=true/fal

Re: Solr and KStem

2007-09-07 Thread Otis Gospodnetic
Look for KStem in Lucene JIRA. Mny years ago something KStem related was contributed, and there was a discussion about licenses then. Otis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Simpy -- http://www.simpy.com/ - Tag - Search - Share - Original Message Fro

RE: adding without overriding dups - DirectUpdateHandler2.java does not implement?

2007-09-07 Thread Lance Norskog
No, I'm just doing standard overwriting. It just took a little digging to be able to do it :) To gild the lily, it would be efficient in our case to add a boolean flag to each record saying whether to overwrite this record. This would make each record read-only or read-write. But I think this is an

Re: adding without overriding dups - DirectUpdateHandler2.java does not implement?

2007-09-07 Thread Yonik Seeley
On 9/7/07, Lance Norskog <[EMAIL PROTECTED]> wrote: > No, I'm just doing standard overwriting. It just took a little digging to be > able to do it :) Overwriting is the default... you shouldn't have to do specify anything extra when indexing the document. -Yonik

Re: Distribution Information?

2007-09-07 Thread Bill Au
I just double checked distribution.jsp. The directory where it looks for status files is hard coded to logs/clients. So for now master_status_dir in your solr/conf/scripts.conf has to be set to that so the scripts will put the status files there. It looks like they are currently in you logs dire

Re: adding without overriding dups - DirectUpdateHandler2.java does not implement?

2007-09-07 Thread Mike Klaas
On 7-Sep-07, at 1:35 PM, Lance Norskog wrote: Hi- It appears that DirectUpdateHandler2.java does not actually implement the parameters that control whether to override existing documents. Should I use No? allowDups=true ovewritePending=false overwriteCommited=false should result in add

New user question: How to show all stored fields in a result

2007-09-07 Thread melkink
Hello Solr Folks, I'm a new solr user and I'm running into a frustrating problem. I'm sure it's a simple solution I just don't have the experience with solr to know the correct way around it. I currently have approximately 600 documents stored and indexed in solr. Each document has some level

Re: Distribution Information?

2007-09-07 Thread Matthew Runo
OK. I made the change, but it seemed not to pick up the files. When I changed distrobutiondump.jsp to say... File masterdir = new File("/opt/solr/logs/clients"); it worked. Thank you for your help! ++ | Matthew Runo | Zappos Development