Re: how to reset the index in solr

2009-04-29 Thread Geetha
Thanks a lot Erik. I would try it and let me know Erik Hatcher wrote: On Apr 29, 2009, at 12:19 AM, Geetha wrote: I need a function (through solr ruby) for ruby that will allow us to clear everything require 'solr' solr = Solr::Connection.new("http://localhost:8983/solr";) solr.delete_by_qu

Re: Problem adding unicoded docs to Solr through SolrJ

2009-04-29 Thread ahmed baseet
Thanks a lot for your quick and detailed response. I got the point. But as I've mentioned earlier I've a string of rawtext[default encoding] that needs to be encoded in utf-8, so I tried something stupid but working though. I first converted the whole string to byte array and then used that byte a

Re: Term highlighting with MoreLikeThisHandler?

2009-04-29 Thread Walter Underwood
Think about this for a moment. When you use MoreLikeThis, the query is a document. How do you highlight a document in another document? wunder On 4/29/09 9:21 PM, "Matt Weber" wrote: > Any luck on this? I am experiencing the same issue. Highlighting > works fine on all other request handlers,

Re: Term highlighting with MoreLikeThisHandler?

2009-04-29 Thread Matt Weber
Any luck on this? I am experiencing the same issue. Highlighting works fine on all other request handlers, but breaks when I use the MoreLikeThisHandler. Thanks, Matt Weber On Apr 28, 2009, at 5:29 AM, Eric Sabourin wrote: Yes... at least I think so. the highlighting works correctly

Re: Unable to import data from database

2009-04-29 Thread Noble Paul നോബിള്‍ नोब्ळ्
I guess this can go in the FAQ section of DIH On Wed, Apr 29, 2009 at 9:47 PM, Erick Erickson wrote: > Thanks for letting us all know the resolution, that may save some other > poor soul from frustration > > Best > Erick > > On Wed, Apr 29, 2009 at 9:31 AM, Ci-man wrote: > >> >> Found the pr

understanding facets and tokens

2009-04-29 Thread Simon Stanlake
Hi, Trying to debug a faceting performance problem. I've pretty much given up but was hoping someone could shed some light on my problems. My index has 80 million documents, all of which are small - one 1000 char text field and a bunch of 30-50 char fields. Got 24G ram allocated to the jvm on a

RE: Multiple Queries

2009-04-29 Thread Ankush Goyal
Hey Guys, Have a novice type question, regarding how to create a query by ORing multiple terms. Currently, the query we are creating is a boosting query using following code: BoostingQuery boosQuery = new BoostingQuery(getHotelIdFilterQuery(hotelIdStr),baseQuery,2.0f); Wherein, getHotelIdFilt

Re: Performance and number of search results

2009-04-29 Thread Walter Underwood
Some part of the server-side work is linear in the number of hits. It has to look up field values for each one of those hits, and that is linear. At some level, you've got one lookup for each term in the query and one lookup for each hit. If you have a handful of terms and a 1000 hits, the time is

Facet counts for common terms of the searched field

2009-04-29 Thread Raju444us
I have a requirement. If I search for text field let's say "metal:glass" what i want is to get the facet counts for all the terms related to "glass" in my search results. window(100) since a window can be glass. plastic(10) plastic is a material just like glass Iron(10) Paper(15) Can I use MLT

Re: Date faceting - howto improve performance

2009-04-29 Thread Shalin Shekhar Mangar
Some basic documentation is in the example schema.xml. Ask away if you have specific questions. On Thu, Apr 30, 2009 at 1:00 AM, Marcus Herou wrote: > Aha! > > Hmm , googling wont help me I see. any hints of usages ? > > /M > > > On Tue, Apr 28, 2009 at 12:29 AM, Shalin Shekhar Mangar < > shalinm

Distributed Search - only get ids

2009-04-29 Thread Joe Pollard
Solr 1.3: If I am only getting back the document ids from a distributed search (e.g., uniqueid is 'id' and the fl parameter only contains 'id'), there seems to be some room for optimization in the current code path: 1) On each shard, grab top N sorted document ids & sort fields) 2) M

Re: Date faceting - howto improve performance

2009-04-29 Thread Marcus Herou
Aha! Hmm , googling wont help me I see. any hints of usages ? /M On Tue, Apr 28, 2009 at 12:29 AM, Shalin Shekhar Mangar < shalinman...@gmail.com> wrote: > Sorry, I'm late in this thread. > > Did you try using Trie fields (new in 1.4)? The regular date faceting won't > work out-of-the-box for

Re: spellcheck.collate causes StringIndexOutOfBoundsException during startup.

2009-04-29 Thread Eric Sabourin
Koji - I've remove them from my solrconfig.xml and that solved the problem. Thanks for your help! On Tue, Apr 28, 2009 at 12:25 PM, Koji Sekiguchi wrote: > I see you are using firstSearcher/newSearcher event listener on your > startup and cause the problem. > If you don't need them, commented ou

Re: limit on query size?

2009-04-29 Thread Shalin Shekhar Mangar
On Wed, Apr 29, 2009 at 10:42 PM, Harsch, Timothy J. (ARC-SC)[PEROT SYSTEMS] wrote: > Is there a limit on the size ( in bytes ) of a query you send to Solr? > > Either through HTTP URL request or through SolrJ? > The limit is whatever you have configured (or the default) in your servlet contain

limit on query size?

2009-04-29 Thread Harsch, Timothy J. (ARC-SC)[PEROT SYSTEMS]
Is there a limit on the size ( in bytes ) of a query you send to Solr? Either through HTTP URL request or through SolrJ? What is the behavior if a limit is reached?

Highlighting using XML instead of strings?

2009-04-29 Thread Michael Ludwig
http://wiki.apache.org/solr/HighlightingParameters I can specify the strings to highlight matched text with using "hl.simple.pre" and "hl.simple.post", for example and . The result looks like this: Eumel NDR Ländermagazine However, what if as the result of favouring XML over strings,

Re: Unable to import data from database

2009-04-29 Thread Erick Erickson
Thanks for letting us all know the resolution, that may save some other poor soul from frustration Best Erick On Wed, Apr 29, 2009 at 9:31 AM, Ci-man wrote: > > Found the problem. > It is with Microsoft jdbc drivers (jdbc 2.0). > > With the latest download Microsoft provides two .jar files:

Re: Authenticated Indexing Not working

2009-04-29 Thread Allahbaksh Asadullah
Hi, I followed the procedure given at http://blog.comtaste.com/2009/02/securing_your_solr_server_on_t.html Regards, Allahbaksh On 4/28/09, Shalin Shekhar Mangar wrote: > On Sun, Apr 26, 2009 at 11:04 AM, Allahbaksh Asadullah < > allahbaks...@gmail.com> wrote: > >> HI Otis, >> I am using HTTPClien

Re: stress tests to DIH and deduplication patch

2009-04-29 Thread Shalin Shekhar Mangar
On Wed, Apr 29, 2009 at 7:44 PM, Marc Sturlese wrote: > > Hey there, I am doing some stress tests indexing with DIH. > I am indexing a mysql DB with 140 rows aprox. I am using also the > DeDuplication patch. > I am using tomcat with JVM limit of -Xms2000M -Xmx2000M > I have indexed 3 times usi

Re: function query scoring

2009-04-29 Thread Shalin Shekhar Mangar
On Wed, Apr 29, 2009 at 9:15 PM, Umar Shah wrote: > can anyone explan the behavior of function query if there are other > terms in the query > > it seems the value of the function query and actual match score are > interfering in some manner. what should be expected? > > Yes, if you include a ter

Re: function query scoring

2009-04-29 Thread Umar Shah
On Wed, Apr 29, 2009 at 8:34 PM, Andrey Klochkov wrote: > On Wed, Apr 29, 2009 at 6:44 PM, Umar Shah wrote: > >> On Wed, Apr 29, 2009 at 7:16 PM, Andrey Klochkov >> wrote: >> > Hi! >> > >> > Base on docs in the wiki I thought that the following query should return >> > constant score "5" for all

Re: function query scoring

2009-04-29 Thread Andrey Klochkov
On Wed, Apr 29, 2009 at 6:44 PM, Umar Shah wrote: > On Wed, Apr 29, 2009 at 7:16 PM, Andrey Klochkov > wrote: > > Hi! > > > > Base on docs in the wiki I thought that the following query should return > > constant score "5" for all "socks" in the index: > > > > http://localhost:8080/solr/select?q

Re: function query scoring

2009-04-29 Thread Umar Shah
On Wed, Apr 29, 2009 at 7:16 PM, Andrey Klochkov wrote: > Hi! > > Base on docs in the wiki I thought that the following query should return > constant score "5" for all "socks" in the index: > > http://localhost:8080/solr/select?q=name:socks _val_:5&fl=name,score the intended query should looklik

Re: ExtractingRequestHandler and SolrRequestHandler issue

2009-04-29 Thread francisco treacy
Well, problem seems to be with > java -Dsolr.solr.home="/my/path/to/solr" -jar start.jar Everything runs fine if I copy my xmls to the original conf directory of the example (example/solr/conf) and I execute like > java -jar start.jar Some wrong path to libs somewhere - who knows. Couldn't find

stress tests to DIH and deduplication patch

2009-04-29 Thread Marc Sturlese
Hey there, I am doing some stress tests indexing with DIH. I am indexing a mysql DB with 140 rows aprox. I am using also the DeDuplication patch. I am using tomcat with JVM limit of -Xms2000M -Xmx2000M I have indexed 3 times using full-import command without restarting tomcat or reloading the

Re: /replication?command=isReplicating

2009-04-29 Thread Noble Paul നോബിള്‍ नोब्ळ्
nope . details is the only command which can give you this info On Wed, Apr 29, 2009 at 7:10 PM, sunnyfr wrote: > > Hi, > > Just to know if there is a quick way to get the information without hiting > replication?command=details > like =isReplicating > > Thanks, > -- > View this message in contex

Re: Advice on custom DIH or other solutions: LuSql

2009-04-29 Thread Glen Newton
The next version of LuSql[1] supports solutions for this kind of issue: reading from JDBC (which may include a long and compex query) and then writing the results to a single (flattened) JDBC table that can subsequently be the source table for Solr. This might be helpful for your particular issue.

function query scoring

2009-04-29 Thread Andrey Klochkov
Hi! Base on docs in the wiki I thought that the following query should return constant score "5" for all "socks" in the index: http://localhost:8080/solr/select?q=name:socks _val_:5&fl=name,score But in fact it finds all the products in the index and it seems that "socks" products have higher sc

/replication?command=isReplicating

2009-04-29 Thread sunnyfr
Hi, Just to know if there is a quick way to get the information without hiting replication?command=details like =isReplicating Thanks, -- View this message in context: http://www.nabble.com/-replication-command%3DisReplicating-tp23295869p23295869.html Sent from the Solr - User mailing list a

Re: Unable to import data from database

2009-04-29 Thread Ci-man
Found the problem. It is with Microsoft jdbc drivers (jdbc 2.0). With the latest download Microsoft provides two .jar files: sqljdbc.jar sqljdbc4.jar I had copied both into the lib directory. By doing so it used the older drivers (sqljdbc.jar) which do not work with jvm1.6. You get this kind of

Re: field type for serialized code?

2009-04-29 Thread Matt Mitchell
Sorry, should of mentioned how I was serializing. In Ruby, I'm using Marshal.dump. When loading back into ruby via Marshal.load, I get an error related to the Marshaled version. I'm starting to play with JSON too. Matt On Wed, Apr 29, 2009 at 6:42 AM, Erik Hatcher wrote: > Are you using REXML?

Re: Faceting - grouping results

2009-04-29 Thread Koji Sekiguchi
I'm not sure this is what you are looking for, but you may try to use fq parameter? &q=*:*&fq=xxx:A&rows=10 for "at most 10 docs with xxx=A". http://wiki.apache.org/solr/CommonQueryParameters#head-6522ef80f22d0e50d2f12ec487758577506d6002 Koji Branca Marco wrote: Hi, I have a question about fa

Re: Problem adding unicoded docs to Solr through SolrJ

2009-04-29 Thread Michael Ludwig
ahmed baseet schrieb: public void postToSolrUsingSolrj(String rawText, String pageId) { doc.addField("features", rawText ); In the above the param rawText is just the html stripped off of all its tags, js, css etc and pageId is the Url for that page. When I'm using this for Eng

Getting junk characters while indexing

2009-04-29 Thread Koushik Mitra
Hi, We are trying to index a .doc file. However, after indexing the dot( . ) and apostrophe( ' ) present in the file, getting converted to junk values. How to resolve the issue? Thanks, Koushik CAUTION - Disclaimer * This e-mail contains PRIVILEGED AND CONFIDEN

Re: how to reset the index in solr

2009-04-29 Thread Erik Hatcher
On Apr 29, 2009, at 12:19 AM, Geetha wrote: I need a function (through solr ruby) for ruby that will allow us to clear everything require 'solr' solr = Solr::Connection.new("http://localhost:8983/solr";) solr.delete_by_query('*:*') solr.commit Erik

Re: Unable to import data from database

2009-04-29 Thread Ci-man
Thanks. I found the interactive debugger . solr/admin/dataimport.jsp and I am seeing exceptions in Java that I can dig into -- View this message in context: http://www.nabble.com/Unable-to-import-data-from-database-tp23283852p23295859.html Sent from the Solr - User mailing list archive at Nabb

Re: how to reset the index in solr

2009-04-29 Thread Erik Hatcher
On Apr 28, 2009, at 11:33 PM, Geetha wrote: Thank you Erik.. Should I write the below code in rake task /lib/tasks/solr.rake? There's a start to some Solr Rake tasks in solr-ruby's lib/solr/ solrtasks.rb. I am newbie to ruby. Welcome! It's a fun fun world to be in :) Erik

Re: Performance and number of search results

2009-04-29 Thread Michael Ludwig
Wouter Samaey schrieb: Can someone please comment on the performance impact of the number of search results? Is there a big difference between querying for 1 result, 10, 20 or even 100 ? Probably not, but YMMV, as the question is very general. Consider that for fast queries the HTTP round tri

Re: UTF8 compatibility

2009-04-29 Thread Shalin Shekhar Mangar
On Wed, Apr 29, 2009 at 12:45 PM, Muhammed Sameer wrote: > > So I tried to run the test_utf8.sh script and got the following output > {code} > Solr server is up. > HTTP GET is accepting UTF-8 > HTTP POST is accepting UTF-8 > HTTP POST defaults to UTF-8 > ERROR: HTTP GET is not accepting UTF-8 beyo

Re: UTF8 compatibility

2009-04-29 Thread Michael Ludwig
Muhammed Sameer schrieb: We run post.jar periodically ie after every 15mins to commit the changes, Is this approach correct ? Sounds reasonable to me. SimplePostTool: WARNING: Make sure your XML documents are encoded in UTF-8, other encodings are not currently supported That's just to remi

Problem adding unicoded docs to Solr through SolrJ

2009-04-29 Thread ahmed baseet
Hi All, I'm trying to automate the process of posting xml s to Solr using Solrj. Essentially I'm extracting the text from a given Url, then creating a solrDoc and posting the same using the following function, public void postToSolrUsingSolrj(String rawText, String pageId) { String url = "

Re: Addition of new field to Solr schema.xml not getting reflected properly

2009-04-29 Thread ahmed baseet
I added some new documents, and for these docs I can use the new field, right? Though to reflect the changes for all docs I need to delete the old index and build a new one. As I mentioned earlier after a couple of restarts its worked. Still don't know whats the issue. :-) Thanks, Ahmed. On Wed,

Re: Addition of new field to Solr schema.xml not getting reflected properly

2009-04-29 Thread Erik Hatcher
Did you reindex your documents after making changes and restarting? The types of changes you're making require reindexing. Erik On Apr 29, 2009, at 2:13 AM, ahmed baseet wrote: Hi All, I'm trying to add a new field to Solr, so I stopped the tomcat[I'm working on Windows] using the

Re: Advice on custom DIH or other solutions

2009-04-29 Thread Noble Paul നോബിള്‍ नोब्ळ्
On Wed, Apr 29, 2009 at 3:24 PM, Wouter Samaey wrote: > Hi there, > > I'm currently in the process of learning more about Solr, and how I > can implement it into my project. > > Since my database is very large and complex, I'm looking into the way > of keeping my documents current in Solr. I have

Re: field type for serialized code?

2009-04-29 Thread Erik Hatcher
Are you using REXML? Or libxml? I'm assuming this is from a Solr/ Ruby (RSolr?) API call to add the document. Erik On Apr 28, 2009, at 9:12 PM, Matt Mitchell wrote: Hi, I'm attempting to serialize a simple ruby object into a solr.StrField - but it seems that what I'm getting bac

Advice on custom DIH or other solutions

2009-04-29 Thread Wouter Samaey
Hi there, I'm currently in the process of learning more about Solr, and how I can implement it into my project. Since my database is very large and complex, I'm looking into the way of keeping my documents current in Solr. I have read the pages about DIH, and find it usefull, but I may need more

Re: boost qf weight between 0 and 10

2009-04-29 Thread sunnyfr
How can I get the weight of a field and use it in bf ?? thanks a lot sunnyfr wrote: > > Hi Hoss, > thanks for this answser, and is there a way to get the weight of a field ? > like that and use it in the bf? queryWeight > > > 0.14232224 = (MATCH) weight(text:chien^0.2 in 9412049), produc

Performance and number of search results

2009-04-29 Thread Wouter Samaey
Hello, Can someone please comment on the performance impact of the number of search results? Is there a big difference between querying for 1 result, 10, 20 or even 100 ? Thanks in advance Wouter Samaey

UTF8 compatibility

2009-04-29 Thread Muhammed Sameer
Salaam, I have a question, its in two parts actually and are related We run post.jar periodically ie after every 15mins to commit the changes, Is this approach correct ? When I run this I get the following message {code} SimplePostTool: version 1.2 SimplePostTool: WARNING: Make sure your XML d