Re: How to post in-memory[not residing on local disks] Xml files to Solr server for indexing?

2009-04-27 Thread Noble Paul നോബിള്‍ नोब्ळ्
the Solr distro contains all the jar files. you can take either the latest release (1.3) or a nightly On Tue, Apr 28, 2009 at 11:34 AM, ahmed baseet wrote: > As far as I know, Maven is a build/mgmt tool for java projects quite similar > to Ant, right? No I'm not using this , then I think I don't

Re: How to post in-memory[not residing on local disks] Xml files to Solr server for indexing?

2009-04-27 Thread ahmed baseet
As far as I know, Maven is a build/mgmt tool for java projects quite similar to Ant, right? No I'm not using this , then I think I don't need to worry about those pom files. But I'm still not able to figure out the error with classpath/jar files I mentioned in my previous mails. Shall I try gettin

Re: half width katakana

2009-04-27 Thread Ashish P
After this should I be using same cjkAnalyzer or use charFilter?? Thanks, Ashish Koji Sekiguchi-2 wrote: > > Ashish P wrote: >> I want to convert half width katakana to full width katakana. I tried >> using >> cjk analyzer but not working. >> Does cjkAnalyzer do it or is there any other way?? >

Re: half width katakana

2009-04-27 Thread Koji Sekiguchi
Ashish P wrote: I want to convert half width katakana to full width katakana. I tried using cjk analyzer but not working. Does cjkAnalyzer do it or is there any other way?? CharFilter which comes with trunk/Solr 1.4 just covers this type of problem. If you are using Solr 1.3, try the patch a

Re: MacOS "Failed to initialize DataSource:db"+ DataimportHandler ???

2009-04-27 Thread Noble Paul നോബിള്‍ नोब्ळ्
apparently you do not have the driver in the path. drop your driver jar into ${solr.home}/lib On Tue, Apr 28, 2009 at 4:42 AM, gateway0 wrote: > > Hi, > > sure: > " > message Severe errors in solr configuration. Check your log files for more > detailed information on what may be wrong. If you wan

Re: DataImportHandler Questions-Load data in parallel and temp tables

2009-04-27 Thread Noble Paul നോബിള്‍ नोब्ळ्
there is an issue already to write to the index in a separate thread. https://issues.apache.org/jira/browse/SOLR-1089 On Tue, Apr 28, 2009 at 4:15 AM, Shalin Shekhar Mangar wrote: > On Tue, Apr 28, 2009 at 3:43 AM, Amit Nithian wrote: > >> All, >> I have a few questions regarding the data impor

half width katakana

2009-04-27 Thread Ashish P
I want to convert half width katakana to full width katakana. I tried using cjk analyzer but not working. Does cjkAnalyzer do it or is there any other way?? -- View this message in context: http://www.nabble.com/half-width-katakana-tp23270186p23270186.html Sent from the Solr - User mailing list

Re: Fwd: Question about MoreLikeThis

2009-04-27 Thread Otis Gospodnetic
Hello, Well, if you want documents similar to a specific document, then just make sure the query ("q") matches that one document. You can do that by using the uniqueKey field in the query, e.g. q=id:123 . Then you will get documents similar to that one document that matched your id:123 query

Re: SOLRizing advice?

2009-04-27 Thread Otis Gospodnetic
My turn to help, Paul. There is no such page on the Solr Wiki, but I agree with Paul, this can really be a quick and painless migration for typical Lucene/Solr setups. This is roughly how I'd do things: - I'd set up Solr - I'd create the schema.xml mimicking the fields in the existing Lucene

Re: Term highlighting with MoreLikeThisHandler?

2009-04-27 Thread Otis Gospodnetic
Eric, Have you tried using MLT with parameters described on http://wiki.apache.org/solr/HighlightingParameters ? Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message > From: Eric Sabourin > To: solr-user@lucene.apache.org > Sent: Monday, April 27, 2

Re: Solr 1.4 Release Date

2009-04-27 Thread Otis Gospodnetic
Gurjot, please see http://wiki.apache.org/solr/Solr1.4 - we are currently 33 JIRA issues away. Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message > From: Gurjot Singh > To: solr-user@lucene.apache.org > Sent: Monday, April 27, 2009 12:45:32 PM > Su

highlighting html content

2009-04-27 Thread Matt Mitchell
Hi, I've been looking around but can't seem to find any clear instruction on how to do this... I'm storing html content and would like to enable highlighting on the html content. The problem is that the search can sometimes match html element names or attributes, and when the highlighter adds the

Re: facet results in order of rank

2009-04-27 Thread Gene Campbell
Thanks for the reply Your thoughts are what I initially was thinking. But, given some more consideration, I imagined a system that would take all the docs that would be returned for a given facet, and get an average score based on their scores from the original search that produced the facets. T

Re: MacOS "Failed to initialize DataSource:db"+ DataimportHandler ???

2009-04-27 Thread gateway0
Hi, sure: " message Severe errors in solr configuration. Check your log files for more detailed information on what may be wrong. If you want solr to continue after configuration errors, change: false in null - org.apache.solr.common.Sol

Re: Phonetic analysis with the spell-check component?

2009-04-27 Thread Shalin Shekhar Mangar
On Sun, Apr 26, 2009 at 11:55 PM, David Smiley @MITRE.org wrote: > > It appears to me that the spell-check component can't build a dictionary > based on phonetic similarity (i.e. using a Phonetic analysis filter). > Sure, > you can go ahead and configure the spell check component to use a field

Re: Authenticated Indexing Not working

2009-04-27 Thread Shalin Shekhar Mangar
On Sun, Apr 26, 2009 at 11:04 AM, Allahbaksh Asadullah < allahbaks...@gmail.com> wrote: > HI Otis, > I am using HTTPClient for authentication. When I use the server with > Authentication for searching it works fine. But when I use it for > indexing it throws error. > What is the error? Is it thro

Re: Get the field value that caused the result

2009-04-27 Thread Shalin Shekhar Mangar
On Sat, Apr 25, 2009 at 8:25 PM, Wouter Samaey wrote: > > I'm looking into a way to determine the value of a field that caused > the result to be returned. > Can highlighting help here? It returns the snipped from the document which matched the query. http://wiki.apache.org/solr/HighlightingPara

Re: facet results in order of rank

2009-04-27 Thread Shalin Shekhar Mangar
On Fri, Apr 24, 2009 at 12:25 PM, ristretto.rb wrote: > Hello, > > Is it possible to order the facet results on some ranking score? > I've had a look at the facet.sort param, > ( > http://wiki.apache.org/solr/SimpleFacetParameters#head-569f93fb24ec41b061e37c702203c99d8853d5f1 > ) > but that seems

Re: Solr test anyone?

2009-04-27 Thread Shalin Shekhar Mangar
Yes, look at AbstractSolrTestCase which is the base class of almost all Solr tests. http://svn.apache.org/repos/asf/lucene/solr/trunk/src/java/org/apache/solr/util/AbstractSolrTestCase.java On Mon, Apr 27, 2009 at 6:38 PM, Eric Pugh wrote: > Look into the test code that Solr uses, there is a lot

Re: DataImportHandler Questions-Load data in parallel and temp tables

2009-04-27 Thread Shalin Shekhar Mangar
On Tue, Apr 28, 2009 at 3:43 AM, Amit Nithian wrote: > All, > I have a few questions regarding the data import handler. We have some > pretty gnarly SQL queries to load our indices and our current loader > implementation is extremely fragile. I am looking to migrate over to the > DIH; however, I

Re: offline solr indexing

2009-04-27 Thread Shalin Shekhar Mangar
On Tue, Apr 28, 2009 at 12:38 AM, Charles Federspiel < charles.federsp...@gmail.com> wrote: > Solr Users, > Our app servers are setup on read-only filesystems. Is there a way > to perform indexing from the command line, then copy the index files to the > app-server and use Solr to perform search

Re: Date faceting - howto improve performance

2009-04-27 Thread Shalin Shekhar Mangar
Sorry, I'm late in this thread. Did you try using Trie fields (new in 1.4)? The regular date faceting won't work out-of-the-box for trie fields I think. But you could use facet.query to achieve the same effect. On my simple benchmarks I've found trie fields to give a huge improvement in range sear

Re: MacOS "Failed to initialize DataSource:db"+ DataimportHandler ???

2009-04-27 Thread Shalin Shekhar Mangar
On Tue, Apr 28, 2009 at 1:18 AM, gateway0 wrote: > > > Everything works fine except for the dataimporthandler in solr, I get this > error message: > "org.apache.solr.handler.dataimport.DataImportHandlerException: Failed to > initialize DataSource: mydb Processing Document # at > > org.apache.solr

Re: adding plug-in after search is done

2009-04-27 Thread Shalin Shekhar Mangar
On Tue, Apr 28, 2009 at 12:04 AM, siping liu wrote: > > trying to manipulate search result (like further filtering out unwanted), > and ordering the results differently. Where is the suitable place for doing > it? I've been using QueryResponseWriter but that doesn't seem to be the > right place.

Re: Configuration of format and type index with solr

2009-04-27 Thread Shalin Shekhar Mangar
On Mon, Apr 27, 2009 at 10:40 PM, hpn1975 nasc wrote: > > 1- Guarantee that my searcher (solr) ALWAYS search in my index in *memory > * (use RAMDirectory). Not to use cache. It is possible to disable all caches. But it is not possible to use RAMDirectory right now. This is in progress. https

DataImportHandler Questions-Load data in parallel and temp tables

2009-04-27 Thread Amit Nithian
All, I have a few questions regarding the data import handler. We have some pretty gnarly SQL queries to load our indices and our current loader implementation is extremely fragile. I am looking to migrate over to the DIH; however, I am looking to use SolrJ + EmbeddedSolr + some custom stuff to rem

Re: Date faceting - howto improve performance

2009-04-27 Thread Marcus Herou
Yes that's exactly what I meant. Think adding "new" fields to a separate index and use ParallelReader at query time would be something to investigate at SOLR level. I think I can spend some time creating a patch for this if you think it is a good idea and if you think it would be merged into the r

Re: offline solr indexing

2009-04-27 Thread Amit Nithian
Not sure if this helps but could you make this a solr server that is not accessible by any other means (except internal), perform your index build using the dataimporthandler and use Solr's replication mechanisms to move the indices across? You can issue the HTTP request to rebuild the index from t

MacOS "Failed to initialize DataSource:db"+ DataimportHandler ???

2009-04-27 Thread gateway0
Hi, I want to transfer my solr project to mac os leopard so I installed xampp for mac and Tomcat 6.0.18. Everything works fine except for the dataimporthandler in solr, I get this error message: "org.apache.solr.handler.dataimport.DataImportHandlerException: Failed to initialize DataSource: mydb

offline solr indexing

2009-04-27 Thread Charles Federspiel
Solr Users, Our app servers are setup on read-only filesystems. Is there a way to perform indexing from the command line, then copy the index files to the app-server and use Solr to perform search from inside the servlet container? If the Solr implementation is bound to http requests, can Solr pe

Re: fail to create or find snapshoot

2009-04-27 Thread Jian Han Guo
Actually, I found the snapshot in the directory where solr was lauched. Is this done on purpose? shouldn't it be in the data directory? Thanks, Jianhan On Mon, Apr 27, 2009 at 11:43 AM, Jian Han Guo wrote: > Hi, > > According to Solr's wiki page http://wiki.apache.org/solr/SolrReplication, >

fail to create or find snapshoot

2009-04-27 Thread Jian Han Guo
Hi, According to Solr's wiki page http://wiki.apache.org/solr/SolrReplication, if I send the following request to master, a snapshoot will be created http://master_host:port/solr/replication?command=snapshoot But after I did it, nothing see

adding plug-in after search is done

2009-04-27 Thread siping liu
trying to manipulate search result (like further filtering out unwanted), and ordering the results differently. Where is the suitable place for doing it? I've been using QueryResponseWriter but that doesn't seem to be the right place. thanks. ___

Re: Solr Performance bottleneck

2009-04-27 Thread Walter Underwood
This isn't a new problem, NFS was 100X slower than local disk for me with Solr 1.1. Backing up indexes is very tricky. You need to do it while the are not being updated, or you'll get a corrupt copy. If your indexes aren't large, you are probably better off backing up the source documents and buil

Re: Solr Performance bottleneck

2009-04-27 Thread Jon Bodner
As a follow-up note, we solved our problem by moving the indexes to local store and upgrading to Solr 1.4. I did a thread dump against our 1.3 Solr instance and it was spending lots of time blocking on index section loading. The NIO implementation in 1.4 solved that problem and copying to local

Re: How to index the contents from SVN repository

2009-04-27 Thread Ryan McKinley
I would suggest looking at Apache commons VFS and using the solrj API: http://commons.apache.org/vfs/ With SVN, you may be able to use the webdav provider. ryan On Apr 26, 2009, at 4:08 AM, Ashish P wrote: Is there any way to index contents of SVN rep in Solr ?? -- View this message in co

Re: ExtractingRequestHandler and SolrRequestHandler issue

2009-04-27 Thread francisco treacy
Thanks for your answers. Still no success. >> These need to be in your Solr home lib, not example/lib. I sometimes get >> confused on this one, too, forgetting that I need to go down a few more >> directories. The example/lib directory is where the Jetty stuff lives, >> example/solr/lib is the l

Configuration of format and type index with solr

2009-04-27 Thread hpn1975 nasc
Hi, I work with Lucne there is some years and I use some advanced resources of the library as different formats of index and types of persistency. Now I would like use Solr. Is possible to configure these resources using solr ? My doubt is about of possibility of configurate in solr this four

Solr 1.4 Release Date

2009-04-27 Thread Gurjot Singh
Hi, I am curious to know when is the scheduled/tentative release date of Solr 1.4. Thanks, Gurjot

RE: How to index the contents from SVN repository

2009-04-27 Thread Steven A Rowe
Hi Ashish, The excellent SVN/CVS repo browser ViewVC has tools to record SVN/CVS commit metadata in a database - seeing how they do it may give you some hints. The INSTALL file gives pointers to the relevant tools (look for the "SQL CHECKIN DATABASE" section): http://

Re: Date faceting - howto improve performance

2009-04-27 Thread Ning Li
You mean doc A and doc B will become one doc after adding index 2 to index 1? I don't think this is currently supported either at Lucene level or at Solr level. If index 1 has m docs and index 2 has n docs, index 1 will have m+n docs after adding index 2 to index 1. Documents themselves are not mod

Term highlighting with MoreLikeThisHandler?

2009-04-27 Thread Eric Sabourin
I submit a query to the MoreLikeThisHandler to find documents similar to a specified document. This works and I've configured my request handler to also return the interesting terms. Is it possible to have MLT return to me highlight snippets in the similar documents it returns? I mean generate hl

Re: boost qf weight between 0 and 10

2009-04-27 Thread sunnyfr
Hi Hoss, thanks for this answser, and is there a way to get the weight of a field ? like that and use it in the bf? queryWeight 0.14232224 = (MATCH) weight(text:chien^0.2 in 9412049), product of: 0.0813888 = queryWeight(text:chien^0.2), product of: 0.2 = boost 6.5946517 = idf

Re: How to post in-memory[not residing on local disks] Xml files to Solr server for indexing?

2009-04-27 Thread Shalin Shekhar Mangar
On Mon, Apr 27, 2009 at 6:27 PM, ahmed baseet wrote: > Can anyone help me selecting the proper pom.xml file out of the bunch of > *-pom.xml.templates available. > Ahmed, are you using Maven? If not, then you do not need these pom files. If you are using Maven, then you need to add a dependency to

Re: Solr test anyone?

2009-04-27 Thread Eric Pugh
Look into the test code that Solr uses, there is a lot of good stuff on how to do testing. http://svn.apache.org/repos/asf/lucene/solr/trunk/src/test/ . Eric On Apr 27, 2009, at 6:25 AM, tarjei wrote: Hi, I'm looking for ways to test that my indexing methods work correctly with my Solr sc

Re: How to post in-memory[not residing on local disks] Xml files to Solr server for indexing?

2009-04-27 Thread ahmed baseet
Can anyone help me selecting the proper pom.xml file out of the bunch of *-pom.xml.templates available. I got the following when searched for pom.xml files, solr-common-csv-pom.xml solr-lucene-analyzers-pom.xml solr-lucene-contrib-pom.xml solr-lucene-*-pom.xml [ a lot of solr-lucene-... pom files a

Re: How to post in-memory[not residing on local disks] Xml files to Solr server for indexing?

2009-04-27 Thread Shalin Shekhar Mangar
On Mon, Apr 27, 2009 at 4:53 PM, ahmed baseet wrote: > > To be precise it gives me the following error, > .cannot find symbol: > symbol : class CommonsHttpSolrServer > > I rechecked to make sure that "commons-httpclient-3.1.jar" is in the class > path. Can someone please point me what is

Re: How to post in-memory[not residing on local disks] Xml files to Solr server for indexing?

2009-04-27 Thread ahmed baseet
Hi, After going through the solrj wiki I found that we've to set some dependencies in pom.xml for using Solrj, which I haven't done yet. So I googled to know how to do that but no help. I searched the solr directory and found a bunch of *-pom.template files [like solr-core-pom.xml, solr-solrj-pom.x

Re: Solr index

2009-04-27 Thread aidahaj
Thanks a lot, I have made a look in these classes. But what I exactly want to do is to detect if a Document(in the index of solr)has changed when I recrawl a site with Nutch. Not to block deduplication, but to detect if a Document has changed and extract changes in a file without writing them over

Re: How to post in-memory[not residing on local disks] Xml files to Solr server for indexing?

2009-04-27 Thread ahmed baseet
Shalin, thanks for your quick response. Actually I'm trying to pull plaintext from html pages and trying to make xml files for each page. I went through the SolrJ webpage and found that the we've to add all the field and its contents anyway, right? but yes it makes adding/updating etc quite easier

Solr test anyone?

2009-04-27 Thread tarjei
Hi, I'm looking for ways to test that my indexing methods work correctly with my Solr schema. Therefore I'm wondering if someone has created a test setup where they start a Solr instance and then add some documents to the instance - as a Junit/testng test - preferably with a working Maven depe

Re: How to post in-memory[not residing on local disks] Xml files to Solr server for indexing?

2009-04-27 Thread Shalin Shekhar Mangar
On Mon, Apr 27, 2009 at 3:30 PM, ahmed baseet wrote: > Hi All, > I'm trying to post some files to Solr server. I've done this using the > post.jar files for posting xml files residing on my local disk[I tried > posting all those xml files from example directory]. Now I'm trying to > generate xml f

How to post in-memory[not residing on local disks] Xml files to Solr server for indexing?

2009-04-27 Thread ahmed baseet
Hi All, I'm trying to post some files to Solr server. I've done this using the post.jar files for posting xml files residing on my local disk[I tried posting all those xml files from example directory]. Now I'm trying to generate xml files on the fly, with required text to be indexed included there