Re: Do nested entities have a representation in Solr indexes?

2012-02-22 Thread Mikhail Khludnev
Hello Mike, Solr is too flat yet. Work is in progress https://issues.apache.org/jira/browse/SOLR-3076 Good introduction is in Michael's blog http://blog.mikemccandless.com/2012/01/searching-relational-content-with.htmlbut it's only about Lucene Queries. Colleague of my blogged about the same probl

Re: Development inside or outside of Solr?

2012-02-22 Thread bing
Hi, Erick, The example is impressive. Thank you. For the first, we decide not to do that, as Tika extraction is time-consuming part in indexing large files, and the dual call make the situation worse. For the second, for now, we choose Dspace to connect to DB, and discovery(solr) as the index

Re: SnapPull failed :org.apache.solr.common.SolrException: Error opening new searcher

2012-02-22 Thread eks dev
thanks Mark, I will give it a go and report back... On Thu, Feb 23, 2012 at 1:31 AM, Mark Miller wrote: > Looks like an issue around replication IndexWriter reboot, soft commits and > hard commits. > > I think I've got a workaround for it: > > Index: solr/core/src/java/org/apache/solr/handler/Sn

Re: need to support bi-directional synonyms

2012-02-22 Thread Bernd Fehling
Use sprayer, washer http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.SynonymFilterFactory Regards Bernd Am 23.02.2012 07:03, schrieb remi tassing: Same question here... On Wednesday, February 22, 2012, geeky2 wrote: hello all, i need to support the following: if the user

problem with parsering (using Tika) on remote glassfish

2012-02-22 Thread ola nowak
Hi all! I'm using Tika parser to index my files into Solr. I created my own parser (which extends XMLParser). It uses my own mimetype. I created a jar file which inside looks like this: src |-main |-some_packages |-MyParser.java |resources |-META-INF |-services

Re: Development inside or outside of Solr?

2012-02-22 Thread bing
Hi, François Schiettecatte Thank you for the reply all the same, but I choose to stick on Solr (wrapped with Tika language API) and do changes outside Solr. Best Regards, Bing -- View this message in context: http://lucene.472066.n3.nabble.com/Development-inside-or-outside-of-Solr-tp3759680

Re: default fq in dismax request handler being overridden

2012-02-22 Thread dboychuck
Think I answered my own question... I need to use an appends list -- View this message in context: http://lucene.472066.n3.nabble.com/default-fq-in-dismax-request-handler-being-overridden-tp3768735p3768817.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: need to support bi-directional synonyms

2012-02-22 Thread remi tassing
Same question here... On Wednesday, February 22, 2012, geeky2 wrote: > hello all, > > i need to support the following: > > if the user enters "sprayer" in the desc field - then they get results for > BOTH "sprayer" and "washer". > > and in the other direction > > if the user enters "washer" in th

default fq in dismax request handler being overridden

2012-02-22 Thread dboychuck
I have a dismax request handler with a default fq parameter. explicit 0.01 sku^9.0 upc^9.1 searchKeyword^1.9 series^2.8 productTitle^1.2 productID^9.0 manufacturer^4.0 masterFinish^1.5 theme^1.1 categoryName^2.0 finish^1.4 searchKeyword^2.1 text^0.2 productTitle^1.5 manufacturer^4.0 finish^1.

Re: result present in Solr 1.4, but missing in Solr 3.5, dismax only

2012-02-22 Thread Robert Muir
On Wed, Feb 22, 2012 at 7:35 PM, Naomi Dushay wrote: > Jonathan has brought it to my attention that BOTH of my failing searches > happen to have 8 terms, and one of the terms is repeated: > >  "The Beatles as musicians : Revolver through the Anthology" >  "Color-blindness [print/digital]; its dan

Re: Recovering from database connection resets in DataimportHandler

2012-02-22 Thread Erick Erickson
It *just happens* that I wrote a blog on this very topic, see: http://www.lucidimagination.com/blog/2012/02/14/indexing-with-solrj/ That code contains two rather different methods, one that indexes based on a SQL database and one based on indexing random files with client-side Tika. Best Erick O

Re: result present in Solr 1.4, but missing in Solr 3.5, dismax only

2012-02-22 Thread Naomi Dushay
Jonathan has brought it to my attention that BOTH of my failing searches happen to have 8 terms, and one of the terms is repeated: "The Beatles as musicians : Revolver through the Anthology" "Color-blindness [print/digital]; its dangers and its detection" but this is a PHRASE search. In cas

Re: distributed deletes working?

2012-02-22 Thread Jamie Johnson
Perhaps if you could give me the steps you're using to test I can find an error in what I'm doing. On Wed, Feb 22, 2012 at 9:24 PM, Mark Miller wrote: > Yonik did fix an issue around peer sync and deletes a few days ago - long > chance that was involved? > > Otherwise, neither Sami nor I have r

Re: distributed deletes working?

2012-02-22 Thread Mark Miller
Yonik did fix an issue around peer sync and deletes a few days ago - long chance that was involved? Otherwise, neither Sami nor I have replicated these results so far. On Feb 22, 2012, at 8:56 PM, Jamie Johnson wrote: > I know everyone is busy, but I was wondering if anyone had found > anything

Is there a way to write a DataImportHandler deltaQuery that compares contents still to be imported to contents in the index?

2012-02-22 Thread Mike O'Leary
I am working on indexing the contents of a database that I don't have permission to alter. In particular, the DataImportHandler examples that show how to specify a deltaQuery attribute value show database tables that have a last_modified column, and it compares these values with last_index_time

Re: distributed deletes working?

2012-02-22 Thread Jamie Johnson
I know everyone is busy, but I was wondering if anyone had found anything with this? Any suggestions on what I could be doing wrong would be greatly appreciated. On Fri, Feb 17, 2012 at 4:08 PM, Mark Miller wrote: > > On Feb 17, 2012, at 3:56 PM, Jamie Johnson wrote: > >> id field is a UUID. > >

RE: Recovering from database connection resets in DataimportHandler

2012-02-22 Thread Mike O'Leary
Could you point me to the most non-intimidating introduction to SolrJ that you know of? I have a passing familiarity with Javascript and, with few exceptions, I haven't developing software that has a graphical user interface of any kind in about 25 years. I like the idea of having finer control

Do nested entities have a representation in Solr indexes?

2012-02-22 Thread Mike O'Leary
The data-config.xml file that I have for indexing database contents has nested entity nodes within a document node, and each of the entities contains field nodes. Lucene indexes consist of documents that contain fields. What about entities? If you change the way entities are structured in a data

Re: Solr Highlighting not working with PayloadTermQueries

2012-02-22 Thread Koji Sekiguchi
(12/02/22 7:53), Nitin Arora wrote: Hi, I'm using SOLR and Lucene in my application for search. I'm facing an issue of highlighting using FastVectorHighlighter not working when I use PayloadTermQueries as clauses of a BooleanQuery. After Debugging I found that In DefaultSolrHighlighter.Java, f

Re: SnapPull failed :org.apache.solr.common.SolrException: Error opening new searcher

2012-02-22 Thread Mark Miller
Looks like an issue around replication IndexWriter reboot, soft commits and hard commits. I think I've got a workaround for it: Index: solr/core/src/java/org/apache/solr/handler/SnapPuller.java === --- solr/core/src/java/org/apache/

Re: result present in Solr 1.4, but missing in Solr 3.5, dismax only

2012-02-22 Thread Naomi Dushay
Jonathan, I have the same problem without the colon - I tested that, but didn't mention it. mm can't be the issue either: in Solr 3.5, if I remove one of the occurrences of "the" (doesn't matter which), I get results. Removing any other word does NOT get results. And if the query isn'

Re: String search in Dismax handler

2012-02-22 Thread Erick Erickson
Two things: 1> what version of Solr are you using? qt=dismax isn't going to any request handler I don't think. 2> what do you get when you add &debugQuery=on? Try that with both results and perhaps that will shed some light. If not, can you post the results? Best Erick On Wed, Feb 22, 2012 at 7:

Re: result present in Solr 1.4, but missing in Solr 3.5, dismax only

2012-02-22 Thread Naomi Dushay
I forgot to include the field definition information: schema.xml: solr 3.5: solr1.4: And the analysis page shows the same results for Solr 3.5 a

Re: result present in Solr 1.4, but missing in Solr 3.5, dismax only

2012-02-22 Thread Jonathan Rochkind
So I don't really know what I'm talking about, and I'm not really sure if it's related or not, but your particular query: "The Beatles as musicians : Revolver through the Anthology" With the lone "word" that's a ':', reminds me of a dismax stopwords-type problem I ran into. Now, I ran into it

Re: Fast Vector Highlighter Working for some records only

2012-02-22 Thread Koji Sekiguchi
Hi dhaivat, I think you may want to use analysis.jsp: http://localhost:8983/solr/admin/analysis.jsp Go to the URL and look into how your custom tokenizer produces tokens, and compare with the output of Solr's inbuilt tokenizer. koji -- Query Log Visualizer for Apache Solr http://soleami.com/

Trunk build errors

2012-02-22 Thread Darren Govoni
Hi, I am getting numerous errors preventing a build of solrcloud trunk. [licenses] MISSING LICENSE for the following file: Any tips to get a clean build working? thanks

need to support bi-directional synonyms

2012-02-22 Thread geeky2
hello all, i need to support the following: if the user enters "sprayer" in the desc field - then they get results for BOTH "sprayer" and "washer". and in the other direction if the user enters "washer" in the desc field - then they get results for BOTH "washer" and "sprayer". would i set up

Re: Same id on two shards

2012-02-22 Thread jerry.min...@gmail.com
Hi, I stumbled across this thread after running into the same question. The answers presented here seem a little vague and I was hoping to renew the discussion. I am using using a branch of Solr 4, distributed searching over 12 shards. I want the documents in the first shard to always be selected

Re: 'location' fieldType indexation impossible

2012-02-22 Thread Erick Erickson
Make sure that your schema file is exactly the same on both your local server and the remote server. Especially there should be a dynamic field definition like: and you should see a couple of fields appear like emploi_city_geoloc_0_coordinate and emploi_city_geoloc_1_coordinate when you index a "

Re: nutch and solr

2012-02-22 Thread alessio crisantemi
thanks for your reply, but don't work. the same message: can't convert empty path and more: impossible find class org.apache.nutch.crawl.injector .. Il giorno 22 febbraio 2012 06:14, tamanjit.bin...@yahoo.co.in < tamanjit.bin...@yahoo.co.in> ha scritto: > Try this command. > > bin/nutch crawl

RE: Unusually long data import time?

2012-02-22 Thread Devon Baumgarten
Thank you everyone for your patience and suggestions. It turns out I was doing something really unreasonable in my schema. I mistakenly edited the max EdgeNgram size to 512, when I meant to set the lengthFilter max to 512. I brought this to a more reasonable number, and my estimated time to imp

result present in Solr 1.4, but missing in Solr 3.5, dismax only

2012-02-22 Thread Naomi Dushay
I am working on upgrading Solr from 1.4 to 3.5, and I have hit a problem. I have a test checking for a search result in Solr, and the test passes in Solr 1.4, but fails in Solr 3.5. Dismax is the desired QueryParser -- I just included output from lucene QueryParser to prove the document exis

Re: solr 3.5 and indexing performance

2012-02-22 Thread mizayah
i got it all commnented in updateHandler, im prety sure there is no default autocommit iorixxx wrote > >> I wanted to switch to new version of solr, exactelly to 3.5 >> but im getting >> big drop of indexing speed. > > Could it be configuration in solrconfig.xml? > -- View this message

Re: Solr Performance Improvement and degradation Help

2012-02-22 Thread naptowndev
As an update to this... I tried running a query again the 4.0.0.2010.12.10.08.54.56 version and the newer 4.0.0.2012.02.16 (both on the same box). So the query params were the same, returned results were the same, but the 4.0.0.2010.12.10.08.54.56 returned the results in about 1.6 seconds and the

Re: org.apache.lucene.store.LockObtainFailedException: Lock obtain timed out

2012-02-22 Thread Sethi, Parampreet
Hi Uomesh, I was facing similar issues few days ago and was able to resolve it by deleting the lock file created in the index directory and restarting my solr server. I have documented the same in one of the posts at http://www.params.me/2011/12/solr-index-lock-issue.html Hope it helps! -param

org.apache.lucene.store.LockObtainFailedException: Lock obtain timed out

2012-02-22 Thread Uomesh
Hi, I am getting below error while running delta import and my index is not updated. Could you please let me know what might be causing this issue? I am using Solr 3.5 version and around 60+ documents suppose to be updated using delta import. [org.apache.solr.handler.dataimport.SolrWriter] - Er

Re: Fields, Facets, and Search Results

2012-02-22 Thread darul
And check your log file, you may have some errors at start of your server. Due to some mistake, bad syntax in your schema file for example... -- View this message in context: http://lucene.472066.n3.nabble.com/Fields-Facets-and-Search-Results-tp3765946p3767569.html Sent from the Solr - User mail

Re: Fields, Facets, and Search Results

2012-02-22 Thread darul
Well, you probably need to clear you index first..remove index director, restart your server and try again. Let me know if it works or not. -- View this message in context: http://lucene.472066.n3.nabble.com/Fields-Facets-and-Search-Results-tp3765946p3767537.html Sent from the Solr - User mailing

dih and solr cloud

2012-02-22 Thread eks dev
out of curiosity, trying to see if new cloud features can replace what I use now... how is this (batch) update forwarding solved at cloud level? imagine simple one shard and one replica case, if I fire up DIH update, is this going to be replicated to replica shard? If yes, - is it going to be sen

Re: Unusually long data import time?

2012-02-22 Thread eks dev
Davon, you ought to try to update from many threads, (I do not know if DIH can do it, check it), but lucene does great job if fed from many update threads... depends where your time gets lost, but it is usually a) analysis chain or b) database if it os a) and your server has spare cpu-cores, you

Re: Problem parsing queries with forward slashes and multiple fields

2012-02-22 Thread Yury Kats
On 2/22/2012 1:24 PM, Yonik Seeley wrote: >> Looks like escaping forward slashes makes the query work, eg >> fieldName:\/a fieldName:\/a >> >> This is a bit puzzling as the forward slash is not part of the query >> language, is it? > > Regex queries were added that use forward slashes: > > http

maxClauseCount Exception

2012-02-22 Thread Darren Govoni
Hi, I am suddenly getting a maxClauseCount exception for no reason. I am using Solr 3.5. I have only 206 documents in my index. Any ideas? This is wierd. QUERY PARAMS: [hl, hl.snippets, hl.simple.pre, hl.simple.post, fl, hl.mergeContiguous, hl.usePhraseHighlighter, hl.requireFieldMatch, echoPar

maxClauseCount error

2012-02-22 Thread Darren Govoni
Hi, I am suddenly getting a maxclause count error and don't know why. I am using Solr 3.5

RE: Unusually long data import time?

2012-02-22 Thread Devon Baumgarten
Ahmet, I do not. I commented autoCommit out. Devon Baumgarten -Original Message- From: Ahmet Arslan [mailto:iori...@yahoo.com] Sent: Wednesday, February 22, 2012 12:25 PM To: solr-user@lucene.apache.org Subject: Re: Unusually long data import time? > Would it be unusual for an import

Re: Problem parsing queries with forward slashes and multiple fields

2012-02-22 Thread Yury Kats
On 2/22/2012 1:24 PM, Yonik Seeley wrote: >> This is a bit puzzling as the forward slash is not part of the query >> language, is it? > > Regex queries were added that use forward slashes: > > https://issues.apache.org/jira/browse/LUCENE-2604 Oh, so / is a special character now? I don't think

Re: Problem parsing queries with forward slashes and multiple fields

2012-02-22 Thread Yury Kats
On 2/22/2012 1:25 PM, Em wrote: > That's strange. > > Could you provide a sample dataset? Data set does not matter. The query fails to parse, long before it gets to the data.

Re: Problem parsing queries with forward slashes and multiple fields

2012-02-22 Thread Em
That's strange. Could you provide a sample dataset? I'd like to try it out. Kind regards, Em Am 22.02.2012 19:17, schrieb Yury Kats: > On 2/22/2012 1:05 PM, Em wrote: >> Yury, >> >> are you sure your request has a proper url-encoding? > > Yes >

Re: Unusually long data import time?

2012-02-22 Thread Ahmet Arslan
> Would it be unusual for an import of 160 million documents > to take 18 hours?  Each document is less than 1kb and I > have the DataImportHandler using the jdbc driver to connect > to SQL Server 2008. The full-import query calls a stored > procedure that contains only a select from my target tabl

Re: Problem parsing queries with forward slashes and multiple fields

2012-02-22 Thread Yonik Seeley
2012/2/22 Yury Kats : > On 2/22/2012 12:25 PM, Yury Kats wrote: >> I'm running into a problem with queries that contain forward slashes and >> more than one field. >> >> For example, these queries work fine: >> fieldName:/a >> fieldName:/* >> >> But if I have two fields with similar syntax in the

Re: Solr & HBase - Re: How is Data Indexed in HBase?

2012-02-22 Thread Jacques
>> Solr does not provide a complex enough support to rank. I believe Solr has a bunch of plug-ability to write your own custom ranking approach. If you think you can't do your desired ranking with Solr, you're probably wrong and need to ask for help from the Solr community. >> retrieving data by

Re: Problem parsing queries with forward slashes and multiple fields

2012-02-22 Thread Yury Kats
On 2/22/2012 1:05 PM, Em wrote: > Yury, > > are you sure your request has a proper url-encoding? Yes

RE: Unusually long data import time?

2012-02-22 Thread Devon Baumgarten
Walter, Do you mean sub-entities in your database, or something else? The data I am feeding DIH is from a select * (no joins or WHERE clause) on a table with: int, int, varchar(32), varchar(32), varchar(512) (this one is the Name), varchar(512), datetime If it matters, the select * is happeni

Re: Problem parsing queries with forward slashes and multiple fields

2012-02-22 Thread Yury Kats
On 2/22/2012 12:25 PM, Yury Kats wrote: > I'm running into a problem with queries that contain forward slashes and more > than one field. > > For example, these queries work fine: > fieldName:/a > fieldName:/* > > But if I have two fields with similar syntax in the same query, it fails. > > For

Re: Problem parsing queries with forward slashes and multiple fields

2012-02-22 Thread Em
Yury, are you sure your request has a proper url-encoding? Kind regards, Em Am 22.02.2012 18:25, schrieb Yury Kats: > I'm running into a problem with queries that contain forward slashes and more > than one field. > > For example, these queries work fine: > fieldName:/a > fieldName:/* > > But

Re: How to merge an "autofacet" with a predefined facet

2012-02-22 Thread Em
Btw.: Solr has no downtime while reloading the core. It loads the new core and while loading the new one it still serves requests with the old one. When the new one is ready (and warmed up) it finally replaces the old core. Best, Em Am 22.02.2012 17:56, schrieb Xavier: > I'm not sure to understan

Re: How to check if a field is a multivalue field with java

2012-02-22 Thread SUJIT PAL
Hi Thomas, With Java (from within a custom handler in Solr) you can get a handle to the IndexSchema from the request, like so: IndexSchema schema = req.getSchema(); SchemaField sf = schema.getField(fielaname); boolean isMultiValued = sf.multiValued(); From within SolrJ code, you can use SolrDoc

Re: How to merge an "autofacet" with a predefined facet

2012-02-22 Thread Em
If you use the suggested solution, it will detect the words at indexing time. However, Solr's FilterFactory's lifecycle keeps no track on whether a file for synonyms, keywords etc. has been changed since Solr's last startup. Therefore a change within these files is not visible until you reload your

Re: Unusually long data import time?

2012-02-22 Thread Walter Underwood
In my first try with the DIH, I had several sub-entities and it was making six queries per document. My 20M doc load was going to take many hours, most of a day. I re-wrote it to eliminate those, and now it makes a single query for the whole load and takes 70 minutes. These are small documents,

Re: Solr & HBase - Re: How is Data Indexed in HBase?

2012-02-22 Thread Bing Li
Mr Gupta, Thanks so much for your reply! In my use cases, retrieving data by keyword is one of them. I think Solr is a proper choice. However, Solr does not provide a complex enough support to rank. And, frequent updating is also not suitable in Solr. So it is difficult to retrieve data randomly

How to check if a field is a multivalue field with java

2012-02-22 Thread tschiela
Hello, is there any way to check, if a field of a SolrDocument ist a multivalue field with java (solrj)? Greets Thomas -- View this message in context: http://lucene.472066.n3.nabble.com/How-to-check-if-a-field-is-a-multivalue-field-with-java-tp3767200p3767200.html Sent from the Solr - User mai

RE: Unusually long data import time?

2012-02-22 Thread Devon Baumgarten
I changed the heap size (Xmx1582m was as high as I could go). The import is at about 5% now, and from that I now estimate about 13 hours. It's hard to say though.. it keeps going up little by little. If I get approval to use Solr for this project, I'll have them install a 64bit jvm instead, but

Problem parsing queries with forward slashes and multiple fields

2012-02-22 Thread Yury Kats
I'm running into a problem with queries that contain forward slashes and more than one field. For example, these queries work fine: fieldName:/a fieldName:/* But if I have two fields with similar syntax in the same query, it fails. For simplicity, I'm using the same field twice: fieldName:/a f

Re: How to merge an "autofacet" with a predefined facet

2012-02-22 Thread Xavier
I'm not sure to understand your solution ? When (and how) will be the 'word' detection in the fulltext ? before (by my own) or during (with) solr indexation ? -- View this message in context: http://lucene.472066.n3.nabble.com/How-to-merge-an-autofacet-with-a-predefined-facet-tp3763988p3767059.h

Solr Performance Improvement and degradation Help

2012-02-22 Thread naptowndev
As I've mentioned before, I'm very new to Solr. I'm not a Java guy or an Apache guy. I'm a .Net guy. We have a rather large schema - some 100 + fields plus a large number of dynamic fields. We've been trying to improve performance and finally got around to implementing fastvectorhighlighting wh

Re: Fields, Facets, and Search Results

2012-02-22 Thread drLocke97
Hi darul, You're right, I was not using . So, following your suggestions, I added text and This required that I add a field "text", which is fine. I did that. Now, when I commit the for indexing, I get this error: SOLR returned a #400 Error: Error adding field "section_text_content". . .

RE: Unusually long data import time?

2012-02-22 Thread Devon Baumgarten
Oh sure! As best as I can, anyway. I have not set the Java heap size, or really configured it at all. The server running both the SQL Server and Solr has: * 2 Intel Xeon X5660 (each one is 2.8 GHz, 6 cores, 12 logical processors) * 64 GB RAM * One Solr instance (no shards) I'm not using facetin

Re: Solr on netty

2012-02-22 Thread prasenjit mukherjee
Thanks for the response. Yes we have 16 shards/partitions each on 16 different nodes and a separate master Solr receiving continuous parallel requests from 10 client threads running on a single separate machine. Our observation was that the perf degraded non linearly as the load ( no of concurrent

SnapPull failed :org.apache.solr.common.SolrException: Error opening new searcher

2012-02-22 Thread eks dev
We started observing strange failures from ReplicationHandler when we commit on master trunk version 4-5 days old. It works sometimes, and sometimes not didn't dig deeper yet. Looks like the real culprit hides behind: org.apache.lucene.store.AlreadyClosedException: this IndexWriter is clos

Re: solr 3.5 and indexing performance

2012-02-22 Thread Ahmet Arslan
> I wanted to switch to new version of solr, exactelly to 3.5 > but im getting > big drop of indexing speed. Could it be configuration in solrconfig.xml?

Re: Unusually long data import time?

2012-02-22 Thread Glen Newton
Import times will depend on: - hardware (speed of disks, cpu, # of cpus, amount of memory, etc) - Java configuration (heap size, etc) - Lucene/Solr configuration (many ...) - Index configuration - how many fields, indexed how; faceting, etc - OS configuration (this usually to a lesser degree; _usua

Unusually long data import time?

2012-02-22 Thread Devon Baumgarten
Hello, Would it be unusual for an import of 160 million documents to take 18 hours? Each document is less than 1kb and I have the DataImportHandler using the jdbc driver to connect to SQL Server 2008. The full-import query calls a stored procedure that contains only a select from my target tab

Re: Solr on netty

2012-02-22 Thread Yonik Seeley
On Wed, Feb 22, 2012 at 9:27 AM, prasenjit mukherjee wrote: > Is anybody aware of any effort regarding porting solr to a netty ( or > any other async-io based framework ) based framework. > > Even on medium load ( 10 parallel clients )  with 16 shards > performance seems to deteriorate quite sharp

Re: reader/searcher refresh after replication (commit)

2012-02-22 Thread Erick Erickson
It's certainly stable enough to start experimenting with, and I know that it's under pretty active development now. I've seen a lot of back-and-forth between Mark Miller and Jamie Johnson, Jamie trying things and Mark responding. It's part of the trunk, so be prepared for occasional re-indexing be

solr 3.5 and indexing performance

2012-02-22 Thread mizayah
Hello, I wanted to switch to new version of solr, exactelly to 3.5 but im getting big drop of indexing speed. I'm using 3.1 and after few tests i discower that 3.4 do it a lot of better then 3.5 My schema is really simple few field using "text" type field /

Re: How to handle to run testcases in ruby code for solr

2012-02-22 Thread Erik Hatcher
I'm not sure what to suggest at this point... obviously your test setup is trying to hit a Solr server that isn't running. Check the host and port that it is trying and ensure that Solr is running as your tests expect or use the mock way that I just replied about. Note, again, that solr-ruby i

Solr on netty

2012-02-22 Thread prasenjit mukherjee
Is anybody aware of any effort regarding porting solr to a netty ( or any other async-io based framework ) based framework. Even on medium load ( 10 parallel clients ) with 16 shards performance seems to deteriorate quite sharply compared another alternative ( async-io based ) solution as load in

Re: How to handle to run testcases in ruby code for solr

2012-02-22 Thread solr
Hi Erik, I have tried links which you given. while runnign rake am getting error == Errno::ECONNREFUSED: No connection could be made because the target machine acti vely refused it. - connect(2) ===

Re: reader/searcher refresh after replication (commit)

2012-02-22 Thread Em
Erick, > You'll *really like* the SolrCloud stuff going into trunk when it's baked > for a while How stable is SolrCloud at the moment? I can not wait to try it out. Kind regards, Em Am 22.02.2012 14:45, schrieb Erick Erickson: > You'll *really like* the SolrCloud stuff going into trunk whe

Re: reader/searcher refresh after replication (commit)

2012-02-22 Thread Erick Erickson
You'll *really like* the SolrCloud stuff going into trunk when it's baked for a while Best Erick On Wed, Feb 22, 2012 at 3:25 AM, eks dev wrote: > Yes, I consciously let my slaves run away from the master in order to > reduce update latency, but every now and then they sync up with master >

Re: how to mock solr server solr_sruby

2012-02-22 Thread Erik Hatcher
You can mock with Ruby really easily, but just overriding methods that would otherwise call the server and fake a response. The solr-ruby library itself was built with an extensive test suite doing this. Here's the mock base:

Re: Fast Vector Highlighter Working for some records only

2012-02-22 Thread dhaivat
Koji Sekiguchi wrote > > (12/02/22 11:58), dhaivat wrote: >> Thanks for reply, >> >> But can you please tell me why it's working for some documents and not >> for >> other. > > As Solr 1.4.1 cannot recognize hl.useFastVectorHighlighter flag, Solr just > ignore it, but due to hl=true is there, So

How is Data Indexed in HBase?

2012-02-22 Thread Bing Li
Dear all, I wonder how data in HBase is indexed? Now Solr is used in my system because data is managed in inverted index. Such an index is suitable to retrieve unstructured and huge amount of data. How does HBase deal with the issue? May I replaced Solr with HBase? Thanks so much! Best regards,

Re: Date filter query

2012-02-22 Thread ku3ia
Hi, all Thanks for your responses. I'd tried [NOW/DAY-30DAY+TO+NOW/DAY-1DAY-1SECOND] and seems it works fine for me. Thanks a lot! -- View this message in context: http://lucene.472066.n3.nabble.com/Date-filter-query-tp3764349p3766139.html Sent from the Solr - User mailing list archive at Nabbl

'location' fieldType indexation impossible

2012-02-22 Thread Xavier
Hi, When i try to index my location field i get this error for each documents : *ATTENTION: Error creating document Error adding field 'emploi_city_geoloc'='48.85,2.5525' * (so i have 0 files indexed) Here is my schema.xml : ** I really don't understand why it isnt working because, it w

Re: Fields, Facets, and Search Results

2012-02-22 Thread darul
Check you schema config file first. It looks like you have missed copy of "section_text_content" field's content to your default search field : text -- View this message in context: http://lucene.472066.n3.nabble.com/Fields-Facets-and-Search-Results-tp3765946p3766084.html Sent from the Sol

how to mock solr server solr_sruby

2012-02-22 Thread solr
Hi, Am using solr_ruby in ruby code for that am starting solr server by using start.jsr. Now i want to write mockobjects for solr connection and code written in my ruby file to search data from solr. Can anybody suggest how to do testing without stating solr server -- View this message in context:

Fields, Facets, and Search Results

2012-02-22 Thread drLocke97
I'm new to SOLR and trying to get a proper understanding of what's going on with fields, facets, and search results. I've modified the example schema.xml and solrconfig.xml that comes with SOLR to reflect some fields I want to experiment with. I've also modified the velocity templates in Solaritas

SIREn integration with SOLR

2012-02-22 Thread chitra
Hi, We would like to implement semantic search in our websites. We already have the full text search service by using SOLR. Heard that SIREn plug-in for SOLR would be able to allow to index & query the semi-structred data. Could any one of you provide me more details about SIREn, its inte

Re: reader/searcher refresh after replication (commit)

2012-02-22 Thread Em
Sounds much clearer to me than before. :) Ad-hoc I have two ideas: First: Let Replication run asynchronously. If shard1 is pulling the new index from the master and therefore very recent documents aren't available anymore, shard2 will find them in the mean-time. As soon as shard1 is up-to-date (in

Re: Unique key constraint and optimistic locking (versioning)

2012-02-22 Thread Per Steffensen
Per Steffensen skrev: Thanks a lot. We will use the UniqueKey feature and build versioning ourselves. Do you think it would be a good idea if we built a versioning feature into Solr/Lucene instead of doing it outside, so that others can benefit from the feature as well? Guess contributions wil

Re: Unique key constraint and optimistic locking (versioning)

2012-02-22 Thread Per Steffensen
Thanks a lot. We will use the UniqueKey feature and build versioning ourselves. Do you think it would be a good idea if we built a versioning feature into Solr/Lucene instead of doing it outside, so that others can benefit from the feature as well? Guess contributions will be made according to

Re: reader/searcher refresh after replication (commit)

2012-02-22 Thread eks dev
Yes, I consciously let my slaves run away from the master in order to reduce update latency, but every now and then they sync up with master that is doing heavy lifting. The price you pay is that slaves do not see the same documents as the master, but this is the case anyhow with replication, in m