Re: Solr indexing process: keep a persistent Mysql connection throu all the indexing process

2011-08-23 Thread samuele.mattiuzzo
those documents are unrelated to the database. the db i have is just storing countries - region - cities, and it's used to do a refinement on a specific solr field example: solrField "thetext" with content "Mary comes from London" updateHandler polls the database for europe - great britain - lon

Re: Solr indexing process: keep a persistent Mysql connection throu all the indexing process

2011-08-23 Thread samuele.mattiuzzo
those documents are unrelated to the database. the db i have is just storing countries - region - cities, and it's used to do a refinement on a specific solr field example: solrField "thetext" with content "Mary comes from London" updateHandler polls the database for europe - great britain - lon

Re: Solr indexing process: keep a persistent Mysql connection throu all the indexing process

2011-08-23 Thread Tom
10K documents. Why not just batch them? You could read in 10K from your database, load em into an array of SolrDocuments. and them post them all at once to the Solr server? Or do em in 1K increments if they are really big. -- View this message in context: http://lucene.472066.n3.nabble.com/S

Re: hierarchical faceting in Solr?

2011-08-23 Thread Naomi Dushay
Chris Beer just did a revamp of the wiki page at: http://wiki.apache.org/solr/HierarchicalFaceting Yay Chris! - Naomi (" ... and I helped!") On Aug 22, 2011, at 10:49 AM, Naomi Dushay wrote: Chris, Is there a document somewhere on how to do this? If not, might you create one? I coul

Re: Funky date string accepted

2011-08-23 Thread Markus Jelsma
That makes sense indeed. Wouldn't it be an idea to test for the single allowed format before parsing it? > : I see, is the leading - char just ignored then? > > i'd have to re-look at the tests/docs (i don't really want to repeat > that agonizing headache right now), but i believe what you are s

Re: Batch updates order guaranteed?

2011-08-23 Thread Yonik Seeley
On Tue, Aug 23, 2011 at 3:38 PM, Yonik Seeley wrote: > On Tue, Aug 23, 2011 at 2:17 PM, Glenn wrote: >> Question about batch updates (performing a delete and add in same >> request, as described at bottom >> of http://wiki.apache.org/solr/UpdateXmlMessages): >>

Re: Batch updates order guaranteed?

2011-08-23 Thread Glenn
Yes, I'm crafting the XML update message myself. Thanks for the confirmation. Glenn -- On 8/23/11 1:38 PM, Yonik Seeley wrote: > On Tue, Aug 23, 2011 at 2:17 PM, Glenn wrote: >> Question about batch updates (performing a delete and add in same >> request, as described at bottom >> of http://wi

Re: Batch updates order guaranteed?

2011-08-23 Thread Yonik Seeley
On Tue, Aug 23, 2011 at 2:17 PM, Glenn wrote: > Question about batch updates (performing a delete and add in same > request, as described at bottom > of http://wiki.apache.org/solr/UpdateXmlMessages): >  is the order > guaranteed?  If a is follo

Batch updates order guaranteed?

2011-08-23 Thread Glenn
Hello, Question about batch updates (performing a delete and add in same request, as described at bottom of http://wiki.apache.org/solr/UpdateXmlMessages): is the order guaranteed? If a is followed by an , will the delete always be performed f

Re: Solr indexing process: keep a persistent Mysql connection throu all the indexing process

2011-08-23 Thread Gora Mohanty
On Tue, Aug 23, 2011 at 10:25 PM, samuele.mattiuzzo wrote: > I wrote my custom update handler for my solr installation, using jdbc to > query a mysql database. Everything works fine: the updater queries the db, > gets the data i need and update it in my documents! Fantastic! > > Only issue is i ha

Re: Funky date string accepted

2011-08-23 Thread Chris Hostetter
: I see, is the leading - char just ignored then? i'd have to re-look at the tests/docs (i don't really want to repeat that agonizing headache right now), but i believe what you are seeing is a compound problem... * parsing sees the -0001 and recognizes that as a negative year. * somewhere th

Re: Sorting results by Range

2011-08-23 Thread Chris Hostetter
: I did not quite understand how that function was made. But, it does work basically the "map" function just translates values in a ranage to some fixed vald value. so if you nest two map functions (that use different ranges) inside of eachother you get a resulting curve that is flat in those

RE: Text Analysis and copyField

2011-08-23 Thread Herman Kiefus
To close, I found this article from Hoss: http://lucene.472066.n3.nabble.com/CopyField-into-another-CopyField-td3122408.html Since I cannot use one copyField directive to copy from another copyField's dest[ination], I cannot achieve what I desire: some terms that are subject to KeepWordFilterFa

Re: can i create filters of score range

2011-08-23 Thread Erick Erickson
Did you try exactly what Chris suggested? Appending sort=_docid_ asc to the query? When you say "client" I assume you're talking SolrJ, and I'm pretty sure that SolrQuery.setSortField is what you want. I suppose you could also set this as the default in your query handler. Best Erick On Tue, Aug

Re: SSD experience

2011-08-23 Thread Peter Sturge
Ah yes, the beautiful new links in Windows 6. These are 'symlinks' in name only - they operate *very* differently from LUNIX symlinks, and sadly, not quite so well. NTFS is one of the best things about Windows, but it's architecture is not well suited to 'on-the-fly' redirection, as there are many

Re: Spatial Search problems

2011-08-23 Thread Smiley, David W.
Could you reproduce a very simple example of this? For example if there is a particular indexed point in your data that should be returned from your query (a query smaller than d=4k10), then reproduce that bug in the Solr example app by supplying a dummy document with this point and running your

Solr indexing process: keep a persistent Mysql connection throu all the indexing process

2011-08-23 Thread samuele.mattiuzzo
I wrote my custom update handler for my solr installation, using jdbc to query a mysql database. Everything works fine: the updater queries the db, gets the data i need and update it in my documents! Fantastic! Only issue is i have to open and close a mysql connection for every document i read. Si

Re: SSD experience

2011-08-23 Thread Sanne Grinovero
Indeed I would never actually use it, but symlinks do exist on Windows. http://en.wikipedia.org/wiki/NTFS_symbolic_link Sanne 2011/8/23 Peter Sturge : > The Solr index directory lives directly on the SSD (running on Windows > - where the word symlink does not appear in any dictionary within a >

Re: Funky date string accepted

2011-08-23 Thread Markus Jelsma
I see, is the leading - char just ignored then? > : The following field value for a date field type is accepted: > : -0001-11-30T00:00:00Z > : > : and ends up in the index and as stored value as: > : 2-11-30T00:00:00Z > : > : I'd prefer to be punished with an exception. File a bug? > > That is

Re: Funky date string accepted

2011-08-23 Thread Chris Hostetter
: The following field value for a date field type is accepted: : -0001-11-30T00:00:00Z : : and ends up in the index and as stored value as: : 2-11-30T00:00:00Z : : I'd prefer to be punished with an exception. File a bug? That is actualy a legal date according to the format spec (although there

Re: HTTP 400 Error

2011-08-23 Thread Gora Mohanty
On Tue, Aug 23, 2011 at 6:30 PM, Lawson, Chris wrote: > I am trying to submit a search ("Cntrct:1310015") on both Prod and Model > system and after submitting with "Search" button, the result is a page > displaying HTTP 400. [...] Please show us the actual URL used to query Solr: At first guess,

RE: HTTP 400 Error

2011-08-23 Thread Lawson, Chris
I am trying to submit a search ("Cntrct:1310015") on both Prod and Model system and after submitting with "Search" button, the result is a page displaying HTTP 400. Thanks, Chris Lawson chris.law...@lfg.com (336) 691-3733 Notice of Confidentiality: **This E-mail and any of its attachments

RE: Spellcheck Phrases

2011-08-23 Thread Herman Kiefus
The angle that I am trying here is to create a dictionary from indexed terms that contain only correctly spelled words. We are doing this by having the field from which the dictionary is created utilize a type that employs solr.KeepWordFilterFactory, which in turn utilizes a text file of known

Re: SSD experience

2011-08-23 Thread Peter Sturge
The Solr index directory lives directly on the SSD (running on Windows - where the word symlink does not appear in any dictionary within a 100 mile radius of Redmond :-) Currently, the main limiting factors of SSD are cost and size. SSDs will get larger over time. Splitting indexes across multiple

Spellcheck index replication

2011-08-23 Thread Herman Kiefus
We employ one 'indexing' master that replicates to many 'query' slaves. We have also recently introduced spellchecking/DYM. It appears that replication does not 'cover' the spellchecker index. Do I understand this correctly? Further, we have seen where 'buildOnCommit' will cause the spellchec

Spatial Search problems

2011-08-23 Thread Javier Heras
Hi all, I'm new at solr. I've downloaded solr 3.3, and having tested solr querys for spatial search with examples that come in the tutorial. Everything ok. But when I substitute the tutorial index with my index, spatial search doesn't work until parameter d is greater than 4510 (km?) Any idea wha

Re: SSD experience

2011-08-23 Thread Gerard Roos
Interesting. Do you make a symlink to the indexes or is the whole Solr directory on SSD? thanks, Gerard Op 23 aug. 2011, om 12:53 heeft Peter Sturge het volgende geschreven: > Just to add a few cents worth regarding SSD... > > We use Vertex SSD drives for storing indexes, and wow, they really

Re: Full sentence spellcheck

2011-08-23 Thread Valentin
I tried your solution, it works. But it modify all the spellcheckers that I made, so that's not a good solution for me (I have an autocomplete and a regular spellcheck with separated words that I want to keep). I tried to move the line "" *into* the requestHandler, but of course it does not work.

Funky date string accepted

2011-08-23 Thread Markus Jelsma
Hi, The following field value for a date field type is accepted: -0001-11-30T00:00:00Z and ends up in the index and as stored value as: 2-11-30T00:00:00Z I'd prefer to be punished with an exception. File a bug? Thanks

RE: what's the status of droids project(http://incubator.apache.org/droids/)?

2011-08-23 Thread O. Klein
Or check http://www.crawl-anywhere.com/ Very customizable crawler. -- View this message in context: http://lucene.472066.n3.nabble.com/what-s-the-status-of-droids-project-http-incubator-apache-org-droids-tp3277367p3277698.html Sent from the Solr - User mailing list archive at Nabble.com.

Query parameter changes from solr 1.4 to 3.3

2011-08-23 Thread Samarendra Pratap
Hi, We are upgrading solr 1.4 (with collapsing patch solr-236) to solr 3.3. I was looking for the required changes in query parameters (or parameter names) if any. One thing I know for sure is that "collapse" and its sub-options are now known by "group", but didn't find anything else. Can someo

Re: SSD experience

2011-08-23 Thread Peter Sturge
Just to add a few cents worth regarding SSD... We use Vertex SSD drives for storing indexes, and wow, they really scream compared to SATA/SAS/SAN. As we do some heavy commits, it's the commit times where we see the biggest performance boost. In tests, we found that locally attached 15k SAS drives

Re: How to copy and extract information from a multi-line text before the tokenizer

2011-08-23 Thread Chantal Ackermann
Hi Michael, have you considered the DataImportHandler? You could use the the LineEntityProcessor to create fields per line and then copyField to collect everything for the AllData field. http://wiki.apache.org/solr/DataImportHandler#LineEntityProcessor Chantal On Tue, 2011-08-23 at 12:28 +02

RE: what's the status of droids project(http://incubator.apache.org/droids/)?

2011-08-23 Thread karl.wright
It's also worth looking at ManifoldCF. Karl -Original Message- From: ext Markus Jelsma Sent: 23/08/2011, 6:24 AM To: solr-user@lucene.apache.org Cc: java-u...@lucene.apache.org Subject: Re: what's the status of droids project(http://incubator.apache.org/droids/)? You should ask on the

How to copy and extract information from a multi-line text before the tokenizer

2011-08-23 Thread Michael Kliewe
Hello all, I have a custom schema which has a few fields, and I would like to create a new field in the schema that only has one special line of another field indexed. Lets use this example: field AllData (TextField) has for example this data: Title: exampleTitle of the book Author: Example Aut

Re: what's the status of droids project(http://incubator.apache.org/droids/)?

2011-08-23 Thread Markus Jelsma
You should ask on the Droids list but there's some activity in Jira. And did you consider Apache Nutch? On Tuesday 23 August 2011 10:17:50 Li Li wrote: > hi all > I am interested in vertical crawler. But it seems this project is not > very active. It's last update time is 11/16/2009

Re: Issue in indexing Zip file content with apache-solr-3.3.0

2011-08-23 Thread Jayendra Patil
Solr doesn't index the content of the files, but just the file names. you can apply patch - https://issues.apache.org/jira/browse/SOLR-2416 https://issues.apache.org/jira/browse/SOLR-2332 Regards, Jayendra On Tue, Aug 23, 2011 at 2:26 AM, Jagdish Kumar wrote: > > Hi All > > I am using apache-so

Re: can i create filters of score range

2011-08-23 Thread jame vaalet
okey, so this is something i was looking for .. the default order of result docs in lucene\solr .. and you are right, since i don care about the order in which i get the docs ideally i shouldn't ask solr to do any sorting on its "raw" result list ... though i understand your point, how do i do it a

Re: Boost or BQ?

2011-08-23 Thread Markus Jelsma
iirc boost gets multiplied into the equation whereas bq is added. Check your debug output. > What is the different between boost= and bq= ? > > I cannot find any documentationŠ

what's the status of droids project(http://incubator.apache.org/droids/)?

2011-08-23 Thread Li Li
hi all I am interested in vertical crawler. But it seems this project is not very active. It's last update time is 11/16/2009

Re: Sorting results by Range

2011-08-23 Thread Sowmya V.B.
Hi Chris Thanks a lot for the mail. I did not quite understand how that function was made. But, it does work like you said - there is a sorted list of documents now, where documents around value 20 are ranked first and documents around 10 are ranked below. (I chose a field with 0 and 100 as limit