Re: DateFormatTransformer issue with value 0000-00-00T00:00:00Z

2010-11-18 Thread gwk
'http://blogs.techrepublic.com.com/security/?p=4501&tag=nl.e036' EARTH has a Right To Life, otherwise we all die. - Original Message From: gwk To: solr-user@lucene.apache.org Sent: Wed, November 17, 2010 2:12:16 AM Subject: Re: DateFormatTransformer issue with value -00-

Re: DateFormatTransformer issue with value 0000-00-00T00:00:00Z

2010-11-17 Thread gwk
tring option, i.e. mysql://user:passw...@mysqlhost/database?zeroDateTimeBehavior=convertToNull This will make the mysql driver return those values as NULL instead of all-zero dates. Regards, gwk

Re: How to Facet on a price range

2010-11-10 Thread gwk
is available, but as Solr is just the search backend the frontend stuff you'll have to write yourself. Regards, gwk

Re: How to Facet on a price range

2010-11-09 Thread gwk
://www.mysecondhome.co.uk/search.html) to create the nice sliders Geert-Jan describes. We've also used it to add the sparklines above the sliders which give a nice indication of how the current selection is spread out. Regards, gwk On 11/9/2010 3:33 PM, Geert-Jan Brits wrote: Just to a

Re: Geographic clustering

2010-09-15 Thread gwk
r the head for that). If you want more information I'd be happy to help. Regards, gwk On 9/14/2010 8:14 PM, Charlie DeTar wrote: Hi, I'm interested in using geographic clustering of records in a Solr search index. Specifically, I want to be able to efficiently produce a map with cl

Re: Autosuggest on PART of cityname

2010-08-23 Thread gwk
name_^4 name^4 names (Where is the currently selected language on the website) - pf=name_^4 name^4 names parents Honestly, those parameters are basically just tweaked without quite understanding their meaning until I got something that worked adequately. Hope this helps. Regards, gwk

Re: Autosuggest on PART of cityname

2010-08-20 Thread gwk
.co.uk), our solution is quite similar to the one proposed by Markus however we use a separate core for the auto-completion data which is updated hourly, this is due to the fact you can complete on multiple levels of geography which would be quite hard to do with faceting. Regards, gwk

Re: Solr 1.4.1 and 3x: Grouping of query changes results

2010-08-09 Thread gwk
llowing and see if it behaves correctly: id:1234 AND (*:* AND -indexid:1 AND -indexid:2) AND -indexid:3 Regards, gwk

Re: Sites with Innovative Presentation of Tags and Facets

2010-05-31 Thread gwk
On 5/31/2010 4:24 PM, gwk wrote: On 5/31/2010 11:50 AM, gwk wrote: On 5/31/2010 11:29 AM, Geert-Jan Brits wrote: May I ask how you implemented getting the facet counts for each interval? Do you use a facet-query per interval? And perhaps for inspiration a link to the site you implemented this

Re: Sites with Innovative Presentation of Tags and Facets

2010-05-31 Thread gwk
On 5/31/2010 11:50 AM, gwk wrote: On 5/31/2010 11:29 AM, Geert-Jan Brits wrote: May I ask how you implemented getting the facet counts for each interval? Do you use a facet-query per interval? And perhaps for inspiration a link to the site you implemented this .. Thanks, Geert-Jan I love the

Re: Sites with Innovative Presentation of Tags and Facets

2010-05-31 Thread gwk
d mix [] and {} but unfortunately I'm not familiar enough with it to get it to work. Regards, gwk

Re: date slider

2010-05-17 Thread gwk
is useful to you, I'll be happy to answer any questions about how I implemented this. Regards, gwk On 5/16/2010 10:07 PM, Lukas Kahwe Smith wrote: On 16.05.2010, at 21:01, Ahmet Arslan wrote: http://wiki.apache.org/solr/StatsComponent can give you min and max values. Sorry my bad,

Re: date facets without intersections

2010-04-28 Thread gwk
Hi, Several possible solutions are discussed in http://lucene.472066.n3.nabble.com/Date-Faceting-and-Double-Counting-td502014.html Regards, gwk On 4/27/2010 10:02 PM, Király Péter wrote: Dear Solr users, I am interesting, whether it is possible to get date facets without intersecting

Re: Bucketing a price field

2010-04-07 Thread gwk
Oops, the new patch only works on Trie fields, other stuff I said should still be valid. (One extra thing to be aware of is double counting, see http://n3.nabble.com/Date-Faceting-and-Double-Counting-td502014.html for example) Regards, gwk On 4/7/2010 4:03 PM, gwk wrote: Hi, A while back

Re: Bucketing a price field

2010-04-07 Thread gwk
ld work on most numerical field types. Regards, gwk On 4/7/2010 2:44 AM, Blargy wrote: What would be the best way to do range bucketing on a price field? I'm sort of taking the example from the Solr 1.4 book and I was thinking about using a PatternTokenizerFactory with a SynonymFilter

Re: Drill down a solr result set by facets

2010-03-30 Thread gwk
seful for faceting. Regards, gwk On 3/29/2010 6:07 PM, Dhanushka Samarakoon wrote: Thanks for the reply. I was just giving the above as an example. Something as simple as following is also not working. /select/?q=france+fDepartmentName:History&version=2.2& So it looks like the query par

Re: How do I create a solr core with the data from an existing one?

2010-03-24 Thread gwk
section in the wiki page). Regards, gwk On 3/24/2010 5:31 PM, Steve Dupree wrote: *Solr 1.4 Enterprise Search Server* recommends doing large updates on a copy of the core, and then swapping it in for the main core. I tried following these steps: 1. Create prep core: http://localhost

Re: distinct on my result

2010-03-11 Thread gwk
Hi, Try replacing KeywordTokenizerFactory with a WhitespaceTokenizerFactory so it'll create separate terms per word. After a reindex it should work. Regards, gwk On 3/11/2010 4:33 PM, stocki wrote: hey, okay i show your my settings ;) i use an extra core with the standard requestha

Re: distinct on my result

2010-03-11 Thread gwk
Hi, I'm no expert on the full-text search features of Solr but I guess that has something to do with your fieldtype, or query. Are you using the standard request handler or dismax for your queries? And what analysers are you using on your product name field? Regards, gwk On 3/11/2010

Re: distinct on my result

2010-03-10 Thread gwk
separate server and replicated so the indexing has no impact on the performance of the site). Regards, gwk On 3/10/2010 3:09 PM, stocki wrote: okay. thx my suggestion run in another core;) do you distinct during the import with DIH ?

Re: distinct on my result

2010-03-10 Thread gwk
hough I haven't tested it so I could be horribly horribly wrong). Regards, gwk On 3/10/2010 2:55 PM, stocki wrote: hello. i implement my suggest-function with edgengramfilter. now when i get my result , is the result not distinct. often ist the name double or more. is it possible tha

Re: Date Facets

2010-02-24 Thread gwk
Hi Liam, This happens because the range searches for date faceting are inclusive on both ends. So values on the exact edges of the intervals are counted twice. You can see some solutions at http://old.nabble.com/Date-Faceting-and-Double-Counting-td25227846.html Regards, gwk On 2/24/2010 6

Re: Question regarding wildcards and dismax

2010-02-19 Thread gwk
Have a look at the q.alt parameter (http://wiki.apache.org/solr/DisMaxRequestHandler#q.alt) which is used for exactly this issue. Basically putting q.alt=*:* in your query means you can leave out the q parameter if you want all documents to be selected. Regards, gwk On 2/19/2010 11:28 AM

Re: How does one sort facet queries?

2010-02-19 Thread gwk
The "trick" I use is to use LocalParams to give eacht facet query a well defined name. Afterwards you can loop through the names in whatever order you want. so basically facet.query={!key=price_0}[* TO 100] etc. N.B. the facet queries in your example will lead to some documents to be counted double (i.e. when the price is exactly 100, 200, 300). Regards, gwk

Re: labeling facets and highlighting question

2010-02-18 Thread gwk
There's a ! missing in there, try {!key=label}. Regards, gwk On 2/18/2010 5:01 AM, adeelmahmood wrote: okay so if I dont want to do any excludes then I am assuming I should just put in {key=label}field .. i tried that and it doesnt work .. it says undefined field {key=label}field

Re: Autosuggest and highlighting

2010-02-09 Thread gwk
s that wouldn't work for the diacritical marks without a lot of transliteration brouhaha. Regards, gwk

Autosuggest and highlighting

2010-02-09 Thread gwk
"1" catenateAll="0" splitOnCaseChange="1"/> maxGramSize="20"/> ignoreCase="true" expand="true"/> words="stopwords.txt"/> generateNumberParts="1" catenateWords="0" catenateNumbers="0" catenateAll="0" splitOnCaseChange="1"/> Regards, gwk

Re: trouble with DTD

2010-02-08 Thread gwk
t [row,col {unknown-source}]: [49,23] é is an entity defined for (X)HTML. XML only uses " & ' < > and &#; So if you want to use the é character you'll have to either use the character itself or something like É Regards, gwk

Solr and Geoserver/Mapserver

2009-11-30 Thread gwk
dered on it on the server to improve performance (GTileLayerOverlay) Does anyone have any experience using geoserver, mapserver or a similar application in combination with Solr so that the application can generate tiles from a Solr query and tile position/zoom level? Regards, gwk

Re: Stop solr without losing documents

2009-11-13 Thread gwk
at the same time the index script is run where it only retrieved one of the updates, this will cause some documents to be updates multiple times but as document updates are idempotent this is no real problem.) Regards, gwk

Re: Geographic clustering

2009-09-11 Thread gwk
achieve the effect of filterqueries (including the awesome caching) by manipulating either the rangeQ or docs. And since the snippet above is called multiple times with different rangeQ but the same (filtered) DocSet I guess manipulating docs would be faster (I think). Regards, gwk gwk wrote:

Re: slow response

2009-09-09 Thread gwk
Hi Elaine, You can page your resultset with the rows and start parameters (http://wiki.apache.org/solr/CommonQueryParameters). So for example to get the first 100 results one would use the parameters rows=100&start=0 and the second 100 results with rows=100&start=100 etc. etc. Regar

Re: slow response

2009-09-09 Thread gwk
u might've meant that you only want to retrieve certain fields from each document. For that you can use the fl parameter (http://wiki.apache.org/solr/CommonQueryParameters#head-db2785986af2355759faaaca53dc8fd0b012d1ab). Hope this helps. Regards, gwk Elaine Li wrote: I want to get the 1

Re: Geographic clustering

2009-09-09 Thread gwk
finer than the requested precision and use a weighted average of the four parts of the cluster, I'll have to see if that is accurate enough. Hopefully I'll have the time to complete this today or tomorrow. I'll report back if it has worked. Regards, gwk Joe Calderon wro

Re: Geographic clustering

2009-09-08 Thread gwk
r curve mapping in Solr and clustering using facet counts on numerical ranges of that mapping but I'm not sure it will pan out. Regards, gwk Grant Ingersoll wrote: Not directly related to geo clustering, but http://issues.apache.org/jira/browse/SOLR-769 is all about a pluggable int

Re: LocalParams for faceting in nightly

2009-09-08 Thread gwk
Hi Gareth, Try removing the space between de closing bracket } and the field name, I think that should work. Regards, gwk gareth rushgrove wrote: Hi All Hoping someone might be able to help me with a problem. I downloaded and got up and running with the latest nightly release of Solr

Geographic clustering

2009-09-08 Thread gwk
ty depending on zoom level by adding latitude on longitude fields for each zoom level (so that for smaller countries you have to be zoomed in further to get the next level of clustering). I was wondering if anybody else has worked on something similar and if so what their solutions are. Regards, gwk

Re: A very complex search problem.

2009-09-02 Thread gwk
Hello Rajan, I might be mistaken, but isn't CouchDB or a similar map/reduce database ideal for situations like this? Regards, gwk rajan chandi wrote: Hi All, We are dealing with a very complex problem of person specific search. We're building a social network where people will

Re: SOLR vs SQL

2009-09-02 Thread gwk
try not to store textual descriptions but every facet a property can have. This gives us exactly the data needed to perform faceting but not so much on the full text search (which is used mind you, to find suggestions when you use the search box). Regards, gwk

Re: Date Faceting and Double Counting

2009-09-02 Thread gwk
number faceting where you can specify if : either end of each range should be inclusive or exclusive. I just ported it gwk: 1) would you mind opening a Jira issue for your date faceting improvements as well (email attachments tend to get lost, and there are legal headaches with committing them

Re: Date Faceting and Double Counting

2009-09-01 Thread gwk
, end, both and neither To maintain compatibility with solr without the patch the default is neither. I hope the meaning of the values are self-explanatory. Regards, gwk Stephen Duncan Jr wrote: If we do date faceting and start at 2009-01-01T00:00:00Z, end at 2009-01-03T00:00:00Z, with a gap of

Re: Thanks

2009-08-27 Thread gwk
Dave Searle wrote: Hi Gwk, It's a nice clean site, easy to use and seems very fast, well done! How well does it do in regards to SEO though? I noticed there's a lot of ajax going on in the background to help speed things up for the user (love the sliders), but seems to be lacking

Thanks

2009-08-27 Thread gwk
o Solr-seems-to-know-a-little-bit. We're running a nightly build of Solr 1.4 with SOLR-1240 applied for the dynamic facet count updates when using the sliders in the search screen. Again, thank you and if you have any suggestions or questions regarding our implementation, feel free to ask. Regards, gwk

Re: debugQuery=true issue

2009-07-29 Thread gwk
Hi, Thanks for your response, I'm still developing so the schema is still in flux so I guess that explains it. Oh and regarding the NPE, I updated my checkout and recompiled and now it's gone so I guess somewhere between revision 787997 and 798482 it's already been fixed.

Re: debugQuery=true issue

2009-07-28 Thread gwk
have misunderstood the purpose of optimize but that shouldn't have any effect on scoring should it? For what it's worth, I'm using dismax with the functionquery in bf. Regards, gwk Oops, it seems it's due to a fq in the same query, not because of the, there's a range q

Re: debugQuery=true issue

2009-07-28 Thread gwk
at org.apache.solr.search.QueryParsing.toString(QueryParsing.java:360) at org.apache.solr.search.QueryParsing.toString(QueryParsing.java:401) at org.apache.solr.search.QueryParsing.toString(QueryParsing.java:466) ... 23 more -Grant On Jul 27, 2009, at 10:59 AM, gwk wrote: gwk wrote: Hi,

Re: debugQuery=true issue

2009-07-27 Thread gwk
gwk wrote: Hi, I'm playing around with sorting via functionqueries, and I've set _val_ to the following: sum(product(always_on_top,5),recip(rord(publication_date),1,1000,1000)) Where the field always_on_top is a simple boolean field, where documents with always_on_top:true should

debugQuery=true issue

2009-07-27 Thread gwk
t the document in question had no publication_date field set (which is totally my fault) however it took quite a while to discover this since I couldn't turn on debugQuery. Is this a bug or expected behviour? Regards, gwk

Re: Faceting

2009-07-14 Thread gwk
lin Shekhar Mangar wrote: On Mon, Jul 13, 2009 at 7:56 PM, gwk wrote: Is there a good way to select the top X facets and include some terms you want to include as well something like facet.field=country&f.country.facet.limit=X&f.country.facet.includeterms=Narnia,Guilder or is ther

Faceting

2009-07-13 Thread gwk
Hi, I'm in the process of making a javascriptless web interface to Solr (the nice ajax-version will be built on top of it unobtrusively). Our database has a lot of fields and so I've grouped those with similar characteristics to make several different 'widgets' (like a numerical type which ge

Re: Numerical range faceting

2009-06-23 Thread gwk
Shalin Shekhar Mangar wrote: On Tue, Jun 23, 2009 at 4:55 PM, gwk wrote: I was wondering if someone is interested in a patch file and if so, where should I post it? This seems useful. Please open an issue and submit a patch. I'm sure there will be interest. Hi, I clean

Re: Numerical range faceting

2009-06-23 Thread gwk
gwk wrote: Hi, I'm currently using facet.query to do my numerical range faceting. I basically use a fixed price range of €0 to €1 in steps of €500 which means 20 facet.queries plus an extra facet.query for anything above €1. I use the inclusive/exclusive query as per my question

Numerical range faceting

2009-06-18 Thread gwk
erlooking something. Regards, gwk

Re: Range queries

2009-06-17 Thread gwk
Yes, this works perfectly, guess the "Never use equality comparison for floating point numbers"-rule was so strong in my mind I didn't even think to consider this possibility. Thanks, gwk Avlesh Singh wrote: Really sorry, this is what I meant: x:{5 TO 8} OR x:5 Cheers Avle

Range queries

2009-06-16 Thread gwk
Hi, When doing range queries it seems the query is either x:[5 TO 8] which means 5 <= x <= 8 or x:{5 TO 8} which means 5 < x < 8. But how do you get one half exclusive, the other inclusive for double fields the following: 5 <= x < 8? Is this possible? Regards, gwk

Re: How to combine facets count from multiple query into one query

2009-05-11 Thread gwk
_title}book_title Of course, this uses an 1.4 feature (tagging and excluding) Regards, gwk Jeffrey Tiong wrote: Hi, I have a schema that has the following fields, publisher_name book_title year abstract Currently if I do a facet count when I have a query "q=abstract:philosophy AND publisher_

Re: Distributed Search

2009-02-25 Thread gwk
Otis Gospodnetic wrote: Yes, that's the standard trick. :) Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message From: gwk To: solr-user@lucene.apache.org Sent: Wednesday, February 25, 2009 5:18:47 AM Subject: Re: Distributed Search

Re: Distributed Search

2009-02-25 Thread gwk
Koji Sekiguchi wrote: gwk wrote: Hello, The wiki states 'When duplicate doc IDs are received, Solr chooses the first doc and discards subsequent ones', I was wondering whether "the first doc" is the doc of the shard which responds first or the doc in the first sha

Distributed Search

2009-02-23 Thread gwk
Hello, The wiki states 'When duplicate doc IDs are received, Solr chooses the first doc and discards subsequent ones', I was wondering whether "the first doc" is the doc of the shard which responds first or the doc in the first shard in the shards GET parameter? Regards, gwk

Facet Paging

2009-01-13 Thread gwk
ordinary query response's numFound attribute so you can determine how many pages there should be. Is it possible to request this information somehow in the same response and if possible how much does it impact performance? Regards, gwk

Re: DataImportHandler: UTF-8 and Mysql

2009-01-13 Thread gwk
Shalin Shekhar Mangar wrote: On Mon, Jan 12, 2009 at 3:48 PM, gwk wrote: 1. Posting UTF-8 data through the example post-script works and I get the proper results back when I query using the admin page. However, data imported through the DataImportHandler from a MySQL database

Re: Index is not created if my database table is large

2009-01-12 Thread gwk
ribute to your datasource, it fixed the problem for me. Regards, gwk

DataImportHandler: UTF-8 and Mysql

2009-01-12 Thread gwk
ute which acually represents that data, which comes from the RegexTransformer I understand but why then is the "column" attribute used instead of the "name"-attribute. This has confused me somewhat, any clarification would be greatly appreciated. Regards, gwk

Solr 1.3.0 with Jetty 6.1.14

2009-01-05 Thread gwk
pContext-scope, solr logs: org.apache.solr.core.SolrResourceLoader locateInstanceDir INFO: No /solr/home in JNDI org.apache.solr.core.SolrResourceLoader locateInstanceDir INFO: solr home defaulted to 'solr/' (could not find system property or JNDI) Am I doing something wrong here? Any help will be appreciated. Regards, gwk