Re: Fuzzy searching documents over multiple fields using Solr

2013-05-09 Thread Geert-Jan Brits
I didn't mention it but I'd like individual fields to contribute to the overall score on a continuum instead of 1 (match) and 0 (no match), which will lead to more fine-grained scoring. A contrived example: all other things equal a tv of 40 inch should score higher than a 38 inch tv when searching

Re: multiple dateranges/timeslots per doc: modeling openinghours.

2011-10-03 Thread Geert-Jan Brits
Interesting! Reading your previous blogposts, I gather that the to be posted 'implementation approaches' includes a way of making the SpanQueries available within SOLR? Also, would with your approach would (numeric) RangeQueries be possible as Hoss suggests? Looking forward to that 'implementation

Re: multiple dateranges/timeslots per doc: modeling openinghours.

2011-10-03 Thread Geert-Jan Brits
Thanks Hoss for that in-depth walkthrough. I like your solution of using (something akin to) FieldMaskingSpanQuery. Conceptually the Join-approach looks like it would work from paper, although

Re: multiple dateranges/timeslots per doc: modeling openinghours.

2011-10-11 Thread Geert-Jan Brits
Op 11 oktober 2011 03:21 schreef Chris Hostetter het volgende: > > : Conceptually > : the Join-approach looks like it would work from paper, although I'm not a > : big fan of introducing a lot of complexity to the frontend / querying > part > : of the solution. > > you lost me there -- i don't see

Re: social123 Data Appending Service

2012-01-26 Thread Geert-Jan Brits
No thanks, not sure which site you're talking about btw. But anyway, no thanks Op 26 januari 2012 19:41 schreef Aaron Biddar het volgende: > Hi there- > > I was on your site today and was not sure who to reach out to. My Company, > Social123, provides Social Data Appending for companies that pr

Re: Sort facet Field by name

2010-06-21 Thread Geert-Jan Brits
facet.sort=false http://wiki.apache.org/solr/SimpleFacetParameters#facet.sort 2010/6/21 Ankit Bhatnagar > Hi All, > I couldn't really figure out if we a have option for sorting the facet > field by name in ascending/descending. > > Any clues? > > Thanks > Ankit >

Re: Sort facet Field by name

2010-06-21 Thread Geert-Jan Brits
I want to be able to sort (asc/desc) the name ie toggle them > > > Ankit > > -Original Message- > From: Geert-Jan Brits [mailto:gbr...@gmail.com] > Sent: Monday, June 21, 2010 12:30 PM > To: solr-user@lucene.apache.org > Subject: Re: Sort facet Field by name >

Re: Searching across multiple repeating fields

2010-06-22 Thread Geert-Jan Brits
Perhaps my answer is useless, bc I don't have an answer to your direct question, but: You *might* want to consider if your concept of a solr-document is on the correct granular level, i.e: your problem posted could be tackled (afaik) by defining a document being a 'sub-event' with only 1 daterang

Re: Setting many properties for a multivalued field. Schema.xml ? External file?

2010-06-26 Thread Geert-Jan Brits
You can treat dynamic fields like any other field, so you can facet, sort, filter, etc on these fields (afaik) I believe the confusion arises that sometimes the usecase for dynamic fields seems to be ill-understood, i.e: to be able to use them to do some kind of wildcard search, e.g: search for a

Re: Setting many properties for a multivalued field. Schema.xml ? External file?

2010-06-26 Thread Geert-Jan Brits
> > I don't clearly see the reason of doing this. Is there a gain in terms of > performance? Or does this make programming on the client-side easier? Or > something else? > > > My other question was: in case we use Dynamic Fields, is there a > documentation about using Sol

Re: Setting many properties for a multivalued field. Schema.xml ? External file?

2010-06-26 Thread Geert-Jan Brits
btw, be careful with you delimiters: pic_url may possibly contain a '-', etc. 2010/6/26 Geert-Jan Brits > >If I understand your suggestion correctly, you said that there's NO need > to have many Dynamic Fields; instead, we can have one definitive field name, > wh

Re: How I can use score value for my function

2010-06-29 Thread Geert-Jan Brits
It's possible using functionqueries. See this link. http://wiki.apache.org/solr/FunctionQuery#query 2010/6/29 MitchK > > Ramzesua, > > this is not possible, because Solr does not know what is the resulting > score > at query-time (as far as I know). > The score will be computed, when every hit

Re: Re:Re: How to speed up solr search speed

2010-07-16 Thread Geert-Jan Brits
you mentioned that you have a lot of mem free, but your yetty containers only using between 40-60 mem. probably stating the obvious, but have you increased the -Xmx param like for instance: java -Xmx1024M -jar start.jar that way you're configuring the container to use a maximum of 1024 MB ram ins

Re: Re: How to speed up solr search speed

2010-07-17 Thread Geert-Jan Brits
>My query string is always simple like "design", "principle of design", "tom" >EG: >URL: http://localhost:7550/solr/select/?q=design&version=2.2&start=0&rows=10&indent=on IMO, indeed with these types of simple searches caching (and thus RAM usage) can not be fully exploited, i.e: there isn't reall

Re: indexing best practices

2010-07-18 Thread Geert-Jan Brits
Have you read: http://wiki.apache.org/lucene-java/ImproveIndexingSpeed http://www.lucidimagination.com/Community/Hear-from-the-Experts/Articles/Scaling-Lucene-and-Solr To be short there are only guidelines (see links) no definitive answers. If you followed the guidelines for improviing indexing sp

Re: Tree Faceting in Solr 1.4

2010-07-23 Thread Geert-Jan Brits
>If I am doing >facet=on & facet.field={!ex=State}State & fq={!tag=State}State:Karnataka >All it gives me is Facets on state excluding only that filter query.. But i >was not able to do same on third level ..Like facet.field= Give me the >counts of cities also in state Karantaka.. >Let me know s

Re: help with a schema design problem

2010-07-23 Thread Geert-Jan Brits
With the usecase you specified it should work to just index each "Row" as you described in your initial post to be a seperate document. This way p_value and p_type all get singlevalued and you get a correct combination of p_value and p_type. However, this may not go so well with other use-cases yo

Re: help with a schema design problem

2010-07-23 Thread Geert-Jan Brits
design my schema ? I have some solutions > but none seems to be a good solution. One way would be to define a single > field in the schema as p_value_type = "client pramod" i.e. combine the > value > from both the field and store it in a single field. > > > On Sat, Ju

Re: filter query on timestamp slowing query???

2010-07-23 Thread Geert-Jan Brits
just wanted to mention a possible other route, which might be entirely hypothetical :-) *If* you could query on internal docid (I'm not sure that it's available out-of-the-box, or if you can at all) your original problem, quoted below, could imo be simplified to asking for the last docid inserted

Re: help with a schema design problem

2010-07-23 Thread Geert-Jan Brits
alue:"Pramod" AND p_type:"Supplier" > > > > > > > > it would give me result as document 1. Which is incorrect, since in > > > > document > > > > 1 Pramod is a Client and not a Supplier. > > Would it? I would expect it to

Re: Tree Faceting in Solr 1.4

2010-07-24 Thread Geert-Jan Brits
Perhaps completely unnessecery when you have a controlled domain, but I meant to use ids for places instead of names, because names will quickly become ambiguous, e.g.: there are numerous different places over the world called washington, etc. 2010/7/24 SR > Hi Geert-Jan, > > What did you mean b

Re: Tree Faceting in Solr 1.4

2010-07-24 Thread Geert-Jan Brits
I believe we use an in-process weakhashmap to store the id-name relationship. It's not that we're talking billions of values here. For anything more mem-intensive we use no-sql (tokyo tyrant through memcached protocol at the moment) 2010/7/24 Jonathan Rochkind > > Perhaps completely unnessecery

Re: Which is a good XPath generator?

2010-07-25 Thread Geert-Jan Brits
I am assuming (like Li I think) that you want to induce a structure/schema from a html-example so you can use that schema to extract data from similiar html-structured pages. Another term often used in literature for that is "Wrapper Induction". Beside DOM, using CSS-classes often give good disti

Re: 2 type of docs in same schema?

2010-07-26 Thread Geert-Jan Brits
You can easily have different types of documents in 1 core: 1. define searchquery as a field(just as the others in your schema) 2. define type as a field (this allows you to decide which type of documents to search for, e.g: "type_normal" or "type_search") now searching on regular docs becomes: q

Re: 2 type of docs in same schema?

2010-07-26 Thread Geert-Jan Brits
" as required so you don't forget to include in in your indexing-program) 2010/7/26 > > Thanks for you answer! That's great. > > Now to index search quieries data is there something special to do? or it > stay as usual? > > > > > > >

Re: advice on creating a solr index when data source is from many unrelated db tables

2010-07-29 Thread Geert-Jan Brits
I can interprete your question in 2 different ways: 1. Do you want to index several heterogenous documents all coming from different tables? So documents of type "tableA" are created and indexed alongside documents of type "tableB", "tableC", etc. 2. Do you want to combine unrelated data from 15 ta

Re: Quering the database

2010-08-02 Thread Geert-Jan Brits
you should (as per the example) define the field as text in your solr-schema not in your RDB. something like: then search like: q=field_1:monitors the example schema illustrates a lot of the possibilities on how you to define fields and what is all means. Moreover have a look at: http://wiki.ap

Re: Quering the database

2010-08-03 Thread Geert-Jan Brits
No. With Solr is really flexible and allows for a lot of complex querying out-of-the-box. Really the Wiki is your best friend here. http://wiki.apache.org/solr/ perhaps start with: 1. http://lucene.apache.org/solr/tutorial.html 2. http://wiki.apache.org/solr/SolrQuerySyntax 3. http://wiki.apache.o

Re: Best solution to avoiding multiple query requests

2010-08-04 Thread Geert-Jan Brits
Field Collapsing (currently as patch) is exactly what you're looking for imo. http://wiki.apache.org/solr/FieldCollapsing Geert-Jan 2010/8/4 Ken Krugler > Hi all, > > I've got a situation where the key result from an initial search request > (let's

Re: Best solution to avoiding multiple query requests

2010-08-04 Thread Geert-Jan Brits
helps, Geert-jan 2010/8/4 Ken Krugler > Hi Geert-Jan, > > > On Aug 4, 2010, at 5:30am, Geert-Jan Brits wrote: > > Field Collapsing (currently as patch) is exactly what you're looking for >> imo. >> >> http://wiki.apache.org/solr/FieldCollapsing >> &g

Re: how to take a value from the query result

2010-08-05 Thread Geert-Jan Brits
you should parse the xml and extract the value. Lot's of libraries undoubtably exist for PHP to help you with that (I don't know PHP) Moreover, if all you want from the result is AUC_CAT you should consider using the fl=param like: http://172.16.17.126:8983/search/select/?q=AUC_ID:607136&fl=AUC_CA

Re: No "group by"? looking for an alternative.

2010-08-05 Thread Geert-Jan Brits
If I understand correctly: 1. products have different product variants ( in case of shoes a combination of color and size + some other fields). 2. Each product is shown once in the result set. (so no multiple product variants of the same product are shown) This would solve that IMO: 1, create 1 d

Re: XML Format

2010-08-06 Thread Geert-Jan Brits
at first glance I see no difference between the 2 documents. Perhaps you can illustrate which fields are not in the resultset that you want to be there? also use the 'fl'-param to describe which fields should be outputted in your results. Of course, you have to first make sure the fields you want

Re: How do i update some document when i use sharding indexs?

2010-08-09 Thread Geert-Jan Brits
I'm not sure if Solr has some build-in support for sharding-functions, but you should generally use some hashing-algorithm to split the indices and use the same hash-algorithm to locate which shard contains a document. http://en.wikipedia.org/wiki/Hash_function Without employing any domain knowled

Re: How do i update some document when i use sharding indexs?

2010-08-09 Thread Geert-Jan Brits
Just to be completely clear: the program that splits your index in 20 shards should employ this algo as well. 2010/8/9 Geert-Jan Brits > I'm not sure if Solr has some build-in support for sharding-functions, but > you should generally use some hashing-algorithm to split the indi

Re: how to support "implicit trailing wildcards"

2010-08-10 Thread Geert-Jan Brits
you could satisfy this by making 2 fields: 1. exactmatch 2. wildcardmatch use copyfield in your schema to copy 1 --> 2 . q=exactmatch:mount+wildcardmatch:mount*&q.op=OR this would score exact matches above (solely) wildcard matches Geert-Jan 2010/8/10 yandong yao > Hi Bastian, > > Sorry for n

Re: Filter Performance in Solr 1.3

2010-08-11 Thread Geert-Jan Brits
fq's are the preferred way to use for filtering when the same filter is often used. (since the filter-set can be cached seperately) . as to your direct question: > My question is whether there is anything that can be done in 1.3 to help alleviate the problem, before upgrading to 1.4? I don't thi

Re: How to Debug Sol-Code in Eclipse ?!

2010-08-22 Thread Geert-Jan Brits
1. download solr lib and import them in your project. 2. download solr source-code of the same version and attach in to the libraries. (I haven't got eclipse open but it is something like project -> settings -> jre/libraries?) 3. write a small program yourself which calls EmbededSolrServer and step

Re: Solr search speed very low

2010-08-25 Thread Geert-Jan Brits
have a look at http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters to see how that works. 2010/8/25 Marco Martinez > You should use the tokenizer solr.WhitespaceTokenizerFactory in your field > type to get your terms indexed, once you have indexed the data, you dont > need to use the * i

Re: solr working...

2010-08-26 Thread Geert-Jan Brits
Check out Drew Farris' explantion for remote debugging Solr with Eclipse posted a couple of days ago: http://lucene.472066.n3.nabble.com/How-to-Debug-Sol-Code-in-Eclipse-td1262050.html Geert-Jan 2010/8/26 Michael

Re: questions about synonyms

2010-08-31 Thread Geert-Jan Brits
concerning: > . I got a very big text file of synonyms. How I can use it? Do I need to index this text file first? have you seen http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#SynonymFilter ? Cheers, Geert-Jan

Re: High - Low field value?

2010-09-01 Thread Geert-Jan Brits
StatsComponent is exactly what you're looking for. http://wiki.apache.org/solr/StatsComponent Cheers, Geert-Jan 2010/9/1 kenf_nc > > I want to do range facets on a couple fields, a Price field in particular. > But Price is relative to the product typ

Re: Is there a way to fetch the complete list of data from a particular column in SOLR document?

2010-09-07 Thread Geert-Jan Brits
>Please let me know if there are any other ideas / suggestions to implement this. You're indexing program should really take care of this IMHO. Each time your indexer inserts a document to Solr, flag the corresponding entity in your RDBMS, each time you delete, remove the flag. You should implemen

Re: Is there a way to fetch the complete list of data from a particular column in SOLR document?

2010-09-09 Thread Geert-Jan Brits
e process query solr for documents in the indexing > state and set them to committed if they are queryable in solr. > > On Tue, Sep 7, 2010 at 14:26, Geert-Jan Brits wrote: > >>Please let me know if there are any other ideas / suggestions to > implement > > this. > >

Re: dynamic "stop" words?

2010-10-09 Thread Geert-Jan Brits
That might work, although depending on your use-case it might be hard to have a good controlled vocab on citynames (hotel metropole bruxelles, hotel metropole brussels, hotel metropole brussel, etc.) Also 'hotel paris bruxelles' stinks... given your example: > Doc 1 > name => "Holiday Inn" > ci

Re: How to Facet on a price range

2010-11-09 Thread Geert-Jan Brits
Just to add to this, if you want to allow the user more choice in his option to select ranges, perhaps by using a 2-sided javasacript slider for the pricerange (ala kayak.com) it may be very worthwhile to discretize the allowed values for the slider (e.g: steps of 5 dolllar) Most js-slider implemen

Re: How to Facet on a price range

2010-11-09 Thread Geert-Jan Brits
spread out. > > Regards, > > gwk > > > On 11/9/2010 3:33 PM, Geert-Jan Brits wrote: > >> Just to add to this, if you want to allow the user more choice in his >> option >> to select ranges, perhaps by using a 2-sided javasacript slider for the >> p

Re: How to Facet on a price range

2010-11-10 Thread Geert-Jan Brits
Ah I see: like you said it's part of the facet range implementation. Frontend is already working, just need the 'update-on-slide' behavior. Thanks Geert-Jan 2010/11/10 gwk > On 11/9/2010 7:32 PM, Geert-Jan Brits wrote: > >> when you drag the sliders , an update of h

Re: Facet showing MORE results than expected when its selected?

2010-11-10 Thread Geert-Jan Brits
Another option : assuming themes_raw is type 'string' (couldn't get that nugget of info for 100%) it could be that you're seeing a difference in nr of results between the 110 for fq:themes_raw and 321 from your db, because fieldtype:string (thus themes_raw) is case-sensitive while (depending on y

Re: modeling prices based on daterange using multipoints

2012-12-12 Thread Geert-Jan Brits
2012/12/12 David Smiley (@MITRE.org) > britske wrote > > Hi David, > > > > Yeah interesting (as well as problematic as far is implementing) use-case > > indeed :) > > > > 1. You mention "there are no special caches / memory requirements > inherent > > in this.". For a given user-query this would

Re: SOLR and secure content

2010-11-23 Thread Geert-Jan Brits
> When making a query these fields should be required. Is it possible to configure handlers on the solr server so that these field are required whith each type of query? So for adding documents, deleting and querying? have a look at 'invariants' (and 'appends') in the example solrconfig. They can

Re: Is this sort order possible in a single query?

2010-11-24 Thread Geert-Jan Brits
You could do it with sorting on a functionquery (which is supported from solr 1.5) http://wiki.apache.org/solr/FunctionQuery#Sort_By_Function Consider the search: http://localhost:8093/solr/select?author:'j.k.rowling' sorting like you sp

Re: How to get facet counts without fields that are constrained by themselves?

2010-11-24 Thread Geert-Jan Brits
http://wiki.apache.org/solr/SimpleFacetParameters#Tagging_and_excluding_Filters 2010/11/24 Petrov Sergey > I need to retrieve result of query and facet counts for all searchable > document fields. I can't get corre

Re: Is this sort order possible in a single query?

2010-11-24 Thread Geert-Jan Brits
ion. i'm a little confused about solr 1.5, > especially > after finding this wiki page: > > http://wiki.apache.org/solr/Solr1.5 > > Is there a stable build available for version 1.5, so i can test your > suggestion > using functionquery? > > > -robert > &

Re: schema design for related fields

2010-12-01 Thread Geert-Jan Brits
"if first is selected in the user interface and we have 10 price ranges query would be 120 cluases (12 months * 10 price ranges)" What would you intend to do with the returned facet-results in this situation? I doubt you want to display 12 categories (1 for each month) ? When a user hasn't select

Re: schema design for related fields

2010-12-01 Thread Geert-Jan Brits
Ok longer answer than anticipated (and good conceptual practice ;-) Yeah I belief that would work if I understand correctly that: 'in Jan [9] in feb [10] in march [1]' has nothing to do with pricing, but only with availability? If so you could seperate it out as two seperate issues: 1. ) showi

Re: schema design for related fields

2010-12-01 Thread Geert-Jan Brits
Also, filtering and sorting on price can be done as well. Just be sure to use the correct price- field. Geert-Jan 2010/12/1 Geert-Jan Brits > Ok longer answer than anticipated (and good conceptual practice ;-) > > Yeah I belief that would work if I understand correctly that: >

Re: schema design for related fields

2010-12-01 Thread Geert-Jan Brits
in this case > just complicated. I'll see if i can hack some facets into the proto type > tommorrow. Thanks for your help > > Lee C > > On 1 December 2010 17:57, Geert-Jan Brits wrote: > > > Ok longer answer than anticipated (and good conceptual practice ;-) >

Re: finding exact case insensitive matches on single and multiword values

2010-12-03 Thread Geert-Jan Brits
when you went from strField to TextField in your config you enabled tokenizing (which I believe splits on spaces by default), which is why you see seperate 'words' / terms in the debugQuery-explanation. I believe you want to keep your old strField config and try quoting: fq=city:"den+haag" or fq=

Re: Search based on images

2010-12-11 Thread Geert-Jan Brits
Well-known algorithms for detecting 'highly descriptive features' in images that can cope with scaling and rotation (up to a certain degree of course) are SIFT and SURF (SURF is generally considered the more mature of the two afaik) http://en.wikipedia.org/wiki/Scale-invariant_feature_transform h

Re: Consequences for using multivalued on all fields

2010-12-21 Thread Geert-Jan Brits
You should be aware that the behavior of sorting on a multi-valued field is undefined. After all, which of the multiple values should be used for sorting? So if you need sorting on the field, you shouldn't make it multi-valued. Geert-Jan 2010/12/21 J.J. Larrea > Someone please correct me if I a

Re: Sub query using SOLR?

2011-01-05 Thread Geert-Jan Brits
Bbarani probably wanted to be able to create the query without having to prefetch the ids at the clientside first. But I agree this is the only stable solution I can think of (so excluding possible patches) Geert-Jan 2011/1/5 Grijesh.singh > > Why thinking so complex,just use result of first qu

Re: Search on two core and two schema

2011-01-18 Thread Geert-Jan Brits
>>Schemas are very differents, i can't group them. In contrast to what you're saying above, you may rethink the option of combining both type of documents in a single core. It's a perfectly valid approach to combine heteregenous documents in a single core in Solr. (and use a specific field -say 't

Re: one last questoni on dynamic fields

2011-01-23 Thread Geert-Jan Brits
Yep you can. Although I'm not sure you can use a wildcard-prefix. (perhaps you can I'm just not sure) . I always use wildcard-suffixes. Cheers, Geert-Jan 2011/1/23 Dennis Gearon > Is it possible to use ONE definition of a dynamic field type for inserting > mulitple dynamic fields of that type w

Re: Faceting Question

2011-01-24 Thread Geert-Jan Brits
> &fq={!tag=tag1}tags:( |1003| |1007|) AND tags:( >|10015|)&version=2.2&start=0&rows=10&indent=on&facet=on&facet.field={!ex=tag1}category&facet.field=capacity&facet.field=brand I'm just guessing here, but perhaps {!tag=tag1} is only picking up the 'tags:( |1003| |1007|) '-part. If so {!ex=tag1} wo

Re: Function Question

2011-02-03 Thread Geert-Jan Brits
I don't have a direct answer to your question, but you could consider having fields: latCombined and LongCombined where you pairwise combine the latitudes and longitudes, e.g: latCombined: 48.0-49.0-50.0 longcombined: 2.0-3.0-4.0 Than in your custom scorer above split latC

Re: Index Not Matching

2011-02-03 Thread Geert-Jan Brits
Make sure your index is completely commited. curl 'http://localhost:8983/solr/update?commit=true' http://wiki.apache.org/solr/UpdateXmlMessages#A.22commit.22_and_.22optimize.22 for an overview: http://lucene.apache.org/solr/tutorial.html hth, Geert-Jan

Re: Sort Stability With Date Boosting and Rounding

2011-02-22 Thread Geert-Jan Brits
You could always use a secondary sort as a tie-breaker, i.e: something unique like 'documentid' or something. That would ensure a stable sort. 2011/2/23 Stephen Duncan Jr > I'm trying to use > > http://wiki.apache.org/solr/SolrRelevancyFAQ#How_can_I_boost_the_score_of_newer_documents > as > a bf

Re: Problem with sorting using functions.

2011-02-28 Thread Geert-Jan Brits
sort by functionquery is only available from solr 3.1 (from : http://wiki.apache.org/solr/FunctionQuery#Sort_By_Function) 2011/2/28 John Sherwood > This works: > /select/?q=*:*&sort=price desc > > This throws a 400 error: > /select/?q=*:*&sort=sum(1, 1) desc > > "Missing sort order." > > I'm us

Re: [ANNOUNCE] Web Crawler

2011-03-02 Thread Geert-Jan Brits
Hi Dominique, This looks nice. In the past, I've been interested in (semi)-automatically inducing a scheme/wrapper from a set of example webpages (often called 'wrapper induction' is the scientific field) . This would allow for fast scheme-creation which could be used as a basis for extraction. L

Re: Efficient boolean query

2011-03-02 Thread Geert-Jan Brits
If you often query X as part of several other queries (e.g: X | X AND Y | X AND Z) you might consider putting X in a filter query ( http://wiki.apache.org/solr/CommonQueryParameters#fq) leading to: q=*:*&fq=X q=Y&fq=X q=Z&fq=X Filter queries are cached seperately which means that after the firs

Re: how would you design schema?

2011-03-09 Thread Geert-Jan Brits
Would having a solr-document represent a 'product purchase per account' solve your problem? You could then easily link the date of purchase to the document as well as the account-number. e.g: fields: orderid (key), productid, product-characteristics, order-characteristics (including date of purcha

Re: Solr

2011-03-10 Thread Geert-Jan Brits
Start by reading http://wiki.apache.org/solr/FrontPage and the provided links (introduction, tutorial, etc. ) 2011/3/10 yazhini.k vini > Hi , > > I need notes and detail about solr because of Now I am working in solr so i > need help . > > > Regards , > > Yazhini . K > NCSI , > M.Sc ( Softwar

Re: Getting Category ID (primary key)

2011-03-11 Thread Geert-Jan Brits
If it works, it's performant and not too messy it's a good way :-) . You can also consider just faceting on Id, and use the id to fetch the categoryname through sql / nosql. That way your logic is seperated from your presentation, which makes extending (think internationalizing, etc.) easier. Not s

Re: Solr and Permissions

2011-03-11 Thread Geert-Jan Brits
About the 'having to reindex when permissions change'-problem: have a look at ExternalFileField http://lucene.apache.org/solr/api/org/apache/solr/schema/ExternalFileField.html which enables you to reload a file withou

Re: Solr and Permissions

2011-03-12 Thread Geert-Jan Brits
Ahh yes, sorry about that. I assumed ExternalFileField would work for filtering as well. Note to self: never assume Geert-Jan 2011/3/12 Koji Sekiguchi > (11/03/12 10:28), go canal wrote: > >> Looking at the API doc, it seems that only floating value is currently >> supported, is it true? >> > >

Re: Solr query POST and not in GET

2011-03-15 Thread Geert-Jan Brits
Yes it's possible. Assuming your using SolrJ as a client-library: set: QueryRequest req = new QueryRequest(); req.setMethod(METHOD.POST); Any other client-library should have a similar method. hth, Geert-Jan 2011/3/15 Gastone Penzo > Hi, > is possible to change Solr sending query method from

Re: Solr Query

2011-03-15 Thread Geert-Jan Brits
> But it returns all resuts with MSRP = 1 and doesnt consider 2nd query at all. I believe you mean: 'it returns all results with RetailPriceCodeID = 1 while ignoring the 2nd query?' If so, please check that your default operator is set to AND in your schema config. Other than that, your syntax se

Re: Adding the suggest component

2011-03-18 Thread Geert-Jan Brits
> 2011-03-18 14:11:02.284:INFO::Started SocketConnector@0.0.0.0:8983 Solr started on port 8983 instead of this: > http://localhost/solr/admin/ try this instead: http://localhost:8983/solr/admin/ Cheers, Geert-Jan 2011/3/18 Brian Lamb > That does seem like a be

Re: working with collection : Where is default schema.xml

2011-03-22 Thread Geert-Jan Brits
Changing the default schema.xml to what you want is the way to go for most of us. It's a good learning experience as well, since it contains a lot of documentation about the options that may be of interest to you. Cheers, Geert-Jan 2011/3/22 geag34 > Ok thank. > > It is my fault. I have created

Re: copyField destination does not exist

2011-03-28 Thread Geert-Jan Brits
The error is saying you have a copyfield-directive in schema.xml that wants to copy the value of a field to the destination field 'text' that doesn't exist (which indeed is the case given your supplied fields) Search your schema.xml for 'copyField'. There's probably something configured related to

Re: Implementing hierarchical facet

2010-03-02 Thread Geert-Jan Brits
If it's a requirement to let Solr handle the facet-hierarchy please disregard this post, but an alternative would be to have your App control when to ask for which 'facet-level' (e.g: country, state, city) in the hierarchy. as follows, each doc has 3 seperate fields (indexed=true, stored=false):

Re: Implementing hierarchical facet

2010-03-02 Thread Geert-Jan Brits
Using Solr 1.4: even less changes to the frontend: &facet=on&facet.field={!key=Location}countryid ... &facet=on&facet.field={!key=Location}cityid&fq=countryid: etc. will consistently render the resulting facet under the name "Location" . 2010/3/3 Geert-Jan Brit

Re: Implementing hierarchical facet

2010-03-03 Thread Geert-Jan Brits
vels in the hierarchy. Adding any > levels beyond that requires schema modification > > Does that sound right? > > Option 3 is probably the best match for my use case. Is there any trick to > make it able to deal with arbitrary number of levels? > > Thanks. >

Re: Will Solr fit our needs?

2010-03-17 Thread Geert-Jan Brits
If you dont' plan on filtering/ sorting and/or faceting on fast-changing fields it would be better to store them outside of solr/lucene in my opinion. If you must: for indexing-performance reasons you will probably end up with maintaining seperate indices (1 for slow-changing/static fields and 1 f

Re: Multi Select Facets through Java API

2010-03-22 Thread Geert-Jan Brits
something like this? q=mainquery&fq={!tag=carfq}cars:corvette OR cars:camaro&facet=on&facet.field={!ex=carfq key=carfacet}cars -the facet: "carfacet" is indepedennt of the filter query that filters on cars. -you construct the filter query (fq={!tag=carfq}cars:corvette OR cars:camaro) yourself in

Re: Impossible Boost Query?

2010-03-25 Thread Geert-Jan Brits
Have a look at functionqueries. http://wiki.apache.org/solr/FunctionQuery You could for instance use your regular score and multiply it with RandomValueSource bound between 1.0 and 1.1 for example. This would at least break ties in a possibly natural loo

Re: limit rows by field

2010-04-13 Thread Geert-Jan Brits
I believe you're talking about Fieldcollapsing. It's available as a patch, although I'm not sure how well it applies to the current trunk. for more info check out: http://wiki.apache.org/solr/FieldCollapsing Geert-Jan 2010/4/13 Felix Zimmermann > Hi

Re: seemingly impossible query

2010-05-20 Thread Geert-Jan Brits
Would each Id need to return a different doc? If not: you could probably use FieldCollapsing: http://wiki.apache.org/solr/FieldCollapsing i.e: - collapse on listOfIds (see wiki entry for syntax) - constrain the field to only return the id's you wa

Re: seemingly impossible query

2010-05-20 Thread Geert-Jan Brits
Hi Kallin, again please look at FieldCollapsing , that should do the trick. basically: first you constrain the field: 'listOfIds' to only contain docs that contain any of the (up to) 100 random ids as you know how to do Next, in the same query, specify

Re: Personalized Search

2010-05-21 Thread Geert-Jan Brits
Just want to throw this in: If you're worried about scaling, etc. you could take a look at item-based collaborative filtering instead of user based. i.e: DO NIGHTLY/ BATCH: - calculate the similarity between items based on their properties DO ON EACH REQUEST - have a user store/update it's interes

Re: Sites with Innovative Presentation of Tags and Facets

2010-05-27 Thread Geert-Jan Brits
Something like sliders perhaps? Of course only numerical ranges can be put into sliders. (or a concept that may be logically presented as some sort of ordening, such as "bad, hmm, good, great" Use Solr's Statscomponent to show the min and max values Have a look at tripadvisor.com for good uses/im

Re: Sites with Innovative Presentation of Tags and Facets

2010-05-27 Thread Geert-Jan Brits
culous and a bunch of other libs) Geert-Jan 2010/5/27 Lukas Kahwe Smith > > On 27.05.2010, at 23:32, Geert-Jan Brits wrote: > > > Something like sliders perhaps? > > Of course only numerical ranges can be put into sliders. (or a concept > that > > may be

Re: Sites with Innovative Presentation of Tags and Facets

2010-05-31 Thread Geert-Jan Brits
p. > > I guess the point is that, if it's a really important thing, then you need > to expose it in another way on mobile. > > Just passing this on, please don't shoot the messenger. ;-) > > Mark > > -- > Mark Bennett / New Idea Engineering, Inc. / mbenn...@ide

Re: Sites with Innovative Presentation of Tags and Facets

2010-05-31 Thread Geert-Jan Brits
Interesting.. say you have a double slider with a discrete range (like tripadvisor et.al.) perhaps it would be a good guideline to use these discrete points for the quantum interval for the sparkline as well? Of course it then becomes the question which discrete values to use for the slider. I te

Re: Sites with Innovative Presentation of Tags and Facets

2010-05-31 Thread Geert-Jan Brits
May I ask how you implemented getting the facet counts for each interval? Do you use a facet-query per interval? And perhaps for inspiration a link to the site you implemented this .. Thanks, Geert-Jan I love the idea of a sparkline at range-sliders. I think if I have time, I > might add them to

Re: Interleaving the results

2010-06-01 Thread Geert-Jan Brits
Indeed, it's just a matter of ordening the results on the client-side IFF I infer correctly from your description that you are guarenteed to get results from enough different customers from SOlr in the first place to do the interleaving that you describe. (In general this is a pretty big IF). So a

Re: Regarding Facet Date query using SolrJ -- Not getting any examples to start with.

2010-06-02 Thread Geert-Jan Brits
Hi Ninad, SolrQuery q = new SolrQuery(); q.setQuery("*:*"); q.setFacet(true); q.set("facet.data", "pub"); q.set("facet.date.start", "2000-01-01T00:00:00Z") ... etc. basically you can completely build your entire query with the 'raw' set (and add) methods. The specific methods are just helpers. S

Re: exclude docs with null field

2010-06-04 Thread Geert-Jan Brits
field1:"new york"+field2:"new york"+field3:[* TO *] 2010/6/4 bluestar > hi there, > > say my search query is "new york", and i am searching field1 and field2 > for it, how do i specify that i want to exlude docs where field3 doesnt > exist? > > thanks > >

Re: exclude docs with null field

2010-06-04 Thread Geert-Jan Brits
Additionally, I should have mentioned that you can instead do: fq=field_3:[* TO *], which uses the filtercache. The method presented by Chris will probably outperform the above method but only on the first request, from then on the filtercache takes over. >From a performance standpoint it's probab

  1   2   >