Re: How to generate tag cloud in SOLR?

2010-11-23 Thread Peter Karich
Hi, another way is to use facets for the tagcloud as we did it in jetwick. Every document then needs a tag field (multivalued). See: https://github.com/karussell/Jetwick/blob/master/src/main/java/de/jetwick/ui/TagCloudPanel.java for an example with wicket and SolrJ. With that you could also

Re: commit=true has no effect

2010-11-23 Thread stockiii
Okay, sry and thx for reply. I Know the Links that you post and i Know the Most dih Settings from Wiki. Im Not New in solr ... Dih says To me After a Delta that Some documenty changed, but He Dong want to Commit. The auery is nö Broken, i check this, changed the query and expert with. But with n

Re: copyField is not tokenizing the values at index time

2010-11-23 Thread Shanmugavel SRD
Thanks Erick. -- View this message in context: http://lucene.472066.n3.nabble.com/copyField-is-not-tokenizing-the-values-at-index-time-tp1952756p1958946.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: How to generate tag cloud in SOLR?

2010-11-23 Thread Otis Gospodnetic
Hi, If you go to http://search-lucene.com and start typing Tagcl , the AutoComplete functionality will point you to one way of doing this: http://search-lucene.com/m/xqRF4Pd6ec1 . That's at the Lucene level. At Solr level, you can use TermsComponent - http://wiki.apache.org/solr/TermsComponen

Re: Containers running SOLR: supported or unsupported?

2010-11-23 Thread Otis Gospodnetic
Hi James, Most of our customers use Tomcat or Jetty. We use Jetty for http://search-lucene.com/ . Both containers have commercial support available. We are very happy with Jetty and never needed tech support (jinx!). Some of our customers use JBoss for reasons similar to yours (company al

Re: Amazon like search function

2010-11-23 Thread Otis Gospodnetic
Hi Nick, You can also have a look at http://sematext.com/products/dym-researcher/index.html , which will help. That particular piece doesn't do *exactly* what you are describing, but it also includes a component that does pretty much what you are after (even though it is not described at the a

How to generate tag cloud in SOLR?

2010-11-23 Thread Pawan Darira
Hi Is there a way out in SOLR/lucene to generate a tag cloud? -- Thanks, Pawan Darira

Re: What tokenizer is good for breaking host names

2010-11-23 Thread Otis Gospodnetic
Hello, Another common and good thing to do with data like hostnames, email addresses, URLs, etc. is to reverse them on separators (e.g. www.foo.com becomes com.foo.www) and then tokenize them so you end up with multiple tokens indexed: com com.foo com.foo.www . You can see how that can be bene

RE: unknown field 'name'

2010-11-23 Thread sivaprasad
The field names in the xml and schema.xml should be matched -Original Message- From: "McGibbney, Lewis John [via Lucene]" Sent: Tuesday, November 23, 2010 4:01pm To: "sivaprasad" Subject: unknown field 'name' Good Evening List, I have been working with Nutch and due to numerous int

RE: DIH - index Multiple tables in Database?

2010-11-23 Thread sivaprasad
I think, the query it self is wrong, The query should be select id_bread,buttertype from bread,butter where a.id_bread=b.id_bread -Original Message- From: "snowyeoghan [via Lucene]" Sent: Tuesday, November 23, 2010 4:22pm To: "sivaprasad" Subject: DIH - index Multiple tables in

Re: Master/Slave High CPU Usage

2010-11-23 Thread Lance Norskog
You're welcome. On Tue, Nov 23, 2010 at 3:55 PM, Ofer Fort wrote: > ok, we ran some tests and doing the commit for the "slave" as a post commit > event of the "master" reloaded the index and allowed us to achieve a master > slave configuration, without replication > This is useful only if your ma

Re: own way synonyms

2010-11-23 Thread Dennis Gearon
Why not add an extra column for 'site created synonym tags' and fill it at index time using server side code? Then just include the tag field in the search. Dennis Gearon Signature Warning It is always a good idea to learn from your own mistakes. It is usually a better idea

RE: own way synonyms

2010-11-23 Thread Jonathan Rochkind
One way synonym at index time. java, python, php => programming No synonyms at query time. So the terms "java", "python" and "php" all get indexed as both their original word and the token "programming". Then a query including the token "programming" will match those. Oops, wait, except I

RE: Special Parent / Child relationship - advice / observations welcome on how to approach this

2010-11-23 Thread Jonathan Rochkind
I gather that your solr documents are the "Title Information" units. Have you considered making your Solr document collection be the "book information" units instead? Each "book information" document will have (yes, de-normalized) the same "title" information as all the other book documents be

Re: SOLR and secure content

2010-11-23 Thread Geert-Jan Brits
> When making a query these fields should be required. Is it possible to configure handlers on the solr server so that these field are required whith each type of query? So for adding documents, deleting and querying? have a look at 'invariants' (and 'appends') in the example solrconfig. They can

Special Parent / Child relationship - advice / observations welcome on how to approach this

2010-11-23 Thread Bob Sandiford
Hi, Long post - sorry... I have a relatively special case of a Parent / Child relationship that I'm trying to model. I'm currently using Solr 1.4.1 and Lucene 2.9.3 For example, my Parent documents represent "Title Information" (e.g. bibliographic information), and each Parent document can co

Re: SOLR and secure content

2010-11-23 Thread Dennis Gearon
Solr basically does ONE thing (and related things) very well. Doing all the error messaging that yuou want would be fighting all the specialization builit into Solr/Lucene code. Dennis Gearon Signature Warning It is always a good idea to learn from your own mistakes. It is us

Re: Master/Slave High CPU Usage

2010-11-23 Thread Ofer Fort
ok, we ran some tests and doing the commit for the "slave" as a post commit event of the "master" reloaded the index and allowed us to achieve a master slave configuration, without replication This is useful only if your master and slave are on the same machine, and it helps reducing the resources

Re: problems with class loaders when adding new FieldType subclass

2010-11-23 Thread Erick Erickson
Does this help? http://wiki.apache.org/solr/SolrPlugins See especially "How to load plugins"... Best Erick On Tue, Nov 23, 2010 at 12:28 PM, Stefanie Scherzinger wrote: > Hi, > > I am trying to define a custom FieldType for SOLR 1.4.1. > > To this end, I created the JAR file TestType.jar, I add

Re: own way synonyms

2010-11-23 Thread Markus Jelsma
Yes, you can do that. Please see the wiki for specifics and good examples: http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.SynonymFilterFactory > a better example might be this: > > When someone searches on "programming" i want it to return results with > java OR python OR php b

Re: own way synonyms

2010-11-23 Thread solruser2010
a better example might be this: When someone searches on "programming" i want it to return results with java OR python OR php but I don't want a search for "java" to return documents with php programming,java,php,python -- View this message in context: http://lucene.472066.n3.nabble.com/own

problems with class loaders when adding new FieldType subclass

2010-11-23 Thread Stefanie Scherzinger
Hi, I am trying to define a custom FieldType for SOLR 1.4.1. To this end, I created the JAR file TestType.jar, I added a lib path in solrconfig.xml so that this JAR file as well as apache-solr-core-1.4.1.jar and lucene-core-2.9.3.jar are included. I then edited schema.xml and defined my new typ

Re: own way synonyms

2010-11-23 Thread Markus Jelsma
Even without expanding the synonym definitions, word1 and word2 will match THISKEYWORD. Although word1 != word2, both will still match THISKEYWORD and in a sense be... well, synonyms, different word/tone but same meaning. What are you trying to achieve here? Using synomyms for what they are not

Re: own way synonyms

2010-11-23 Thread sara motahari
I think you can do this by THISKEYWORD => word1,word2,word3,word4 you can try it and then see if it work by analysing it on the analyzer tab on the admin page. From: solruser2010 To: solr-user@lucene.apache.org Sent: Tue, November 23, 2010 1:44:20 PM Subject:

Re: SOLR and secure content

2010-11-23 Thread Jos Janssen
Dennis, We will we serving the content to de indexed websites. As i wrote we will be looking into setting up different cores, 1 core for each website. This wil make sure the content is sperated for each individual indexed website. The so called "error" handling is only needed, in case of bad pro

own way synonyms

2010-11-23 Thread solruser2010
Hi, Is it possible to set up synonyms to work like this THISKEYWORD = word1,word2,word3,word4 but have it so word1 != word2 != word3 in this theoretical example a search for fishing would be set up like this. fishing,sport,water,boat,bait Thanks -- View this message in context: http:/

Re: unknown field 'name'

2010-11-23 Thread Markus Jelsma
I see i missed the `name`-part. Did you really start the example with java - jar start.jar in the example directory? Name is a defined field in the shipped schema. > Hi, > > Strange, the example schema should work with the example documents. Anyway, > see your Solr output as it will show you whi

Re: unknown field 'name'

2010-11-23 Thread Markus Jelsma
Hi, Strange, the example schema should work with the example documents. Anyway, see your Solr output as it will show you which field it complains about. Cheers, > Good Evening List, > > I have been working with Nutch and due to numerous integration advantages I > decided to get to grips with t

Reading timestamp for DIH

2010-11-23 Thread Siddharth Powar
Hey, Is it possible to read the timestamp that the DataImportHandler uses for a delta-import from a location other than "conf/dataimport.properties". Thanks, Sid

unknown field 'name'

2010-11-23 Thread McGibbney, Lewis John
Good Evening List, I have been working with Nutch and due to numerous integration advantages I decided to get to grips with the Solr code base. Solr dist - 1.4.1 java version 1.6.0_22 Windows Vista Home Premium Command Prompt to execute commands I encountered the following problem very early on

Re: SOLR and secure content

2010-11-23 Thread Dennis Gearon
Solr basically does ONE thing (and related things) very well. Doing all the error messaging that yuou want would be fighting all the specialization builit into Solr/Lucene code. Dennis Gearon Signature Warning It is always a good idea to learn from your own mistakes. It is us

Re: SOLR and secure content

2010-11-23 Thread Dennis Gearon
I can see no reason to keep separate web sites information in the same index. If it's not being served to a website at all, why have data from another website in 'accidental' proximity to it? Someday, a coder WILL make a mistake, or a library upgrade will allow access. Best at least sort dat

Re: Need Middleware between search client and solr?

2010-11-23 Thread Jan Høydahl / Cominvent
Nope. But it's quite affordable if search is important to you. For an open sourced middleware, check out www.sesat.no -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com On 23. nov. 2010, at 16.28, Lukáš Vlček wrote: > Hello, > > is the twigkit open sourced? Looks interes

Re: Need Middleware between search client and solr?

2010-11-23 Thread Lukáš Vlček
Hello, is the twigkit open sourced? Looks interesting. Regards, Lukas On Tue, Nov 23, 2010 at 2:13 PM, Jan Høydahl / Cominvent < jan@cominvent.com> wrote: > Check out for instance www.twigkit.com which is a light-weight middleware > (as well as GUI framework) for Solr. It could speed up dev

Re: commit=true has no effect

2010-11-23 Thread Erick Erickson
Patience, my friend. It's still early in the morning and people are thinking about Thanksgiving ... We need more details. My first guess is that "only the sql statement changed" means that something's wrong with the new SQL. There's a little-known debug console for DIH you might want to investigat

Re: copyField is not tokenizing the values at index time

2010-11-23 Thread Erick Erickson
I think you got fooled by what's returned as a field value. When you store a field and later return that field as part of a document, your exact input is returned *regardless* of what analysis has been done. So your *query* of spellword:[* to *] returns the stored value, not the indexed tokens. I

Re: SOLR and secure content

2010-11-23 Thread Jos Janssen
The setup of multiple cores is a good option, thanks for the advice. I agree the "required" field should be in the application layer, but i also think some "error" handling should come from the Solr server to prevent incorrect usage. If i only knew how to do this for each request. Regards, Jos

Re: ranged and boolean query

2010-11-23 Thread Peter Blokland
hi, On Wed, Nov 17, 2010 at 04:39:00PM +0100, Peter Blokland wrote: > i'm using solr and am trying to limit my resultset to documents > that either have a publication date in the range * to now, or > have no publication date set at all (field is not present). > however, using this : > > (pubdat

Re: SOLR and secure content

2010-11-23 Thread Savvas-Andreas Moysidis
Sounds like a good plan. I'd probably also set multiple cores for each website. This could give you more accurate results scoring. Good question about the "required" configuration option.. any input? Although on the other hand, this is a rule which seems to better fit in your application's Validat

Re: commit=true has no effect

2010-11-23 Thread stockii
=( anyone a idea ? -- View this message in context: http://lucene.472066.n3.nabble.com/commit-true-has-no-effect-tp1952567p1953391.html Sent from the Solr - User mailing list archive at Nabble.com.

Spellcheck in solr-nutch integration

2010-11-23 Thread Anurag
I have integrated solr and nutch using http://www.lucidimagination.com/blog/2009/03/09/nutch-solr/ this As the tutorial says, the Schema.xml & SolrConfig.xml of Solr has to be modified. I also did the same. I am using Solr-1.3. But my problem is that i am not able to implement Spellcheck in thi

Re: Need Middleware between search client and solr?

2010-11-23 Thread Jan Høydahl / Cominvent
Check out for instance www.twigkit.com which is a light-weight middleware (as well as GUI framework) for Solr. It could speed up development time considerably for your project. It has hooks to transform queries before they are sent to Solr and process responses before displaying, if needed. --

Re: SOLR and secure content

2010-11-23 Thread Jos Janssen
Hi everyone, This is how we think we should set it up. Situation: - Multiple websites indexed on 1 solr server - Results should be seperated for each website - Search results should be filtered on group access Solution i think is possible with solr: - Solr server should only be accesed through

Re: SOLR and secure content

2010-11-23 Thread Robert Muir
On Tue, Nov 23, 2010 at 5:26 AM, Peter Sturge wrote: > Document-level access control can be a real 'can of worms', and it can > be worthwhile spending a bit of time defining exactly what you need. I agree, "document-level access control" is an anti-feature. You can't just give someone access to

Saravanan Chinnadurai/Actionimages is out of the office.

2010-11-23 Thread Saravanan . Chinnadurai
I will be out of the office starting 23/11/2010 and will not return until 24/11/2010. Please email to itsta...@actionimages.com for any urgent issues. Action Images are proud to be an Official Supplier to England 2018 - www.england2018bid.com Action Images is a division of Reuters Limited a

Re: SOLR and secure content

2010-11-23 Thread Peter Sturge
Yes, as mentioned in the above link, there's SOLR-1872 for maintaing your own document-level access control. Also, if you have access to the file system documents and want to use their existing ACL, have a look at SOLR-1834. Document-level access control can be a real 'can of worms', and it can be

commit=true has no effect

2010-11-23 Thread stockii
Hello. some of my cores absolutely dont want to commit the delta-import changes. the query are all okay but i have no idea what ist wrong Solr version: Solr Specification Version: 4.0.0.2010.10.26.08.43.14 dih found the deltas, and changed it but no commit is performed !? the config files are t

Basic Solr Configurations and best practice

2010-11-23 Thread Darx Oman
Hi guys I'm kind of new to solr and I'm wondering how to configure solr to best fulfills my requirements. Requirements are as follow: I have 2 data sources: database and file system documents. Every document in the file system has related information stored in the database. Both the file conten

result of filtered field not indexed

2010-11-23 Thread Bernd Fehling
Dear list, solr/lucene has a strange problem. I'm currently using apache-solr-4.0-2010-10-12_08-05-48 I have written a MessageDigest for fields which generally works. Part of my schema.xml is: ... ... ... ... ... I have a field type "text_md" which uses the KeywordTokenizerF