Re: Indexing Different Types

2010-06-23 Thread Otis Gospodnetic
Stephen, Sure, multiple cores, one for each type is one approach. Another one is just adding a 'type' field and restricting auto-completion by type. In our AC implementation we have a piece made for very similar situations, where you have multiple types of entities, but want a single input fi

Re: Non-prefix, hierarchical autocomplete? Would SOLR-1316 work? Solritas?

2010-06-23 Thread Otis Gospodnetic
Hi Andy, I didn't check out SOLR-1316 yet, other then looking at the comments. Sounds more complicated than it should be, but maybe it's great and I really need to try it. Solritas uses TermsComponent, which should work well for individual terms (which country and city names are not, unless yo

Re: Nested table support ability

2010-06-23 Thread Otis Gospodnetic
Amit, I'd say it depends on the types of queries you need to run. Maybe you mentioned that already, but your reply cut it off (Nabble). I can say this with certainty: 1M is a small number and 30 fields is not a big deal. Otis Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch

Re: solr with hadoop

2010-06-23 Thread Otis Gospodnetic
I don't think it's ever been discussed - your Q below is #1 hit currently: http://search-lucene.com/?q=%2B%28dih+OR+dataimporthandler%29+hdfs Otis Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch Lucene ecosystem search :: http://search-lucene.com/ - Original Message > F

Re: anyone use hadoop+solr?

2010-06-23 Thread Otis Gospodnetic
Marc is referring to the very informative by Ted Dunning from maybe a month or so ago. For what it's worth, we just used Hadoop Streaming, JRuby, and EmbeddedSolr to speed up indexing by parallelizing it. Otis Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch Lucene ecosystem searc

Re: Field missing when use distributed search + dismax

2010-06-23 Thread Otis Gospodnetic
Make sure you list it in ...&fl=ID,type or set it in the defaults section of your handler. Otis Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch Lucene ecosystem search :: http://search-lucene.com/ - Original Message > From: Scott Zhang > To: solr-user@lucene.apache.org

Spatial types and DIH

2010-06-23 Thread Eric Angel
I'm using solr 4.0-2010-06-23_08-05-33 and can't figure out how to add the spatial types (LatLon, Point, GeoHash or SpatialTile) using dataimporthandler. My lat/lngs from the database are in separate fields. Does anyone know how to do his? Eric

Re: Performance related question on DISMAX handler..

2010-06-23 Thread Otis Gospodnetic
BB, Dismax could be slower than standard, depending on what kinds of queries you throw at either handler. "Millions of docs" is a bit imprecise (2M or 22M or 222M or 999M, tweet-sized docs or book sized docs), but given adequate hardware and proper treatment shouldn't be a problem. Otis --

Re: Alphabetic range

2010-06-23 Thread Otis Gospodnetic
Sophie, Go to your Solr Admin page, look for the Analysis page link, go there, enter some artists names, enter the query, check the verbose checkboxes, and submit. This will tell you what is going on with your analysis at index and at search time. Otis Sematext :: http://sematext.com/ ::

Re: fuzzy query performance

2010-06-23 Thread Otis Gospodnetic
Btw. here you can see Robert's presentation on what he did to speed up fuzzy queries: http://www.slideshare.net/otisg Otis Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch - Original Message > From: Robert Muir > To: solr-user@lucene.apache.org > Sent: Wed, June 23, 2

Re: Multiple Solr Webapps in Glassfish with JNDI

2010-06-23 Thread Otis Gospodnetic
Hi Kelly, I'm not much of a Classfish user, but have you tried following the JNDI instructions for Tomcat, maybe that works for Glassfish, too? http://search-lucene.com/?q=jndi&fc_project=Solr Otis Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch Lucene ecosystem search :: http:/

Re: DIH and dynamicField

2010-06-23 Thread Lance Norskog
A side comment about patches and JIRA- the second-to-last comment on SOLR-742 says ''Committed'. That means one of the committers (Shalin in this case) committed the fix. It was in 2008 so it's in Solr 1.4. https://issues.apache.org/jira/browse/SOLR-742?focusedCommentId=12643747&page=com.atlassian

Re: Setting up Eclipse with merged Lucene Solr source tree

2010-06-23 Thread Lance Norskog
I have found it easier to make these projects in my Eclipse workspace and make remote links to the parts that I really want. This cuts the total stuff in the project- cuts build times, 'search everywhere' times, menus full of classes named '*file*', etc. But git may have problems with this, and gi

Multiple Solr Webapps in Glassfish with JNDI

2010-06-23 Thread Kelly Taylor
Does anybody know how to setup multiple Solr webapps in Glassfish with JNDI? -Kelly -- View this message in context: http://lucene.472066.n3.nabble.com/Multiple-Solr-Webapps-in-Glassfish-with-JNDI-tp918383p918383.html Sent from the Solr - User mailing list archive at Nabble.com.

Some minor Solritas layout tweaks

2010-06-23 Thread Ken Krugler
I grabbed the latest & greatest from trunk, and then had to make a few minor layout tweaks. 1. In main.css, the ".query-box input" { height} isn't tall enough (at least on my Mac 10.5/FF 3.6 config), so character descenders get clipped. I bumped it from 40px to 50px, and that fixed the is

RE: Stemmed and/or unStemmed field

2010-06-23 Thread caman
Ahh,perfect. Will take a look. thanks From: Robert Muir [via Lucene] [mailto:ml-node+918302-232685105-124...@n3.nabble.com] Sent: Wednesday, June 23, 2010 4:17 PM To: caman Subject: Re: Stemmed and/or unStemmed field On Wed, Jun 23, 2010 at 3:58 PM, Vishal A. <[hidden email]>wrote: >

Re: Stemmed and/or unStemmed field

2010-06-23 Thread Robert Muir
On Wed, Jun 23, 2010 at 3:58 PM, Vishal A. wrote: > > Here is what I am trying to do : Someone clicks on 'Comforters & Pillows' > , we would want the results to be filtered where title has keyword > 'Comforter' or 'Pillows' but we have been getting results with word > 'comfort' in the title. I

Re: DIH and dynamicField

2010-06-23 Thread Robert Zotter
Boyd Hemphill-2 wrote: > > I am having a problem where importing with DIH and attempting to use > dynamicField produces no result. I get no error, nor do I get a message > in > the log. It would help if you posted the relevant parts of your data-config.xml and schema.xml. If you are doing a s

Re: fuzzy query performance

2010-06-23 Thread Robert Muir
On Wed, Jun 23, 2010 at 3:34 PM, Peter Karich wrote: > > So, you mean I should try it out her: > http://svn.apache.org/viewvc/lucene/dev/trunk/solr/ > > yes, the speedups are only in trunk. -- Robert Muir rcm...@gmail.com

Re: Help with sorting

2010-06-23 Thread Ahmet Arslan
> When I want sort the > documents > wich contain a certain word by date or by instituion all I > get is > an > order that I don't understand . > > stored="false" /> > stored="false" > required="true" /> You need to use a sortable type: sint with solr 1.3; tint with solr 1.4 field nam

Re: Highlight question

2010-06-23 Thread Ahmet Arslan
> In the solrconfig.xml I have been able to change the > hl.simple.pre/post variable, but when I try to change the > hl,regex pattern or the hl.snippets they don't have any > effect. I thought the hl.snippets would alow me to find more > than one and highlight it, and well I tried a bunch of regex

RE: Can solr return pretty text as the content?

2010-06-23 Thread caman
Define Pretty text. 1)Are you talking about XML/JSON returned by SOLR is not pretty ? If yes, try indent=on with your query params 2)Or talking about data in certain field? Solr returns what you feed it. Look at your filters for that field type. Your filters/tokenizer ma

Can solr return pretty text as the content?

2010-06-23 Thread JohnRodey
When I feed pretty text into solr for indexing from lucene and search for it, the content is always returned as one long line of text. Is there a way for solr to return the pretty formatted text to me? -- View this message in context: http://lucene.472066.n3.nabble.com/Can-solr-return-pretty-te

Stemmed and/or unStemmed field

2010-06-23 Thread Vishal A.
Hello all, One quick question, trying to find out what scenario would work best. We have huge free text dataset containing product titles, descriptions. Unfortunately, we don't have the data categorized so we rely on 'search relevancy + synonyms' heavily to categorize. Here is what I am tryi

Re: fuzzy query performance

2010-06-23 Thread Peter Karich
Hi Mark! > Solr trunk should have much improved fuzzy speeds (due to some very cool work that was done in Lucene) - you using 1.4? yes. So, you mean I should try it out her: http://svn.apache.org/viewvc/lucene/dev/trunk/solr/ or some 'more stable' branch? http://svn.apache.org/viewvc/lucene/solr

DIH and dynamicField

2010-06-23 Thread Boyd Hemphill
I am new to the list so any coaching on asking question is much appreciated. I am having a problem where importing with DIH and attempting to use dynamicField produces no result. I get no error, nor do I get a message in the log. I found this: https://issues.apache.org/jira/browse/SOLR-742 whi

Help with sorting

2010-06-23 Thread Adi Neacsu
Hi everyone , I'm stuck in sorting with solr . I have documents of some institutions differentiated by an id named instanta . I indexed all those documents and among other things I put in the index the date the document was created and the id of the institution .When I want sort the docume

Highlight question

2010-06-23 Thread Gregg Hoshovsky
I just started working with the highlighting. I am using the default configurations. I have a field that I can get a single highlight to occur marking the data. What I would like to do is this, Given a word say 'tumor', and the sentence " the lower tumor grew 1.5 cm. blah blah blah we need t

Re: Help with highlighting

2010-06-23 Thread noel
Thanks, that's exactly the problem. I've tried different types, even a fieldType that had no tokenizers and that didn't work. However, text just gives me my results as wanted. -Original Message- From: "dan sutton" Sent: Wednesday, June 23, 2010 12:06pm To: solr-user@lucene.apache.org S

Re: remove from list

2010-06-23 Thread Susan Rust
Will do -- but wasn't selling -- trying to donate! Susan Rust VP of Client Services If you wish to travel quickly, go alone If you wish to travel far, go together Achieve Internet 1767 Grand Avenue, Suite 2 San Diego, CA 92109 800-618-8777 x106 8

RE: remove from list

2010-06-23 Thread Markus Jelsma
If you want to unsubscribe, then you can do so [1] without trying to sell something ;)   [1]: http://lucene.apache.org/solr/mailing_lists.html   Cheers!   -Original message- From: Susan Rust Sent: Wed 23-06-2010 18:23 To: solr-user@lucene.apache.org; Erik Hatcher ; Subject: remove fr

remove from list

2010-06-23 Thread Susan Rust
Hey SOLR folks -- There's too much info for me to digest, so please remove me from the email threads. However, if we can build you a forum, bulletin board or other web- based tool, please let us know. For that matter, we would be happy to build you a new website. Bill O'Connor is our CTO a

Re: Help with highlighting

2010-06-23 Thread dan sutton
It looks to me like a tokenisation issue, all_text content and the query text will match, but the string fieldtype fields 'might not' and therefore will not be highlighted. On Wed, Jun 23, 2010 at 4:40 PM, wrote: > Here's my request: > q=ASA+AND+minisite_id%3A36&version=1.3&json.nl > =map&rows=1

Re: Help with highlighting

2010-06-23 Thread noel
Here's my request: q=ASA+AND+minisite_id%3A36&version=1.3&json.nl=map&rows=10&start=0&wt=json&hl=true&hl.fl=%2A&hl.simple.pre=%3Cspan+class%3D%22hl%22%3E&hl.simple.post=%3C%2Fspan%3E&hl.fragsize=0&hl.mergeContiguous=false And here's what happened: It didn't return results, even when I applied an a

Re: Setting up Eclipse with merged Lucene Solr source tree

2010-06-23 Thread Erick Erickson
Did you see this page?" http://wiki.apache.org/solr/HowToContribute Especially down near the end, the section "Development Environment Tips" HTH Erick On Wed, Jun 23, 2010 at 8:57 AM, Ukyo Virgden wrote: > Hi, > > I'm trying to setup and eclipse env

Re: fuzzy query performance

2010-06-23 Thread Mark Miller
On 6/23/10 9:48 AM, Peter Karich wrote: Hi! How can I improve the performance of a fuzzy search like: mihchael~0.7 through a relative large index (~1 million docs)? It takes over 15 seconds at the moment if we would perform it on the normal text search field. I searched the web and the jira and

fuzzy query performance

2010-06-23 Thread Peter Karich
Hi! How can I improve the performance of a fuzzy search like: mihchael~0.7 through a relative large index (~1 million docs)? It takes over 15 seconds at the moment if we would perform it on the normal text search field. I searched the web and the jira and couldn't find anything related to that. A

Re: dataimport.properties is not updated on delta-import

2010-06-23 Thread Stefan Moises
Hi, what I have experienced is that the primary key seems to be case sensitive for the delta queries, at least for some jdcd drivers... see http://lucene.472066.n3.nabble.com/Problem-with-DIH-delta-import-on-JDBC-tp763469p765262.html ... so make sure you specify it with the correct case (e.g.

Re: TermsComponent - AutoComplete - Multiple Term Suggestions & Inclusive Search?

2010-06-23 Thread Sophie M.
To build your autocompletion, you can use the NGramFilterFactory. If you type cat It will match "subcategory" and "the best category". If you change your mind and you don't want anymore to match subcategory, you can use the EdgeNGramFilterFactory. -- View this message in context: http://lucene.

Indexing Rich Format Documents using Data Import Handler (DIH) and the TikaEntityProcessor

2010-06-23 Thread Tod
Please refer to this thread for history: http://mail-archives.apache.org/mod_mbox/lucene-solr-user/201006.mbox/%3c4c1b6bb6.7010...@gmail.com%3e I'm trying to integrate the TikaEntityProcessor as suggested. I'm using Solr Version: 1.4.0 and getting the following error: java.lang.ClassNotFoun

dataimport.properties is not updated on delta-import

2010-06-23 Thread warb
Hello! I am having some difficulties getting dataimport (DIH) to behave correctly in Solr 1.4.0. Indexing itself works just as it is supposed to with both full-import and delta-import adding modified or newly created records to the index. The problem is however that the date and time of the last

Setting up Eclipse with merged Lucene Solr source tree

2010-06-23 Thread Ukyo Virgden
Hi, I'm trying to setup and eclipse environment for combined Lusolr tree. I've created a Lucene project containing /trunk/lusolr/lucene and /trunk/lusolr/modules as one project and /trunk/lusolr/solr as another. I've added lucene project as a dependency to Solr project, removed solr libs from luce

Alphabetic range

2010-06-23 Thread Sophie M.
Hello all, I try since several day to build up an alphabetical range. I will explain all steps (i have the Solr1.4 Enterprise Search Server book written by Smiley and Pugh). I want get all artists beginning by the two first letter. If I request "mi", I want to have as response "michael jackson"

Re: TermsComponent - AutoComplete - Multiple Term Suggestions & Inclusive Search?

2010-06-23 Thread Chantal Ackermann
Hi Saïd, I think your problem is the field's type: String. You have to use a TextField and apply tokenizers that will find "subcategory" if you put in "cat". (Not sure which filter does that, though. I wouldn't think that the PorterStemmer cuts off prefix syllables of that kind?) If, however, you

Re: Import XML files different format?

2010-06-23 Thread scrapy
Thanks Eric for your answer. I'll try to use DIH via data-config.xml as i might index other content with different XML structure in the futur... Will i need to have different data-config for each XML strucure content file? And then manualy cange between them? -Original Mess

Re: about function query

2010-06-23 Thread Yonik Seeley
See http://wiki.apache.org/solr/SolrRelevancyFAQ#How_can_I_boost_the_score_of_newer_documents for more info on how to do a multiplicative boost. -Yonik http://www.lucidimagination.com On Tue, Jun 22, 2010 at 11:13 PM, Li Li wrote: > I want to integrate document's timestamp into scoring of searc

Re: Import XML files different format?

2010-06-23 Thread Erik Hatcher
You can use DataImportHandler's XML/XPath capabilities to do this: or you could, of course, convert your XML to Solr's XML format. Another fine option for what this data looks like, CSV format. I'd imagin

Import XML files different format?

2010-06-23 Thread scrapy
Hi, I'm new to solr. It looks great. I would like to add a XML document in the following format in solr: etc... Is there a way to do this? If yes how? Or i need to convert it with some scripts to this: Patrick Eagar Sports etc.

Solrj throws RuntimeException - Invalid version or the data is not in javabin format

2010-06-23 Thread Villemos, Gert
I have a problem injecting data using SolrJ from a Windows client to an Ubuntu server (see exception below). The same configuration works when injecting from a Windows client to a Windows server. I inject using a standard org.apache.solr.client.solrj.impl.CommonsHttpSolrServer instance, created

Re: collapse exception

2010-06-23 Thread Martijn v Groningen
That is a good idea. I'm trying to achieve something similar. I'm already busy with creating a Lucene collector that groups the result set and will eventually have the same functionality as in SOLR-236. When that is solid the idea is to integrate that into Solr. I've attached a patch in LUCENE-14

Re: Field Collapsing SOLR-236

2010-06-23 Thread Rakhi Khatwani
Hi, But these is almost no settings in my config heres a snapshot of what i have in my solrconfig.xml *:* Am i goin wrong anywhere? Regards, Raakhi On Wed, Jun 23, 2010 at 3:28 PM, Govind Kanshi wrote: > fieldType:analyzer without class or tokenizer & filter list seems to p

TermsComponent - AutoComplete - Multiple Term Suggestions & Inclusive Search?

2010-06-23 Thread Saïd Radhouani
Hi, I'm using the Terms Component to se up the autocomplete feature based on a String field. Here are the params I'm using: terms=true&terms.fl=type&terms.lower=cat&terms.prefix=cat&terms.lower.incl=false With the above params, I've been able to get suggestions for terms that start with the sp

Re: Mr Erick Re: Change the Solr searcher

2010-06-23 Thread sarfaraz masood
but how to add this change to the running solr server?? i mean to say that how make my changes visible in running solr ?? Do i need to make a pluggin , patch or something??? -sarfaraz --- On Tue, 22/6/10, Erik Hatcher wrote: From: Erik Hatcher Subject: Re: Change the Solr searcher To: solr-

Re: Field Collapsing SOLR-236

2010-06-23 Thread Govind Kanshi
fieldType:analyzer without class or tokenizer & filter list seems to point to the config - you may want to correct. On Wed, Jun 23, 2010 at 3:09 PM, Rakhi Khatwani wrote: > Hi, >I checked out modules & lucene from the trunk. > Performed a build using the following commands > ant clean >

Re: Field Collapsing SOLR-236

2010-06-23 Thread Rakhi Khatwani
Hi, I checked out modules & lucene from the trunk. Performed a build using the following commands ant clean ant compile ant example Which compiled successfully. I then put my existing index(using schema.xml from solr1.4.0/conf/solr/) in the multicore folder, configured solr.xml and start

Re: OOM on sorting on dynamic fields

2010-06-23 Thread Matteo Fiandesio
Hi to all, we moved solr with patched lucene's FieldCache in production environment. During tests we noticed random ConcurrentModificationException calling the getCacheEntries method due to this bug https://issues.apache.org/jira/browse/LUCENE-2273 We applied that patch as well, and added an abst

Re: Searching across multiple repeating fields

2010-06-23 Thread Mark Allan
Cheers, Geert-Jan, that's very helpful. We won't always be searching with dates and we wouldn't want duplicates to show up in the results, so your second suggestion looks like a good workaround if I can't solve the actual problem. I didn't know about FieldCollapsing, so I'll definitely kee

Re: Field missing when use distributed search + dismax

2010-06-23 Thread Scott Zhang
Hi. All. I found more about fields missing things. I tried the default distributed search example which configured 2 instances, one on 8983 and another on 7574. When I try search with standard query handler, the result fields are all right. When I search with the deafult dismax, some fields disapp

Re: Field Collapsing SOLR-236

2010-06-23 Thread Rakhi Khatwani
Oops this is probably i didn't checkout the modules file from the trunk. doing that right now :) Regards Raakhi On Wed, Jun 23, 2010 at 1:12 PM, Rakhi Khatwani wrote: > Hi, >Patching did work. but when i build the trunk, i get the following > exception: > > [SolrTrunk]# ant compile > Bu

Re: Field Collapsing SOLR-236

2010-06-23 Thread Rakhi Khatwani
Hi, Patching did work. but when i build the trunk, i get the following exception: [SolrTrunk]# ant compile Buildfile: /testWorkspace/SolrTrunk/build.xml init-forrest-entities: [mkdir] Created dir: /testWorkspace/SolrTrunk/build [mkdir] Created dir: /testWorkspace/SolrTrunk/build/web c

Re: Nested table support ability

2010-06-23 Thread Govind Kanshi
Amit - unless you test it would not be apparent. Key piece is as Otis mentioned "flatten everything". This requires effort from your side to actually create documents in manner suitable for your searches. The relationship needs to be "merged" into the document. To avoid storing text representations