Re: solr-user

2010-10-08 Thread Lance Norskog
Please start a new thread with this topic in the subject line. On Fri, Oct 8, 2010 at 10:37 PM, ankita shinde wrote: > -- Forwarded message -- > From: ankita shinde > Date: Sat, Oct 9, 2010 at 8:19 AM > Subject: solr-user > To: solr-user@lucene.apache.org > > > hello, > Is there

Fwd: solr-user

2010-10-08 Thread ankita shinde
-- Forwarded message -- From: ankita shinde Date: Sat, Oct 9, 2010 at 8:19 AM Subject: solr-user To: solr-user@lucene.apache.org hello, Is there any api in SolrJ that calls the dataImportHandler to execute commands like full-import and delta-import. Please help..

Snappuller opening too many threads

2010-10-08 Thread Alexander Ramos Jardim
Hi there people, I was running my Solr aplication in one server and the client application in other one. So far so good. I wanted to make some benchmarks between embedded and http access. So I put one Solr instance in the same server my application is running. Again, so far so, good. I made my be

Re: Getting an ngram fieldtype to work

2010-10-08 Thread Lance Norskog
Is your browser caching the older search result? The example config comes with HTTP caching on, and if you comment it out the engine defaults to caching on. So, you have to use the XML to configure Solr to stop caching. On Fri, Oct 8, 2010 at 6:52 AM, Markus Jelsma wrote: > > > On Friday, October

Re: Speeding up solr indexing

2010-10-08 Thread Dennis Gearon
How does that have to work with Java's memory? In lockstep, a certain percentage, not related, what, or at all? Dennis Gearon Signature Warning It is always a good idea to learn from your own mistakes. It is usually a better idea to learn from others’ mistakes, so you do not

Re: Search Cloud , store stemmed Tokens?

2010-10-08 Thread Otis Gospodnetic
Alex, If I understand correctly, you are taking people's search terms and storing them in some Solr index? Before you store them you want to convert them to their root form. In that case a custom UpdateRequestProcessor (in which you'd add your code that does this conversion) should do the job.

Re: dynamic "stop" words?

2010-10-08 Thread Otis Gospodnetic
Matt, The first thing that came to my mind is that this might be interesting to try with a dictionary (of city names) if this example is not a made-up one. Otis Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch Lucene ecosystem search :: http://search-lucene.com/ - Original

Re: Accented Search in Solr

2010-10-08 Thread Otis Gospodnetic
Param, Note that the original value will be stored even if ISOLatin1AccentFilter removes the accept for indexing / matching purposes. Otis Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch Lucene ecosystem search :: http://search-lucene.com/ - Original Message > From: "S

Re: Speeding up solr indexing

2010-10-08 Thread Otis Gospodnetic
Hi, Assuming your DB/network/something else is not the bottleneck, increase your ramBufferSizeMB (in solrconfig). Otis Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch Lucene ecosystem search :: http://search-lucene.com/ - Original Message > From: sivaprasad > To: solr

Re: access control for spellcheck suggestions?

2010-10-08 Thread Peter Wolanin
Thanks for the info - I'll try out this patch. -Peter On Thu, Oct 7, 2010 at 10:43 AM, Dyer, James wrote: > Look at SOLR-2010 which has patches for 1.4.1 and trunk.  It works with the > spellcheck "collate" functionality and ensures that collations are returned > only if they can result in hit

Re: Sorting on arbitary 'custom' fields

2010-10-08 Thread Simon Wistow
On Fri, Oct 08, 2010 at 04:56:38PM -0700, kenf_nc said: > > What behavior are you trying to see? You are allowed to sort on fields that > are potentially empty, they just sort to the top or bottom depending on your > sort order. Now, if you Query on the fields that could be empty, you won't > see

Re: Sorting on arbitary 'custom' fields

2010-10-08 Thread kenf_nc
What behavior are you trying to see? You are allowed to sort on fields that are potentially empty, they just sort to the top or bottom depending on your sort order. Now, if you Query on the fields that could be empty, you won't see the result, but if your document is valid for the query, you can s

Sorting on arbitary 'custom' fields

2010-10-08 Thread Simon Wistow
We have a set of documents - which have a standard set of fields. However they can also have an arbitary number of custom fields which may each have a value. So some docs may look like id: 1 title: Document 1 created: 2010-10-09 15:23:00 custom_fields: - foo : 5 - bar : 6 id: 2 titl

Re: Speeding up solr indexing

2010-10-08 Thread Erick Erickson
Well, 10million rows is a bunch of rows, it'll take some time. But you haven't given us any clue what that means. Is it taking 5 minutes? 5 hours? 5 days? Without some dimension on the problem it's really hard to provide any suggestions, you might be seeing entirely reasonable times, we just don't

Re: Missing content stream

2010-10-08 Thread Erick Erickson
Thanks for letting us know. Erick On Fri, Oct 8, 2010 at 2:21 PM, Dennis Brundage wrote: > > Thanks to anyone who spent time looking at this. In the future, should > anyone else run across this, I got it working by adding the "name" > attribute > to the tag. Final script looks like: > > > >

Re: NumberFormatException upon reading a Trie field during search

2010-10-08 Thread Erick Erickson
Thanks for letting us know... Erick On Fri, Oct 8, 2010 at 11:42 AM, Jon Poulton wrote: > Hi all, > > Just to let you know, deleting the index and reindexing our data appears to > have fixed this problem, at least for the moment. My guess is that the old > index wasn't deleted cleanly, as I ass

Re: Accented Search in Solr

2010-10-08 Thread Erick Erickson
not that I know of. Do note that whether the query has the accent filter active or not MUST be matched with the index-time filter. In other words, if you indexed with the filter but search without it or vice-versa you won't get the resultsyou expect. Also note that no matter what, the original tex

Re: Index time boosting is not working with boosting value in document level

2010-10-08 Thread Erick Erickson
Boosting doesn't necessarily put documents in a particular order. Boosting just #tends# to make the doc score higher. What do you see if you add &debugQuery=on? That'll tell you why docs scored as they did. Best Erick On Thu, Oct 7, 2010 at 11:39 AM, Shanmugavel SRD wrote: > > We are having 10 d

Re: Trouble with exception Document [Null] missing required field DocID

2010-10-08 Thread Erick Erickson
Right. You're requiring that every document have an ID (via uniqueKey), but there's nothing magic about DIH that'll automagically parse a PDF file and map something into your ID field. So you have to create a unique ID before you send your doc to Curl. I'm pretty sure you can specify literal.ID=yo

Speeding up solr indexing

2010-10-08 Thread sivaprasad
Hi, I am indexing the data using DIH.Data coming from mysql.Each document contains 30 fields.Some of the fields are multi valued.When i am trying to index 10 million records it taking more time to index. Any body has suggestions to speed up indexing process?Any suggestions on solr admin level con

Re: Missing content stream

2010-10-08 Thread Dennis Brundage
Thanks to anyone who spent time looking at this. In the future, should anyone else run across this, I got it working by adding the "name" attribute to the tag. Final script looks like: http://localhost.:4747/solr/update"; enctype="multipart/form-data" accept-charset="UTF-8" method="POST">

Re: having problem about Solr Date Field.

2010-10-08 Thread Dennis Gearon
OK, great. No worrying about which time zone the server is running in with Solr, LOL! Databases and server side languages and cron scripts OTOH . . . Dennis Gearon Signature Warning It is always a good idea to learn from your own mistakes. It is usually a better idea to lea

Re: Strategy for re-indexing

2010-10-08 Thread Allistair Crossley
Thanks for your time responding to this. I have decided also to go down the route of cron-scheduled Perl LWP pings to DIH + deltaQueries. This seems to work inline with what the business requires and for the index size. Thanks again On Oct 7, 2010, at 7:46 AM, Shawn Heisey wrote: > On 10/6/201

Re: Missing content stream

2010-10-08 Thread Dennis Brundage
Markus Jelsma-2 wrote: > > I don't know if sending the wrong content type results in this error > message > but you must send the proper content type header, text/xml. > Thanks Markus. I thought uploading a file from an HTML form required "multipart/form-data". Anyway, I did try changing

Re: Help removing myself from this mailing list

2010-10-08 Thread Markus Jelsma
Life is difficult: http://lmgtfy.com/?q=unsubscribe+from+solr On Friday, October 08, 2010 05:50:42 pm Steve Reichgut wrote: > Can anyone help me on how to remove myself from this mailing list. The > emails don't come with an opt-out link nor can I find anywhere else to > go to remove my email ad

Help removing myself from this mailing list

2010-10-08 Thread Steve Reichgut
Can anyone help me on how to remove myself from this mailing list. The emails don't come with an opt-out link nor can I find anywhere else to go to remove my email address. Any help would be really appreciated! Steve

Re: NumberFormatException upon reading a Trie field during search

2010-10-08 Thread Jon Poulton
Hi all, Just to let you know, deleting the index and reindexing our data appears to have fixed this problem, at least for the moment. My guess is that the old index wasn't deleted cleanly, as I assumed it had been. Thanks Jon On 8 Oct 2010, at 12:14, Jon Poulton wrote: > Hi there, > I have

Accented Search in Solr

2010-10-08 Thread Sethi, Parampreet
Hi All, I am using Solr 1.3 in my project. Just wanted to know if there is any other way by which below mentioned queries will return the same results: Gruyère-and-Zucchini Gruyere-and-Zucchini The first query has accented characters in it. I was just going through the Solr tokenizers and fi

dynamic "stop" words?

2010-10-08 Thread Matt Mitchell
Is it possible to have certain query terms not effect score, if that same query term is present in a field? For example, I have an index of hotels. Each hotel has a name and city. If the name of a hotel has the name of the city in it's "name" field, I want to completely ignore that and not have it

Re: Missing content stream

2010-10-08 Thread Markus Jelsma
I don't know if sending the wrong content type results in this error message but you must send the proper content type header, text/xml. On Friday, October 08, 2010 03:57:33 pm Dennis Brundage wrote: > I am just getting started with Solr and have successfully run through the > tutorial. I now h

Missing content stream

2010-10-08 Thread Dennis Brundage
I am just getting started with Solr and have successfully run through the tutorial. I now have Solr up and running on Glassfish and decided to try a simple test to verify the ability to update the index, using the following HTML page: http://localhost.:4747/solr/update"; enctype="multipart/

Re: Getting an ngram fieldtype to work

2010-10-08 Thread Markus Jelsma
On Friday, October 08, 2010 03:40:09 pm Allistair Crossley wrote: > Well, a lot of this is working but not all. > > Consider the company name Shooters Inc > > My ngram field is able to match queries to the name for shoot and hoot and > so on. This works. > > However consider the company name >

Re: Getting an ngram fieldtype to work

2010-10-08 Thread Allistair Crossley
Well, a lot of this is working but not all. Consider the company name Shooters Inc My ngram field is able to match queries to the name for shoot and hoot and so on. This works. However consider the company name Location Scotland If I query scot I get one result back - but it's for a company c

Search Cloud , store stemmed Tokens?

2010-10-08 Thread alexander sulz
Hello dear Solr Users.. As far as I understand, I am able to process stuff with analyzers (and in there with tokenizers and filters and whatnot) before indexing, but is it also possible to do that before storing the input into a field? What I want to do is to store some search words from user

Re: case-insensitive phrase query for string fields

2010-10-08 Thread Matt Mitchell
Hey thanks guys! This all makes sense now. I'm using a text field and it's giving good results of course. Matt On Fri, Oct 8, 2010 at 6:08 AM, Erik Hatcher wrote: > Matt - > >        Erik > > > On Oct 7, 2010, at 23:38 , Jonathan Rochkind wrote:

check keyword density

2010-10-08 Thread Sushan Rungta
I have indexed my website in lucene and search is working effectively. I now want to make an algorithm wherein I could count the quantum of keywords used repetitively in documents like the one mentioned below: "gold tips, silver tips, tips for crude oil, commodity tips on mobile, free commodity t

Re: Getting an ngram fieldtype to work

2010-10-08 Thread Allistair Crossley
Oh my. I am basically being a total monkey. Every time I was changing my schema.xml to try new things out I was then reindexing our staging server's index instead of my local dev index so no changes were occurring locally. Dear me. This is working now, surprise. On Oct 8, 2010, at 8:53 AM, Ma

Search Cloud , store stemmed Tokens?

2010-10-08 Thread alexander sulz
Hello dear Solr Users.. As far as I understand, I am able to process stuff with analyzers (and in there with tokenizers and filters and whatnot) before indexing, but is it also possible to do that before storing the input into a field? What I want to do is to store some search words from users

Re: Getting an ngram fieldtype to work

2010-10-08 Thread Markus Jelsma
How come your query analyser spits out grams? It isn't configured to do so or you posted an older field definition. Anyway, do you actually search on your new field? On Friday, October 08, 2010 02:46:08 pm Allistair Crossley wrote: > Hi, > > Yep, I was just looking at the analyzer jsp. The ngr

Re: Getting an ngram fieldtype to work

2010-10-08 Thread Allistair Crossley
Hi, Yep, I was just looking at the analyzer jsp. The ngrams *do* exist as expected, so it's not my configuration that is at fault (he says) Index Analyzer sh ho oo ot te er sho hoo oot ote ter shoohootooteotershoot hoote oote

Re: Getting an ngram fieldtype to work

2010-10-08 Thread Jan Høydahl / Cominvent
Hi, The first thing I would try is to go to the analysis page, enter your test data, and report back what each analysis stage prints out: http://localhost:8983/solr/admin/analysis.jsp -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com On 8. okt. 2010, at 14.19, Allistair

Getting an ngram fieldtype to work

2010-10-08 Thread Allistair Crossley
Morning all, I would like to ngram a company name field in our index. I have read about the costs of doing so in the great David Smiley Solr 1.4 book and just to get started I have followed his example in setting up an ngram field type as follows:

NumberFormatException upon reading a Trie field during search

2010-10-08 Thread Jon Poulton
Hi there, I have recently upgraded our Solr instance and have reindexed all of the items in our store, and for at least one search I am getting some unusual error messages back for a search that previously worked. It reads as follows: HTTP Status 500 - Invalid shift value in prefixCoded string (

Re: PatternReplaceFilterFactory creating empty string as a term

2010-10-08 Thread Shawn Heisey
On 10/5/2010 10:38 PM, Shawn Heisey wrote: That fixed it. Thank you. If I have time, I'll peek at the patternfilter source code and see if I can figure out how to make it optionally remove empty terms. For me, it's not terribly critical, because my database is the bottleneck in my indexing p

Re: having problem about Solr Date Field.

2010-10-08 Thread Jan Høydahl / Cominvent
Correct. You get back what you push in. Of course if your index is for users in one time zone only, you may insert the local time to Solr, and everything will work well. However, if you operate an index with international users, you'd want to make sure you convert to/from UTC in your applicatio

Re: Strange search result (or lack of)

2010-10-08 Thread Savvas-Andreas Moysidis
Hello, Try searching for name_de:(das urteil). A search for name_de:das urteil will search for "das" in *name_de* and for "urteil" in the default field (e.g. "catch all" field). Hope that helps, -- Savvas On 8 October 2010 09:00, Thomas Kellerer wrote: > Hi, > > I have the following field defi

Re: Experience running Solr on ISCSI

2010-10-08 Thread Thijs
Hi thanks for the info. The hardware guys made it possible for me to do some testing on an ISCSI device, and it looks good so far. I have no idea what the hardware is, but I'm getting about the same throughput as the local disks. We are now looking into how it will scale if multiple machines

Re: Can anyone compare Solr with Autonomy?

2010-10-08 Thread Tommaso Teofili
Hi Scott, I can say that in my experience I've seen a company dropping the Autonomy solution in favour of Apache Solr :-) It's not a comparison and nor a matter of better/worse but it can count when evaluating how market is behaving regarding that. Tommaso 2010/10/8 Otis Gospodnetic > Scott, > >

Re: case-insensitive phrase query for string fields

2010-10-08 Thread Erik Hatcher
Matt - Erik On Oct 7, 2010, at 23:38 , Jonathan Rochkind wrote: > If you are going to put explict phrase quotes in the query string like that, > an ordinary text field will match fine, on phrase searches or other searches. > That is a

Re: Experience running Solr on ISCSI

2010-10-08 Thread Peter Sturge
Hi, We've used iSCSI SANs with 6x1TB 15k SAS drives RAID10 in production environments, and this works very well for both reads and writes. We also have FibreChannel environments, and this is faster as you would expect. It's also a lot more expensive. The performance bottleneck will have more to d

Strange search result (or lack of)

2010-10-08 Thread Thomas Kellerer
Hi, I have the following field defined in my schema: The field contains the value "Das Urteil" which is thus stored as "das urteil" The following query (using Solr 1.4) returns nothing name_de:das urteil But when I run the query name_de:"das urteil" t

Re: SOLRJ - Searching text in all fields of a Bean

2010-10-08 Thread Savvas-Andreas Moysidis
Hello, What does your schema look like? Have you defined a "catch all" field and copy every value from all your other fields in it with a directive? Cheers, -- Savvas On 8 October 2010 08:30, Subhash Bhushan wrote: > Hi, > > I have two fields in the bean class, id and title. > After adding th

Re: SOLRJ - Searching text in all fields of a Bean

2010-10-08 Thread Ahmet Arslan
> I have two fields in the bean class, id and title. > After adding the bean to SOLR, I want to search for, say > "kitten", in all > defined fields in the bean, like this -- query.setQuery( > "kitten"); -- > But I get results only when I affix the bean field name > before the search > text like thi

SOLRJ - Searching text in all fields of a Bean

2010-10-08 Thread Subhash Bhushan
Hi, I have two fields in the bean class, id and title. After adding the bean to SOLR, I want to search for, say "kitten", in all defined fields in the bean, like this -- query.setQuery( "kitten"); -- But I get results only when I affix the bean field name before the search text like this -- query.

Re: having problem about Solr Date Field.

2010-10-08 Thread Dennis Gearon
So where does translation take place between the QUERIER's time zone, and the eventual VIEWER's time zone? That is done all at the application level? Dennis Gearon Signature Warning It is always a good idea to learn from your own mistakes. It is usually a better idea to learn