Re: PDFBOX 1.3.1 Parsing Error

2010-12-12 Thread Pradeep Singh
If the document is encrypted maybe it isn't meant to be indexed and publicly visible after all? On Sun, Dec 12, 2010 at 10:22 PM, pankaj bhatt wrote: > hi All, >While using PDFBOX 1.3.1 in APACHE TIKA 1.7 i am getting the > following error to parse an PDF Document. > *Error: Expected

Re: Which query parser and how to do full text on mulitple fields

2010-12-12 Thread Pradeep Singh
You said you were using a third party plugin. What do you expect people herre to know? Solr plugins don't have parameters lat, long, radius and threadCount (they have pt and dist). On Sun, Dec 12, 2010 at 4:47 PM, Dennis Gearon wrote: > Which query parser did my partner set up below, and how to I

Re: Rollback can't be done after committing?

2010-11-11 Thread Pradeep Singh
In some cases you can rollback to a named checkpoint. I am not too sure but I think I read in the lucene documentation that it supported named checkpointing. On Thu, Nov 11, 2010 at 7:12 PM, gengshaoguang wrote: > Hi, Kouta: > Any data store does not support rollback AFTER commit, rollback works

Re: command line to check if Solr is up running

2010-10-27 Thread Pradeep Singh
How about - Please do not respond to 20 emails at one time? On Wed, Oct 27, 2010 at 12:33 AM, Lance Norskog wrote: > Please start new threads for new topics. > > > Xin Li wrote: > >> As we know we can use browser to check if Solr is running by going to >> http://$hostName:$portNumber/$masterName

Re: Looking for Developers

2010-10-26 Thread Pradeep Singh
This is the second time he has sent this shit. Kill his subscription. Is it possible? On Tue, Oct 26, 2010 at 10:38 PM, Yuchen Wang wrote: > UNSUBSCRIBE > > On Tue, Oct 26, 2010 at 10:15 PM, Igor Chudov wrote: > > > UNSUBSCRIBE > > > > On Wed, Oct 27, 2010 at 12:14 AM, ST ST wrote: > > > Looki

Re: Multiple Word Facets

2010-10-26 Thread Pradeep Singh
Use this field type - On Tue, Oct 26, 2010 at 6:43 PM, Adam Estrada wrote: > All, > I am a new to Solr faceting and stuck on how to get multiple-word > facets returned from a standard Solr query. See below for what is > currently being returned. > > > > > > 89 > 87 > 87 > 87 > 84

Re: Highlighting for non-stored fields

2010-10-26 Thread Pradeep Singh
Another way you can do this is - after the search has completed, load the field in your application, write separate code to reanalyze that field/document, index it in RAM, and run it through highlighter classes. All this as part of your web application outside of Solr. Considering the size of your

Re: How to use AND as opposed to OR as the default query operator.

2010-10-25 Thread Pradeep Singh
Which query handler are you using? For a standard query handler you can set q.op per request or set defaultOperator in schema.xml. For a dismax handler you will have to work with min should match. On Mon, Oct 25, 2010 at 6:41 AM, Swapnonil Mukherjee < swapnonil.mukher...@gettyimages.com> wrote:

Re: Failing to successfully import international characters via DIH

2010-10-22 Thread Pradeep Singh
Holy cow, you already have this in place. I apologize. This looked exactly the kind of problem I have solved this way. On Fri, Oct 22, 2010 at 8:38 AM, Pradeep Singh wrote: > > >> What would you recommend changing or checking? >> >> > Tomcat *Connector* URIEncodin

Re: Failing to successfully import international characters via DIH

2010-10-22 Thread Pradeep Singh
> > What would you recommend changing or checking? > > Tomcat *Connector* URIEncoding. I have done this several times on tomcat, might be at a loss on other servers though. - Pradeep

Re: Step by step tutorial for multi-language indexing and search

2010-10-20 Thread Pradeep Singh
Here's what I would do - Search all the fields everytime regardless of language. Use one handler and specify all of these in "qf" and "pf". question_en, answer_en, question_fr, answer_fr, question_pl, answer_pl Individual field based analyzers will take care of appropriate tokenization and you w

Re: Spatial

2010-10-20 Thread Pradeep Singh
t the dynamic field type _latLon is of type double and not tdouble by default. Can I have your input about that decision? -Pradeep On Tue, Oct 19, 2010 at 6:10 PM, Grant Ingersoll wrote: > > On Oct 19, 2010, at 6:23 PM, Pradeep Singh wrote: > > > https://issues.apache.org/jir

Re: Mulitple facet - fq

2010-10-20 Thread Pradeep Singh
fq=(category:corporate category:personal) On Wed, Oct 20, 2010 at 7:39 AM, Yavuz Selim YILMAZ wrote: > Under category facet, there are multiple selections, whicih can be > personal,corporate or other > > How can I get both "personal" and "corporate" ones, I tried > fq=category:corporate&fq=

Re: Lucene vs Solr

2010-10-19 Thread Pradeep Singh
Is that right? On Tue, Oct 19, 2010 at 11:08 PM, findbestopensource < findbestopensou...@gmail.com> wrote: > Hello all, > > I have posted an article Lucene vs Solr > http://www.findbestopensource.com/article-detail/lucene-vs-solr > > Please feel free to add your comments. > > Regards > Aditya > w

Spatial

2010-10-19 Thread Pradeep Singh
https://issues.apache.org/jira/browse/LUCENE-2519 If I change my code as per 2519 to have this - public double[] coords(double latitude, double longitude) { double rlat = Math.toRadians(latitude); double rlong = Math.toRadians(longitude); double nlat = rlong * Math.cos(rlat); re

Re: Uppercase and lowercase queries

2010-10-19 Thread Pradeep Singh
Use text field. On Tue, Oct 19, 2010 at 3:19 AM, PeterKerk wrote: > > I want to query on cityname. This works when I query for example: > "Boston" > > But when I query "boston" it didnt show any results. In the database is > stored: "Boston". > > So I thought: I should change the filter on this

Re: I need to indexing the first character of a field in another field

2010-10-18 Thread Pradeep Singh
You can use regular expression based template transformer without writing a separate function. It's pretty easy to use. On Mon, Oct 18, 2010 at 2:31 PM, Renato Wesenauer < renato.wesena...@gmail.com> wrote: > Hello guys, > > I need to indexing the first character of the field "autor" in another >

Admin for spellchecker?

2010-10-18 Thread Pradeep Singh
Do we need an admin screen for spellchecker? Where you can browse the words and delete the ones you don't like so that they don't get suggested?

Re: Spell checking question from a Solr novice

2010-10-18 Thread Pradeep Singh
I think a spellchecker based on your index has clear advantages. You can spellcheck words specific to your domain which may not be available in an outside dictionary. You can always dump the list from wordnet to get a starter english dictionary. But then it also means that misspelled words from yo

Re: Spell checking question from a Solr novice

2010-10-18 Thread Pradeep Singh
I haven't yet but I was going to use the spell checker in the lucene contrib module. That spellchecker is ngram based and previously I have noticed that I get better results from ngram based spellcheck rather than fuzzy string match based ones. On Mon, Oct 18, 2010 at 12:43 PM, Xin Li wrote: > H

facet.field :java.lang.NullPointerException

2010-10-15 Thread Pradeep Singh
Faceting blows up when the field has no data. And this seems to be random. Sometimes it will work even with no data, other times not. Sometimes the error goes away if the field is set to multiValued=true (even though it's one value every time), other times it doesn't. In all cases setting facet.met