Re: Applying synonyms increase the data size from MB to GBs

2011-06-05 Thread pravesh
Since you r using expand="true" , so, every time a matching synonym entry is found the analyzer is expanding the term with all synonyms set in the index. This may cause the index to grow in size. -- View this message in context: http://lucene.472066.n3.nabble.com/Applying-synonyms-increase-the-da

Expunging deletes from a very large index

2011-06-05 Thread Simon Wistow
Due to some emergency maintenance I needed to run delete on a large number of documents in a 200Gb index. The problem is that it's taking an inordinately long amount of time (2+ hours so far and counting) and is steadily eating up disk space - presumably up to 2x index size which is getting awf

Re: Applying synonyms increase the data size from MB to GBs

2011-06-05 Thread rajini maski
I have the flat files (synonym text files) each upto 200kb. Integrationg all of them increased the txt file size to huge. And I wanted to maintain them separately. So in order to apply all those synonyms to same field type I created that many filter tags for respective synonym txt files. Is it

TIKA INTEGRATION PERFORMANCE

2011-06-05 Thread Naveen Gupta
Hi Since it is php, we are using solphp for calling curl based call, what my concern here is that for each user, we might be having 20-40 attachments needed to be indexed each day, and there are various users ..daily we are targeting around 500-1000 users .. right now if you see, we http://loca

Re: Applying synonyms increase the data size from MB to GBs

2011-06-05 Thread Gora Mohanty
On Mon, Jun 6, 2011 at 10:34 AM, rajini maski wrote: > Applying synonyms increased the data size from 28 mb to 10.3 gb > >   Before enabling synonyms to the a field , the data size was 28mb.  Now , > after applying synonyms I see that data folder size has increased to 10.3 > gb. > > Attached is sc

Applying synonyms increase the data size from MB to GBs

2011-06-05 Thread rajini maski
Applying synonyms increased the data size from 28 mb to 10.3 gb Before enabling synonyms to the a field , the data size was 28mb. Now , after applying synonyms I see that data folder size has increased to 10.3 gb. Attached is schema field type for that field:

RE: java.io.IOException: The specified network name is no longer available

2011-06-05 Thread Gaurav Shingala
Hi, Yes, you are right I have a remote file system also I have checked and confirmed that there was no issue in network. One more thing i need to include here is i had found same bug with ID SOLR-2235 on ASF JIRA. Thanks, Gaurav > Date: Fri, 3 Jun 2011 09:13:00 -0400 > Subject: Re: java.io.I

Re: synonyms problem

2011-06-05 Thread deniz
well i have changed it into text... but still confused about how to use synonyms... and also I want to know how to call synonym methods in java... i have tried to use synonymmap and some other similar things but nothing happens... anyone can give me a sample or a website that i can find examples

java.lang.AbstractMethodError at org.apache.solr.handler.ContentStreamHandlerBase.handleRequestBody(ContentStreamHandlerBase.java:55)

2011-06-05 Thread idivad
Hello, I am new to solr and followed the instruction on ; I am executing the following command in a shell ubuntu1@ubuntu1:/opt/solr$ curl "http://192.168.1.126:8080/solrprod/update/extract?literal.id=doc2&captureAttr=true&defaultField=text&fmap.d

Re: Solr Field name restrictions

2011-06-05 Thread Erick Erickson
I'd stay away from dashes too. It's too easy for the query parsers to mistake them for the NOT operator on a URL. You've really got two issues here: 1> what is allowable in the field name 2> what causes grief with some query parser. To avoid <2>, I'd really just stick with characters and undersco

Re: Solr Indexing Patterns

2011-06-05 Thread Erick Erickson
See: http://wiki.apache.org/solr/SchemaXml By adding ' "multiValued="true" ' to the field, you can add the same field multiple times in a doc, something like value1 value2 But there's no real ability in Solr to store "sub documents", so you'd have to get creative in how you encoded the

Re: URGENT HELP: Improving Solr indexing time

2011-06-05 Thread Rohit Gupta
Thanks Faud, Have started working optimizing my Database structure, since the tables are huge in terms of records, optimization is taking time. Will update the results when complete. Regards, Rohit From: Fuad Efendi To: "Solr-User@Lucene. Org" Sent: Sun,

Re: Search with Synonyms in two fields

2011-06-05 Thread occurred
Ok, thx for the answer. My idea now is to store both field-values in one field and pre- and suffix the values from field2 with something very special. Also then the synonyms have to have the special pre- and suffixes. cheers Charlie Am 05.06.11 01:33, schrieb Jonathan Rochkind [via Lucene]: > T

Re: Search with Synonyms in two fields

2011-06-05 Thread occurred
Yeah, this is also a way to go. Was thinking about it, but isn't a very beautiful solution. cheers Charlie Am 05.06.11 00:51, schrieb lee carroll [via Lucene]: > your app can do the above search? > > On 4 June 2011 20:42, occurred <[hidden email] > > wrote: > > > No, there should be only one fie