from:"pravesh"

Multi threaded document atomic OR in-place updates

2018-04-03 Thread pravesh

I have a scenario as follows: There are 2 separate threads where each will try to update the same document in a single index for 2 separate fields, for which we are using atomic OR in-place updates. For e.g. id is the unique field in the index thread-1 will update following info: id:1001 field-1

Re: Problem with Synonyms

2013-09-03 Thread pravesh

SOLR has a nice analysis page. You can use it to get insight what is happening after each filter is applied at index/search time Regards Pravesh -- View this message in context: http://lucene.472066.n3.nabble.com/Problem-with-Synonyms-tp4087905p4087915.html Sent from the Solr - User mailing

Re: AW: Surprising score?

2013-07-05 Thread pravesh

>>Is there a way to omitNorms and still be able to use {!boost b=boost} ? OR you could let /omitNorms="false"/ as usual and have your custom Similarity implementation with the length normalization method overridden for using a constant value of 1. Regards Pravesh -- View

Re: SOLR guidance required

2013-05-10 Thread pravesh

ttp://wiki.apache.org/solr/CommonQueryParameters> http://wiki.apache.org/solr/SimpleFacetParameters <http://wiki.apache.org/solr/SimpleFacetParameters> BTW, almost all the job sites out there (whether small/medium/big) use SOLR/lucene to power their searches :) Best Pravesh -- View thi

Re: TooManyClauses: maxClauseCount is set to 1024

2013-04-18 Thread pravesh

Update: Also remove your range queries from the main query and specify it as a filter query. Best Pravesh -- View this message in context: http://lucene.472066.n3.nabble.com/TooManyClauses-maxClauseCount-is-set-to-1024-tp4056965p4056969.html Sent from the Solr - User mailing list archive at

Re: TooManyClauses: maxClauseCount is set to 1024

2013-04-18 Thread pravesh

Just increase the value of /maxClauseCount/ in your solrconfig.xml. Keep it large enough. Best Pravesh -- View this message in context: http://lucene.472066.n3.nabble.com/TooManyClauses-maxClauseCount-is-set-to-1024-tp4056965p4056966.html Sent from the Solr - User mailing list archive at

Re: Response time in client was much longer than QTime in tomcat

2013-01-21 Thread pravesh

SOLR's QTime represents actual time it spent on searching, where as your c# client response time might be the total time spent in sending HTTP request and getting back the response(which might also include parsing the results) . Regards Pravesh -- View this message in context: http://l

Re: need basic information

2012-09-02 Thread pravesh

Do logstash/graylog2 do log processing/searching in real time? Or can scale for real time need? I guess harshadmehta is looking for real-time indexing/search. Regards Pravesh -- View this message in context: http://lucene.472066.n3.nabble.com/need-basic-information-tp4004588p4004996.html Sent

Re: need basic information

2012-08-30 Thread pravesh

=true&facet.field=action This way you have facet count for created/updated/deleted etc. Hope this is what u r looking for. Thanx Pravesh -- View this message in context: http://lucene.472066.n3.nabble.com/need-basic-information-tp4004588p4004637.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Query during a query

2012-08-30 Thread pravesh

Did you checked SOLR Field Collapsing/Grouping. http://wiki.apache.org/solr/FieldCollapsing http://wiki.apache.org/solr/FieldCollapsing If this is what you are looking for. Thanx Pravesh -- View this message in context: http://lucene.472066.n3.nabble.com/Query-during-a-query

Re: Query Time problem on Big Index Solr 3.5

2012-08-30 Thread pravesh

http://lucidworks.lucidimagination.com/display/solr/Distributed+Search+with+Index+Sharding Regards Pravesh -- View this message in context: http://lucene.472066.n3.nabble.com/Query-Time-problem-on-Big-Index-Solr-3-5-tp4003660p4004630.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Query Time problem on Big Index Solr 3.5

2012-08-30 Thread pravesh

apply to other fields which require very less analysis or have limited unique fields. Thanx Pravesh -- View this message in context: http://lucene.472066.n3.nabble.com/Query-Time-problem-on-Big-Index-Solr-3-5-tp4003660p4004437.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Load Testing in Solr

2012-08-30 Thread pravesh

Hi Dhaivat, JMeter is a nice tool. But it all depends what sort of load are you expecting, how complex queries are you expecting(sorting/filtering/textual searches). You need to consider all these to benchmark. Thanx Pravedsh -- View this message in context: http://lucene.472066.n3.nabble.com

Re: optimum solr core size

2012-08-30 Thread pravesh

your searches/indexing performing over the time? Are there any impact? Regards Pravesh -- View this message in context: http://lucene.472066.n3.nabble.com/optimum-solr-core-size-tp4004251p4004424.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: solr indexing slows down after few minutes

2012-08-30 Thread pravesh

Pravesh -- View this message in context: http://lucene.472066.n3.nabble.com/solr-indexing-slows-down-after-few-minutes-tp4004337p4004421.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Maximum index size on single instance of Solr

2012-08-30 Thread pravesh

We have a 48GB index size on a single shard. 20+ million documents. Recently migrated to SOLR 3.5 But we have a cluster of SOLR servers for hosting searches. But i do see to migrate to SOLR sharding going forward. Thanx Pravesh -- View this message in context: http://lucene.472066.n3

Re: IndexWrite in Lucene/Solr 3.5 is slower?

2012-06-14 Thread pravesh

BTW, Have you changed the MergePolicy & MergeScheduler settings also? Since Lucene 3.x/3.5 onwards, there have been new MergePolicy & MergeScheduler implementations available, like TieredMergePolicy & ConcurrentMergeScheduler. Regards Pravesh -- View this message in context: h

Re: Dynamically pick dataDirs

2012-05-09 Thread pravesh

x27;s for a single core with multiple index directories in it). BTW, how big your index(es) are? Total documents? total size? etc. If each core is small(MBs/ few GBs) then you could merge few of them together. Regards Pravesh -- View this message in context: http://lucene.472066.n3.nabble.com/Dynami

Re: SOLR 3.5 Index Optimization not producing single .cfs file

2012-05-04 Thread pravesh

lters) on both the system(1.4.1 vs 3.5) and found that 3.5 searches takes longer time than the 1.4.1(around 10-20% slower). Haven't done any load test till now Regards Pravesh -- View this message in context: http://lucene.472066.n3.nabble.com/SOLR-3-5-Index-Optimization-not-producing-s

SOLR 3.5 Index Optimization not producing single .cfs file

2012-05-03 Thread pravesh

#x27;ve set the , true Is it something related to the new MergePolicy being used with SOLR 3.x onwards (I suppose it is TieredMergePolicy with 3.x version)? If yes should i change it to the LogByteSizeMergePolicy? Does this change requires complete rebuilt OR will do incrementally? Regards Pra

Re: Any way to get reference to original request object from within Solr component?

2012-03-17 Thread pravesh

Hi Sujit, The Http parameters ordering is above the SOLR level. Don't think this could be controlled at SOLR level. You can append all required values in a single Http param at then break at your component level. Regds Pravesh -- View this message in context: http://lucene.472066.n3.nabbl

Re: Basic SOLR help needed

2012-02-19 Thread pravesh

e first term KENTUCKY is searched in the INST_NAME and rest terms like TECH and PADUCAH are searched in your default search field Regds Pravesh -- View this message in context: http://lucene.472066.n3.nabble.com/Basic-SOLR-help-needed-tp3759855p375.html Sent from the Solr - User mailing lis

Re: Sorting and searching on a field

2011-12-15 Thread pravesh

be the optimal solution for doing it. Regds Pravesh -- View this message in context: http://lucene.472066.n3.nabble.com/Sorting-and-searching-on-a-field-tp3584992p3587906.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Solr Search Across Multiple Cores not working when quering on specific field

2011-12-14 Thread pravesh

a particular field.? Since you have multiple shard infra, do the cores share the same configurations(schema.xml/solrconfig.xml etc.)?? What error/output you are getting for sharded query? Regards Pravesh -- View this message in context: http://lucene.472066.n3.nabble.com/Solr-Search-Across-Mult

Generic RemoveDuplicatesTokenFilter

2011-12-12 Thread pravesh

for the above implementation :) Regards Pravesh -- View this message in context: http://lucene.472066.n3.nabble.com/Generic-RemoveDuplicatesTokenFilter-tp3581656p3581656.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: How to improve facet search?

2011-12-08 Thread pravesh

searching done in milis/secs/mins?? I am trying to understand if your search could already be performing quite good/OK. Regards Pravesh -- View this message in context: http://lucene.472066.n3.nabble.com/How-to-improve-facet-search-tp3569910p3570048.html Sent from the Solr - User mailing list archive

Re: How to improve facet search?

2011-12-08 Thread pravesh

What is the type of the field on which you are getting facets (string, Text, int, date etc.). Is it multivalued or not? How many unique values do you have for the field? What is your filtercache setting in your solrconfig.xml? Regards Pravesh -- View this message in context: http://lucene

Re: Solr using very high I/O

2011-12-08 Thread pravesh

Can u share more info: like what is your H/W infra, CPU, RAM, HDD?? >From where you pick the records/documents to index; RDBMS, Files, Network?? Regards Pravesh -- View this message in context: http://lucene.472066.n3.nabble.com/Solr-using-very-high-I-O-tp3567076p3569903.html Sent from the S

Re: Solr or SQL fultext search

2011-12-08 Thread pravesh

Go ahead with SOLR based text search. Thats what it is meant for and does it great. Regards Pravesh -- View this message in context: http://lucene.472066.n3.nabble.com/Solr-or-SQL-fultext-search-tp3566654p3569894.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Solr sorting issue : can not sort on multivalued field

2011-12-07 Thread pravesh

Was that field multivalued="true" earlier by any chance??? Did you rebuild the index from scratch after changing it to multivalued="false" ??? Regards Pravesh -- View this message in context: http://lucene.472066.n3.nabble.com/Solr-sorting-issue-can-not-sort-

Re: cache monitoring tools?

2011-12-07 Thread pravesh

>>facet.limit=50 your facet.limit seems too high. Do you actually require this much? Since there a lot of evictions from filtercache, so, increase the maxsize value to your acceptable limit. Regards Pravesh -- View this message in context: http://lucene.472066.n3.nabble.com

Re: how to make effective search with fq and q params

2011-11-22 Thread pravesh

. Also, avoid using the q=*:* as it implicitly translates to matchalldocsquery Regds Pravesh -- View this message in context: http://lucene.472066.n3.nabble.com/how-to-make-effective-search-with-fq-and-q-params-tp3527217p3527535.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: best way for sum of fields

2011-11-07 Thread pravesh

I Guess, This has nothing to do with search part. You can post process the search results(I mean iterate through your results and sum it) Regds Pravesh -- View this message in context: http://lucene.472066.n3.nabble.com/best-way-for-sum-of-fields-tp3477517p3486536.html Sent from the Solr

Re: to prevent number-of-matching-terms in contributing score

2011-11-07 Thread pravesh

effect(also set /omitNorms="true"/ for your fields where you need this feature) Regds Pravesh -- View this message in context: http://lucene.472066.n3.nabble.com/to-prevent-number-of-matching-terms-in-contributing-score-tp3486373p3486512.html Sent from the Solr - User mailing list

Re: to prevent number-of-matching-terms in contributing score

2011-11-07 Thread pravesh

Did you rebuild the index from scratch. Since this is index time factor, you need to build complete index from scratch. Regds Pravesh -- View this message in context: http://lucene.472066.n3.nabble.com/to-prevent-number-of-matching-terms-in-contributing-score-tp3486373p3486447.html Sent from

Re: SolrJ - threading, http clients, connection managers

2011-11-07 Thread pravesh

>1) Is it safe to reuse a single _mgr and _client across all 28 cores? both are thread-safe API as per HttpClient specs. You shld go ahead with this. Regds Pravesh -- View this message in context: http://lucene.472066.n3.nabble.com/SolrJ-threading-http-clients-connection-manag

Re: Getting single documents by fq on unique field, performance

2011-10-21 Thread pravesh

This approach seems fine. You might benchmark it through load test etc. Regds Pravesh -- View this message in context: http://lucene.472066.n3.nabble.com/Getting-single-documents-by-fq-on-unique-field-performance-tp3440229p3440351.html Sent from the Solr - User mailing list archive at

Re: Painfully slow indexing

2011-10-21 Thread pravesh

Are you posting through HTTP/SOLRJ? Your script time 'T' includes time between sending POST request -to- the response fetched after successful response right?? Try sending in small batches like 10-20. BTW how many documents are u indexing??? Regds Pravesh -- View this message

Re: inconsistent results when faceting on multivalued field

2011-10-21 Thread pravesh

ich have term 1234567 only (facet.query would apply to the facets,so as to which facet to be picked/shown) Regds Pravesh -- View this message in context: http://lucene.472066.n3.nabble.com/inconsistent-results-when-faceting-on-multivalued-field-tp3438991p3440128.html Sent from the Solr - Us

Re: hierarchical synonym

2011-10-20 Thread pravesh

If I understood correctly, this seems you are wanting facets/hierarchical facets. Regds Pravesh -- View this message in context: http://lucene.472066.n3.nabble.com/hierarchical-synonym-tp344p3440090.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: text search and data aggregation, thoughts?

2011-10-14 Thread pravesh

in the item-detail page then option-1 seems better. Regds Pravesh -- View this message in context: http://lucene.472066.n3.nabble.com/text-search-and-data-aggregation-thoughts-tp3416330p3421361.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: upgrading 1.4 to 3.x

2011-10-14 Thread pravesh

Just look into your tomcat logs in more detail.specifically the logs when tomcat loads the solr application's web context. There you might find some clues or just post the logs snapshot here. Regds Pravesh -- View this message in context: http://lucene.472066.n3.nabble.com/upgrading-1

Re: Problem while getting more than 100 records from solr 1.4.1

2011-10-12 Thread pravesh

Can you provide the tomcat logs full stack trace for further assistance Regds Pravesh -- View this message in context: http://lucene.472066.n3.nabble.com/Problem-while-getting-more-than-100-records-from-solr-1-4-1-tp3415443p3415619.html Sent from the Solr - User mailing list archive at

Re: upgrading 1.4 to 3.x

2011-10-12 Thread pravesh

This link might help: http://www.lucidimagination.com/blog/2011/04/01/solr-powered-isfdb-part-8/ http://www.lucidimagination.com/blog/2011/04/01/solr-powered-isfdb-part-8/ Regds Pravesh -- View this message in context: http://lucene.472066.n3.nabble.com/upgrading-1-4-to-3-x-tp3415044p3415546

Re: Problem while getting more than 100 records from solr 1.4.1

2011-10-12 Thread pravesh

/wget /or /curl / locally from your solr server for search with page size >100, to check it it works. Regds Pravesh -- View this message in context: http://lucene.472066.n3.nabble.com/Problem-while-getting-more-than-100-records-from-solr-1-4-1-tp3415443p3415541.html Sent from the Solr - User

Re: how to retrieve only the updated data from database using solr?

2011-10-10 Thread pravesh

If you mean using DIH? Then you need to have a timestamp column in your DB, which has to be updated to current timestamp, whenever you are modifying the record in DB. For rest just go through the http://wiki.apache.org/solr/DataImportHandler DIH wiki here Thanx Pravesh -- View this message in

RE: searching documents partially

2011-10-10 Thread pravesh

ine the index which i will be making a search in, in query time? Then you are just duplicating the index and you might stuck-up in maintenance issues Thanx Pravesh -- View this message in context: http://lucene.472066.n3.nabble.com/searching-documents-partially-tp3408429p3411743.html Sent

Re: searching documents partially

2011-10-10 Thread pravesh

Can you clarify following: 1) Is it that: You want to hide some documents from search when user is not logged-in? OR 2) Is it that: You want to hide some fields of some documents from search when user is not logged-in? For Point 2; one solution can be that while indexing the documents, you can

Re: Hierarchical faceting with Date

2011-10-05 Thread pravesh

You count index the date as a text field(or use a new text field to store date as text) and then try it on this new field Thanx Pravesh -- View this message in context: http://lucene.472066.n3.nabble.com/Hierarchical-faceting-with-Date-tp3394521p3395824.html Sent from the Solr - User mailing

Re: is there a way to know which mm value was used?

2011-10-05 Thread pravesh

You can explicitly pass /mm/ for every search, and get it in your response, otherwise use /debugQuery=true/, it will give you all implicitly used defaults (but you wouldn't want to use this in production) Thanx Pravesh -- View this message in context: http://lucene.472066.n3.nabble.com/is-

Re: boosting and relevancy options from solr extensibility points -java-

2011-10-05 Thread pravesh

hat current >user's friends (source:facebook) purchased/used/... You can check https://cwiki.apache.org/confluence/display/MAHOUT/Recommender+Documentation apache mahout for this purpose. It's got recommendation engine that works pretty well. Thanx Pravesh -- View this message in c

Re: what is scheduling ? why should we do this?how to achieve this ?

2011-08-29 Thread pravesh

The Wiki link that you referred is quite old and is not into active development. I would prefer the OS based scheduling using cron jobs. You can check below link. http://wiki.apache.org/solr/CollectionDistribution http://wiki.apache.org/solr/CollectionDistribution Thanx Pravesh -- View this

Re: Viewing the complete document from within the index

2011-08-29 Thread pravesh

esire. (It's sort of reverse engineering not supported) Thanx Pravesh -- View this message in context: http://lucene.472066.n3.nabble.com/Viewing-the-complete-document-from-within-the-index-tp3288076p3292111.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: how i am getting data in my search field eventhough i removed data in my remote database?

2011-08-29 Thread pravesh

http://lucene.472066.n3.nabble.com/how-to-update-solr-cache-when-i-delete-records-from-remote-database-td3291879.html http://lucene.472066.n3.nabble.com/how-to-update-solr-cache-when-i-delete-records-from-remote-database-td3291879.html -- View this message in context: http://lucene.472066.n3.na

Re: how to update solr cache when i delete records from remote database?

2011-08-29 Thread pravesh

You would have to delete them from SOLR also, and then commit it (commit will automatically refresh your caches). Thanx Pravesh -- View this message in context: http://lucene.472066.n3.nabble.com/how-to-update-solr-cache-when-i-delete-records-from-remote-database-tp3291879p3292074.html Sent

Re: what is scheduling ? why should we do this?how to achieve this ?

2011-08-29 Thread pravesh

(cron/scheduled-tasks). Thanx Pravesh -- View this message in context: http://lucene.472066.n3.nabble.com/what-is-scheduling-why-should-we-do-this-how-to-achieve-this-tp3287115p3292068.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: How could I monitor solr cache

2011-07-18 Thread pravesh

This might be of some help: http://wiki.apache.org/solr/SolrJmx http://wiki.apache.org/solr/SolrJmx Thanx Pravesh -- View this message in context: http://lucene.472066.n3.nabble.com/How-could-I-monitor-solr-cache-tp3181317p3181407.html Sent from the Solr - User mailing list archive at

Re: SOLR Shard failover Query

2011-07-18 Thread pravesh

ster in load balanced manner? Thanx Pravesh -- View this message in context: http://lucene.472066.n3.nabble.com/SOLR-Shard-failover-Query-tp3178175p3181400.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Start parameter messes with rows

2011-07-18 Thread pravesh

hine >?can a single core be presented in different shards ? You might look into following thread: http://lucene.472066.n3.nabble.com/difference-between-shard-and-core-in-solr-td3178214.html Thanx Pravesh -- View this message in context: http://lucene.472066.n3.nabble.com/Start-parameter-m

Re: difference between shard and core in solr

2011-07-17 Thread pravesh

ted across a cluster of machines. So all shards belonging to a single core share same schema/analysis/search requirements. You go with sharding when index is not scalable on a single machine, or, when your index grows really big in size. Thanx Pravesh -- View this message in context: http://luce

Re: Deleted docs in IndexWriter Cache (NRT related)

2011-07-17 Thread pravesh

commit would be the safest way for making sure the deleted content doesn't show up. Thanx Pravesh -- View this message in context: http://lucene.472066.n3.nabble.com/Deleted-docs-in-IndexWriter-Cache-NRT-related-tp3177877p3178179.html Sent from the Solr - User mailing list archive at Nabble.com.

SOLR Shard failover Query

2011-07-17 Thread pravesh

fails, or, is this handled gracefully by SOLR? Thanx Pravesh -- View this message in context: http://lucene.472066.n3.nabble.com/SOLR-Shard-failover-Query-tp3178175p3178175.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Is it possible to extract all the tokens from solr?

2011-07-14 Thread pravesh

You can use lucene for doing this. It provides TermEnum API to enumerate all terms of field(s). SOLR-1.4.+ also provides a special request handler for this purpose. Check it if that helps Thanx Pravesh -- View this message in context: http://lucene.472066.n3.nabble.com/Is-it-possible-to

Re: How do I specify a different analyzer at search-time?

2011-07-13 Thread pravesh

You can configure analyzer for 'index-time' & for 'search-time' for each of your field-types in schema.xml Thanx Pravesh -- View this message in context: http://lucene.472066.n3.nabble.com/How-do-I-specify-a-different-analyzer-at-search-time-tp3159463p3165593.html S

Re: POST for queries, length/complexity limit of fq?

2011-07-13 Thread pravesh

>1. I assume that it's worthwhile to rely on POST method instead of GET when issuing a search. Right? As I can see, this should work. We do restrict users search by passing unique id's(sometimes in thousands) in 'fq' and use POST method Thanx Pravesh -- View this me

Re: how to build lucene-solr (espeically if behind a firewall)?

2011-07-13 Thread pravesh

If behind proxy; then use: ant dist ${build_files:autoproxy} Thanx Pravesh -- View this message in context: http://lucene.472066.n3.nabble.com/how-to-build-lucene-solr-espeically-if-behind-a-firewall-tp3163038p3165568.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Re:OOM at solr master node while updating document

2011-07-07 Thread pravesh

You just need to allocate more heap to your JVM. BTW are you doing any complex search while indexing is in progress, like getting large set of documents. Thanx Pravesh -- View this message in context: http://lucene.472066.n3.nabble.com/OOM-at-solr-master-node-while-updating-document

Re: Removing duplicate documents from search results

2011-06-23 Thread pravesh

Would you care to even index the duplicate documents? Finding duplicacy in content fields would be not so easy as in some untokenized/keyword field. May be you could do this filtering at indexing time before sending the document to SOLR. Then the question comes, which one document should go(from a

Re: Read past EOF error due to broken connection

2011-06-22 Thread pravesh

Did you do manual copy of index from Master to Slave of servers. I suppose, it won't be copied properly. If this is the case, then you can check the size of indexes on both servers. Otherwise, you would've to recreate the indexes. Thanx Pravesh -- View this message in context: htt

Re: Search is taking long-long time.

2011-06-22 Thread pravesh

Was your searches always slow, OR, since you did some changes at index/config/schema level? Is it due to 5-mins index updation? Are you warming ur searches? Thanx Pravesh -- View this message in context: http://lucene.472066.n3.nabble.com/Search-is-taking-long-long-time-tp3095306p3098552.html

Re: Read past EOF error due to broken connection

2011-06-21 Thread pravesh

First commit and then try again to search. You can also use lucene's CheckIndex tool to check & fix your index (it may remove some corrupt segments in your index) Thanx Pravesh -- View this message in context: http://lucene.472066.n3.nabble.com/Read-past-EOF-error-due-to-broken-c

Re: relevant result for query with boost factor on parameters

2011-06-20 Thread pravesh

evance) >if both the fields do not contain these terms together (show as normal one >with field1 having more relevance than field2) You wud've to experiment with different boost values to arrive at some benchmark. Start with same for field-1 & field-2, then inc. for field-1 a litt

Re: relevant result for query with boost factor on parameters

2011-06-20 Thread pravesh

t off Thanx Pravesh -- View this message in context: http://lucene.472066.n3.nabble.com/relevant-result-for-query-with-boost-factor-on-parameters-tp3079337p3085406.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: difficult sort

2011-06-17 Thread pravesh

Yes. Then I beleive you would need multiple queries -- View this message in context: http://lucene.472066.n3.nabble.com/difficult-sort-tp3075563p3075802.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Search failed even if it has the keyword .

2011-06-17 Thread pravesh

What is the type for the field's defaultquery & title in your schema.xml ? -- View this message in context: http://lucene.472066.n3.nabble.com/Search-failed-even-if-it-has-the-keyword-tp3075626p3075797.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: difficult sort

2011-06-17 Thread pravesh

I'm not sure, but have looked at Collapsing feature in SOLR yet? You may have to apply patch for 1.4.1 version, if this is what u want? -- View this message in context: http://lucene.472066.n3.nabble.com/difficult-sort-tp3075563p3075661.html Sent from the Solr - User mailing list archive at Nabb

Re: Search failed even if it has the keyword .

2011-06-17 Thread pravesh

First check, in your schema.xml, which is your default search field. Also look if you are using WordDelimiterFilterFactory in your schema.xml for the specific field. This would tokenize your words on every capital letter, so, for the word "DescribeYourImageWithAMovieTitle" will be broken into multi

Re: SOlR -- Out of Memory exception

2011-06-16 Thread pravesh

If you are sending whole CSV in a single HTTP request using curl, why not consider sending it in smaller chunks? -- View this message in context: http://lucene.472066.n3.nabble.com/SOlR-Out-of-Memory-exception-tp3074636p3075091.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: High 100% CPU usage with SOLR 1.4.1

2011-06-15 Thread pravesh

mples). My actual production load is much less than that(3hrs cycle is actually spans to 24 hrs on production). I will repeat this with actual load now. Thanx all 4 ur time :) Pravesh -- View this message in context: http://lucene.472066.n3.nabble.com/High-100-CPU-usage-with-SOLR-

Re: problem with the new IndexSearcher when snpainstaller (and commit script) happen

2011-06-15 Thread pravesh

pe this helps Thanx Pravesh -- View this message in context: http://lucene.472066.n3.nabble.com/problem-with-the-new-IndexSearcher-when-snpainstaller-and-commit-script-happen-tp3066902p3068903.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: High 100% CPU usage with SOLR 1.4.1

2011-06-15 Thread pravesh

ex of about 32GB with 100+ fields indexed,18 fields stored & using an optimized index for search Thanx Pravesh -- View this message in context: http://lucene.472066.n3.nabble.com/High-100-CPU-usage-with-SOLR-1-4-1-tp3068667p3068778.html Sent from the Solr - User mailing list archive at Nabble.com.

High 100% CPU usage with SOLR 1.4.1

2011-06-15 Thread pravesh

ibly NIOFSDirectory attributes to high CPU? Is there a mechanism in 1.4.1 to use the SimpleFSDirectory implementation for searching(would this require full re-index)? Help will be appreciated :) Thanx Pravesh -- View this message in context: http://lucene.472066.n3.nabble.com/High-100-CPU-us

Re: How do I make sure the resulting documents contain the query terms?

2011-06-06 Thread pravesh

>k0 --> A | C >k1 --> A | B >k2 --> A | B | C >k3 --> B | C >Now let q=k1, how do I make sure C doesn't appear as a result since it doesn't contain any occurence of k1? Do we bother to do that. Now that's what lucene does :) -- View this message in context: http://lucene.472066.n3.nabble.com/Ho

Re: Feature: skipping caches and info about cache use

2011-06-06 Thread pravesh

SOLR1.3+ logs only the fresh queries in the logs. If you re-run the same query then it is served from cache, and not printed on the logs(unless cache(s) are not warmed or sercher is reopened). So, Otis's proposal would definitely help in doing some benchmarks & baselining the search :) -- View t

Re: Applying synonyms increase the data size from MB to GBs

2011-06-05 Thread pravesh

Since you r using expand="true" , so, every time a matching synonym entry is found the analyzer is expanding the term with all synonyms set in the index. This may cause the index to grow in size. -- View this message in context: http://lucene.472066.n3.nabble.com/Applying-synonyms-increase-the-da

Re: Strategy --> Frequent updates in our application

2011-06-03 Thread pravesh

You can go ahead with the Master/Slave setup provided by SOLR. Its trivial to setup and you also get SOLR's operational scripts for index synch'ing b/w Master-to-Slave(s), OR the Java based replication feature. There is no need to re-invent other architecture :) -- View this message in context:

Re: Sorting

2011-06-03 Thread pravesh

BTW, why r u sorting on this field? You could also index & store this field twice. First, in its original value, and then second, by encoding to some unique code/hash and index it and sort on that. -- View this message in context: http://lucene.472066.n3.nabble.com/Sorting-tp3017285p3019055.html

Re: Strategy --> Frequent updates in our application

2011-06-03 Thread pravesh

You can use DataImportHandler for your full/incremental indexing. Now NRT indexing could vary as per business requirements (i mean delay cud be 5-mins ,10-mins,15-mins,OR, 30-mins). Then it also depends on how much volume will be indexed incrementally. BTW, r u having Master+Slave SOLR setup? -- V

Re: Re: Anyway to know changed documents?

2011-06-01 Thread pravesh

SOLR wiki will provide help on this. You might be interested in pure Java based replication too. I'm not sure,whether SOLR operational will have this feature(synch'ing only changed segments). You might need to change configuration in searchconfig.xml -- View this message in context: http://lucene

Re: Query problem in Solr

2011-06-01 Thread pravesh

>>We're using Solr to search on a Shop index and a Product index Do you have 2 separate indexes (using distributed shard search)?? I'm sure you are actually having only single index. >> Currently a Shop has a field `shop_keyword` which also contains the >> keywords of the products assigned to it.

Re: Anyway to know changed documents?

2011-06-01 Thread pravesh

If your index size if smaller (a few 100 MBs), you can consider the SOLR's operational script tools provided with distribution to sync indexes from Master to Slave servers. It will update(copies) the latest index snapshot from Master to Slave(s). SOLR wiki provides good info on how to set them as C

Can we stream binary data with StreamingUpdateSolrServer ?

2011-05-30 Thread pravesh

Hi, I'm using StreamingUpdateSolrServer to post a batch of content to SOLR1.4.1. By looking at StreamingUpdateSolrServer code, it looks it only provides the content to be streamed in XML format only. Can we use it to stream data in binary format? -- View this message in context: http://lucene

RE: FieldCache

2011-05-26 Thread pravesh

Since FieldCache is an expert level API in lucene, there is no direct control provided by SOLR/Lucene to control its size. -- View this message in context: http://lucene.472066.n3.nabble.com/FieldCache-tp2987541p2989443.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: What is omitNorms

2011-05-26 Thread pravesh

What would be the default value for omitNorms? --- Default value is "false" Is general advise to ignore this and set the value explicitly? --- Depends on your requirement. Do this on field-per-field basis. Set to "false" on fields where you want the norms, or, set to "true" on fields where you w

Re: How does Solr's MoreLikeThis component internally work to get results?

2011-05-26 Thread pravesh

This will help: http://cephas.net/blog/2008/03/30/how-morelikethis-works-in-lucene/ -- View this message in context: http://lucene.472066.n3.nabble.com/How-does-Solr-s-MoreLikeThis-component-internally-work-to-get-results-tp2938407p2988487.html Sent from the Solr - User mailing list archive at

Re: Huge performance drop in distributed search w/ shards on the same server/container

2011-05-26 Thread pravesh

Do you really require multi-shards? Single core/shard will do for even millions of documents and the search will be faster than searching on multi-shards. Consider multi-shard when you cannot scale-up on a single shard/machine(e.g, CPU,RAM etc. becomes major block). Also read through the SOLR di

Re: how to integrate solr with spring framework

2011-05-26 Thread pravesh

Just read through: http://www.springbyexample.org/examples/solr-client.html http://static.springsource.org/spring-roo/reference/html/base-solr.html -- View this message in context: http://lucene.472066.n3.nabble.com/how-to-integrate-solr-with-spring-framework-tp2955540p2988363.html Sent from th

RE: Out of memory on sorting

2011-05-26 Thread pravesh

For saving Memory: 1. allocate as much memory to the JVM (especially if you are using 64bit OS) 2. You can set "omitNorms=true" for your date & id fields (actually for all fields where index-time boosting & length normalization isn't required. This will require a full reindex) 3. Are you sorting o

Re: Too many Boolean Clause and Filter Query

2011-05-26 Thread pravesh

I'm sure you can fix this by increasing value to some max. This shld apply to filter query as well -- View this message in context: http://lucene.472066.n3.nabble.com/Too-many-Boolean-Clause-and-Filter-Query-tp2974848p2988190.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: What is document tag in data-config.xml of Solr

2011-05-26 Thread pravesh

tag represents to the actual SOLR document that will be posted by the DIH. This mapping is used by the DIH to map DB-to-index document. You can have multiple tags, as you might be pulling data from more than 1 table. You can only have one tag in you db-data-config.xml (remember, the purpose of

1 2 >

1 - 100 of 103 matches

Mail list logo