Re: EmbeddedSolrServer and StreamingUpdateSolrServer

2012-04-17 Thread pcrao
Hi Mikhail Khludnev, You are partially right. i.e. We have two separate processes accessing the same Lucene Directory but they do not run simultaneously. They run one after the other and only after the first one is completed. The commit from the EmbeddedServer is successful and I am posting the lo

Unable to execute query (Transactions not supported) on running fullimport

2012-04-17 Thread nigmail
I am using the full import option with the files as mentioned below: *data-config.xml* *schema.xml* ... ... *solrconfig.xml* data-config.xml The query runs perfec

Re: Solr TransformerException, SocketException: Broken pipe

2012-04-17 Thread JM
> > Hi Guys, > We are experiencing SEVERE exceptions in SOLR (stacktrace below) > Please let me know if anyone has experienced this and have some insight / > pointers on to where and what should I look for to resolve this. > ERROR [solr.servlet.SolrDispatchFilter] - : java.io.IOException: XSLT > t

Re: Options for automagically Scaling Solr (without needing distributed index/replication) in a Hadoop environment

2012-04-17 Thread Jan Høydahl
Hi, I think Katta integration is nice, but it is not very real-time. What if you want both? Perhaps a Katta/SolrCloud integration could make the two frameworks play together, so that some shards in SolrCloud may be marked as "static" while others are "realtime". SolrCloud will handle indexing t

Re: Can I use Field Aliasing/Renaming on Solr3.3?

2012-04-17 Thread Jan Høydahl
You'll have to upgrade to 3.6. Upgrading is really easy and should be 100% back-compat. Just keep your old config and drop in the new solr.war, then you'll get the new features. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 17

Jira 1540

2012-04-17 Thread Ramprakash Ramamoorthy
I am using solr to perform a distributed search. I am using version 1.3 to accommodate older indices in the already existing system. I am able to perform a search over a single shard, even faceting and highlighting works. However, when it comes to distributed search, I get an exception 500. The st

Wrong categorization with DIH

2012-04-17 Thread Ramo Karahasan
Hi, i currently face the followin issue: Testing the following sql statement which is also used in SOLR (DIH) leads to a wrong categorization in solr: select p.title as title, p.id, p.category_id, p.pic_thumb, c.name as category, c.id as category_id from product p, category c WHERE p.category

Re: Can Solr solve this simple problem?

2012-04-17 Thread Jan Høydahl
Hi, You have many basic questions about search. Can I recommend one of the books? http://lucene.apache.org/solr/books.html Also, you'll find a lot of answers on the Solr WIKI: http://wiki.apache.org/solr/ if you're not aware of it. I think Solr may solve your performance problems well. Whether

Re: Jira 1540

2012-04-17 Thread Jan Høydahl
Simply try using Solr3.6 to read your old 1.3 indices. Chances are that it will work - without the exceptions :) -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 17. apr. 2012, at 11:08, Ramprakash Ramamoorthy wrote: > I am using

Re: Wrong categorization with DIH

2012-04-17 Thread Gora Mohanty
On 17 April 2012 14:47, Ramo Karahasan wrote: > Hi, > > > > i currently face the followin issue: > > Testing the following sql statement which is also used in SOLR (DIH) leads > to a wrong categorization in solr: > > select p.title as title, p.id, p.category_id, p.pic_thumb, c.name as > category,

AW: Wrong categorization with DIH

2012-04-17 Thread Ramo Karahasan
I've figured out, that this wrong categorization comes when doing an Delta import... I'm doing the delta import as described her: http://wiki.apache.org/solr/DataImportHandlerDeltaQueryViaFullImport my data-properties.xml looks like: So I run the delta import command like:

making query in query result

2012-04-17 Thread halil
Hi List, I want to make query in a query result whish is done previously. I googled the net but couldnot find anything. How can I do that? I need a starting point. thanks in advance, -halil agin.

Re: Can Solr solve this simple problem?

2012-04-17 Thread Alexandr Bocharov
Thanks for your replies, you're good expert :) I've read documentation on Solr basicaly, I'm familiar with it around 2 days. The documentation is very huge at first sight :). Me and my company is being deciding to use Solr or other solution. Maybe you're right about re-implementing our sorting func

[Solr 4.0] what is stored in .tim index file format?

2012-04-17 Thread Lyuba Romanchuk
Hi, I have index ~31G where 27% of the index size is .fdt files (8.5G) 20% - .fdx files (6.2G) 37% - .frq files (11.6G) 16% - .tim files (5G) I didn't manage to find the description for .tim files. Can you help me with this? Thank you. Best regards, Lyuba

Re: Can Solr solve this simple problem?

2012-04-17 Thread Jan Høydahl
1. Just trust that Lucene will perform :) Incremental updates are actually stored in separate new index segments with own caches, so all the old existing data is left un-touched with caches in place. 2. Please explain what you expect from "semantic search" which is an overloaded word. 3. On

Re: making query in query result

2012-04-17 Thread halil
I think the answer is the "nested query". thanks... On Tue, Apr 17, 2012 at 12:52 PM, halil wrote: > Hi List, > > I want to make query in a query result whish is done previously. I googled > the net but couldnot find anything. How can I do that? I need a starting > point. > > > thanks in advance

Re: making query in query result

2012-04-17 Thread Jeevanandam
Halil - I'm describing scenario with sample query below: query 1: (cat:"electronics") - lets say it returns 25 docs in search result query 2: (features:"power") - will be applied on above result i.e. 'query 1'(25 docs) so final result refined to 16 docs in search result. If above scenario

SolrCloud: Programmatically create multiple collections?

2012-04-17 Thread ravi
Hi, I have recently started experimenting with solrCloud. I want to use it for below mentioned requirements: - create one collection per client - create several shards per collection (say 1 shard for each day of month) - all of the collection would follow the same schema. - may need to add

ConcurrentUpdateSolrServer - catching errors

2012-04-17 Thread ads_green
I'm trying to bulk update a number of documents using the ConcurrentUpdateSolrServer. Now under normal circumstances, this is working fine. I create the ConcurrentUpdateSolrServer with suitable queue and connection pool sizes and then submit documents to be added using multiple threads. Watching t

Elevation togehter with grouping

2012-04-17 Thread notebook99
Hi, is it posible to use query elevation togehter with result grouping? I tried http://localhost:8983/solr/elevate?enableElevation=true&fl=score%2C[elevated]%2Cid%2Cname&forceElevation=true&group.field=manu&group=on&indent=on&q=ipod&wt=json but the results ignored the elevation: { "respon

Re: Difference between two solr indexes

2012-04-17 Thread nutchsolruser
I'm Also seeking solution for similar problem. -- View this message in context: http://lucene.472066.n3.nabble.com/Difference-between-two-solr-indexes-tp3916328p3917050.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Merge results of search query to multiple cores

2012-04-17 Thread Erick Erickson
Why do you have these in two cores? Why not have them in a single core (two different document types) and do a join? Best Erick On Tue, Apr 17, 2012 at 7:28 AM, nitinkhosla79 wrote: > I have setup solr with multiple cores. Each core has its own schema(with > common unique id). > Example: > Core0

Re: A tool for frequent re-indexing...

2012-04-17 Thread Ravish Bhagdev
Thanks. This is useful to know as well. I was actually after SolrEntityProcessor which I failed to notice until pointed out by previous reply because I'm using 1.4 still. Cheers, Ravish On Fri, Apr 6, 2012 at 11:01 AM, Valeriy F

How sorlcloud distribute data among shards of the same cluster?

2012-04-17 Thread emma1023
How solrcloud manage distribute data among shards of the same cluster when you query? Is it distribute the data equally? What is the basis? Which part of the code that I can find about it?Thank you so much! -- View this message in context: http://lucene.472066.n3.nabble.com/How-sorlcloud-distrib

Re: Distributed FacetComponent NullPointer Exception

2012-04-17 Thread Jamie Johnson
I'm noticing that this issue seems to be occurring with facet fields which have some unexpected characters. For instance the query that I see going across the wire is as follows facet=true&tie=0.1&ids=3F2504E0-4F89-11D3-9A0C-0305E82C3301&qf=%0a++author^0.5+type^0.5+content_mvtxt^10++subje

Re: Distributed FacetComponent NullPointer Exception

2012-04-17 Thread Yonik Seeley
facet.field={!terms=$organization__terms}organization This is referring to another request parameter that Solr should have added (organization__terms) . Did you cut-n-paste all of the parameters below? -Yonik lucenerevolution.com - Lucene/Solr Open Source Search Conference. Boston May 7-10 On

Re: Distributed FacetComponent NullPointer Exception

2012-04-17 Thread Jamie Johnson
I tried to clean it up a little bit, but removed too much. Here is an example which more closely shows what is in the log. I changed the actual data, hopefully I didn't mess it up. facet=true&tie=0.1&ids=urn:sha256:0bea3adf1415c6c737063122d8abd343b24167bdfc7134a3efaef79b263c0a43&qf=%0a++

what's best to use for monitoring solr 3.6 farm on redhat/tomcat

2012-04-17 Thread Robert Petersen
Hello solr users, Is there any lightweight tool of choice for monitoring multiple solr boxes for memory consumption, heap usage, and other statistics? We have a pretty large farm of RHEL servers running solr now and up until migrating from 1.4 to 3.6 we were running the lucid gaze component on

Solr and TREC Enterprise Track 2007

2012-04-17 Thread obadayh
Dear All, Can anybody tell me how to index TREC Enterprise Track 2007 by sorl. Thanx -- View this message in context: http://lucene.472066.n3.nabble.com/Solr-and-TREC-Enterprise-Track-2007-tp3917893p3917893.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Problem with faceting on a boolean field

2012-04-17 Thread Yonik Seeley
On Tue, Apr 17, 2012 at 2:22 PM, Kissue Kissue wrote: > Hi, > > I am faceting on a boolean field called "usedItem". There are a total of > 607601 items in the index and they all have value for "usedItem" set to > false. > > However when i do a search for *:* and faceting on "usedItem", the num > f

Re: Difference between two solr indexes

2012-04-17 Thread Pawel Rog
If there are only 100'000 documents dump all document ids and make diff If you're using linux based system you can just use simple tools to do it. Something like that can be helpful curl "http://your.hostA:port/solr/index/select?*:*&fl=id&wt=csv"; > /tmp/idsA curl "http://your.hostB:port/solr/inde

HTML Indexing error

2012-04-17 Thread Chambeda
Hi All, I am trying to parse some text that contains embedded HTML elements and am getting the following error: FATAL: Solr returned an error #400 Unexpected close tag ; expected . My set up is as follows: schema.xml XML snippet: 1Bose's best bookshelf speak

Re: How sorlcloud distribute data among shards of the same cluster?

2012-04-17 Thread Mark Miller
On Apr 17, 2012, at 9:56 AM, emma1023 wrote: It hashes the id. The doc distribution is fairly even - but sizes may be fairly different. > How solrcloud manage distribute data among shards of the same cluster when > you query? Is it distribute the data equally? What is the basis? Which part > of

Re: SolrCloud: Programmatically create multiple collections?

2012-04-17 Thread Mark Miller
On Apr 17, 2012, at 7:07 AM, ravi wrote: > Hi, > > I have recently started experimenting with solrCloud. I want to use it for > below mentioned requirements: > > - create one collection per client > - create several shards per collection (say 1 shard for each day of month) > - all of the colle

Re: [Solr 4.0] what is stored in .tim index file format?

2012-04-17 Thread Robert Muir
This is the term dictionary for 4.0's default codec (currently uses BlockTree implementation) .tim is the on-disk portion of the terms (similar in function to .tis in previous releases) .tip is the in-memory "terms index" (similar in function to .tii in previous releases) On Tue, Apr 17, 2012 at

Re: what's best to use for monitoring solr 3.6 farm on redhat/tomcat

2012-04-17 Thread Otis Gospodnetic
Hi Robert, Have a look at SPM for Solr: http://sematext.com/spm/solr-performance-monitoring/index.html It has all Solr metrics, works with 3.*, has a bunch of system metrics, filtering, alerting, email subscriptions, no loss of granularity, and you can use it to monitor other types of systems

Re: Options for automagically Scaling Solr (without needing distributed index/replication) in a Hadoop environment

2012-04-17 Thread Otis Gospodnetic
I think Jason is right - there is no index splitting in ES and SolrCloud, so one has to think ahead, "overshard", and then count on redistributing shards from oversubscribed nodes to other nodes.  No resharding on demand and no index/shard splitting yet. Otis  Performance Monitoring SaaS f

Populating a filter cache by means other than a query

2012-04-17 Thread Chris Collins
Hi, I am a long time Lucene user but new to solr. I would like to use something like the filterCache but build a such a cache not from a query but custom code. I guess I will ask my question by using techniques and vocab I am familiar with. Not sure its actually the right way so I appologize

Re: How sorlcloud distribute data among shards of the same cluster?

2012-04-17 Thread emma1023
Thanks for your reply. In sorl 3.x, we need to manually hash the doc Id to the server.How does solrcloud do this instead? I am working on a project using solrcloud.But we need to monitor how the solrcloud distribute the data. I cannot find which part of the code it is from source code.Is it from th

Re: Options for automagically Scaling Solr (without needing distributed index/replication) in a Hadoop environment

2012-04-17 Thread Jason Rutherglen
> redistributing shards from oversubscribed nodes to other nodes Redistributing shards on a live system is not possible however because the updates in-flight will likely be lost. Also it is not simple technology to build from the ground-up. As is today, one would need to schedule downtime, for m

RE: what's best to use for monitoring solr 3.6 farm on redhat/tomcat

2012-04-17 Thread Robert Petersen
Wow that looks like just what the doctor ordered! Thanks Otis -Original Message- From: Otis Gospodnetic [mailto:otis_gospodne...@yahoo.com] Sent: Tuesday, April 17, 2012 1:29 PM To: solr-user@lucene.apache.org Subject: Re: what's best to use for monitoring solr 3.6 farm on redhat/tomcat

Different solr config under tomcat.

2012-04-17 Thread mizayah
Is there a way to use different path for solrconfig.xml, like solrconfig_slave.xml for instance under tomcat? I dont want run cores. I know how to do it with cores, but i want to have single instance. Is there any parameter which i can use to say tomcat to use solrconfig_slave.xml? Pls help -- V

Hide results for dataimport - initArgs

2012-04-17 Thread Adolfo Carreno
Hi all.For security reasons I want to hide the result of a dataimport command, specifically the section "initArgs", in order to hide the connection parameters of the database. I removed from the config.xml the tag "datasource", and moved into the solrconfig.xml, in the requestHandler defined for

Re: Hide results for dataimport - initArgs

2012-04-17 Thread Tomás Fernández Löbbe
I guess this should be possible by setting the "echoParams"=none or explicit as an invariant. For example: none ... I haven't tried it, but I think that should work. On Tue, Apr 17, 2012 at 6:20 PM, Adolfo Carreno wrote: > Hi all.For security reasons I want to hide the

Re: Options for automagically Scaling Solr (without needing distributed index/replication) in a Hadoop environment

2012-04-17 Thread Lukáš Vlček
Hi, speaking about ES I think it would be fair to mention that one has to specify number of shards upfront when the index is created - that is correct, however, it is possible to give index one or more aliases which basically means that you can add new indices on the fly and give them same alias w

Re: Hide results for dataimport - initArgs

2012-04-17 Thread Adolfo Carreno
Thanks Tomas for your response, unfortunately didn't work, still is presenting the datasource information in each dataimport output: 00dih-tenant1-jdbc.xmlorg.postgresql.Driverjdbc:postgresql://localhost/cloududb1pdb1nbcnonestatusidleThis response format is experimental. It is likely to change in

SOLR 4 / Date Query: Spurious Results: Is it me or ... ?

2012-04-17 Thread vybe3142
I wrote a custom handler that uses externally injected metadata (bypassing Tika et all) WRT Dates, I see them associated with the correct docs when retrieving all docs: BUT: looking at the schema analyzer, things look wierd: 1. Top terms = -1 2. The Dates are all mixed up with some spurious 197

Re: Difference between two solr indexes

2012-04-17 Thread search engn dev
Thanks Pawel Rog for much needed reply, i'll give try and let u know. -- View this message in context: http://lucene.472066.n3.nabble.com/Difference-between-two-solr-indexes-tp3916328p3918996.html Sent from the Solr - User mailing list archive at Nabble.com.

Haystack - Solr recommended solr directory location

2012-04-17 Thread BillB1951
I'm using django-haystack and I am a little confused about where to put /solr, and it's schema.xml, solr.xml, and solrconfig.xml files. I currently have /solr in the following longer path --- /home/mydir/ solr/apache-solr3.6.0/example/ I'm thinking about moving the guts up a level (getting rid of