Re: Problems with edismax parser and solr3.6

2012-04-18 Thread Jan Høydahl
Hi, Thanks for reporting this. I've created a bug ticket for this at https://issues.apache.org/jira/browse/SOLR-3377 -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 18. apr. 2012, at 17:00, Bernd Fehling wrote: > > I just look

Re: Problems with edismax parser and solr3.6

2012-04-18 Thread Erick Erickson
Happened to see that Jan confirms this as a bug, see: https://issues.apache.org/jira/browse/SOLR-3377 On Wed, Apr 18, 2012 at 11:00 AM, Bernd Fehling wrote: > > I just looked through my logs of solr 3.6 and saw several "0 hits" which were > not seen with solr 3.5. > > While tracing this down it

Re: Solr Core not able access latest data indexed by multiple server.

2012-04-18 Thread Erick Erickson
I think you're trying to do something that's you shouldn't. The trunk SolrCloud stuff will address this issue, but for the 3.x code line having multiple servers opening up a shared index and writing to it will produce unpredictable results. This is really bad practice. You'd be far ahead setting u

Re: Date granularity

2012-04-18 Thread Erick Erickson
If Peter's suggestion doesn't work, please post the results of adding &debugQuery=on to your query. The date math stuff is sensitive to spaces, for instance and it's impossible to tell whether you're making a simple error like that without seeing what you're actually doing. Best Erick On Wed, Apr

Re: Multiple document structure

2012-04-18 Thread Erick Erickson
Solr does not enforce anything about documents conforming to the schema except: 1> a field specified in a doc must be present in the schema 2> any field in the schema with ' required="true" ' must be present in the doc. Additionally there is no penalty for NOT putting all the fields defined in

Re: Populating a filter cache by means other than a query

2012-04-18 Thread Erick Erickson
Pesky users. Life would be so much easier if they'd just leave devs alone Right. Well, you can certainly create your own SearchComponent and attach your custom filter at that point, note how I'm skimping on the details here. >From left field, you might create a custom FunctionQuery that

Re: Solr 3.6 parsing and extraction files

2012-04-18 Thread Jan Høydahl
Hi, I suppose you want to POST office docs into Solr for text extraction using the Extracting RequestHandler (SolrCell). Have you read this page? http://wiki.apache.org/solr/ExtractingRequestHandler You basically need all libs provided by contrib/extraction. You can see in the example solr/conf/

Re: minimum match and not matched words / term frequency in query result

2012-04-18 Thread Jan Høydahl
Hi, Which query terms that match may of course vary from document to document, so it would be hard to globally print non matching terms. But for each individual document match, you could deduct what terms do not match by enumerating what terms that DO match - using the explain output for instan

Re: Difference between Search result from Admin console and solr/browse

2012-04-18 Thread Jan Høydahl
Hi, The /browse Request Handler is built to showcase the xml documents in solr/example/exampledata and if you want to use it for your own data and schema you must modify the templates in solr/example/conf/velocity/ to display whatever you want to display. Given that you use an unmodified examp

Re: Date granularity

2012-04-18 Thread Peter Markey
you could use a filter query like: fq=datefield:[NOW/DAY-1DAY TO NOW/DAY+1DAY] *replace datefield with your field that contains the time info On Wed, Apr 18, 2012 at 11:11 AM, vybe3142 wrote: > A query search on a particular date: > > returns 1valid result (as expected). > > How can I alter the

RE: Changing precisionStep without a re-index

2012-04-18 Thread Michael Ryan
In case anyone tries to do this... If you facet on a TrieField and change the precisionStep to 0, you'll need to re-index. Changing precisionStep to 0 changes the prefix returned by TrieField.getMainValuePrefix(FieldType), which then causes facets with a value of "0" to be returned. -Michael

Re: Populating a filter cache by means other than a query

2012-04-18 Thread Chris Collins
Great question. The set could be in the millions. I over simplified the use case somewhat to protect the innocent :-}. If a user is querying a large set of documents (for the sake of argument lets say its high tens of millions but could be in the small billions), they want to potentially mar

Re: Solr file size limit?

2012-04-18 Thread Shawn Heisey
On 4/18/2012 6:17 AM, Bram Rongen wrote: I've been using Solr for a very short time now and I'm stuck. I'm trying to index a drupal website consisting of 1.2 million smaller nodes and 300k larger nodes (~400kb avg).. A followup to my previous reply: Your ramBufferSizeMB is only 32, the default

Difference between Search result from Admin console and solr/browse

2012-04-18 Thread srini
I have imported my xml documents from oracle database and indexed them. When I search *:* in *admin console *I do get results. My xml format is not close to what solr expects. but still when I search for any word that is part of my xml document Solr displays whole xml document. for example if I sea

Re: Solr file size limit?

2012-04-18 Thread Shawn Heisey
On 4/18/2012 6:17 AM, Bram Rongen wrote: I'm using Solr 3.5 on a dedicated Ubuntu 10.04 box with 3TB of diskspace and 16GB of memory. I've tried using the sun JRE and OpenJDK, both resulting in the same problem. Indexing works great until my .fdt file reaches the size of 4.9GB/ 5217987319b. At th

Re: Can you suggest a method or pattern to consistently promote a document with any query?

2012-04-18 Thread Jeevanandam Madanagopal
Chris - If you have defined 'last-components' in search handler, forceElevation=true may not required. It gets invoked in search life cycle elevator -Jeevanandam On Apr 18, 2012, at 11:37 PM, Chris Warner wrote: > Thanks to those who responded. A more thorough reading of the wiki an

Suggester

2012-04-18 Thread John
Using Solr 3.6, I am trying to get suggestions for phrases. I managed getting prefixed suggestions, but not suggestions for middle of phrase. Can this be achieved with built in Solr suggest, or do I need to create a special core for this purpose? Thanks in advance.

Re: hierarchical faceting?

2012-04-18 Thread Charlie Maroto
The PathHierarchyTokenizerFactory is intended for file path therefore assumes that all documents should be indexed with all of the paths to the parent folders but you are trying to use it for a taxonomy so you can't simply use the PathHierarchyTokenizerFactory. Use the analysis page ( http://loc

Date granularity

2012-04-18 Thread vybe3142
A query search on a particular date: returns 1valid result (as expected). How can I alter the granularity of the search for example , to all matches on the particular DAY? Reading through various docs, I attempt to append "/DAY" but this doesn't seem to work (in fact I get 0 results back when qu

Re: Can you suggest a method or pattern to consistently promote a document with any query?

2012-04-18 Thread Chris Warner
Thanks to those who responded. A more thorough reading of the wiki and I see the need for forceElevation=true in the elevate query. Cheers, Chris - Original Message - From: Otis Gospodnetic To: "solr-user@lucene.apache.org" ; Chris Warner Cc: Sent: Wednesday, April 18, 2012 10:23 AM

Re: hierarchical faceting?

2012-04-18 Thread Darren Govoni
I don't use any of that stuff in my app, so not sure how it works. I just manage my taxonomy outside of solr at index time and don't need any special fields or tokenizers. I use a string field type and insert the proper field at index time and query it normally. Nothing special required. On Wed,

Re: Can you suggest a method or pattern to consistently promote a document with any query?

2012-04-18 Thread Chris Warner
Browsing all documents and all facets, skipper. Cheers, Chris   - Original Message - From: Walter Underwood To: solr-user@lucene.apache.org Cc: Sent: Wednesday, April 18, 2012 10:29 AM Subject: Re: Can you suggest a method or pattern to consistently promote a document with any query?

Re: Can you suggest a method or pattern to consistently promote a document with any query?

2012-04-18 Thread Walter Underwood
That is not a useful test. Users don't look for *:*. Test with real queries. wunder On Apr 18, 2012, at 10:27 AM, Chris Warner wrote: > Thanks, Jeevanandam and Otis, > > I'll take another look at Elevate. My first attempts did not yield success, > as I was not able to find a way to elevate a

Re: Can you suggest a method or pattern to consistently promote a document with any query?

2012-04-18 Thread Chris Warner
Thanks, Jeevanandam and Otis, I'll take another look at Elevate. My first attempts did not yield success, as I was not able to find a way to elevate a document with a *:* query. Perhaps I'll try a * query to see what happens. Cheers, Chris   - Original Message - From: Jeevanandam Mad

Re: Can you suggest a method or pattern to consistently promote a document with any query?

2012-04-18 Thread Otis Gospodnetic
Chris, I haven't checked if Elevate Component has an easy way to push a specific doc for *all* queries, but have a look http://wiki.apache.org/solr/QueryElevationComponent Otis  Performance Monitoring SaaS for Solr - http://sematext.com/spm/solr-performance-monitoring/index.html -

Re: Can you suggest a method or pattern to consistently promote a document with any query?

2012-04-18 Thread Jeevanandam Madanagopal
Chris - Take a look - QueryElevationComponent http://wiki.apache.org/solr/QueryElevationComponent -Jeevanandam On Apr 18, 2012, at 10:46 PM, Chris Warner wrote: > Hi, folks, > > Perhaps I'm overlooking an obvious solution to a common desire... I'd like to > return a specific document with ev

Can you suggest a method or pattern to consistently promote a document with any query?

2012-04-18 Thread Chris Warner
Hi, folks, Perhaps I'm overlooking an obvious solution to a common desire... I'd like to return a specific document with every query, as the first result. As well, I'd like to have that document be the first result in a *:* query. I'm looking into index time boosting using the boost attribute o

Re: hierarchical faceting?

2012-04-18 Thread sam ”
It looks like TextField is the problem. This fixed: I am assuming the text_path fields won't include whitespace characters. ?q=colors:red/pink ==> Doc2 (Doc1, which has colors = red isn't included!) Is there a tokenizer that tokeniz

Re: SOLR 4 / Date Query: Spurious Results: Is it me or ... ?

2012-04-18 Thread vybe3142
Thanks for clarifying. I figured out the (terms=-1). It was my fault. I attempted a truncate of the index in my test case setup by issuing a delete query and think the subsequent commit might not have taken effect by the time the subsequent index queries started. -- View this message in context:

RE: Maximum Open Cursors using JdbcDataSource and cacheImpl

2012-04-18 Thread Dyer, James
Keith, Can you supply your data-config.xml ? James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: Keith Naas [mailto:keithn...@dswinc.com] Sent: Wednesday, April 18, 2012 11:43 AM To: solr-user@lucene.apache.org Subject: Maximum Open Cursors using

Maximum Open Cursors using JdbcDataSource and cacheImpl

2012-04-18 Thread Keith Naas
After upgrading from 3.5.0 to 3.6.0 we have noticed that when we use a cacheImpl on a nested JdbcDataSource entity, the database runs out of cursors. It does not matter what transactionIsolation, autoCommit, or holdability setting we use. I have only been using solr for a few months but after

solr stats component

2012-04-18 Thread Peter Markey
Hello, I am using the stats component and I wanted help with range like function (in facet component). To be more clear, we would like to have a similar functionality of facet.range (i.e with gap and stuff) for the statistics component. That is, with one call we would like to do faceting in stats

[Job] Search Engineer Lead at Sematext International

2012-04-18 Thread Otis Gospodnetic
Hello, If you've always wanted a full-time job working with Solr, ElasticSearch, or Lucene, we have a position that is all about that, offers path to team leadership, and will expose a person to a healthy mixture of engineering and business.  If you are interested, please send your resume to j.

Re: Options for automagically Scaling Solr (without needing distributed index/replication) in a Hadoop environment

2012-04-18 Thread Jason Rutherglen
The main point being made is established NoSQL solutions (eg, Cassandra, HBase, et al) have solved the update problem (among many other scalability issues, for several years). If an update is being performed and it is not known where the record exists, the update capability of the system is ineffi

Re: pushing updates to solr from postgresql

2012-04-18 Thread Otis Gospodnetic
Hi Richard, One thing to think about here is what you will do when Solr is unavailable to take a new document for whatever reason.  If you send docs to Solr from PG, docs either get indexed or not.  So you may have to catch errors and then mark documents in PG as not indexed.  You may want to k

Solr 3.6 parsing and extraction files

2012-04-18 Thread Tod
Could someone possibly provide me with a list of jars that I need to extract from the apache-solr-3.6.0.tgz file to enable the parsing and remote streaming of office style documents? I assume (for a multicore configuration) they would go into ./tomcat/webapps/solr/WEB-INF/lib - correct? Tha

minimum match and not matched words / term frequency in query result

2012-04-18 Thread giovanni.bricc...@banzai.it
Hi I have a dismax query with a mininimum match settings, this allows some terms to be missing in query results. I would like give a feedback to the user, highlighting the not matched words. It would be interesting also to show the words with a very low frequence. For instance searching fo

Re: hierarchical faceting?

2012-04-18 Thread sam ”
Yah, that's exactly what PathHierarchyTokenizer does. I think I have a query time tokenizer that tokenizes at / ?q=colors:red ==> Doc1, Doc2 ?q=colors:redfoobar ==> ?q=colors:red/foobarasdfoaijao ==> Doc1, Doc2 On Wed, Apr 18, 2012 at 11:10 AM, Darren Govoni

Re: How to add/remove/customize search tabs

2012-04-18 Thread Dave Stuart
This is question is probably better set on the Drupal groups page for Apache Solr http://groups.drupal.org/lucene-nutch-and-solr As this is more of a Drupal issue than a Solr issue On 18 Apr 2012, at 16:11, Valentin, AJ wrote: > I have Apache Solr installed with my Drupal 7 site and noticed s

How to add/remove/customize search tabs

2012-04-18 Thread Valentin, AJ
I have Apache Solr installed with my Drupal 7 site and noticed some default tabs available (Content, Site, Users). Is there a way to add/change that tabs section? CONFIDENTIALITY NOTICE: This email constitutes an electronic communication within the meaning of t

Re: hierarchical faceting?

2012-04-18 Thread Darren Govoni
Put the parent term in all the child documents at index time and the re-issue the facet query when you expand the parent using the parent's term. works perfect. On Wed, 2012-04-18 at 10:56 -0400, sam ” wrote: > I have hierarchical colors: > stored="true" multiValued="true"/> > text_path is TextFi

Problems with edismax parser and solr3.6

2012-04-18 Thread Bernd Fehling
I just looked through my logs of solr 3.6 and saw several "0 hits" which were not seen with solr 3.5. While tracing this down it turned out that edismax don't like queries of type "...&q=(text:ide)&..." any more. If parentheses around the query term the edismax fails with solr 3.6. Can anyone

hierarchical faceting?

2012-04-18 Thread sam ”
I have hierarchical colors: text_path is TextField with PathHierarchyTokenizerFactory as tokenizer. Given these two documents, Doc1: red Doc2: red/pink I want the result to be the following: ?fq=red ==> Doc1, Doc2 ?fq=red/pink ==> Doc2 But, with PathHierarchyTokenizer, Doc1 is included for the

pushing updates to solr from postgresql

2012-04-18 Thread Welty, Richard
i have a setup right this instant where the dataimporthandler is being used to pull data for an index from a postgresql server. i'd like to switch over to push, and am looking for some validation of my approach. i have perl installed as an untrusted language on my postgresql server and am plan

Re: Options for automagically Scaling Solr (without needing distributed index/replication) in a Hadoop environment

2012-04-18 Thread Lukáš Vlček
AFAIK it can not. You can only add new shards by creating a new index and you will then need to index new data into that new index. Index aliases are useful mainly for searching part. So it means that you need to plan for this when you implement your indexing logic. On the other hand the query logi

Re: Multiple document structure

2012-04-18 Thread Gora Mohanty
On 18 April 2012 10:05, abhijit bashetti wrote: > > Hi , > Is it possible to have 2 document structures in solr? [...] Do not think so, but why do you need it? Use two separate indices, either in a multi-core setup, or in separate Solr instances. Regards, Gora

Re: Options for automagically Scaling Solr (without needing distributed index/replication) in a Hadoop environment

2012-04-18 Thread Jason Rutherglen
I'm curious how on the fly updates are handled as a new shard is added to an alias. Eg, how does the system know to which shard to send an update? On Tue, Apr 17, 2012 at 4:00 PM, Lukáš Vlček wrote: > Hi, > > speaking about ES I think it would be fair to mention that one has to > specify number

Solr file size limit?

2012-04-18 Thread Bram Rongen
Dear fellow Solr users, I've been using Solr for a very short time now and I'm stuck. I'm trying to index a drupal website consisting of 1.2 million smaller nodes and 300k larger nodes (~400kb avg).. I'm using Solr 3.5 on a dedicated Ubuntu 10.04 box with 3TB of diskspace and 16GB of memory. I've

Re: SOLR 4 / Date Query: Spurious Results: Is it me or ... ?

2012-04-18 Thread Erick Erickson
Your schema didn't come through, but... 1> why terms=-1 I don't know. I have a build from this morning and it's fine. When's yours? 2> date .vs. tdate. Yes, that's kind of confusing, but the Trie types inject some extra stuff in the field that allows the faster range queries, I thin

Solr hanging

2012-04-18 Thread Trym R. Møller
Hi I am using Solr trunk and have 7 Solr instances running with 28 leaders and 28 replicas for a single collection. After indexing a while (a couple of days) the solrs start hanging and doing a thread dump on the jvm I see blocked threads like the following: Thread 2369: (state = BLOCKED)

Re: How sorlcloud distribute data among shards of the same cluster?

2012-04-18 Thread Erick Erickson
Try looking at DistributedUpdateProcessor, there's a "hash(cmd)" method in there. Best Erick On Tue, Apr 17, 2012 at 4:45 PM, emma1023 wrote: > Thanks for your reply. In sorl 3.x, we need to manually hash the doc Id to > the server.How does solrcloud do this instead? I am working on a project >

Re: Populating a filter cache by means other than a query

2012-04-18 Thread Erick Erickson
I guess my question is "what advantage are you trying to get here?" At the start, this feels like an "XY" problem. How are you intending to use the fq after you've built it? Because if there's any way to just create an "fq" clause, Solr will take care of it for you. Caching it, autowarming it when

property substitution not working with multicore

2012-04-18 Thread jmlucjav
Hi, I cannot seem to get right the configuration of using a properties file for cores (with 3.6.0). In Solr3 Entr. Search Server book they say this: "This property substitution works in solr.xml , solrconfig.xml, schema.xml, and DIH configuration files." So my solr.xml is like this:

DIH + JNDI

2012-04-18 Thread Stephen Lacy
Hi All, I'm new to solr and I don't have much experience in java. I'm trying to setup two environments with configuration files that mirror each other so that it's easy to copy files across after changes have been made. The problem is that they both access different sql servers. So I want to separ

Re: need help to integrate SolrJ with my web application.

2012-04-18 Thread Marcelo Carvalho Fernandes
Hi Vijaya, Why not just making standard http calls to Solr as it was a RESTful Service? Just use a HTTP/REST Client on Spring, ask solr to return Json responses and get rid of all those war dependencies of SolrJ --- Marcelo On Monday, April 16, 2012, Ben McCarthy wrote: > Hello, > > When I have

Solr Core not able access latest data indexed by multiple server.

2012-04-18 Thread Paresh Modi
Hi, I am using Solr multicore approach in my app. we have two different servers (ServerA1 and ServerA2) for load balancing, both the server accessing the same index repository and request will go to any server as per load balance algorithm. Problem occurs in following way [Note that both the ser

Re: searching and text highlighting

2012-04-18 Thread darul
rpc29y wrote > > Good afternoon: > I would like to know if it can be indexed with SolR word documents or > pdf. > Yes, you may first look at Tika Solr processor. rpc29y wrote > > If so how do I modify the solrconfig.xml to search these documents and > highlight the found text? > I guess you

Re: HTML Indexing error

2012-04-18 Thread Gora Mohanty
On 18 April 2012 00:41, Chambeda wrote: > Hi All, > > I am trying to parse some text that contains embedded HTML elements and am > getting the following error: [...] > According to the documentation the should be removed correctly. > > Anything I am missing? How are you indexing the XML document