Re: Grouping based on multiple filters/criterias

2014-08-24 Thread deniz
umeshprasad wrote > Solr does support date mathematics in filters / queries . So your > timestamps intervals can be dynamic .. how would it be done for this case then? retrieving bunch of documents sorted by timestamp, then depending on some interval like 1 hour, those should be grouped all togeth

Re: embedded documents

2014-08-24 Thread Bill Bell
See my Jira. It supports it via json.fsuffix=_json&wt=json http://mail-archives.apache.org/mod_mbox/lucene-dev/201304.mbox/%3CJIRA.12641293.1365394604231.125944.1365397875874@arcas%3E Bill Bell Sent from mobile > On Aug 24, 2014, at 6:43 AM, "Jack Krupansky" wrote: > > Indexing and query of r

Re: Help with StopFilterFactory

2014-08-24 Thread Jack Krupansky
Just to confirm, the generated phrase query is generated using the analyzed terms, so if the stop filter is removing the terms, they won't appear in the generated query. It will be interesting to see what does get generated. -- Jack Krupansky -Original Message- From: heaven Sent: Sun

Re: ADDREPLICA doesn't respect requested solr_port assignment, replicas can report green w/o replicating

2014-08-24 Thread ralph tice
I created: ADDREPLICA can add a replica that reports green without actually replicating - https://issues.apache.org/jira/browse/SOLR-6420 ADDREPLICA doesn't respect :port_solr designation - https://issues.apache.org/jira/browse/SOLR-6421 DELETEREPLICA exposes an inconsistent param REPLICA_PROP - ht

Re: Help with StopFilterFactory

2014-08-24 Thread heaven
The problem is in #4: >> 4. if I index twitter.com/testuser and search for >> https://twitter.com/testuser I am getting 0 matches even though "https" >> should be filtered out by the StopFilterFactory. When I said that the stop filter factory "doesn't work" I mentioned that blacklisted words still

Re: ADDREPLICA doesn't respect requested solr_port assignment, replicas can report green w/o replicating

2014-08-24 Thread Mark Miller
Sounds like you should file 3 JIRA issues. They all look like legit stuff we should dig into on a glance. -- Mark Miller about.me/markrmiller On August 24, 2014 at 12:35:13 PM, ralph tice (ralph.t...@gmail.com) wrote: > Hi all, > > Two issues, first, when I issue an ADDREPLICA call like so: >

ADDREPLICA doesn't respect requested solr_port assignment, replicas can report green w/o replicating

2014-08-24 Thread ralph tice
Hi all, Two issues, first, when I issue an ADDREPLICA call like so: http://localhost:8983/solr/admin/collections?action=ADDREPLICA&shard=myshard&collection=mycollection&createNodeSet=solr18.mycorp.com:8983_solr It does not seem to respect the 8983_solr designation in the createNodeSet parameter

Re: Help with StopFilterFactory

2014-08-24 Thread Jack Krupansky
If autoGeneratePhraseQueries="true" (which I endorse) is working, then what's the problem? I mean, the only problem you mention is with autoGeneratePhraseQueries="false", which is clearly NOT what you want. Once again, I have to reiterate that the situation here remains very confused, mostly

Re: Help with StopFilterFactory

2014-08-24 Thread heaven
Just a guess but it seems that auto phase generation and stop filter factory don't know of each other. Here's the current field configuration: {code} {code} -- View this message in context: http://lucene.472066.n3.nabble.com/Help-with-StopFilter

Re: Help with StopFilterFactory

2014-08-24 Thread heaven
I don't see any confusions, the problem is clearly explained in the first post. The one confusion I had was with the autoGeneratePhraseQueries and my schema version, I didn't know about that attribute and that its behavior could differ per schema version. I think we now figured that out and I am us

Re: Help with StopFilterFactory

2014-08-24 Thread Jack Krupansky
I think somehow the discussion has gotten confused, so we really need to start over. 1. Make sure you're using the most current schema version. 2. Make sure autoGeneratePhraseQueries is set explicitly the way you want it, based on #1 above. 3. Yes, stop filter should remove sop words. No questi

Re: Help with StopFilterFactory

2014-08-24 Thread heaven
Unfortunately I can't change the operator and phrase query for "https://twitter.com/testuser"; doesn't work as well. It does work for "twitter.com/testuser" but that makes no sense since I then can simply use old schema version or autoGenereratePhaseQueries=true and ask users to remove http/www fr

Re: Indexing and Querying MS SQL Server 2012 Spatial

2014-08-24 Thread david.w.smi...@gmail.com
On Sun, Aug 24, 2014 at 2:29 AM, Bostic, Alex wrote: > Ok updating my field type to the below, allowed me to index :-) HUGE THANKS > > class="solr.SpatialRecursivePrefixTreeFieldType" > spatialContextFactory="com.spatial4j.core.context.jts.JtsSpatialContextFactory" > geo="false" distErrP

Re: embedded documents

2014-08-24 Thread Jack Krupansky
Indexing and query of raw JSON would be a valuable addition to Solr, so maybe you could simply explain more precisely your data model and transformation rules. For example, when multi-level nesting occurs, what does your loader do? Maybe if the fielld names were derived by concatenating the fu

Re: Exact search with special characters

2014-08-24 Thread Jack Krupansky
What precisely do you mean by the term "exact search". I mean, Solr (and Lucene) do not have that concept for tokenized text fields. Or did you simply mean "quoted phrase". In which case, you need to be aware that all the quotes do is assure that the terms occur in that order or in close proxi

Exact search with special characters

2014-08-24 Thread Shay Sofer
Hi all, I have a docs that's indexed by text field with mention schema. I have those docs names: - Test host - Test_host - Test-host - Test $host When I'm trying to do exact search like: "test host" All the results from above are shown as a results. How c

Re: Invalid chunk header Error in solr

2014-08-24 Thread lalitjangra
Thanks Steve, I am using Apache ManifoldCF to index alfresco repository and saving index in solr. Here ManifoldCf is sending index to solr in ManifoldCF job. This error is not coming from every content but it comes often, sometimes for a jpg file, sometimes for a pdf or word doc so its quite unpre