date:20170316

Re: Index corruption with replication

2017-03-16 Thread santosh sidnal

Hi Erik/David,

Schema is same on both live and stage servers. We are using the same schema
files on the stage and live files.


   - Schema files are included in replication but these are not being
   changed whenever we are observing schema corruption issue.
   - My guess is that because of replication the core is getting corrupted.
   - SOLR version used is 4.7.0
   -



Exception which I see in log is

org.apache.solr.common.SolrException log
org.apache.lucene.index.CorruptIndexException: Corrupted: docID=8195,
docBase=7, chunkDocs=249, numDocs=10596
(resource=MMapIndexInput(path="/app/IBM/WebSphere/CommerceServer70/instances/RBUATLV/search/solr/home/MC_10001/fr_FR/CatalogEntry/data/index/_5a.fdt"))
at
org.apache.lucene.codecs.compressing.CompressingStoredFieldsReader.visitDocument(CompressingStoredFieldsReader.java:236)
at org.apache.lucene.index.SegmentReader.document(SegmentReader.java:276)
at
org.apache.lucene.index.BaseCompositeReader.document(BaseCompositeReader.java:110)
at org.apache.solr.search.SolrIndexSearcher.doc(SolrIndexSearcher.java:661)
at
org.apache.solr.util.SolrPluginUtils.optimizePreFetchDocs(SolrPluginUtils.java:213)
at
org.apache.solr.handler.component.QueryComponent.doPrefetch(QueryComponent.java:568)
at
org.apache.solr.handler.component.QueryComponent.process(QueryComponent.java:475)
at

On 15 March 2017 at 22:42, Erick Erickson  wrote:

> You can specify your replication to include config files, but if the
> schema has changed you'll have to restart your Solr afterwards.
>
> How is it corrupt? what is the symptom? Any error messages in the solr
> log on the slave? What version of Solr? Details matter.
>
> Best,
> Erick
>
> On Wed, Mar 15, 2017 at 9:12 AM, David Hastings
>  wrote:
> > are you certain the schema is the same on both master and slave?  I find
> > that the schema file doesnt always go with the replication and if a field
> > is different on the slave it will cause problems
> >
> > On Wed, Mar 15, 2017 at 12:08 PM, Santosh Sidnal <
> sidnal.sant...@gmail.com>
> > wrote:
> >
> >> Hi all,
> >>
> >> I am facing issues of index corruption at regular intervals of the time
> on
> >> live server where i pull index data from one master server.
> >>
> >> Can anyone please give us some ppinters why we are facing issue on
> regular
> >> interval of time?
> >> I am aware of how can we correct corrupted index but i am looking some
> >> pointers how can i stop or reduce this occurrence.
> >>
> >> Thanks in advance.
> >>
> >>
> >> Sent from my iPhone
>



-- 
Regards,
Santosh Sidnal

sum multivalued field index with banana

2017-03-16 Thread tkg_cangkul


hi sorry if this a little bit out ouf topic,

i've just started to using banana dashboard. and i want to do summarize 
proccess from data that indexed in solr


can i do sum proccess with banana dashboard when i have some multivalued 
data index on my field?


this is my sample data on solr :

"timestamp_dt":"2016-12-30T15:50:00Z",
"FR":["fr1"],
"EV":"89v",
"RC":[0],
"SF":["SSP"],
"CT":["POST"],
"rb.id":["rb30", "rb30"],
"rb.co":[1,  2],
"rb.lat":[47, 9]

Ok, from the data above, is it possible to summarize the value of 
"rb.co" with EV as a Group By. ?

On my banana dashboard panel, i've try to set something like this :



but there is nothing happen on it.

any suggestion pls ?



Best Regards,

Yuza

fq performance

2017-03-16 Thread Ganesh M

Hi,

We have 1 million of documents and would like to query with multiple fq values.

We have kept the access_control ( multi value field ) which holds information 
about for which group that document is accessible.

Now to get the list of all the documents of an user, we would like to pass 
multiple fq values ( one for each group user belongs to )

q:somefiled:value&
fq:access_control:g1&fq:access_control:g2&fq:access_control:g3&fq:access_control:g4&fq:access_control:g5...

Like this, there could be 100 groups for an user.

If we fire query with 100 values in the fq, whats the penalty on the 
performance ? Can we get the result in less than one second for 1 million of 
documents.

Let us know your valuable inputs on this.

Regards,

First of all, from what I can see, this won't do what you're expecting.
Multiple fq conditions are always combined using AND, so if a user is
member of 100 groups, but the document is accessible to only 99 of them,
then the user won't find it.

Or in other words, if you add a user to some group, then she would get
*less* results than before.

But coming back to your performance question: Just try it. Having 100 fq
conditions will of course slow down your query a bit, but not that much.
I rather see the problem with the filter cache: It will only be fast
enough if all of your fq filters fit into the cache. Each possible fq
filter will take 1 million/8 == 125k bytes, so having hundreds of
possible access groups conditions might blow up your query cache (which
must fit into RAM).

-Michael

Am 16.03.2017 um 13:02 schrieb Ganesh M:

Hi,

We have 1 million of documents and would like to query with multiple fq values.

We have kept the access_control ( multi value field ) which holds information
about for which group that document is accessible.

Now to get the list of all the documents of an user, we would like to pass
multiple fq values ( one for each group user belongs to )

q:somefiled:value&
fq:access_control:g1&fq:access_control:g2&fq:access_control:g3&fq:access_control:g4&fq:access_control:g5...

Like this, there could be 100 groups for an user.

If we fire query with 100 values in the fq, whats the penalty on the
performance ? Can we get the result in less than one second for 1 million of
documents.

57 matches

Mail list logo