Re: Solr admin client crash - caused by too many fields

2018-08-14 Thread Toke Eskildsen
On Fri, 2018-08-10 at 06:38 -0700, ruby wrote:
> I have 60 thousand fields in schema. When I go to the Analysis page
> to analyze a field content
> 
> http://localhost:8983/solr/#/collection1/analysis?analysis.fieldvalue
> =xyz&analysis.query=xyz&analysis.fieldname=field1&verbose_output=0
> 
> the admin panel crashes and shows error: Connection to Solr lost.
> Please see Solr instance. 

The default Admin GUI timeout is 10 seconds. You can raise it by
locating server/solr-webapp/webapp/js/angular/app.js and changing
config.timeout = 1;
to
config.timeout = 1;
Then reload the Admin GUI.

I don't know why the Admin GUI has a timeout at all. It seems to me
that anyone capable of using that GUI is also capable of pressing
reload if Solr takes too long to respond. But I digress.


What Shawn & Erick says still stands: Having 60K fields is an outlier
in Solr Land and as such warrants caution and consideration.

- Toke Eskildsen, Royal Danish Library



Re: Graph/Stream requestHandlers in standalone

2018-08-14 Thread Joel Bernstein
Currently Streaming Expressions only work with Solr Cloud. This is because
Streaming Expressions were designed to use Solr Cloud features for all
distributed capabilites of the language.

Joel Bernstein
http://joelsolr.blogspot.com/

On Tue, Aug 14, 2018 at 1:59 AM, David '-1' Schmid  wrote:

> Hello!
>
> I'm interested in using the graph traversal streaming expressions.
> My solr instance is running in standalone (not SolrCloud) mode, so I'm
> missing the "Stream" interface on the admin pages. The request handler
> is there (I guess because it's implicit), but no UI.
>
> Is the streaming something that's only useful with -cloud? I could not
> find any information about it.
>
> I expected the graph requestHandler to work as well, but it defaults
> to a NullPointerException.
> When I'm using the stream interface with the first example from the
> guide, I get:
>
> org.apache.solr.common.SolrException:
> java.lang.IllegalArgumentException: A HostProvider may not be empty!
>
> Which leads me to believe, that this is only possible with -cloud?
> I could not find anything about it, could anybody point out what I'm
> missing here?
>
> regards,
> David
>


Re: Help with error on indexing mongoDB document by Solr DataImportHandler

2018-08-14 Thread Wendy2
H Shawn,

Thank you very much for your response! 

Solr DataImportHandler "could not" directly index MongoDB collections. I
used open source SolrMongoImporter project
(https://github.com/james75/SolrMongoImporter) on top of Solr DIH to to
directly index data of MongoDB collections.

What I did was that I wrote a new Java class (public class EntryTransformer
extends Transformer) to convert the ObjectId to a String and to flatten
MongoDB document.  

Here is my data-config.xml file:











So overall, I didn't change any code in the following projects. 

solr-dataimporthandler-7.3.1.jar
solr-dataimporthandler-extras-7.3.1.jar
solr-mongo-importer-1.1.0.jar

Since MongoDB or other NOo-SQL database are in the trend of increasing
usage, I wish that Solr will develop new tools or extend its
DataImportHandler to allow directly index No-SQL document.

All the best!  



--
Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html


Re: Solr admin client crash - caused by too many fields

2018-08-14 Thread ruby
Thanks for your answer.
In previous Solr version (5.0) I had same number of fields and didn't run
into this issue. Did anything change in Solr 6.0 above?

Thanks



--
Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html


Re: Question regarding searching Chinese characters

2018-08-14 Thread Christopher Beer
Hi all,

Thanks for this enlightening thread. As it happens, at Stanford Libraries we’re 
currently working on upgrading from Solr 4 to 7 and we’re looking forward to 
using the new dictionary-based word splitting in the ICUTokenizer.

We have many of the same challenges as Amanda mentioned, and thanks to the 
advice on this thread, we’ve taken a stab at a CharFilter to do the traditional 
-> simplified transformation [1] and it seems to be promising and we've sent it 
out for testing by our subject matter experts for evaluation.

Thanks,
Chris

[1] 
https://github.com/sul-dlss/CJKFilterUtils/blob/master/src/main/java/edu/stanford/lucene/analysis/ICUTransformCharFilter.java

On 2018/07/24 12:54:35, Tomoko Uchida  wrote:
Hi Amanda,>

do all I need to do is modify the settings from smartChinese to the ones>
you posted here>

Yes, the settings I posted should work for you, at least partially.>
If you are happy with the results, it's OK!>
But please take this as a starting point because it's not perfect.>

Or do I need to still do something with the SmartChineseAnalyzer?>

Try the settings, then if you notice something strange and want to know why>
and how to solve it, that may be the time to dive deep into. ;)>

I cannot explain how analyzers works here... but you should start off with>
the Solr documentation.>
https://lucene.apache.org/solr/guide/7_0/understanding-analyzers-tokenizers-and-filters.html>

Regards,>
Tomoko>



2018年7月24日(火) 21:08 Amanda Shuman :>

Hi Tomoko,>

Thanks so much for this explanation - I did not even know this was>
possible! I will try it out but I have one question: do all I need to do is>
modify the settings from smartChinese to the ones you posted here:>

>
>
>

id="Traditional-Simplified"/>>
>

Or do I need to still do something with the SmartChineseAnalyzer? I did not>
quite understand this in your first message:>

" I think you need two steps if you want to use HMMChineseTokenizer>
correctly.>

1. transform all traditional characters to simplified ones and save to>
temporary files.>
I do not have clear idea for doing this, but you can create a Java>
program that calls Lucene's ICUTransformFilter>
2. then, index to Solr using SmartChineseAnalyzer.">

My understanding is that with the new settings you posted, I don't need to>
do these steps. Is that correct? Otherwise, I don't really know how to do>
step 1 with the java program>

Thanks!>
Amanda>


-->
Dr. Amanda Shuman>
Post-doc researcher, University of Freiburg, The Maoist Legacy Project>
>
PhD, University of California, Santa Cruz>
http://www.amandashuman.net/>
http://www.prchistoryresources.org/>
Office: +49 (0) 761 203 4925>



solr 3.4 do not want to apply synonym mapping term to search matching docs

2018-08-14 Thread surya
The following data is getting indexed-in to our solr.

doc1:
University of Virginia 
doc2:
Katrina Uva 
doc3:
University of new york 

synonym.txt
   University of Virginia, uva

search term:
   University of Virginia

Expected result:
   doc1

Actual result:
   doc1 and doc2 

the second document is coming because the synonym term "uva" is matching
with doc2: Katrina Uva
Requirement:
We do not want to apply the synonym (uva) to bring the  doc2 (Katie Uva)

The following is our solr setup version 3.4 analyzers 

/




  
   

  







  


/



--
Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html


Semantic Knowledge Graph query using SolrJ

2018-08-14 Thread Pratik Patel
I am trying to use Semantic Knowledge Graph in my java based application.

I have a Semantic Knowledge Graph query which works fine if I trigger it
through browser using restlet client. Following is the query.

{
  "queries": [
"foo:\"5a6127a7234e76473a816f1c\""
  ],
  "compare": [
{
  "type": "bar",
  "limit": 30
}
  ]}

Now, I want to trigger the same query through SolrJ client. I have tried
following code but it gives me an error

{"error":{"msg":"KnowledgeGraphHandler requires POST data","code":400}}

The code in java is

SolrQuery request = new SolrQuery();
request.setRequestHandler("/skg");
request.setShowDebugInfo(true);
request.setParam("wt", "json");
request.setParam("json",
"{\"queries\":[\"foo:\\\"5a6127a7234e76473a816f1c\\\"\"],\"compare\":[{\"type\":\"bar\",\"limit\":30}]}");
request.set("rows", 10);
request.setParam("qf", "conceptname^10 tags^3 textproperty^2 file_text^4");
try {
QueryResponse response =
getStore().getEnvironment().getSolr().query(request,
SolrRequest.METHOD.POST);
NamedList rsp = response.getResponse();

ArrayList> skg_resp =
(ArrayList>) rsp.get("clusters");

if (skg_resp != null) {

}
}

Any idea what is wrong here? Any pointer to documentation on how to
construct request for Semantic Knowledge Graph through solrJ would be very
helpful.

Thanks

Pratik


SolrCloud CDCR with 3+ DCs

2018-08-14 Thread cdatta
Hello..

Is there any restriction we have from the CDCR standpoint to have 3 DCs as
ACTIVE-ACTIVE-ACTIVE scenario?

We are seeing the following:

DC1: We are indexing 1000 Documents.
DC2: Random 300 (some number) documents are getting forwarded here from DC1.
(Not 1000)
DC3: Random 700 (some number) documents are getting forwarded here from DC1.
(Not 1000)

If we run a reload in DC1 after sometime, documents started showing up in
DC2/DC3 but again not all 1000. 

Running with  Solr 7.3.1.


Thanks,
Chandi



--
Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html


Re: SolrCloud CDCR issue

2018-08-14 Thread Amrit Sarkar
Hi,

Yeah if you look above I have stated the same jira. I see your question on
3DCs with Active-Active scenario, will respond there.

Amrit Sarkar
Search Engineer
Lucidworks, Inc.
415-589-9269
www.lucidworks.com
Twitter http://twitter.com/lucidworks
LinkedIn: https://www.linkedin.com/in/sarkaramrit2
Medium: https://medium.com/@sarkaramrit2


On Mon, Aug 13, 2018 at 9:43 PM cdatta  wrote:

> And I was thinking about this one:
> https://issues.apache.org/jira/browse/SOLR-11959.
>
>
>
> --
> Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html
>


Lucene/Solr bug list caused by JVM's implementations

2018-08-14 Thread Yasufumi Mizoguchi
Hi,

I am looking for Lucene/Solr's bug list caused by JVM's implementations.
And I found the following, but it seems not to be updated.
https://wiki.apache.org/lucene-java/JavaBugs

Where can I check the latest one?

Thanks,
Yasufumi