Re: running solr 6.x in Eclipse for debugging

2016-10-05 Thread John Bickerstaff
build 'ant server dist' from the install_dir/solr directory first. > > The 'ant eclipse' target just builds the project structure for Eclipse, > > it doesn't build Solr. > > > > YMMV of course since you're using a different IDE. > >

Re: running solr 6.x in Eclipse for debugging

2016-10-05 Thread John Bickerstaff
_6x > > On Wed, Oct 5, 2016 at 5:51 PM, John Bickerstaff > > wrote: > > > Mikhail -- which version of Solr are you using to do this [ant example > > -Dexample.debug=true] > > > > I may be wrong, but it seems that "example" no longer works

Re: running solr 6.x in Eclipse for debugging

2016-10-05 Thread John Bickerstaff
roblem mode" here - My goal is: Add plugin source code, build, make config changes where needed, and see source when I put a breakpoint in code. On Wed, Oct 5, 2016 at 12:04 PM, John Bickerstaff wrote: > Thanks Mikhail! > > On Wed, Oct 5, 2016 at 11:29 AM, Mikhail Khludnev wro

Re: running solr 6.x in Eclipse for debugging

2016-10-05 Thread John Bickerstaff
irectories...? On Wed, Oct 5, 2016 at 1:30 PM, John Bickerstaff wrote: > OK - I'm running now in debug mode. My intent is to add and test a "hello > world"

Re: running solr 6.x in Eclipse for debugging

2016-10-05 Thread John Bickerstaff
Thank you very much Eric - I'll try that tomorrow. On Wed, Oct 5, 2016 at 7:57 PM, Erick Erickson wrote: > John: > > The simple answer is "cheat" > > It takes a little fiddling, but here's what I do in IntelliJ: > > 1> use IntelliJ to create an

Re: Average of Averages in Solr

2016-10-06 Thread John Bickerstaff
This may help? Note the "Bloomberg Analytics" at the bottom of the post... https://dzone.com/articles/solr-not-just-for-text-anymore Quote from article: - *Bloomberg Analytics Component for Solr*: Bloomberg Financial Services uses Solr extensively, and found the existing statistical pack

Re: Writing Solr Custom Components

2016-10-06 Thread John Bickerstaff
Thank you Otis! On Thu, Oct 6, 2016 at 10:28 AM, Otis Gospodnetić < otis.gospodne...@gmail.com> wrote: > John, if it helps, here are a few examples of custom Solr SearchComponents: > > https://github.com/sematext/query-segmenter > https://github.com/sematext/solr-researcher &g

Re: solr 5 leaving tomcat, will I be the only one fearing about this?

2016-10-07 Thread John Bickerstaff
I won't speak for the committers, but I'm guessing you won't find a lot of support for the idea of continuing to provide a WAR file with the standard SOLR releases... I feel for you and your situation however - I've had to wrestle with a number of situations where a somewhat monolithic architectur

Re: solr 5 leaving tomcat, will I be the only one fearing about this?

2016-10-07 Thread John Bickerstaff
Rajesh was right on Renee -- the only big concern might be if that other code is tightly-coupled to Tomcat or to other things which *must* have Tomcat. But it sounds to me as if your multi-lingual processors - if they just work with Solr/Tomcat - out to "just work" with Solr/Jetty - or work with m

Re: solr 5 leaving tomcat, will I be the only one fearing about this?

2016-10-10 Thread John Bickerstaff
For what it's worth / in case it's helpful... I haven't dealt with JDBC in this context so I can't offer anything useful there... You can reference the data in Zookeeper when creating a new collection - so you don't need to put the configs anywhere on the Solr boxes themselves. It's not automati

Re: solr 5 leaving tomcat, will I be the only one fearing about this?

2016-10-10 Thread John Bickerstaff
Renee - you have probably already thought of this, but just in case it helps... (It helped me a lot several years ago and I hadn't thought of it at the time...) If you end up needing to do a big re-index, Production doesn't have to be affected (assuming you have the hardware/cloud resources). You

Re: PDF writer

2016-10-17 Thread John Bickerstaff
It's not fun to build a .pdf this way, but this may help... http://itextpdf.com/ On Mon, Oct 17, 2016 at 12:15 PM, Matthew Roth wrote: > Thanks Erick. That is as anticipated. Scouring my other resources didn't > indicate the existence of a PDF writer. I thought I'd try the group be > embarking

Re: PDF writer

2016-10-17 Thread John Bickerstaff
Awesome - I learned something - thanks! On Mon, Oct 17, 2016 at 3:06 PM, Markus Jelsma wrote: > Did someone miss https://pdfbox.apache.org/ ? It can write PDF documents, > is ASF and has a ton of examples to learn from. > M. > > > > -Original message- > > From:John Bickerstaff > > Sent:

Re: Advice on implementing SOLR Cloud

2016-10-17 Thread John Bickerstaff
ne in the set called something like 6.1_final.txt and that will contain a step-by-step way to set up the Solr Cloud. You can modify for your situation. Hope this helps... https://www.linkedin.com/pulse/actual-solrcloud-vms-zookeeper-nodes-john-bickerstaff?trk=hp-feed-article-title-publish Oh, by the

Re: Advice on implementing SOLR Cloud

2016-10-17 Thread John Bickerstaff
Excellent! Glad to know the efforts were useful to someone else. On Oct 17, 2016 11:20 PM, "Sadheera Vithanage" wrote: Thank you very much John, I fixed it. Your steps helped big time.. Thanks again.. On Tue, Oct 18, 2016 at 10:17 AM, John Bickerstaff wrote: > I had quite

Re: Query by distance

2016-10-18 Thread John Bickerstaff
Just in case it helps, I had good success on multi-word synonyms using this plugin... https://github.com/healthonnet/hon-lucene-synonyms IIRC, the instructions are clear and fairly easy to follow - especially for Solr 6.x Ping back if you run into any problems setting it up... On Tue, Oct 18,

Re: Migration from Solr 4

2016-10-18 Thread John Bickerstaff
You can go here for the link... https://www.linkedin.com/pulse/actual-solrcloud-vms-zookeeper-nodes-john-bickerstaff On Tue, Oct 18, 2016 at 11:28 AM, sputul wrote: > We are using Solr 4.3, sing Zoopeeker on development manage Solr Cloud > having > one or two nodes. Will it

Re: Migration from Solr 4

2016-10-18 Thread John Bickerstaff
ale (hundreds > of Solr instances and/or collections), and even in these cases it's > usually best to use Observers > > FWIW, > Erick > > On Tue, Oct 18, 2016 at 2:28 PM, John Bickerstaff > wrote: > > For what it's worth, (and it may not work for your situ

Re: Migration from Solr 4

2016-10-18 Thread John Bickerstaff
, Oct 18, 2016 at 2:05 PM, Shawn Heisey wrote: > On 10/18/2016 12:28 PM, John Bickerstaff wrote: > > For what it's worth, (and it may not work for your situation) I > > decided not to upgrade, but to "upgrade by replacing". In other words, > > I just installe

Public/Private data in Solr :: Metadata or ?

2016-10-18 Thread John Bickerstaff
I have a question that I suspect I'll need to answer very soon in my current position. How (or is it even wise) to "segregate data" in Solr so that some data can be seen by some users and some data not be seen? Taking the case of "public / private" as a (hopefully) simple, binary example... Let'

Re: Public/Private data in Solr :: Metadata or ?

2016-10-18 Thread John Bickerstaff
Thanks Markus, In your case that client's key is fairly static, yes? It doesn't change at any time, but tends to live on the data more or less permanently? On Tue, Oct 18, 2016 at 4:07 PM, Markus Jelsma wrote: > In case you're not up for Doug or Jan's anwers; we have relied on HTTP > proxies (

Re: Public/Private data in Solr :: Metadata or ?

2016-10-18 Thread John Bickerstaff
, 2016 at 3:32 PM, Jan Høydahl wrote: > https://wiki.apache.org/solr/SolrSecurity#Document_Level_Security < > https://wiki.apache.org/solr/SolrSecurity#Document_Level_Security> > > -- > Jan Høydahl, search solution architect > Cominvent AS - www.cominvent.com > > > 18

Empty facets on TextField

2016-10-18 Thread John Davis
n, it feels like this should just work. I would appreciate any input. Search query: /select/?facet.field=FACET_FIELD_NAME&facet=on&indent=on&q=QUERY_STRING&wt=json Interestingly facets are returned if I change facet.method to enum instead of default fc. John

Re: Empty facets on TextField

2016-10-18 Thread John Davis
l exist in the index for this > field (just with no values), and that normal faceting would use those. > Forcing facet.method=enum forces the use of the index instead of > docvalues (or the fieldcache if the field is configured w/o > docvalues). > > -Yonik > > On Tue, Oct 18

Re: Public/Private data in Solr :: Metadata or ?

2016-10-19 Thread John Bickerstaff
Thank you both! Very helpful. On Wed, Oct 19, 2016 at 8:48 AM, Shawn Heisey wrote: > On 10/18/2016 3:00 PM, John Bickerstaff wrote: > > How (or is it even wise) to "segregate data" in Solr so that some data > > can be seen by some users and some data not be seen? >

Re: Public/Private data in Solr :: Metadata or ?

2016-10-19 Thread John Bickerstaff
always better to do > the work up-front during queries > > Best, > Erick > > On Wed, Oct 19, 2016 at 12:07 PM, John Bickerstaff > wrote: > > Thank you both! Very helpful. > > > > On Wed, Oct 19, 2016 at 8:48 AM, Shawn Heisey > wrote: > > > >>

Re: Result Grouping vs. Collapsing Query Parser -- Can one be deprecated?

2016-10-19 Thread John Bickerstaff
Thank you for posting that. I'll be saving it in my "important painful lessons learned by others" mail folder. On Oct 19, 2016 4:51 PM, "Mike Lissner" wrote: > Hi all, > > I've had a rotten day today because of Solr. I want to share my experience > and perhaps see if we can do something to fix

Re: (solrcloud) Importing documents into "implicit" router

2016-10-20 Thread John Bickerstaff
e document belongs. If the field specified is missing in the document, however, the document will be rejected. You could also use the _route_ parameter to name a specific shard. On Thu, Oct 20, 2016 at 12:12 PM, John Bickerstaff wrote: > This may help? https://cwiki.apache.org/confluence

Re: (solrcloud) Importing documents into "implicit" router

2016-10-20 Thread John Bickerstaff
This may help? https://cwiki.apache.org/confluence/display/solr/Shards+and+Indexing+Data+in+SolrCloud On Thu, Oct 20, 2016 at 12:09 PM, Customer wrote: > Hey, > > I hope you all are doing well.. > > I got a router with "router.name=implicit" with couple of shards (lets > call them shardA and sha

Re: Load balancing with solr cloud

2016-10-20 Thread John Bickerstaff
Others on the list are more expert, but I think your #1 Zookeeper will not get hammered. As I understand it, Solr itself (the leader) will handle farming out the work to the other two Solr nodes. The amount of traffic on the Zookeeper instances should be minimal. Now - could your SolrCloud of 3

Re: Query formulation help

2016-10-26 Thread John Bickerstaff
It looks to me as if it's blowing up on syntax. I don't have access to the Admin UI right now, but I would suggest attempting to submit this query via the UI and examining the URL that comes back. That frequently solves my more frustrating syntax problems. I.E. try putting the cost:[...] in

Re: Query formulation help

2016-10-26 Thread John Bickerstaff
Ahh - I see what you're after (I think) This page should be helpful for you: https://cwiki.apache.org/confluence/display/solr/Function+Queries again, I'd try using the Admin UI as a test phase to get things right (and see the syntax in the URL that comes back on the response) Open the edismax s

Re: Query formulation help

2016-10-26 Thread John Bickerstaff
For what it's worth- you can do some complex stuff - including using document fields as "variables" -- I did it on an Solr query endpoint (like /search) because I had stuff that was constant for every query. The syntax is challenging, but it can be done. I won't confuse the issue more unless you

Re: How-To: Secure Solr by IP Address

2016-11-04 Thread john saylor
hi any firewall worth it's name should be able to do this. in fact, that is one of several things that a firewall was designed to do. also, you are stopping this traffic at the application, which is good; but you'd prolly be better off stopping it at the network interface [using a firewall,

Facets based on sampling

2016-11-04 Thread John Davis
hould be able to extrapolate well. Thank you in advance for any advice! John

Re: load balancer for solr

2016-11-07 Thread John Bickerstaff
So from a "load balancing" perspective, you may want to consider using SolrCloud... It was designed (as I understand it) to be able to handle large loads, although to understand how it will work in your situation you will have to test. On Mon, Nov 7, 2016 at 6:29 AM, Mugeesh Husain wrote: > Tha

Re: Editing schema and solrconfig files

2016-11-09 Thread John Bickerstaff
I never found a way to do it through the UI... and ended up using "nano" on linux for simple things. For more complex stuff, I scp'd the file (or the whole conf directory) up to my dev box (a Mac in my case) and edited in a decent UI tool, then scp'd the whole thing back... I wrote a simple bash

empty strings outputting to numeric field types

2016-11-15 Thread John Blythe
hi all. i'm outputting our data to xml format for solr to consume. i have several fields storing dollar values as tdouble. they don't always exist in the outputted rows, however, at which point they throw an error and fail at indexing because the field is seen as an empty string (the log message:

Re: empty strings outputting to numeric field types

2016-11-16 Thread John Blythe
Ahh, makes sense. Thanks for the clarity, Hoss. Best- -- *John Blythe* Product Manager & Lead Developer 251.605.3071 | j...@curvolabs.com www.curvolabs.com 58 Adams Ave Evansville, IN 47713 On Tue, Nov 15, 2016 at 5:05 PM, Chris Hostetter wrote: > > : fields storing dollar

Failure to load shards

2017-06-09 Thread John Bickerstaff
Hi all, Here's my situation... In AWS with zookeeper / solr. When trying to spin up additional Solr boxes from an "auto scaling group" I get this failure. The code used is exactly the same code that successfully spun up the first 3 or 4 solr boxes in each "auto scaling group" Below is a copy o

Re: Failure to load shards

2017-06-09 Thread John Bickerstaff
inging up Solr boxes (although I need to check) I appreciate the hint to check the Overseer queue - I'll be doing that for sure... On Fri, Jun 9, 2017 at 12:19 PM, Erick Erickson wrote: > John: > > First place I'd look is the ZooKeeper Overseer queue. Prior to 6.6 > t

Re: Failure to load shards

2017-06-13 Thread John Bickerstaff
roduced in 6 that the patch uses/depends on? Thanks... On Fri, Jun 9, 2017 at 12:03 PM, John Bickerstaff wrote: > Hi all, > > Here's my situation... > > In AWS with zookeeper / solr. > > When trying to spin up additional Solr boxes from an "auto scaling group&qu

stats and pivots and sums

2017-07-06 Thread John Blythe
hi all. i'm attempting to find similar purchases for a user. the volume of purchase helps dictate the price point that they can expect. as such, i'm attempting to determine the sum of the quantity field across all purchases per user. i've got something like this as of yet: facet=on& > stats=true

Re: stats and pivots and sums

2017-07-06 Thread John Blythe
will check it out, thanks- -- *John Blythe* Product Manager & Lead Developer 251.605.3071 | j...@curvolabs.com www.curvolabs.com 58 Adams Ave Evansville, IN 47713 On Thu, Jul 6, 2017 at 12:37 PM, Erick Erickson wrote: > Have you looked at the JSON facet capabilities? It might work

missing documents after restart

2017-08-08 Thread John Blythe
hi all. i have a core that contains about 22 million documents. when the solr server is restarted it drops to 200-400k. the dashbaord says that it's both optimized and current. is there config issues i need to address in solr or the server? not really sure where to begin in hunting this down. th

JSON Logs in SOLR 5.x

2017-08-08 Thread John Bickerstaff
I'm running Solr 5.x and have the need to push logs into AWS's kinesis firehose. As I understand it, I need the logs to be in JSON format. This page: https://cwiki.apache.org/confluence/display/solr/Configuring+Logging Tells me that SOLR is using Log4J version 1.2 I've played with Log4J con

Re: missing documents after restart

2017-08-08 Thread John Blythe
u kill Solr with evil intent. But that's a long chain > of "ifs" > > Best, > Erick > > On Tue, Aug 8, 2017 at 6:02 AM, John Blythe wrote: > > hi all. > > > > i have a core that contains about 22 million documents. when the solr > > ser

Re: missing documents after restart

2017-08-11 Thread John Blythe
Looks like part of our nightly processing was restarting the solr server before all indexing was done bc of using a blunt object approach of doing so at designated times, doh! On Tue, Aug 8, 2017 at 9:35 PM John Blythe wrote: > Thanks Erick. I don't think all of those ifs are in place.

Re: missing documents after restart

2017-08-12 Thread John Blythe
haha, can't have that now! -- *John Blythe* Product Manager & Lead Developer 251.605.3071 | j...@curvolabs.com www.curvolabs.com 58 Adams Ave Evansville, IN 47713 On Fri, Aug 11, 2017 at 2:44 PM, Erick Erickson wrote: > Thanks for closing this out, I was breaking out in hives

Re: Comparison of Solr with Sharepoint Search

2017-08-14 Thread John Bickerstaff
Anything that Solr searches must exist in the Solr index. You could do it, but you'd need to get those documents into Solr "format" and indexed by Solr... Then you could return results that include a link to the actual doc in SharePoint. On Mon, Aug 14, 2017 at 11:08 AM, jmahuang wrote: > Sir,

entities, keepwords, and confusion

2017-08-16 Thread John Blythe
hi all, i'm attempting to define some entities in a keepwords file that a group of fields will use as other fields are copied into them. my index analyzer is defined with a KeyWord tokenizer followed by the Keepword filter and a LowerCase for the finale. my query analyzer uses WhiteSpace for the

Solr Logs to ELK / AWS Firestream

2017-08-17 Thread John Bickerstaff
All, I'm trying to get Solr logs into AWS Firestream. Not having a lot of luck. Does anyone out there have any experience getting Solr logs into an ELK stack? Or, better yet, getting Solr Logs into AWS Firestream? We direct logs to SLF4J and use logback as our SLF4j implementation. I have a n

solr-map-reduce:jar:6.6.0

2017-08-29 Thread John Bickerstaff
All, I'm looking at upgrading some java projects to solr and lucene 6.6.0. I get an error when trying to "mvn package" one of them - it complained about the solr-map-reduce:jar:6.6.0 not being available. I went here: https://mvnrepository.com/artifact/org.apache.solr/solr-map-reduce And it look

Re: solr-map-reduce:jar:6.6.0

2017-08-29 Thread John Bickerstaff
A quick search of the code's imports suggests we aren't actually using it. However I see it explicitly mentioned in the POM. I'll have to do more digging to be sure... On Tue, Aug 29, 2017 at 3:44 PM, Hrishikesh Gadre wrote: > Hi John, > > The map-reduce contrib modu

Re: solr-map-reduce:jar:6.6.0

2017-08-29 Thread John Bickerstaff
If we do need that functionality somehow, what is the recommended approach? Was it replaced by something? On Tue, Aug 29, 2017 at 4:27 PM, John Bickerstaff wrote: > A quick search of the code's imports suggests we aren't actually using it. > However I see it explicitly ment

Re: Solrcloud configuration

2017-09-19 Thread John Bickerstaff
This may be of some assistance... http://lucene.apache.org/solr/guide/6_6/ There is a section discussing sharding and another section that includes the schema. On Tue, Sep 19, 2017 at 1:42 PM, Shashi Roushan wrote: > Hello David > > No, I didn't read any documentation on the schema and DIH. >

Re: Solrcloud configuration

2017-09-19 Thread John Bickerstaff
This may also be of some assistance: https://gist.github.com/maxivak/3e3ee1fca32f3949f052 I haven't tested, just found it. On Tue, Sep 19, 2017 at 4:10 PM, John Bickerstaff wrote: > This may be of some assistance... > > http://lucene.apache.org/solr/guide/6_6/ > >

Re: Solrcloud configuration

2017-09-19 Thread John Bickerstaff
Addendum: It's not sql server, but I imagine the steps will be similar if not identical except for the details of the JDBC driver you need. On Tue, Sep 19, 2017 at 4:11 PM, John Bickerstaff wrote: > This may also be of some assistance: > > https://gist.github.com/maxivak/3e3ee1

tipping point for using solrcloud—or not?

2017-09-29 Thread John Blythe
er discussion at the conference i'm combing through our configs to make sure we trim any fat we can. also wanting to get optimization scheduled more regularly to help out w segmentation and garbage heap. not sure how far those two alone will get us, though. thanks for any thoughts! -- John Blythe

Re: tipping point for using solrcloud—or not?

2017-10-02 Thread John Blythe
splitting up the machine if there isn't a jvm load issue we're currently experiencing? i can def provide more info that could help in the discussion. help me know the best way / stuff to send if you can please. thanks again for the help guys- -- John Blythe On Fri, Sep 29, 201

Re: tipping point for using solrcloud—or not?

2017-10-02 Thread John Blythe
. > > But otherwise I'd be using SolrCloud. > > Best, > Erick > > On Mon, Oct 2, 2017 at 1:48 PM, John Blythe wrote: > > thanks for the responses, guys. > > > > erick: we do need NRT in several cases. also in need of HA pending where > > the line is dra

Re: Solr cloud planning

2017-10-03 Thread John Bickerstaff
Others will comment I'm sure, but I'm suspicious the answer is that you can only know by testing. The kind and quantity of data in your indexes - the type of queries you normally receive, etc, etc... All these things will have an affect on your speed. For sure you don't want 15 Zookeepers. That

length of indexed value

2017-10-03 Thread John Blythe
field to be indexed with the length of what i know will end up being the indexed value's length am i missing out on an easier, more straight forward solution? thanks! -- John Blythe

Re: length of indexed value

2017-10-04 Thread John Blythe
be counted as a match. -- John Blythe On Wed, Oct 4, 2017 at 6:22 AM, alessandro.benedetti wrote: > Are the norms a good approximation for you ? > If you preserve norms at indexing time ( it is a configuration that you can > operate in the schema.xml) you can retrieve them with this

Re: length of indexed value

2017-10-04 Thread John Blythe
ah, thanks for the link. -- John Blythe On Wed, Oct 4, 2017 at 9:23 AM, Erick Erickson wrote: > Check. The problem is they don't encode the exact length. I _think_ > this patch shows you'd be OK with shorter lengths, but check: > https://issues.apache.org/jira/browse/LUCENE-

Schemaless detecting multivalued fields

2017-10-18 Thread John Davis
tlong while if it is list of longs then use tlongS. Thanks! John

Re: Facets based on sampling

2017-10-20 Thread John Davis
een on my TODO list for the JSON Facet API. > How much it would help depends on where the bottlenecks are, but that > in conjunction with a hashing approach to collection (assuming field > cardinality is high) should definitely help. > > -Yonik > > > On Fri, Nov 4, 2016 at 3

Really slow facet performance in 6.6

2017-10-23 Thread John Davis
{ "appliedMethod": "UIF", "elapse": 8310, "field": "level", "inputDocSetSize": 476, "requestedMethod": "UIF" } ] } Thanks John

Re: Facets based on sampling

2017-10-23 Thread John Davis
Docvalues don't work for multivalued fields. I just started a separate thread with more debug info. It is a bit surprising why facet computation is so slow even when the query matches hundreds of docs. On Mon, Oct 23, 2017 at 6:53 AM, alessandro.benedetti wrote: > Hi John, > first of

Re: Facets based on sampling

2017-10-24 Thread John Davis
> Erick > > > On Tue, Oct 24, 2017 at 1:08 AM, Emir Arnautović > wrote: > > Hi John, > > Did you mean “docValues don’t work for analysed fields” since it works > for multivalue string (or other supported types) fields. What you need to > do is to convert your

DelimitedPayloadTokenFilterFactory missing from ref guide

2017-11-22 Thread John Anonymous
DelimitedPayloadTokenFilterFactory appears to be missing from this page: https://lucene.apache.org/solr/guide/7_1/filter-descriptions.html

Re: DelimitedPayloadTokenFilterFactory missing from ref guide

2017-11-22 Thread John Anonymous
umentation section here: > https://wiki.apache.org/solr/HowToContribute > > Best, > Erick > > On Wed, Nov 22, 2017 at 3:37 PM, John Anonymous wrote: > > DelimitedPayloadTokenFilterFactory appears to be missing from this page: > > https://lucene.apache.org/solr/guide/7_1/filter-descriptions.html >

does the payload_check query parser have support for simple query parser operators?

2017-11-29 Thread John Anonymous
I would like to use wildcards and fuzzy search with the payload_check query parser. Are these supported? {!payload_check f=text payloads='NOUN'}apple~1 {!payload_check f=text payloads='NOUN'}app* Thanks

Re: does the payload_check query parser have support for simple query parser operators?

2017-11-30 Thread John Anonymous
Ok, thanks. Do you know if there are any plans to support special syntax in the future? On Thu, Nov 30, 2017 at 5:04 AM, Erik Hatcher wrote: > No it doesn’t. The payload parsers currently just simple tokenize with > no special syntax supported. > > Erik > > > On Nov 30,

Solr index size statistics

2017-12-02 Thread John Davis
Hello, Is there a way to get index size statistics for a given solr instance? For eg broken by each field stored or indexed. The only things I know of is running du on the index data files and getting counts per field indexed/stored, however each field can be quite different wrt size. Thanks John

PayloadScoreQuery always returns score of zero

2017-12-13 Thread John Anonymous
The PayloadScoreQuery always returns a score of zero, regardless of payloads. The PayloadCheckQParser works fine, so I know that I am successfully indexing the payloads. Details below *payload field that I am searching on:* *definition of payload field type:*

SolrCloud

2017-12-15 Thread John Davis
SolrCloud? Thanks, John

Re: SolrCloud

2017-12-15 Thread John Davis
== > new_collection, basically all your routing is the same. You can create > aliases pointing to multiple collections or specify multiple > collections on the query, don't know if that fits your use case or not > though. > > > Best, > Erick > > On Fri, Dec 15, 2017 a

Highlighting, offsets -- external doc store

2016-11-29 Thread John Bickerstaff
All, One of the questions I've been asked to answer / prove out is around the question of highlighting query matches in responses. BTW - One assumption I'm making is that highlighting is basically a function of storing offsets for terms / tokens at index time. If that's not right, I'd be gratefu

Documentation of Zookeeper's specific roles and functions in Solr Cloud?

2016-11-29 Thread John Bickerstaff
All, I've thought I understood that Solr search requests are made to the Solr servers and NOT Zookeeper directly. (I.E. Zookeeper doesn't decide which Solr server responds to requests and requests are made directly to Solr) My new place tells me they're sending requests to Zookeeper - and those

Re: Documentation of Zookeeper's specific roles and functions in Solr Cloud?

2016-11-29 Thread John Bickerstaff
s sent to Zookeeper to determine the live nodes. If indexing, > >> CloudSolrClient can find the leader and send documents directly there. > The > >> client then uses that information to query the correct nodes directly. > >> > >> Zookeeper is not forwarding request

Re: Highlighting, offsets -- external doc store

2016-11-29 Thread John Bickerstaff
hing in the index is what's made it through the > >> analysis chains. So stopwords are missing. Stemming > >> has been done. You could even have put a phonetic filter > >> in there and have terms like ARDT KNTR which would > >> be...er...not very useful to show t

Re: Highlighting, offsets -- external doc store

2016-11-29 Thread John Bickerstaff
have been used in any Solr > 4.10) > > -Simon > > On Tue, Nov 29, 2016 at 11:43 AM, John Bickerstaff < > j...@johnbickerstaff.com > > wrote: > > > All, > > > > One of the questions I've been asked to answer / prove out is around the > > questi

Re: boost solr results using users behavior

2016-11-29 Thread John Bickerstaff
This slideshow / presentation may give you some idea of the complexity involved... No, nothing like this in Solr itself. At least one approach is to mine your logs for user behavior and use that information as a starting point for either an external machine learning piece, or for just fine-tuning

Re: solr audit logging

2016-12-06 Thread John Bickerstaff
If you can identify currently-logged messages that give you what you need (even if you have to modify or process them afterwards) you can easily make a custom log4j config that grabs ONLY what you want and dumps it into a separate file... I'm pretty sure I've seen all the request coming through in

Re: solr audit logging

2016-12-06 Thread John Bickerstaff
ng back and let us know what you find in the logs and if you want more "advice" -- which you should always take with a grain of salt... On Tue, Dec 6, 2016 at 3:56 PM, John Bickerstaff wrote: > If you can identify currently-logged messages that give you what you need > (even i

cores vs. instances vs. zookeeper vs. cloud vs ?

2016-12-16 Thread John Blythe
good morning everyone. i've got a crowing number of cores that various parts of our application are relying upon. i'm having difficulty figuring out the best way to continue expanding for both sake of scale and convenience. i need two extra versions of each core due to our demo instance and our d

Re: cores vs. instances vs. zookeeper vs. cloud vs ?

2016-12-16 Thread John Blythe
loud or something it's even capable of? thanks again, erick! -- *John Blythe* Product Manager & Lead Developer 251.605.3071 | j...@curvolabs.com www.curvolabs.com 58 Adams Ave Evansville, IN 47713 On Fri, Dec 16, 2016 at 11:38 AM, Erick Erickson wrote: > It's not quite clear

Re: cores vs. instances vs. zookeeper vs. cloud vs ?

2016-12-16 Thread John Blythe
ts on solrcloud? seems like it to me, but hope to confirm that before investing time in the wrong direction. thanks! -- *John Blythe* Product Manager & Lead Developer 251.605.3071 | j...@curvolabs.com www.curvolabs.com 58 Adams Ave Evansville, IN 47713 On Fri, Dec 16, 2016 at 1:59 PM, Erick

Re: cores vs. instances vs. zookeeper vs. cloud vs ?

2016-12-16 Thread John Blythe
cloud, two entirely different approaches to scaling out. very much helpful to see how off balance i was on that assumption! thanks shawn- -- *John Blythe* Product Manager & Lead Developer 251.605.3071 | j...@curvolabs.com www.curvolabs.com 58 Adams Ave Evansville, IN 47713 On Fri, Dec 16, 2

Stats component's percentiles are incorrect

2016-12-19 Thread John Blythe
hi, all. i've begun recruiting solr stats for some nifty little insights for our users' data. it seems to be running just fine in most cases, but i have noticed that there is a fringe group of results that seem to have incorrect data. for instance, one query returns the following output; 3900

Re: Stats component's percentiles are incorrect

2016-12-19 Thread John Blythe
d to the range itself rather than the concentration of distinct values? thanks for any continued insight here! -- *John Blythe* Product Manager & Lead Developer 251.605.3071 | j...@curvolabs.com www.curvolabs.com 58 Adams Ave Evansville, IN 47713 On Mon, Dec 19, 2016 at 3:12 PM, Toke Eskilds

Re: Stats component's percentiles are incorrect

2016-12-19 Thread John Blythe
gotcha. yup, that was the back up plan so i think i'll go that route for now. thanks for the info! best, -- *John Blythe* Product Manager & Lead Developer 251.605.3071 | j...@curvolabs.com www.curvolabs.com 58 Adams Ave Evansville, IN 47713 On Mon, Dec 19, 2016 at 3:41 PM, Toke

Re: Stats component's percentiles are incorrect

2016-12-19 Thread John Blythe
very good point, walter. i think we could find some cool ways to leverage this intelligence for our users after serving up the flattened version based on the simple range that they're expecting to see. the clarity is helpful in getting some creative ideas moving, so thanks. best, --

Re: Is it possible to rewrite part of the solr response?

2016-12-21 Thread John Bickerstaff
It would be custom code and I have something along those lines, although it throws an error instead of changing the response... Rushing now and can't go into more detail right now. On Dec 21, 2016 6:57 PM, "Mike Thomsen" wrote: > We're trying out some ideas on locking down solr and would like t

Re: Is it possible to rewrite part of the solr response?

2016-12-21 Thread John Bickerstaff
Awesome explanation Eric, I'll be filing this away for future reference. On Dec 21, 2016 7:08 PM, "Erick Erickson" wrote: "grab the response" is a bit ambiguous here in Solr terms. Sure, a SearchComponent (you can write a plugin) gets the response, but it only sees the final list being returned

ordering results by the min and max difference / converting mysql into solr query

2016-12-30 Thread John Blythe
hi everyone. hope you all had a great christmas! i'm having trouble converting an example mysql script into a solr query. here's my preliminary query: select vendorItem, min(unitPrice), max(unitPrice), -(min(unitPrice) - > max(unitPrice)) as `diff` > from transactions > where orgId IN (x,y,z) > a

Re: Empty facets on TextField

2017-01-06 Thread John Davis
On Tue, Oct 18, 2016 at 11:02 PM, Yonik Seeley wrote: > > A delete-by-query of *:* may do it (because it special cases to > > removing the index). > > The underlying issue is when lucene merges a segment without docvalues > > with a segment that has them. > > -Yo

Referencing a !key and !stat in facet.pivot

2017-01-12 Thread John Blythe
hi all i'm having an issue with an attempt to assign a key to a facet.pivot while simultaneously referencing one of my stat fields. i've got something like this: stats.field={!tag=pivot_stats}lastPrice& > ... > facet.pivot={!key=pivot} {!stats=pivot_stats}buyer,vendor& ... i've attempted it wi

Re: Referencing a !key and !stat in facet.pivot

2017-01-15 Thread John Blythe
aceting#Faceting-CombiningStatsComponentWithPivots > > > https://cwiki.apache.org/confluence/display/solr/Faceting#Faceting-LocalParametersforFaceting > > > https://cwiki.apache.org/confluence/display/solr/The+Stats+Component#TheStatsComponent-LocalParameters > > > > > > : Date: Thu, 12 J

<    1   2   3   4   5   6   7   8   9   >