Re: Why use a different analyzer for "index" and "query"?

2020-09-10 Thread Tim Casey
okens in a time field, so you dont get names of people 'june' while searching for 'jun', for instance. tim On Thu, Sep 10, 2020 at 10:08 AM Walter Underwood wrote: > It is very common for us to do more processing in the index analysis > chain. In general, we do that

Re: Dynamic Stopwords

2020-05-15 Thread Tim Casey
er to have an honest index and allow the post analysis to change. This way you can change it 10 times a day and no one will care. If you are interested in a word cloud I would suspect people have done a reasonable job around this using a solr index already. tim On Fri, May 15, 2020 at 1:48 PM A

Re: Dynamic Stopwords

2020-05-15 Thread Tim Casey
, you can build the symbol space from bigrams. If I ever write a book the title is going to be "The The". I hope it has multi-lingual translations. Although, at this point, it is a very short book :/ tim On Fri, May 15, 2020 at 11:43 AM Walter Underwood wrote: > Right. I might us

Re: Tuning for 500+ field schemas

2020-03-18 Thread Tim Robertson
> Erick > > On Wed, Mar 18, 2020, 12:04 Edward Ribeiro > wrote: > > > What are your hard and soft commit settings? This can have a large > > impact on the writing throughput. > > > > Best, > > Edward > > > > On Wed, Mar 18, 2020 at 11:43 AM Tim Ro

Re: Tuning for 500+ field schemas

2020-03-18 Thread Tim Robertson
f fields and/or many > rows, this shouldn’t run “for many minutes”, but it’s something to look for. > > When this happens, what is your query response time like? I’m assuming > it’s very slow. > > But these are all shots in the dark, some thread dumps would be where I’d > start. &g

Tuning for 500+ field schemas

2020-03-18 Thread Tim Robertson
you would do to tune Solr for large amounts of dynamic fields? Does anyone have a guess on what the single high CPU node is doing (some kind of metrics aggregation maybe?). Thank you all, Tim [1] [image: image.png]

Re: cursorMark and shards? (6.6.2)

2020-02-10 Thread Tim Casey
Walter, When you do the query, what is the sort of the results? tim On Mon, Feb 10, 2020 at 8:44 PM Walter Underwood wrote: > I’ll back up a bit, since it is sort of an X/Y problem. > > I have an index with four shards and 17 million documents. I want to dump > all the docs in

Re: ConcurrentModificationException in SolrInputDocument writeMap

2019-11-18 Thread Tim Swetland
stException (java.lang.String cannot be cast to java.util.Map) on the replica as in issue SOLR-13471 <https://issues.apache.org/jira/browse/SOLR-13471>. Anyway, thanks for the insight everyone, Tim On Fri, Nov 8, 2019 at 12:26 AM Shawn Heisey wrote: > On 11/6/2019 8:17 AM, Tim Swetland wrot

Re: ConcurrentModificationException in SolrInputDocument writeMap

2019-11-06 Thread Tim Swetland
Nevermind my comment on not having this problem in 8.1. We do have it there as well, I just didn't look far enough back in our logs on my initial search. Would still appreciate whatever thoughts anyone might have on the exception. On Wed, Nov 6, 2019 at 10:17 AM Tim Swetland wrote:

ConcurrentModificationException in SolrInputDocument writeMap

2019-11-06 Thread Tim Swetland
I'm currently running into a ConcurrentModificationException ingesting data as we attempt to upgrade from Solr 8.1 to 8.2. It's not every document, but it definitely appears regularly in our logs. We didn't run into this problem in 8.1, so I'm not sure what might have changed. I feel like this is p

BlobRepository "runtme.lib.size"

2019-11-04 Thread Tim Swetland
rrect me if I'm wrong. Thanks, Tim

Re: Position search

2019-10-16 Thread Tim Casey
c segments. I think you will find the last N tokens of a document have some odd categories within the search results. I might guess you have a different purpose in mind. Either way, you would likely do better to segment what you are searching. tim On Mon, Oct 14, 2019 at 11:25 PM Kaminski,

Re: Position search

2019-10-15 Thread Tim Casey
particularly short messages. So I would expect a small set of side fields remarking this. This would allow you to carry the measures along with the data. tim On Tue, Oct 15, 2019 at 12:19 PM Alexandre Rafalovitch wrote: > Is the 100 words a hard boundary or a soft one? > > If it is a

Re: Re: Need urgent help with Solr spatial search using SpatialRecursivePrefixTreeFieldType

2019-09-30 Thread Tim Casey
https://stackoverflow.com/questions/48348312/solr-7-how-to-do-full-text-search-w-geo-spatial-search On Mon, Sep 30, 2019 at 10:31 AM Anushka Gupta < anushka_gu...@external.mckinsey.com> wrote: > Hi, > > I want to be able to filter on different cities and also sort the results > based on geoproxi

Re: Indexing information on number of attachments and their names in EML file

2019-08-02 Thread Tim Allison
I'd strongly recommend rolling your own ingest code. See Erick's superb: https://lucidworks.com/post/indexing-with-solrj/ You can easily get attachments via the RecursiveParserWrapper, e.g. https://github.com/apache/tika/blob/master/tika-parsers/src/test/java/org/apache/tika/parser/RecursiveParse

Re: Encrypting Solr Index

2019-06-25 Thread Tim Casey
My two cents worth of comment, For our local lucene indexes we use AES encryption. We encrypt the blocks on the way out, decrypt on the way in. We are using a C version of lucene, not the java version. But, I suspect the same methodology could be applied. This assumes the data at rest is the at

Re: Solr query with long query

2019-05-30 Thread Tim Casey
if need be. (Be wary of over generation if one of the categories turns out to be 'thin'). Then in the filter query you can query over a category, or simply require a category:thing to be in the query. tim On Thu, May 30, 2019 at 3:33 PM Shawn Heisey wrote: > On 5/30/2019 4:13 PM, V

Re: problem indexing GPS metadata for video upload

2019-05-10 Thread Tim Allison
de Solr as soon as Tika is out (I also mean it this time). *TM by Erick Erickson On Fri, May 3, 2019 at 3:44 AM Where is Where wrote: > > Thank you very much Tim, I wonder how to make the Tika change apply to > Solr? I saw Tika core, parse and xml jar files tika-core.jar > tika-parse

Re: problem indexing GPS metadata for video upload

2019-05-02 Thread Tim Allison
Sorry build #182: https://builds.apache.org/job/tika-branch-1x/ On Thu, May 2, 2019 at 12:01 PM Tim Allison wrote: > > I just pushed a fix for TIKA-2861. If you can either build locally or > wait a few hours for Jenkins to build #182, let me know if that works > with straight

Re: problem indexing GPS metadata for video upload

2019-05-02 Thread Tim Allison
I just pushed a fix for TIKA-2861. If you can either build locally or wait a few hours for Jenkins to build #182, let me know if that works with straight tika-app.jar. On Thu, May 2, 2019 at 5:00 AM Where is Where wrote: > > Thank you Alex and Tim. > I have looked at the solrconfig.xm

Re: problem indexing GPS metadata for video upload

2019-05-01 Thread Tim Allison
Related? https://issues.apache.org/jira/plugins/servlet/mobile#issue/TIKA-2861 On Wed, May 1, 2019 at 8:09 AM Alexandre Rafalovitch wrote: > What happens when you run it against a standalone Tika (recommended option > anyway)? Do you see the relevant fields? > > Not every Tika field is capture

Re: SOLR Text Field

2019-04-06 Thread Tim Allison
TextField is a classname. Look in managedschema and pick a field type by name, e.g. text_general On Sat, Apr 6, 2019 at 9:00 AM Dave Beckstrom wrote: > Hi Everyone, > > I'm really hating SOLR. All I want is to define a text field that data > can be indexed into and which is searchable. Should

Spatial Search using two separate fields for lat and long

2019-04-03 Thread Tim Hedlund
? The reason I want to keep the fields as two separate ones is that I want to be able to export from solr back to exact same excel file structure, i.e. solr fields maps exactly to excel columns. I'm using solr 7. Any thoughts or suggestions would be appreciated. Regards Tim

Re: Java 9 & solr 7.7.0

2019-03-25 Thread Tim Underwood
/index.html -Tim On Mon, Mar 25, 2019 at 10:51 AM Jay Potharaju wrote: > I just learnt that java 11 is . Is anyone using open jdk11 in > production? > Thanks > > > > On Mar 23, 2019, at 5:15 PM, Jay Potharaju > wrote: > > > > I have not kept up with jdk vers

Re: Java 9 & solr 7.7.0

2019-03-23 Thread Tim Underwood
We are successfully running Solr 7.6.0 (and 7.5.0 before it) on OpenJDK 11 without problems. We are also using G1. We do not use Solr Cloud but do rely on the legacy replication. -Tim On Sat, Mar 23, 2019 at 10:13 AM Erick Erickson wrote: > I am, in fact, trying to get a summary of all t

Why is elevate not working when I convert a request to local parameters?

2019-03-22 Thread Tim Allison
4.x...y, I know... What am I doing wrong? How can I fix this? Thank you. Best, Tim

Re: Help with a DIH config file

2019-03-15 Thread Tim Allison
Haha, looks like Jörn just answered this... onError="skip|continue" >greatly preferable if the indexing process could ignore exceptions Please, no. I'm 100% behind the sentiment that DIH should gracefully handle Tika exceptions, but the better option is to log the exceptions, store the stacktrace

Re: [CDCR]Unable to locate core

2019-02-07 Thread Tim
So it looks like I'm having an issue with this fix: https://issues.apache.org/jira/browse/SOLR-11724 So I've messed around with this for a while and every time the leader to leader replica portion works fine. But the Recovery portion (implemented as part of the fix above) fails. I've run a few t

Re: by: java.util.zip.DataFormatException: invalid distance too far back reported by Solr API

2019-02-05 Thread Tim Allison
>At the end of the day it would be a much better architecture to parse the > PDFs using plain standalone TikaServer +1 Also, note that we added a -spawnChild switch to tika-server that will run the server in a child process and kill+restart the child process if there is an infinite loop/oom/segfa

Re: [CDCR]Unable to locate core

2019-02-02 Thread Tim
Thank you for the reply. Sorry I did not include more information in the first post. So maybe there's some confusion here from my end. So both the target and source clusters are running in cloud mode. So I think you're correct that it is a different issue. So it looks like the source leader to ta

Re: [CDCR]Unable to locate core

2019-02-01 Thread Tim
After some more investigation it seems that we're running into the same bug found here . However if my understanding is correct that bug in 7.3 was patched out. Unfortunately we're running into the same behavior in 7.5 CDCR is replicating succe

[CDCR]Unable to locate core

2019-01-30 Thread Tim
I'm trying to setup CDCR but I'm running into an issue where one or two shards/replicas will not be replicated but the rest will out of the six cores. The only error that appears in the logs is: "Unable to locate core". Occasionally restarting the instance will fix this but then the issue will r

TokenizerChain.getMultiTermAnalyzer().normalize() no longer normalizes multiterms in 8.x?!

2019-01-25 Thread Tim Allison
All, I don't know if this change was intended, but it feels like a bug to me... TokenFilterFactory[] filters = new TokenFilterFactory[2]; filters[0] = new LowerCaseFilterFactory(Collections.EMPTY_MAP); filters[1] = new ASCIIFoldingFilterFactory(Collections.EMPTY_MAP); TokenizerChain chain = new

Re: 8.0.0-SNAPSHOT snapshot repo poms broken?

2019-01-17 Thread Tim Allison
User error..please ignore. On Thu, Jan 17, 2019 at 4:36 PM Tim Allison wrote: > > All, > I recently tried to upgrade a project that relies on the snapshot > repos[1], but maven wasn't able to pull lucene-highlighter, > lucene-test-framework, lucene-memory, among a

8.0.0-SNAPSHOT snapshot repo poms broken?

2019-01-17 Thread Tim Allison
ually checked that the jars and poms for the artifacts that maven wasn't able to pull were in fact there. Is this user error or something wrong with the poms or something else? Thank you. Best, Tim [1] apache-snapshot

Re: Content from EML files indexing from text/html (which is not clean) instead of text/plain

2019-01-17 Thread Tim Allison
Y, I tracked this down within Solr. This is a feature, not a bug. I found a solution (set {{captureAttr}} to {{true}}): https://issues.apache.org/jira/browse/TIKA-2814?focusedCommentId=16745263&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-16745263 Please, though,

Re: How to handle List in Solr 6.6

2018-11-06 Thread Tim Underwood
itly delete the parent and child documents. There are a number of JIRA tickets floating around relating to cleaning up the user experience for this. -Tim [1] https://lucene.apache.org/solr/guide/6_6/uploading-data-with-index-handlers.html#UploadingDatawithIndexHandlers-NestedChildDocuments [2] http

Re: Solr OCR Support

2018-11-02 Thread Tim Allison
to ding Nuance (or tesseract), I just wish to point out that > what to OCR is important, because OCR works well when it has good input. > > > -Original Message- > > From: Tim Allison > > Sent: Friday, November 2, 2018 11:03 AM > > To: solr-user@lucene.apach

Re: Solr OCR Support

2018-11-02 Thread Tim Allison
OCR'ing of PDFs is fiddly at the moment because of Tika, not Solr! We have an open ticket to make it "just work", but we aren't there yet (TIKA-2749). You have to tell Tika how you want to process images from PDFs via the tika-config.xml file. You've seen this link in the links you mentioned: ht

Re: Tesseract language

2018-10-27 Thread Tim Allison
Martin, Let’s move this over to user@tika. Rohan, Is there something about Tika’s use of tesseract for image files that can be improved? Best, Tim On Sat, Oct 27, 2018 at 3:40 AM Rohan Kasat wrote: > I used tess4j for image formats and Tika for scanned PDFs and images wit

Re: Tesseract language

2018-10-26 Thread Tim Allison
Tika relies on you to install tesseract and all the language libraries you'll need. If you can successfully call `tesseract testing/eurotext.png testing/eurotext-dan -l dan`, Tika _should_ be able to specify "dan" with your code above. On Fri, Oct 26, 2018 at 10:49 AM Martin Frank Hansen (MHQ) wr

Re: Reading data using Tika to Solr

2018-10-26 Thread Tim Allison
ion: > https://wiki.apache.org/tika/RecursiveMetadata > > But thanks again for all your help! > > -Original Message- > From: Martin Frank Hansen (MHQ) > Sent: 26. oktober 2018 10:14 > To: solr-user@lucene.apache.org > Subject: RE: Reading data using Tika to Sol

Re: Reading data using Tika to Solr

2018-10-26 Thread Tim Allison
you’re wondering why you might upgrade to 1.19.1, look no further than: https://tika.apache.org/security.html On Fri, Oct 26, 2018 at 4:14 AM Martin Frank Hansen (MHQ) wrote: > Hi Tim, > > It is msg files and I added tika-app-1.14.jar to the build path - and now > it works 😊 But

Re: Reading data using Tika to Solr

2018-10-25 Thread Tim Allison
If you’re processing actual msg (not eml), you’ll also need poi and poi-scratchpad and their dependencies, but then those msgs could have attachments, at which point, you may as just add tika-app. :D On Thu, Oct 25, 2018 at 2:46 PM Martin Frank Hansen (MHQ) wrote: > Hi Erick and Tim, > &g

Re: Reading data using Tika to Solr

2018-10-25 Thread Tim Allison
To follow up w Erick’s point, there are a bunch of transitive dependencies from tika-parsers. If you aren’t using maven or similar build system to grab the dependencies, it can be tricky to get it right. If you aren’t using maven, and you can afford the risks of jar hell, consider using tika-app or

Re: Help with multi-lang searches

2018-10-22 Thread Tim Casey
is a set of probable languages. From there, you can pivot the results based on the user expectations. tim On Mon, Oct 22, 2018 at 11:18 AM Alexandre Rafalovitch wrote: > Additional possibilities: > 1) omitNorms and maybe omitTermFreqAndPositions for the fields to > avoid frequen

Re: Encoding issue in solr

2018-10-05 Thread Tim Allison
This is probably caused by an encoding detection problem in Nutch and/or Tika. If you can share the file on the Tika user’s list, I can take a look. On Fri, Oct 5, 2018 at 7:11 AM UMA MAHESWAR wrote: > HI ALL, > > while i am using nutch for crawling and indexing in to solr,while storing > data i

Re: solr and diversification

2018-09-28 Thread Tim Allison
If you haven’t already, might want to check out maximal marginal relevance...original paper: Carbonell and Goldstein. On Thu, Sep 27, 2018 at 7:29 PM Joel Bernstein wrote: > Yeah, I think your plan sounds fine. > > Do you have a specific use case for diversity of results. I've been > wondering i

Re: solr crypto mining hack...

2018-08-25 Thread Tim Casey
I am not sure how solr is exactly set up currently, much less on any specific system. But, for operations which are largely reading, *maybe* like a query, you might be able run on a read only partition. A firewall is a lot less work and a good start, like 90% of the problem. To do this, you brin

Re: Memory Leak in 7.3 to 7.4

2018-08-06 Thread Tim Allison
+1 to Shawn's and Erick's points about isolating Tika in a separate jvm. Y, please do let us know: u...@tika.apache.org We might be able to help out, and you, in turn, can help the community figure out what's going on; see e.g.: https://issues.apache.org/jira/browse/TIKA-2703 On Sun, Aug 5, 2018

Re: Exact Phrase search not returning results.

2018-07-20 Thread Tim Casey
Deepti, I am going to guess the analyzer part of the .net application is cutting off the last token. If you try the queries on the console of the running solr cluster, what do you get? If you dump that specific field for all the docs, can you find it with grep? tim On Fri, Jul 20, 2018 at 10

Re: Index protected zip

2018-05-29 Thread Tim Allison
t; > the info is in our "official" place but the real story is in another > > place, > > > one we alternately tell people to sometimes ignore but sometimes keep > up > > to > > > date? Even I'm confused. > > > > > > On Sat, May 26, 20

Re: Index protected zip

2018-05-26 Thread Tim Allison
W00t! Thank you, Shawn! The "don't use ERH in production" response comes up frequently enough > that I have created a wiki page we can use for responses: > > https://wiki.apache.org/solr/RecommendCustomIndexingWithTika > > Tim, you are extremely well-qualified t

Re: simple enrich uploaded binary documents with sha256 hashes

2018-05-26 Thread Tim Allison
standing by on the user list for Tika when you have questions. :) Cheers, Tim On Fri, May 25, 2018 at 11:10 AM Erick Erickson wrote: > I'd consider using a separate Java program that uses Tika directly, or > one of various services. Then you can assemble whatever you please >

Re: Index protected zip

2018-05-26 Thread Tim Allison
...@mail.gmail.com%3e On Sat, May 26, 2018 at 6:34 AM Tim Allison wrote: > You’ll need to provide a PasswordProvider in the ParseContext. I don’t > think that is currently possible in the Solr integration. Please open a > ticket if SolrJ doesn’t meet your needs. > > On Thu, May 24,

Re: Index protected zip

2018-05-26 Thread Tim Allison
You’ll need to provide a PasswordProvider in the ParseContext. I don’t think that is currently possible in the Solr integration. Please open a ticket if SolrJ doesn’t meet your needs. On Thu, May 24, 2018 at 1:03 PM Alexandre Rafalovitch wrote: > Hmm. If it works, then it is Tika magic. Which m

Re: Zookeeper 3.4.12 with Solr 6.6.2?

2018-05-22 Thread Tim Casey
We have 3.4.10 and have *tested* at a functional level 6.6.2. So far it works. We have not done any stress/load testing. But would have to do this prior to release. On Tue, May 22, 2018 at 9:44 AM, Walter Underwood wrote: > Is anybody running Zookeeper 3.4.12 with Solr 6.6.2? Is that a recomme

Re: Date Query Confusion

2018-05-17 Thread Tim Casey
date range, when the source material has date ranges built into it is kinda odd. But it occurs. If you query from noon-1p does that include meeting notes which started at 1130a, but went for an hour? You have to choose what to do. tim On Thu, May 17, 2018 at 6:11 AM, Terry Steichen wrote: >

Re: Title Search scoring issues with multivalued field & norm

2018-01-31 Thread Tim Casey
For smaller length documents TFIDFSimilarity will weight towards shorter documents. Another way to say this, if your documents are 5-10 terms, the 5 terms are going to win. You might think about having per token, or token pair, weight. I would be surprised if there was not something similar out t

Re: Howto search for § character

2017-12-07 Thread Tim Casey
separate tokens. So what happens to "§!" or "§?" or "?§", and in the case of things like text/email what happens to "§". In any event, my 2 pence worth tim On Thu, Dec 7, 2017 at 10:00 AM, Shawn Heisey wrote: > On 12/7/2017 9:37 AM, Bernd Schmi

Re: Java profiler?

2017-12-06 Thread Tim Casey
I really like Profiler. It takes a little bit of set up, but it works. tim On Wed, Dec 6, 2017 at 2:04 AM, Peter Sturge wrote: > Hi, > We'be been using JPRofiler (www.ej-technologies.com) for years now. > Without a doubt, the most comprehensive and useful profiler for java. >

Re: Solr query help

2017-08-18 Thread Tim Casey
29 30 31), which does all the days in march and apr. A more complicated nested query would do more complicated date ranges. I don't know if there is a way to get repeating date range queries, like the fourth tuesday for all months in a year. The date support is usually about querying a spec

Re: Arabic words search in solr

2017-08-02 Thread Tim Casey
There should be a way to use a phrasal query for the specific names. On Wed, Aug 2, 2017 at 2:15 PM, Phil Scadden wrote: > Hopefully changing to default AND solves your problem. If so, I would be > quite interested in what your index config looks like in the end. I also > have upcoming need to i

Re: Spatial Search based on the amount of docs, not the distance

2017-06-22 Thread Tim Casey
e are going to suggest some heuristic which avoids this problem. What Erick is suggesting is down this path. Query around a point and sort by distance taking the top K results. The result is taking a linear slice of the n^2 distance attribute. tim On Wed, Jun 21, 2017 at 7:50 PM, Erick Erickson

Re: model building

2017-03-21 Thread Tim Casey
l the result. tim On Mon, Mar 20, 2017 at 4:42 PM, Joel Bernstein wrote: > I've only tested with the training data in it's own collection, but it was > designed for multiple training sets in the same collection. > > I suspect you're training set is too small to

Re: query rewriting

2017-03-07 Thread Tim Casey
et of required terms/phrases, then you can add terms to boost or remove non-required terms without any loss to the overall result set. This changes the order in which items are returned, so may impact user perception of recall, but is possible under for specific reasons. tim On Sun, Mar 5, 2017 at 11:

Re: Question about best way to architect a Solr application with many data sources

2017-02-22 Thread Tim Casey
I would possibly extend this a bit futher. There is the source, then the 'normalized' version of the data, then the indexed version. Sometimes you realize you miss something in the normalized view and you have to go back to the actual source. This will be as likely as there are number of sources

child doc filter

2016-11-03 Thread Tim Williams
ryWrapperFilter right after the child query is created[1] but before I did that, I wanted to see if there's a built-in way to achieve the same behavior? Thanks, --tim [1] - https://github.com/apache/lucene-solr/blob/master/solr/core/src/java/org/apache/solr/search/join/BlockJoinParentQParser.java#L69

Configuration options/concerns for multiple Solr versions

2016-10-14 Thread Tim Parker
o do so - any thoughts? Am I missing something?] -- Tim Parker Senior Engineer PaperThin, Inc. 300 Congress Street, Suite 303 Quincy, MA 02169 Ph: 617.471.4440 x203 CommonSpot helps organizations improve engagement across the web, mobile devices, and social media outlets to achieve better marketing

Solr 4.10 Joins: Slow performance with millions of documents

2016-08-14 Thread Tim Frey
Hi there. I'm trying to fix a performance problem I have with queries that use Solr's Join feature. The query is intended to find all Job Applications that have an Interview in a particular state. There are 20 million Job Applications and around 7 million Interviews, with 1 million Interviews in

RE: Solr Cloud with 5 servers cluster failed due to Leader out of memory

2016-08-09 Thread Tim Chen
Guys, (@Erick & @Shawn), Thanks for the great suggestions! I have increased Tomcat MaxThreads from 200 to 1 on our staging environment. So far so good. I will perform some more indexing test and see how it goes. Many thanks, Tim -Original Message- From: Shawn Heisey [mailto

RE: Solr Cloud with 5 servers cluster failed due to Leader out of memory

2016-08-07 Thread Tim Chen
ck where you started. And just as a sanity check: You didn't perchance increase the maxWarmingSearchers parameter in solrconfig.xml, did you? If so, that's really a red flag. Best, Erick On Fri, Aug 5, 2016 at 12:41 AM, Tim Chen wrote: > Thanks Guys. Very very helpful. > > I will

RE: Solr Cloud with 5 servers cluster failed due to Leader out of memory

2016-08-07 Thread Tim Chen
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) at org.apache.tomcat.util.threads.TaskThread$WrappingRunnable.run(TaskThread.java:61) at java.lang.Thread.run(Thread.java:745) Many thanks, Tim -Original Message- From: Erick Erickson [mailto:erickerick..

RE: Solr Cloud with 5 servers cluster failed due to Leader out of memory

2016-08-05 Thread Tim Chen
restart it. A bit dirty, but may work for a short term. I don't know too much about how documents indexed, and how to save memory from that. Will probably work with a developer on this as well. Many Thanks guys. Cheers, Tim -Original Message- From: Shawn Heisey [mailt

Solr Cloud with 5 servers cluster failed due to Leader out of memory

2016-08-04 Thread Tim Chen
uster. Am I right? 2, Apparently we should not pushing too many documents to Solr, how do you guys handle this? Set a limit somewhere? Thanks, Tim [Premiere League Starts Saturday 13 August 9.30pm on SBS]<http://theworldgame.sbs.com.au/>

RE: Is it possible to force a Shard Leader change?

2016-07-28 Thread Tim Chen
election. Thanks again. Cheers, Tim -Original Message- From: Erick Erickson [mailto:erickerick...@gmail.com] Sent: Thursday, 28 July 2016 12:10 AM To: solr-user Subject: Re: Is it possible to force a Shard Leader change? The REBALANCELEADERS stuff was put in to deal with 100s of leaders winding

Is it possible to force a Shard Leader change?

2016-07-26 Thread Tim Chen
shutdown a single Replica that is currently a Shard Leader and force Solr to elect a different replica to be new Shard Leader? Thanks guys! Regards, Tim [Roots Wednesday 27 July 8.30pm]<http://www.sbs.com.au/programs/roots/>

RE: Solr Cloud 2nd Server Recover Stuck

2016-06-29 Thread Tim Chen
Hi Erick, I have followed your instruction to added as new replica and deleted the old replica - works great! Everything back to normal now. Thanks mate! Cheers, Tim -Original Message- From: Erick Erickson [mailto:erickerick...@gmail.com] Sent: Thursday, 30 June 2016 1:49 AM To: solr

Solr Cloud 2nd Server Recover Stuck

2016-06-29 Thread Tim Chen
quot;}, "core_node4":{ "state":"down", "core":"cr_dev_shard1_replica2", "node_name":"10.2.11.244:8983_solr", "base_url":"http://10.2.11.244:8983/solr"}}}, &q

Re: Indexing 700 docs per second

2016-04-19 Thread Tim Robertson
30 concurrent users steady state (not all hitting SOLR though). It seems realistic with that kind of hardware in my experience, but you didn't mention what else was going on that might affect it (e.g. reads). HTH, Tim On Tue, Apr 19, 2016 at 7:12 PM, Erick Erickson wrote: > Make very sur

NPE when executing clustering query search

2016-03-22 Thread Tim Hearn
Hi everyone, I am trying to execute a clustering query to my single-core master-slave solr setup and it is returning a NullPointerException. I checked the line in the source code where it is being thrown, and it looks like the null object is some sort of 'filt' object, which doesn't make sense.

Re: Running Lucene/SOR on Hadoop

2016-01-04 Thread Tim Williams
Apache Blur (Incubating) has several approaches (hive, spark, m/r) that could probably help with this ranging from very experimental to stable. If you're interested, you can ask over on blur-u...@incubator.apache.org ... Thanks, --tim On Fri, Dec 25, 2015 at 4:28 AM, Dino Chopins wrote:

Re: mlt and document boost

2015-12-24 Thread Tim Hearn
One workaround is to use the 'important terms' feature to grab the query generated by the MLT handler, then parse that list into your own solr query to use through a standard search handler. That way, you can get the same results as if you used the MLT handler, and you can also use filter querying

Geospatial search question - document with multiple locations

2015-12-24 Thread Tim Hearn
-70 is not? Thanks much! Tim

Query to count matching terms and disable 'coord' multiplication

2015-10-06 Thread Tim Hearn
Hello everyone, I have two questions 1) Is there a way to query solr to rank results based purely on the amount of terms in the query which are contained in the document? Example: doc1: 'foo bar poo car foo' q1: 'foo, car, two, start' score(doc1, q1) = 2 (since both foo and car both occur in doc1

Re: solr training

2015-09-17 Thread Tim Dunphy
> > How about in Denver? Nah dude. I'm in Jersey. Denver's like a half a country away! On Thu, Sep 17, 2015 at 12:18 AM, William Bell wrote: > How about in Denver? > > On Sun, Sep 13, 2015 at 7:53 PM, Otis Gospodnetić < > otis.gospodne...@gmail.com> wrote

Re: solr training

2015-09-13 Thread Tim Dunphy
Cool, I'll check it out. Thanks! On Sun, Sep 13, 2015 at 9:53 PM, Otis Gospodnetić < otis.gospodne...@gmail.com> wrote: > Hi Tim, > > A slightly delayed reply ;) > We are running Solr training in NYC next month - > http://sematext.com/training/solr-training.

solr errors integrating with drupal

2015-09-09 Thread Tim Dunphy
to the same name: /update/json ignoring: org.apache.solr.handler.UpdateRequestHandler How can I correct these errors? I'll be able to show you whatever config files you think may lead to a solution. I'll just need to know which ones to show you, as I am still new to solr. Thanks! Tim

NPE when faceting with MLT Query from upgrade to Solr 5.1.0

2015-05-18 Thread Tim H
Hi everyone, Recently I upgraded to solr 5.1.0. When trying to generate facets using the more like this handler, I now get a a NullPointerException. I never got this exception while using Solr 4.10.0 Details are below: Stack Trace: at org.apache.solr.request.SimpleFacets.getHeatmapCounts(Simple

NPE with faceting query on MoreLikeThis handler

2015-05-18 Thread Tim Hearn
Hi everyone, Recently I upgraded to solr 5.1.0. When trying to generate facets using the more like this handler, I now get a a NullPointerException. I never got this exception while using Solr 4.10.0 Details are below: Stack Trace: at org.apache.solr.request.SimpleFacets.getHeatmapCounts(Simple

NPE when Faceting with MoreLikeThis handler in Solr 5.1.0

2015-05-15 Thread Tim Hearn
Hi everyone, Recently I upgraded to solr 5.1.0. When trying to generate facets using the more like this handler, I now get a a NullPointerException. I never got this exception while using Solr 4.10.0 Details are below: Stack Trace: at org.apache.solr.request.SimpleFacets.getHeatmapCounts(Simple

NPE when Faceting with MoreLikeThis handler in Solr 5.1.0

2015-05-14 Thread Tim Hearn
Hi everyone, Recently I upgraded to solr 5.1.0. When trying to generate facets using the more like this handler, I now get a a NullPointerException. I never got this exception while using Solr 4.10.0 Details are below: Stack Trace: at org.apache.solr.request.SimpleFacets.getHeatmapCounts(Simple

Re: solr 3.6.2 under tomcat 8 missing corename in path

2015-05-07 Thread Tim Dunphy
tion1 there: [root@aoadbld00032la solr]# ls -ld admin/cores/collection1 drwxr-xr-x. 5 root root 4096 May 6 17:29 admin/cores/collection1 So, if I assume correctly, that the way I reference the collection1 directory is the problem, how can I express this differently in my solr.xml file so that it wo

solr 3.6.2 under tomcat 8 missing corename in path

2015-05-06 Thread Tim Dunphy
169 May 6 14:38 solr.xml Yet, when I bounce tomcat, this is the result that I get: HTTP Status 404 - missing core name in path type Status report message missing core name in path description The requested resource is not available. Cany anyone tell me what I'm doin

Re: apache 5.1.0 under apache web server

2015-05-04 Thread Tim Dunphy
t;http-nio-8080"] 04-May-2015 15:48:27.257 INFO [main] org.apache.coyote.AbstractProtocol.start Starting ProtocolHandler ["ajp-nio-8009"] 04-May-2015 15:48:27.258 INFO [main] org.apache.catalina.startup.Catalina.start Server startup in 3350 ms However it sounds like you're sure it'

apache 5.1.0 under apache web server

2015-05-04 Thread Tim Dunphy
ickly. Sigh.. But thank you for your advice! Tim -- GPG me!! gpg --keyserver pool.sks-keyservers.net --recv-keys F186197B

solr training

2015-05-02 Thread Tim Dunphy
Hey guys, My company has a training budget that it wants me to use. So what I'd like to find out is if there is any instructor lead courses in the NY/NJ area, or courses online that are instructor lead that you could recommend? Thanks, Tim -- GPG me!! gpg --keyserver pool.sks-keyserver

Re: Solrcloud (to HDFS) poor indexing performance

2015-02-04 Thread Tim Smith
replication would be redundant. No? Each document/event is only a few hundred bytes (less than 500 bytes, I would say). Thanks, Tim On Tue, Feb 3, 2015 at 5:03 PM, Mark Miller wrote: > What is your replication factor and doc size? > > Replication can affect performance a fair amount mo

Solrcloud (to HDFS) poor indexing performance

2015-02-03 Thread Tim Smith
annels get saturated within seconds. Any clues/pointers for troubleshooting will be appreciated? Thanks, Tim

import community cassandra into solr

2015-02-01 Thread Tim Dunphy
Hey folks, Is there any way that we know of to import the open source Apache version of the Cassandra database into Solr? I know that the pay for Datastax version has a connection built in. But we're not using that. Any thoughts or observations will be appreciated. Thanks, Tim -- G

  1   2   3   4   5   >