Re: Setting up MiniSolrCloudCluster to use pre-built index

2018-10-23 Thread Ken Krugler
n > use the merge API or something to merge your index into the empty > collection. > > - Mark > > On Sat, May 19, 2018 at 5:25 PM Ken Krugler > wrote: > >> Hi all, >> >> Wondering if anyone has experience (this is with Solr 6.6) in setting up >> Mi

Re: Storing & using feature vectors

2018-10-22 Thread Ken Krugler
to discuss but he didn’t seem to mention it in his talk. > If this is a huge importance to you, I might also suggest looking at vespa, > which makes tensors a first-class citizen and makes matrix-math pretty > seamless: http://vespa.ai Interesting, though my client is pretty much lo

Storing & using feature vectors

2018-10-19 Thread Ken Krugler
following the same pattern as geospatial support - so a new field type and query/parser, plus plumbing to hook it into Solr. Before I go much further, is there anything like this already done, or in the works? Thanks, — Ken -- Ken Krugler +1 530-210-6378 http

Is router.field an explicit shard name, or hashed?

2018-07-13 Thread Ken Krugler
rect. Thanks, — Ken ---------- Ken Krugler +1 530-210-6378 http://www.scaleunlimited.com Custom big data solutions & training Flink, Solr, Hadoop, Cascading & Cassandra

Setting up MiniSolrCloudCluster to use pre-built index

2018-05-19 Thread Ken Krugler
case)? Thanks! — Ken PS - yes, we’re aware of the routing issue with generating our own shards…. -- Ken Krugler +1 530-210-6378 http://www.scaleunlimited.com Custom big data solutions & training Flink, Solr, Hadoop, Cascading & Cassandra

Handling of local params in QParserPlugin.createParser

2017-04-03 Thread Ken Krugler
to poke around? Thanks, — Ken -- Ken Krugler +1 530-210-6378 http://www.scaleunlimited.com custom big data solutions & training Hadoop, Cascading, Cassandra & Solr

Shingles from WDFF

2017-03-24 Thread Ken Krugler
actually a way to make this work with Solr 5/6? Thanks, — Ken -- Ken Krugler +1 530-210-6378 http://www.scaleunlimited.com custom big data solutions & training Hadoop, Cascading, Cassandra & Solr

RE: how to update billions of docs

2016-03-19 Thread Ken Krugler
t; solr 5.3 (other fields are quite big). > > Any suggestions ? > > -Mohsin -- Ken Krugler +1 530-210-6378 http://www.scaleunlimited.com custom big data solutions & training Hadoop, Cascading, Cassandra & Solr

RE: Embedded Solr now deprecated?

2015-08-05 Thread Ken Krugler
e of the SolrClient classes and > use a separate Solr deployment from your application. > > Thanks, > Shawn > -- Ken Krugler +1 530-210-6378 http://www.scaleunlimited.com custom big data solutions & training Hadoop, Cascading, Cassandra & Solr

Re: When not to use NRTCachingDirectory and what to use instead.

2014-04-19 Thread Ken Krugler
mapping are not something you'd want to > give up. Tom - did you ever get any useful results from testing here? I'm also curious about the impact of various xxxDirectoryFactory implementations for batch indexing. Thanks, -- Ken -- Ken Krugler +1 530-210-6378 http://www.scaleunlimited.com custom big data solutions & training Hadoop, Cascading, Cassandra & Solr

Re: Enabling other SimpleText formats besides postings

2014-03-31 Thread Ken Krugler
segments.gen and segments_XX files; what, those aren't pluggable?!?! :) -- Ken -- Ken Krugler +1 530-210-6378 http://www.scaleunlimited.com custom big data solutions & training Hadoop, Cascading, Cassandra & Solr

Re: Enabling other SimpleText formats besides postings

2014-03-31 Thread Ken Krugler
r? Or is this currently not possible? Thanks, -- Ken > On Fri, Mar 28, 2014 at 8:53 AM, Ken Krugler > wrote: >> Hi all, >> >> I've been using the SimpleTextCodec in the past, but I just noticed >> something odd... >> >> I'm r

Enabling other SimpleText formats besides postings

2014-03-28 Thread Ken Krugler
E-3074 is about adding a simple text format for DocValues. I can walk the code to figure out what's up, but I'm hoping I just need to change some configuration setting. Thanks! -- Ken ---------- Ken Krugler +1 530-210-6378 http://www.scaleunlimited.com custom big

RegexReplaceProcessorFactory replacement string support for match groups

2013-10-15 Thread Ken Krugler
t's making it hard for me to write up a simple solution to a training exercise, where students need to clean up incorrectly formatted dates :) Thanks, -- Ken ---------- Ken Krugler +1 530-210-6378 http://www.scaleunlimited.com custom big data solutions & trainin

WikipediaTokenizer documentation - never mind

2013-10-03 Thread Ken Krugler
But is there any way to get , for example? Thanks, -- Ken ------ Ken Krugler +1 530-210-6378 http://www.scaleunlimited.com custom big data solutions & training Hadoop, Cascading, Cassandra & Solr -- Ken Krugler +1 530-210-6378 ht

WikipediaTokenizer documentation

2013-10-03 Thread Ken Krugler
"body". But is there any way to get , for example? Thanks, -- Ken ------ Ken Krugler +1 530-210-6378 http://www.scaleunlimited.com custom big data solutions & training Hadoop, Cascading, Cassandra & Solr

Re: Grouping by field substring?

2013-09-12 Thread Ken Krugler
iginal Message----- From: Ken Krugler > Sent: Wednesday, September 11, 2013 8:24 PM > To: solr-user@lucene.apache.org > Subject: Grouping by field substring? > > Hi all, > > Assuming I want to use the first N characters of a specific field for > grouping results, is such a

Grouping by field substring?

2013-09-11 Thread Ken Krugler
Hi all, Assuming I want to use the first N characters of a specific field for grouping results, is such a thing possible out-of-the-box? If not, then what would the next best option be? E.g. a custom function query? Thanks, -- Ken -- Ken Krugler +1 530-210-6378 http

Re: Filter cache pollution during sharded edismax queries

2013-08-26 Thread Ken Krugler
why. > > Otis > -- > Solr & ElasticSearch Support -- http://sematext.com/ > Performance Monitoring -- http://sematext.com/spm > > > > On Tue, Jul 2, 2013 at 3:01 PM, Ken Krugler > wrote: >> Hi all, >> >> After upgrading from Solr 3.5 to 4.2.1, I noticed

Blog posts on extracting text features using Solr

2013-07-21 Thread Ken Krugler
edly has some things that are unclear or even incorrect, so please comment :) Regards, -- Ken ------ Ken Krugler +1 530-210-6378 http://www.scaleunlimited.com custom big data solutions & training Hadoop, Cascading, Cassandra & Solr

Filter cache pollution during sharded edismax queries

2013-07-02 Thread Ken Krugler
em_::" The net result of the above is that even with a very big filterCache size of 2K, the hit ratio is still only 60%. Thanks for any insights, -- Ken -- Ken Krugler +1 530-210-6378 http://www.scaleunlimited.com custom big data solutions & training Hadoop, Cascading, Cassandra & Solr

Solr 4.2.1 behavior with field names that use "|" character

2013-05-11 Thread Ken Krugler
s this a known issue? Is there any way to disable the parsing of field names in a field list? Thanks, -- Ken ---------- Ken Krugler +1 530-210-6378 http://www.scaleunlimited.com custom big data solutions & training Hadoop, Cascading, Cassandra & Solr

Javadocs issue on Solr web site

2012-07-04 Thread Ken Krugler
apache.org/solr/api-4_0_0-ALPHA/org/apache/solr/client/solrj/impl/StreamingUpdateSolrServer.html -- Ken ------ Ken Krugler http://www.scaleunlimited.com custom big data solutions & training Hadoop, Cascading, Mahout & Solr

Re: Error with distributed search and Suggester component (Solr 3.4)

2012-05-02 Thread Ken Krugler
Hi Robert, On May 1, 2012, at 7:07pm, Robert Muir wrote: > On Tue, May 1, 2012 at 6:48 PM, Ken Krugler > wrote: >> Hi list, >> >> Does anybody know if the Suggester component is designed to work with shards? > > I'm not really sure it is? They would prob

Re: Error with distributed search and Suggester component (Solr 3.4)

2012-05-01 Thread Ken Krugler
0.0 true Thanks, -- Ken On May 1, 2012, at 3:48pm, Ken Krugler wrote: > Hi list, > > Does anybody know if the Suggester component is designed to work with shards? > > I'm asking because the documentation implies that it should (since > ...Suggester reuses muc

Error with distributed search and Suggester component (Solr 3.4)

2012-05-01 Thread Ken Krugler
m wondering if my configuration is just borked and this should work, or the fact that the Suggester doesn't return a response field means that it just doesn't work with shards. Thanks, -- Ken -------- http://about.me/kkrugler +1 530-210-6378

Re: JSON & XML response writer issues with short & binary fields

2012-01-13 Thread Ken Krugler
On Jan 13, 2012, at 1:39pm, Yonik Seeley wrote: > -Yonik > http://www.lucidimagination.com > > > > On Fri, Jan 13, 2012 at 4:22 PM, Yonik Seeley > wrote: >> On Fri, Jan 13, 2012 at 4:04 PM, Ken Krugler >> wrote: >>> I finally got around to looking

JSON & XML response writer issues with short & binary fields

2012-01-13 Thread Ken Krugler
ethods that take an explicit XMLWriter object as a parameter. -- Ken Krugler http://www.scaleunlimited.com custom big data solutions & training Hadoop, Cascading, Mahout & Solr

Re: Solr core as a dispatcher

2012-01-11 Thread Ken Krugler
p N hits from the shards. And there's something wonky with the way that distributed HTTP requests are queued up & processed - under load, I see IOExceptions where it's always N-1 shards that succeed, and one shard request fails. But I don't have a good reproducible case yet to debu

Re: strange performance issue with many shards on one server

2011-12-29 Thread Ken Krugler
www.hathitrust.org/blogs/large-scale-search/tuning-search-performance >>>>> Regards >>>>> Vadim >>>>> >>>>> >>>>> 2011/9/28 Frederik Kraus >>>> (mailto:frederik.kr...@gmail.com) (mailto: >>>> frederi

SearchComponents and ShardResponse

2011-12-15 Thread Ken Krugler
lr.war with my custom SearchHandler, but that's pretty painful. Any other ideas/input? Thanks, -- Ken -- Ken Krugler http://www.scaleunlimited.com custom big data solutions & training Hadoop, Cascading, Mahout & Solr

Distributed search and binary fields w/Solr 3.4

2011-11-13 Thread Ken Krugler
y raises their hand, and then walk it. Thanks, -- Ken ------ Ken Krugler http://www.scaleunlimited.com custom big data solutions & training Hadoop, Cascading, Mahout & Solr

Re: overwrite=false support with SolrJ client

2011-11-10 Thread Ken Krugler
h (unless you do some interesting helicopter stunts). So yes, by default the index is always being rebuilt from scratch. And thus as long as the primary key is being used as the reduce-phase key, it's easy to ensure uniqueness in the index. Thanks again, -- Ken -- Ken Krugler http://www.scaleunlimited.com custom big data solutions & training Hadoop, Cascading, Mahout & Solr

overwrite=false support with SolrJ client

2011-11-04 Thread Ken Krugler
based workflows, it's straightforward to ensure that the unique key field is really unique, thus if the performance gain is significant, I might look into figuring out some way (with a trigger lock) of re-enabling this support in SolrJ. Thanks, -- Ken ------ Ken Kr

Re: indexing key value pair into lucene solr index

2011-10-24 Thread Ken Krugler
so you only index the key. E.g. -- Ken ------ Ken Krugler +1 530-210-6378 http://bixolabs.com custom big data solutions & training Hadoop, Cascading, Mahout & Solr

Re: Want to support "did you mean xxx" but is Chinese

2011-10-21 Thread Ken Krugler
> This is basic function of commercial search engine especially in > Chinese processing. I wonder how to implements in SOLR and where is > the start point. > > Floyd -- Ken Krugler +1 530-210-6378 http://bixolabs.com custom big data solutions & training Hadoop, Cascading, Mahout & Solr

Re: Does anybody has experience in Chinese soundex(sounds like) of SOLR?

2011-10-20 Thread Ken Krugler
nese soundex(sounds like) of SOLR? >> >> Hi there, >> >> There are many English soundex implementation can be referenced, but I >> wonder how to do Chinese soundex(sounds like) filter (maybe). >> >> any idea? >> >> Floyd >> >> >&g

Re: Multi CPU Cores

2011-10-16 Thread Ken Krugler
>> >> >> 6495 root 15 0 42416 3892 1740 S 0.4 0.0 9:34.71 >> openvpn >> >>

Re: strange performance issue with many shards on one server

2011-09-28 Thread Ken Krugler
ards >>>>> Vadim >>>>> >>>>> >>>>> 2011/9/28 Frederik Kraus >>>> (mailto:frederik.kr...@gmail.com) (mailto: >>>> frederik.kr...@gmail.com (mailto:frederik.kr...@gmail.com))> >>>>> >>>>>> Hi, >>>>>> >>>>>> >>>>>> I am experiencing a strange issue doing some load tests. Our setup: >>>>>> >>>>>> - 2 server with each 24 cpu cores, 130GB of RAM >>>>>> - 10 shards per server (needed for response times) running in a single >>>>>> tomcat instance >>>>>> - each query queries all 20 shards (distributed search) >>>>>> >>>>>> - each shard holds about 1.5 mio documents (small shards are needed due >>>> to >>>>>> rather complex queries) >>>>>> - all caches are warmed / high cache hit rates (99%) etc. >>>>>> >>>>>> >>>>>> Now for some reason we cannot seem to fully utilize all CPU power (no >>>> disk >>>>>> IO), ie. increasing concurrent users doesn't increase CPU-Load at a >>>> point, >>>>>> decreases throughput and increases the response times of the individual >>>>>> queries. >>>>>> >>>>>> Also 1-2% of the queries take significantly longer: avg somewhere at >>>> 100ms >>>>>> while 1-2% take 1.5s or longer. >>>>>> >>>>>> Any ideas are greatly appreciated :) >>>>>> >>>>>> Fred. > -- Ken Krugler +1 530-210-6378 http://bixolabs.com custom big data solutions & training Hadoop, Cascading, Mahout & Solr

Re: two cores but have single result set in solr

2011-09-23 Thread Ken Krugler
with a different conf dir, and in that separate conf/solrschema.xml you can set up a request handler that just dispatches to the two real cores. -- Ken ------ Ken Krugler +1 530-210-6378 http://bixolabs.com custom big data solutions & training Hadoop, Cascading, Mahout & Solr

Re: Distinct elements in a field

2011-09-17 Thread Ken Krugler
asy to use. What do you think ? Thank you If you turn on facets in your query (facet=true&facet.field=) then you'll get back all of the distinct values, though might have to play with other settings (e.g. facet.limit=-1) to get the results you need. -- Ken --

Re: Will Solr/Lucene crawl multi websites (aka a mini google with faceted search)?

2011-09-11 Thread Ken Krugler
will be added to the discussion > below: >> > http://lucene.472066.n3.nabble.com/Will-Solr-Lucene-crawl-multi-websites-aka-a-mini-google-with-faceted-search-tp3328314p3328340.html >> >> To unsubscribe from Will Solr/Lucene crawl multi websites (aka a mini > google with face

Re: performance crossover between single index and sharding

2011-08-02 Thread Ken Krugler
shing >>> returns and a performance decrease, but I wouldn't worry about that at all >>> until you've got many terabytes -- I don't know how many but don't worry >>> about it. >>> >>> ~ David >>> >>> - >>> Author: https://www.packtpub.com/solr-1-4-enterprise-search-server/book >>> -- >>> View this message in context: >>> http://lucene.472066.n3.nabble.com/performance-crossover-between-single-in >>> dex-and-sharding-tp3218561p3219397.html Sent from the Solr - User mailing >>> list archive at Nabble.com. -- Ken Krugler +1 530-210-6378 http://bixolabs.com custom data mining solutions

Re: Processing/Indexing CSV

2011-06-09 Thread Ken Krugler
On Jun 9, 2011, at 2:21pm, Helmut Hoffer von Ankershoffen wrote: > Hi, > > btw: there seems to somewhat of a non-match regarding efforts to Enhance DIH > regarding the CSV format (James Dyer) and the effort to maintain the > CSVLoader (Ken Krugler). How about merging your effort

Re: Processing/Indexing CSV

2011-06-09 Thread Ken Krugler
own list of fieldnames and optionally ignore the >> first line of the CSV file (assuming it contains the field names). >> http://wiki.apache.org/solr/UpdateCSV#fieldnames >> >> -Yonik >> http://www.lucidimagination.com >> -- Ken Krugler +1 530-210-6378 http://bixolabs.com custom data mining solutions

Re: Solr monitoring: Newrelic

2011-06-09 Thread Ken Krugler
> war file, only >> jetty-***.jar files >> >> Same error, could not locate a jetty instance. >> >> -- >> View this message in context: >> http://lucene.472066.n3.nabble.com/Solr-monitoring-Newrelic-tp3042889p3043080.html >> Sent from t

Re: Hitting the URI limit, how to get around this?

2011-06-03 Thread Ken Krugler
http://lucene.472066.n3.nabble.com/Hitting-the-URI-limit-how-to-get-around-this-tp3017837p3020185.html > Sent from the Solr - User mailing list archive at Nabble.com. -- Ken Krugler +1 530-210-6378 http://bixolabs.com custom data mining solutions

Re: Difference between Solr and Lucidworks distribution

2011-04-03 Thread Ken Krugler
On Apr 3, 2011, at 6:56am, yehosef wrote: > How can they require payment for something that was developed under the > apache license? It's the difference between free speech and free beer :) See http://en.wikipedia.org/wiki/Gratis_versus_libre -- Ken ------ Ken

Re: boilerpipe solr tika howto please

2011-01-14 Thread Ken Krugler
ing like: return new BoilerpipeContentHandler(new ContentHandlerDecorator( Though from a quick look at that code, I'm curious why it doesn't use BodyContentHandler, versus the current ContentHandlerDecorator. -- Ken -- Ken Krugler +1 530-210-6378 http://bi

Re: How to let crawlers in, but prevent their damage?

2011-01-10 Thread Ken Krugler
blocking those with a bad ratio of those two - bots that crawl a lot but don't bring a lot of value. Any other ideas? Thanks, Otis Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch Lucene ecosystem search :: http://search-lucene.com/ - Original Message From: Ken

Re: How to let crawlers in, but prevent their damage?

2011-01-10 Thread Ken Krugler
hem, minimizing their negative side-effects, while still letting them crawl you? Thanks, Otis Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch Lucene ecosystem search :: http://search-lucene.com/ ---------- Ken Krugler +1 530-210-6378 http://bixolabs.com e l a s t

Re: entire farm fails at the same time with OOM issues

2010-12-01 Thread Ken Krugler
x27;ll see a really big chunk used for a sorted field. See http://wiki.apache.org/solr/SolrCaching and http://wiki.apache.org/solr/SolrPerformanceFactors for more details. -- Ken -Original Message----- From: Ken Krugler [mailto:kkrugler_li...@transpac.com] Sent: Tuesday, November 30, 20

Re: entire farm fails at the same time with OOM issues

2010-11-30 Thread Ken Krugler
un JRE 1.6.0_18 on dual quad xeon machines with 64GB memory etc etc <http://ken-blog.krugler.org> +1 530-265-2225 ---------- Ken Krugler +1 530-210-6378 http://bixolabs.com e l a s t i c w e b m i n i n g

Re: Dinamically change master

2010-11-30 Thread Ken Krugler
laves are configured to use a VIP to talk to the master, so that it's easy to dynamically change which master they use, via updates to the load balancer config. -- Ken ------ Ken Krugler +1 530-210-6378 http://bixolabs.com e l a s t i c w e b m i n i n g

Re: A Newbie Question

2010-11-14 Thread Ken Krugler
e recommended way of posting documents to Solr. Could someone please tell me what is the preferred approach in such an environment? I am not a programmer and would appreciate some hand-holding here :o) Thanks in advance, Sesh -- Lance Norskog goks...@gmail.com -- Ken Krugler +1 530-210-6378 http://bixolabs.com e l a s t i c w e b m i n i n g

Re: Dynamic creating of cores in solr

2010-11-10 Thread Ken Krugler
the server side to do is realize by itself if the cores exists or not, and if not - create it One other restriction - I can't change anything in the client side - calling to the server can only make the calls it's doing now - for index and search, and cannot make calls for cores creation via the CoreAdminHandler. All I can do is something in the server itself What can I do to get it done? Write some RequestHandler? REquestProcessor? Any other option? Thanks, nizan -- Ken Krugler +1 530-210-6378 http://bixolabs.com e l a s t i c w e b m i n i n g

Re: Inconsistent slave performance after optimize

2010-10-27 Thread Ken Krugler
fish for an explanation or ideas that might explain this inconsistent performance. Obviously, we'd like to be able to reproduce the performance of the 3rd slave, and avoid the poor performance of the first two slaves the next time we decide it's time to optimize our index. thanks in

Re: Multiple Word Facets

2010-10-27 Thread Ken Krugler
type="query". Please advise on how to group or cluster document terms so that they can be used as facets. Many thanks in advance, Adam Estrada -- Ken Krugler +1 530-210-6378 http://bixolabs.com e l a s t i c w e b m i n i n g

Re: using HTTPClient sending solr ping request wont timeout as specified

2010-10-13 Thread Ken Krugler
: http://lucene.472066.n3.nabble.com/using-HTTPClient-sending-so lr-ping-request-wont-timeout-as-specified-tp1691292p1691355.html Sent from the Solr - User mailing list archive at Nabble.com. ---------- Ken Krugler +1 530-210-6378 http://bixolabs.com e l a s t i c w e b m i n i n g

Re: PatternReplaceFilterFactory creating empty string as a term

2010-10-05 Thread Ken Krugler
tp://ken-blog.krugler.org> +1 530-265-2225 ------ Ken Krugler +1 530-210-6378 http://bixolabs.com e l a s t i c w e b m i n i n g

Re: getting a list of top page-ranked webpages

2010-09-16 Thread Ken Krugler
here are other free collections of data around, though none that I know of which target top-ranked pages. -- Ken ---------- Ken Krugler +1 530-210-6378 http://bixolabs.com e l a s t i c w e b m i n i n g

Re: Color search for images

2010-09-15 Thread Ken Krugler
just being smart about color-specific keywords found in associated text? -- Ken -- Ken Krugler +1 530-210-6378 http://bixolabs.com e l a s t i c w e b m i n i n g

Re: Is semicolon a character that needs escaping?

2010-09-02 Thread Ken Krugler
to Solr, and it (IIRC) tries to be smart about handling this type of escaping for you. Dismax is not (yet) an option because we need the full lucene syntax within the query. OK - in that case sounds like you're stuck with escaping. -- Ken -- Ken Krugler +1 530-21

Re: Is semicolon a character that needs escaping?

2010-09-02 Thread Ken Krugler
#x27;ll save yourself some pain and suffering. Also, since I did the above code the DisMaxRequestHandler has been added to Solr, and it (IIRC) tries to be smart about handling this type of escaping for you. -- Ken -- Ken Krugler +1 530-210-6378 http://bixolabs.com e l a s t i c w e b m i n i n g

Re: Restricting HTML search?

2010-08-25 Thread Ken Krugler
On Aug 25, 2010, at 7:22pm, Lance Norskog wrote: This assumes that the HTML is good quality. I don't know exactly what your use case is. If you're crawling the web you will find some very screwed-up HTML. On Wed, Aug 25, 2010 at 6:45 AM, Ken Krugler wrote: On Aug 24, 2010, at 10:5

Re: Restricting HTML search?

2010-08-25 Thread Ken Krugler
w to SOLR and wondering if the following is possible: in addition to normal full text search, my users want to have the option to search only HTML heading innertext, i.e. content inside of , , or tags. ------------ Ken Krugler +1 530-210-6378 http://bixolabs.c

Re: indexing???

2010-08-17 Thread Ken Krugler
rm.pdf 1. This URL doesn't work for me. 2. Please include the full stack trace from the RuntimeException. 3. What version of Tika are you using? Thanks, -- Ken -------- Ken Krugler +1 530-210-6378 http://bixolabs.com e l a s t i c w e b m i n i n g

Re: Best solution to avoiding multiple query requests

2010-08-04 Thread Ken Krugler
log.jteam.nl/2009/10/20/result-grouping-field-collapsing-with-solr/ Yup, that's the one - http://blog.jteam.nl/2009/10/20/result-grouping-field-collapsing-with-solr/comment-page-1/#comment-1249 So with some modifications to that patch, it could work...thanks for the info! -- Ken 2010/8/

Re: Best solution to avoiding multiple query requests

2010-08-04 Thread Ken Krugler
is the best solution to create my own request handler? 3. And in that case, any input/tips on developing this type of custom request handler? Thanks, -- Ken Ken Krugler +1 530-210-6378 http://bixolabs.com e l a s t i c w e b m i n i n g

Best solution to avoiding multiple query requests

2010-08-03 Thread Ken Krugler
Solr as-is? 2. if not, is the best solution to create my own request handler? 3. And in that case, any input/tips on developing this type of custom request handler? Thanks, -- Ken ------------ Ken Krugler +1 530-210-6378 http://bixolabs.com e l a s t i c w e b m i n i n g

Re: SolrCore has a large number of SolrIndexSearchers retained in "infoRegistry"

2010-07-27 Thread Ken Krugler
gives you access to a SolrIndexSearcher is documented very clearly on how to "release" it when you are done with it so the ref count can be decremented. -Hoss -------- Ken Krugler +1 530-210-6378 http://bixolabs.com e l a s t i c w e b m i n i n g

Re: faceted search with job title

2010-07-22 Thread Ken Krugler
es and used those, otherwise you get "Senior Bottlewasher" and "Sr. Bottlewasher" and "Sr Bottlewasher" as separate facet values. Ken Krugler +1 530-210-6378 http://bixolabs.com e l a s t i c w e b m i n i n g

Re: Problem building Nightly Solr

2010-07-06 Thread Ken Krugler
that compile fro mthat anyway. Note that you'll need to "ant compile" from the top of the lucene directory first, before trying any of the solr-specific builds from inside of the /solr sub-dir. Or at least that's what I ran into when trying to build a solr dist recently.

Re: document level security: indexing/searching techniques

2010-07-06 Thread Ken Krugler
LDAP server. This then becomes a fairly well-bounded list of "terms" for an OR query against the "acl-groups" field in each file/project document. Just don't forget to set the boost to 0 for that portion of the query :) -- Ken --------

Re: IOException: read past EOF when opening index built directly w/Lucene

2010-07-01 Thread Ken Krugler
On Jul 1, 2010, at 1:03pm, Ken Krugler wrote: I've got a version 2.3 index that appears to be valid - I can open it with Luke 1.0.1, and CheckIndex reports no problem. [snip] and Luke overview says: This time as text: Index version: 12984d2211c Index format: -4 (Lucene 2.3)

IOException: read past EOF when opening index built directly w/Lucene

2010-07-01 Thread Ken Krugler
d Luke overview says: -------- Ken Krugler +1 530-210-6378 http://bixolabs.com e l a s t i c w e b m i n i n g

Re: SolrJ/EmbeddedSolrServer

2010-06-27 Thread Ken Krugler
you should be able to sort what you need On Fri, May 21, 2010 at 7:55 PM, Ken Krugler wrote: I've got a situation where my data directory (a) needs to live elsewhere besides inside of Solr home, (b) moves to a different location when updating indexes, and (c) setting up a symlink from /dat

Re: SolrJ/EmbeddedSolrServer

2010-06-27 Thread Ken Krugler
eload of a core? This, of course, assumes that I'm able to programmatically change the location of the dataDir, which is another issue. Thanks, -- Ken On Fri, May 21, 2010 at 7:55 PM, Ken Krugler wrote: I've got a situation where my data directory (a) needs to live elsewh

Re: SolrJ/EmbeddedSolrServer

2010-06-27 Thread Ken Krugler
s that I'm able to programmatically change the location of the dataDir, which is another issue. Thanks, -- Ken On Fri, May 21, 2010 at 7:55 PM, Ken Krugler wrote: I've got a situation where my data directory (a) needs to live elsewhere besides inside of Solr home, (b) moves to

Re: SolrJ/EmbeddedSolrServer

2010-06-27 Thread Ken Krugler
yan McKinley wrote: Check: http://wiki.apache.org/solr/CoreAdmin Unless I'm missing something, I think you should be able to sort what you need On Fri, May 21, 2010 at 7:55 PM, Ken Krugler wrote: I've got a situation where my data directory (a) needs to live elsewhere besides inside of

Re: [ANN] Solr 1.4.1 Released

2010-06-26 Thread Ken Krugler
maven artifact coordinates. Regards, Stevo. -- Ken ------------ Ken Krugler +1 530-210-6378 http://bixolabs.com e l a s t i c w e b m i n i n g

Some minor Solritas layout tweaks

2010-06-23 Thread Ken Krugler
s for these types of things, but I also don't want to add to noise on the list. Which approach is preferred? Thanks, -- Ken Ken Krugler +1 530-210-6378 http://bixolabs.com e l a s t i c w e b m i n i n g

Re: Minor bug with Solritas and price data

2010-06-21 Thread Ken Krugler
Jun 21, 2010, at 11:36am, Chris Hostetter wrote: : Here's what's in my schema: : : : : Which is exactly what was in the original example schema. but what does hte "version" property of your schema say (at the top) this is what's in the example... -Hoss -----

Re: Minor bug with Solritas and price data

2010-06-21 Thread Ken Krugler
k Circles Rug ... Any other ideas what might be going on? Thanks, -- Ken On Jun 19, 2010, at 9:12 PM, Ken Krugler wrote: I noticed that my prices weren't showing up, even though I've got a price field. I think the issue is with this line

Minor bug with Solritas and price data

2010-06-19 Thread Ken Krugler
ce')) ...since getFirstValue() returns a single value without brackets. -- Ken Ken Krugler +1 530-210-6378 http://bixolabs.com e l a s t i c w e b m i n i n g

Minor bug in Solritas with post-facet search

2010-06-19 Thread Ken Krugler
is the right way to fix it. -- Ken -------- Ken Krugler +1 530-210-6378 http://bixolabs.com e l a s t i c w e b m i n i n g

Re: Autocompletion with Solritas

2010-06-19 Thread Ken Krugler
uggest in there easily. Erik On Jun 18, 2010, at 7:54 PM, Ken Krugler wrote: Hi Erik, On Jun 17, 2010, at 8:34pm, Erik Hatcher wrote: Your wish is my command. Check out trunk, fire up Solr (ant run- example), index example data, hit http://localhost:8983/solr/ browse - type in search b

Re: Autocompletion with Solritas

2010-06-18 Thread Ken Krugler
olr/terms?limit=10&terms.fl=product_name&q=rug&terms.sort=count&terms.prefix=rug " Then I get the expected XML response: < Content-Type: text/xml; charset=utf-8 < Content-Length: 225 < Server: Jetty(6.1.22) < 0name="QTime">0name="product_na

Re: Autocompletion with Solritas

2010-06-18 Thread Ken Krugler
handler where q is used for the actual query for filtering terms on. Cool?! I think so! :) Erik On Jun 17, 2010, at 8:03 PM, Ken Krugler wrote: I don't believe Solritas supports autocompletion out of the box. So I'm wondering if anybody has experience using the LucidWorks

Re: Autocompletion with Solritas

2010-06-17 Thread Ken Krugler
s used for the actual query for filtering terms on. Cool?! I think so! :) Erik On Jun 17, 2010, at 8:03 PM, Ken Krugler wrote: I don't believe Solritas supports autocompletion out of the box. So I'm wondering if anybody has experience using the LucidWorks distro

Autocompletion with Solritas

2010-06-17 Thread Ken Krugler
jQuery Autocomplete plugin, and hooking it up to Solr facets, but I was curious if there were any tricks or traps in getting it all to work. Thanks, -- Ken ------------ Ken Krugler +1 530-210-6378 http://bixolabs.com e l a s t i c w e b m i n i n g

Re: Need help on Solr Cell usage with specific Tika parser

2010-06-14 Thread Ken Krugler
t" does not match any known parser. Thanks Olivier -------- Ken Krugler +1 530-210-6378 http://bixolabs.com e l a s t i c w e b m i n i n g

Re: Tika language extraction

2010-06-10 Thread Ken Krugler
a language. -- Ken -------- Ken Krugler +1 530-210-6378 http://bixolabs.com e l a s t i c w e b m i n i n g

Re: Indexing HTML

2010-06-09 Thread Ken Krugler
er.org> +1 530-265-2225 ------------ Ken Krugler +1 530-210-6378 http://bixolabs.com e l a s t i c w e b m i n i n g

Re: Build query programmatically with lucene, but issue to solr?

2010-05-28 Thread Ken Krugler
/Solrj -- Ken ------------ Ken Krugler +1 530-210-6378 http://bixolabs.com e l a s t i c w e b m i n i n g

SolrJ/EmbeddedSolrServer

2010-05-21 Thread Ken Krugler
king around with low-level SolrCore instantiation. Any other approaches? Thanks, -- Ken -------- Ken Krugler +1 530-210-6378 http://bixolabs.com e l a s t i c w e b m i n i n g

"Special Circumstances" for embedded Solr

2010-05-20 Thread Ken Krugler
y other commonly compelling reasons to use SolrJ? Thanks, -- Ken ------------ Ken Krugler +1 530-210-6378 http://bixolabs.com e l a s t i c w e b m i n i n g

Re: Personalized Search

2010-05-20 Thread Ken Krugler
e recommendation engine, then you could use this to adjust search results. I'm waiting for Hoss to jump in here on how best to handle that :) -- Ken Ken Krugler +1 530-210-6378 http://bixolabs.com e l a s t i c w e b m i n i n g

Re: How to query for similar documents before indexing

2010-05-10 Thread Ken Krugler
n I really build a request such as mydoc.title:wordexample~ AND mydoc.content:( all the content words)~0.9 ? Thank you for your help Ken Krugler +1 530-210-6378 http://bixolabs.com e l a s t i c w e b m i n i n g

Re: MoreLikeThis: How to get quality terms from html from content stream?

2009-08-08 Thread Ken Krugler
point the stream.url param to file:///parsedfile.txt it works great. -Jay ---------- Ken Krugler TransPac Software, Inc. <http://www.transpac.com> +1 530-210-6378

  1   2   >