RE: Searching on (hyphenated/capitalized) word issue

2013-10-09 Thread Van Tassell, Kristian
uot;wi", "fi" and "WiFi"). So really, you need to craft your own analysis chain to fit the kind of data you are working with. Upayavira On Mon, Sep 30, 2013, at 06:50 PM, Van Tassell, Kristian wrote: > I have a search term "multi-CAD" being issues on tokenized text.

Searching on (hyphenated/capitalized) word issue

2013-09-30 Thread Van Tassell, Kristian
I have a search term "multi-CAD" being issues on tokenized text. The problem is that you cannot get any search results when you type "multicad" unless you add a hyphen (multi-cad) or type "multiCAD" (omitting the hyphen, but correctly adding the CAPS into the spelling). However, for the simi

saxon exception on startup

2013-07-03 Thread Van Tassell, Kristian
I made a minor change in my solr schema and suddenly Solr won't start (4.2 running in Tomcat). I have the same files and configuration on another machine (running 4.2 and Tomcat), and the same configuration on yet another (a JBoss one). All other webapps start ok. Has anyone seen this before?

RE: How to query Solr for empty field or specific value

2013-07-02 Thread Van Tassell, Kristian
Message- From: Van Tassell, Kristian Sent: Tuesday, July 02, 2013 3:47 PM To: solr-user@lucene.apache.org Subject: How to query Solr for empty field or specific value Hello, I'm using Solr 4.2 and am trying to get a specific value (blue) or null field (no color) returned by my fil

How to query Solr for empty field or specific value

2013-07-02 Thread Van Tassell, Kristian
Hello, I'm using Solr 4.2 and am trying to get a specific value (blue) or null field (no color) returned by my filter query. My results should yield 3 documents (If I execute the two separate filters in different queries, I get 2 hits for one query and 1 for the other). I've tried this (blue o

RE: Dynamically create new fields

2013-06-12 Thread Van Tassell, Kristian
Great, thank you! -Original Message- From: Chris Hostetter [mailto:hossman_luc...@fucit.org] Sent: Wednesday, June 12, 2013 2:27 PM To: solr-user@lucene.apache.org Subject: Re: Dynamically create new fields : Dynamically adding fields to schema is yet to get released.. : : https://issu

Dynamically create new fields

2013-06-12 Thread Van Tassell, Kristian
We have a need to dynamically create new fields. These fields would mostly be used for new facet types. While I could modify, as needed, the schema, that presents some deployment issues (such as needing to restart the Solr service). Whereas, something such as elasticsearch's schema-free model,

RE: Error while starting Solr on Websphere

2013-04-24 Thread Van Tassell, Kristian
r home entirely from the WebLogic instance (just to be sure). -Original Message- From: Gora Mohanty [mailto:g...@mimirtech.com] Sent: Wednesday, April 24, 2013 3:31 PM To: solr-user@lucene.apache.org Subject: Re: Error while starting Solr on Websphere On 25 April 2013 01:42, Van Tassell, K

RE: Error while starting Solr on Websphere

2013-04-24 Thread Van Tassell, Kristian
I'm having this same issue (versions and all). Was there ever a response to this question? I can't seem to find one. Thanks in advance! >From divz80 Subject Error while starting Solr on Websphere Date Wed, 20 Mar 2013 23:13:07 GMT Hi, i'm attempting to setup Solr 4.2.0 o

SSLInitializationException on startup

2013-04-22 Thread Van Tassell, Kristian
I'm configuring a number of servers to support Solr 4.2 and have come across one that will not start. This is a pre-existing application server (running Tomcat) and I'm not quite sure what to look for. Has anyone seen this before and solved it? Thanks in advance! INFO: Creating new http client

RE: AW: AW: java.lang.OutOfMemoryError: Map failed

2013-04-03 Thread Van Tassell, Kristian
I just posted a similar error and discovered that decreasing the Xmx fixed the problem for me. The "free" command/top, etc. indicated I was flying just below the threshold for my allowed memory, and with swap/virtual space available, so I'm still confused as to what the issue is, but you may try

SolrException: Error opening new searcher

2013-04-03 Thread Van Tassell, Kristian
We're suddenly seeing an error when trying to do updates/commits. This is on Solr 4.2 (Tomcat, solr war deployed to webapps, on Linux SuSE 11). Based off of some initial searching on things related to this issue, I have set ulimit in Linux to 'unlimited' and verified that Tomcat has enough memor

RE: Confusion over Solr highlight hl.q parameter

2013-04-03 Thread Van Tassell, Kristian
M To: solr-user@lucene.apache.org Subject: Re: Confusion over Solr highlight hl.q parameter (13/04/03 5:27), Van Tassell, Kristian wrote: > Thanks Koji, this helped with some of our problems, but it is still not > perfect. > > This query, for example, returns no highlight

RE: Confusion over Solr highlight hl.q parameter

2013-04-02 Thread Van Tassell, Kristian
6:14 PM To: solr-user@lucene.apache.org Subject: Re: Confusion over Solr highlight hl.q parameter (13/03/16 4:08), Van Tassell, Kristian wrote: > Hello everyone, > > If I search for a term “baz” and tell it to highlight it, it highlights just > fine. > > If, however, I search

RE: Highlighting result problems

2013-03-31 Thread Van Tassell, Kristian
uot;) then I do not always get good highlighting results back (highlighting fields are sometimes empty). -----Original Message----- From: Van Tassell, Kristian [mailto:kristian.vantass...@siemens.com] Sent: Friday, March 29, 2013 7:43 AM To: solr-user@lucene.apache.org Subject: Highlighting result p

Cannot find word with accent

2013-03-29 Thread Van Tassell, Kristian
I'm trying to find documents with this word: général It returns one hit for a document containing "General". If I search for g*ral I get 230 hits, of which some contain the word général. I'm not sure where to begin looking, I believe everything is encoded correctly. The text_fr (French) fieldT

What to expect when testing Japanese search index

2013-03-21 Thread Van Tassell, Kristian
I’m trying to set up our search index to handle Japanese data, and while some searches yield results, others do not. This is especially true the smaller the search term. For example, searching for this term: 更 Yields no results even though I know it appears in the text. I understand that this

RE: Getting back highlights almost always works...

2013-03-20 Thread Van Tassell, Kristian
That was it, thank you! -Original Message- From: Koji Sekiguchi [mailto:k...@r.email.ne.jp] Sent: Tuesday, March 19, 2013 5:17 PM To: solr-user@lucene.apache.org Subject: Re: Getting back highlights almost always works... (13/03/20 6:14), Van Tassell, Kristian wrote: > ...but

Getting back highlights almost always works...

2013-03-19 Thread Van Tassell, Kristian
...but I'm finding some examples where the stored text is so big (14,000 words) that Solr fails to highlight anything. But the data is definitely in the text field and is returning due to that hit. Does anyone have any ideas why this happens?

RE: Boosting based on filter query

2013-03-11 Thread Van Tassell, Kristian
ismax/edismax you can use bq=filetype:pdf^10 (where "filetype:pdf" is a valid Lucene query parser expression for your documents). Erik On Mar 11, 2013, at 09:31 , Van Tassell, Kristian wrote: > I want to be able to boost results where the filetype is a pdf: > > Here is som

Boosting based on filter query

2013-03-11 Thread Van Tassell, Kristian
I want to be able to boost results where the filetype is a pdf: Here is some pseudo code so I don't misrepresent/misinterpret via a URL: search("foobar") foreach result (where filetype==pdf) { boost^10 } Is there a way to do this? Thanks in advance!

RE: Unable to match partial word

2013-03-05 Thread Van Tassell, Kristian
you need to be aware of the performance issues. If there words are related parts of speech, like singular and plural, you can use a stemmer to index a root form. You can also configure synonyms at index time, for things like "TV" and "television". wunder On Mar 5, 2013, at

Unable to match partial word

2013-03-05 Thread Van Tassell, Kristian
I'm doing a search for "prod" and would assume it would pull back matches for product, production, etc. but I get zero hits. Any ideas? Here is my field type:

RE: Search returns hits but highlighting does not work for certain field

2013-03-04 Thread Van Tassell, Kristian
wData fl=*,score defType=edismax hl=true hl.fl=rawData hl.simple.pre=BBB hl.simple.post=EEE hl.fragsize=10 hl.snippets=1000 but contains no highlighting results (ie - ). -Original Message- From: Van Tassell, Kristian [mailto:kristian.vantass...@siemens.com] Sent: Friday, March 01, 2013

RE: Defining tokenizer pattern with < character

2013-03-01 Thread Van Tassell, Kristian
owe wrote: > Kristian, > > I think what you want is pattern="<[^>]>" (untested) - that is, you > probably don't want to regex-escape the character class brackets "[" and "]", > and you should html-escape the angle brackets. >

Highlighting in non-tokenized field

2013-02-26 Thread Van Tassell, Kristian
I have a field (non tokenized) that I want to search and then highlight on: Schema- Query- q=creates%20a%20new%20layer qf=text_exact fl=* defType=edismax hl=true hl.fl=text_exact hl.simple.pre=BBB hl.simple.post=EEE hl.fragsize=10 hl.snippets=1000 The highlighting comes back as tokenized,

RE: Matching an exact word

2013-02-21 Thread Van Tassell, Kristian
really is no way to do what you are looking for within Solr. I'd suggest you'll need to do some parsing at your side and, if you find quotes, do the query against a different field. Upayavira On Thu, Feb 21, 2013, at 06:17 PM, Van Tassell, Kristian wrote: > I'm trying to mat

Matching an exact word

2013-02-21 Thread Van Tassell, Kristian
I'm trying to match the word "created". Given that it is surrounded by quotes, I would expect an exact match to occur, but instead the entire stemming results show for words such as create, creates, created, etc. q="created"&wt=xml&rows=1000&qf=text&defType=edismax If I copy the text field to a

Efficiently mining or parsing data out of XML source files

2012-05-31 Thread Van Tassell, Kristian
I'm just wondering what the general consensus is on indexing XML data to Solr in terms of parsing and mining the relevant data out of the file and putting them into Solr fields. Assume that this is the XML file and resulting Solr fields: XML data: foo garbage data Solr Fields: Id=1234 Title

RE: Crawling an SCM to update a Solr index

2012-04-22 Thread Van Tassell, Kristian
/solr-performance-monitoring/index.html > > From: "Van Tassell, Kristian" >To: "solr-user@lucene.apache.org" >Sent: Friday, April 20, 2012 3:26 PM >Subject: Crawling an SCM to update a Solr index > >Hello everyone, > >

Crawling an SCM to update a Solr index

2012-04-20 Thread Van Tassell, Kristian
Hello everyone, I'm in the process of pulling together requirements for a SCM (source code manager) crawling mechanism for our Solr index. I probably don't need to argue the need for a crawler, but to be specific, we have an index which receives its updates from a custom built application. I wo

Another zero match issue

2012-02-03 Thread Van Tassell, Kristian
Hi everyone! I'm also having some zero match weirdness. When I execute this search: ?q=Create+a+self+contained+Part+Module&defType=edismax&qf=location^0.9+text^0.8+fileName^8.0+title^4.0 I get ZERO results. If I remove the fileName qf parameter (an indexed but not stored field), I get 5 hits.

RE: UTF-8 support during indexing content

2012-02-01 Thread Van Tassell, Kristian
id you confirm the document contains the right quote marks immediately prior to uploading? If you just visually inspected it, then use whatever tool you viewed it in to see what the character set is. cheers, Travis On Wed, Feb 1, 2012 at 9:17 AM, Van Tassell, Kristian < kristian.vantass...@s

UTF-8 support during indexing content

2012-02-01 Thread Van Tassell, Kristian
Hello everyone, I have a question that I imagine has been asked many times before, so I apologize for the repeat. I have a basic text field with the following text: the word ”stemming” in quotes Uploading the data yields no errors, however when it is indexed, the text looks like this: