I'm on Windows Vista, using the trunk. Some of the JSP pages do not
execute, but instead Jetty downloads them.
solr/admin/get-properties.jsp for example. This is called by the 'JAVA
PROPERTIES' button in the main admin page.
Is this a known problem/quirk for Windows? Or fallout from a jetty
chang
How to handle special character when constructing filter query?
for example, i want to do something like:
http://.&fq=genre:ACTION & ADVENTURE
How do i handle the space and & in the filter query part?
Thanks.
--
View this message in context:
http://lucene.472066.n3.nabble.com/How-to
You would want to evaluate the size and the number of searched, plus how often
the index will need changed data.
There is no recipe just good experience.
Bill Bell
Sent from mobile
On Feb 25, 2011, at 3:06 AM, Stefan Matheis
wrote:
> Hi Grant,
>
> Multi Sites == Multi Cores? :) http://wiki
Once i change the query to be:
+name:memory features:battery^100
+name:memory features:battery^100
+name:memory features:battery^100
+name:memori features:batteri^100.0
+name:memori features:batteri^100.0
Then it gets rid of result that only result which does not match
name:memory.
However
For the solr example(exampleDIH), how do i achieve the following with
standard queryparser?
search all docs which name field contains "memory" (primary query logic),
Within that resultset, boost the doc matches features:battery (boosting
logic).
Note that I have to use standard queryparse
Hi Andreas,
> java -jar tika-app-0.9.jar --list-met-models
> TikaMetadataKeys
> PROTECTED
> RESOURCE_NAME_KEY
> TikaMimeKeys
> MIME_TYPE_MAGIC
> TIKA_MIME_FILE
>
> Both 0.8 and 0.9 give me the same list. Is that a configuration issue?
Strange -- those are the only met models you're seeing liste
Hi Chris,
java -jar tika-app-0.9.jar --list-met-models
TikaMetadataKeys
PROTECTED
RESOURCE_NAME_KEY
TikaMimeKeys
MIME_TYPE_MAGIC
TIKA_MIME_FILE
Both 0.8 and 0.9 give me the same list. Is that a configuration issue?
I'm a bit unclear if that gets me to what I was looking for - metadata
like
(11/02/26 5:24), Rosa (Anuncios) wrote:
Hi Koji,
Yes of course i have RegexTransformer" in my .
What i'm not sure is the syntax of this i
don't need any other parameter here?
Hi Rosa,
So I've mentioned groupNames attribute for field element in previous mail.
Did you try it?
Koji
--
http://
Ahmet Arslan yahoo.com> writes:
>
> > I want a string field that is case
> > insensitive. This is what I tried:
> >
> > > sortMissingLast="true"
> > omitNorms="true">
> >
> >
> >
> >
> >
> >
> >
> >
> >
> >
> >
> >
> I want a string field that is case
> insensitive. This is what I tried:
>
> sortMissingLast="true"
> omitNorms="true">
>
>
>
>
>
>
>
>
>
>
>
> However, it is matching "opengl" for "opengl128". I
> want exact string
Yep it's fixed in 0.9.
Cheers,
Chris
On Feb 25, 2011, at 2:37 PM, Andreas Kemkes wrote:
> According to the Tika release notes, it's fixed in 0.9. Haven't tried it
> myself.
>
> A critical backwards incompatible bug in PDF parsing that was introduced in
> Tika
> 0.8 has been fixed. (TIKA-548
I want a string field that is case insensitive. This is what I tried:
However, it is matching "opengl" for "opengl128". I want exact string matches,
but I want them case-insensitive. What did I do wrong?
According to the Tika release notes, it's fixed in 0.9. Haven't tried it
myself.
A critical backwards incompatible bug in PDF parsing that was introduced in
Tika
0.8 has been fixed. (TIKA-548)
Andreas
From: Darx Oman
To: solr-user@lucene.apache.org
Sent: F
Hi Chris,
Thank you so much - that's a great start.
Andreas
From: "Mattmann, Chris A (388J)"
To: "solr-user@lucene.apache.org"
Cc: "u...@tika.apache.org"
Sent: Fri, February 25, 2011 1:21:33 PM
Subject: Re: Tika metadata extracted per supported document form
Hi Andreas,
In Tika 0.8+, you can run the --list-met-models command from tika-app:
java -jar tika-app-.jar --list-met-models
And get a print out of the met keys that Tika supports. Some parsers add their
own that aren't part of this met listing, but this is a relatively
comprehensive list.
Ch
Hi Koji,
Yes of course i have RegexTransformer" in my .
What i'm not sure is the syntax of this xpath="/product/url" regex=" /> i don't need any other parameter here?
Rosa
Le 25/02/2011 12:21, Koji Sekiguchi a écrit :
Hi Rosa,
Are you sure you have transformer="RegexTransformer" in your
Hello,
I've asked this on the Tika mailing list w/o an answer, so apologies for
cross-posting.
I'm trying to find information that tells me specifically what metadata is
provided for the different supported document formats. Unfortunately all I was
able to find so far is "The Metadata produce
: I'm using the edismax handler, although my question is probably the same for
: dismax. When the user types a long query, I use the "mm" parameter so that
: only 75% of terms need to match. This works fine, however, sometimes documents
: that only match 75% of the terms show up higher in my resul
On Fri, Feb 25, 2011 at 1:57 PM, Jan Høydahl wrote:
> I also have a case (yellow-page) where IDF comes in and destroys the rank.
> A company listing with a word which occurs in few other listings is not
> necessarily better than others just because of that. When it gets to the
> extreme value of
Jan,
You are correct, you'll need your own Similarity class.
Have a look at SweetSpotSimilarity
(http://lucene.apache.org/java/3_0_3/api/contrib-misc/org/apache/lucene/misc/SweetSpotSimilarity.html)
On 2/25/11 10:57 AM, Jan Høydahl wrote:
I also have a case (yellow-page) where IDF comes in a
I also have a case (yellow-page) where IDF comes in and destroys the rank.
A company listing with a word which occurs in few other listings is not
necessarily better than others just because of that. When it gets to the
extreme value of IDF=1, we get an artificially high IDF boost.
It is not kil
hi
if you want to index pdf files then use tika 0.6
because 0.7 and 0.8 does not detect the correctly the pdfParse
You guys are great.. I will stick for now to the release version and if I
have problem parsing I will give the branch jars a try the reason I am
looking for upgrading tika is because tika keeps improving on things like
languages, mime type support, and so on
thanks again
JO
--
View thi
If the dictionary is a Lucene index, wouldn't it be as simple as delete
using a term query? Something like this:
IndexReader sdreader = new IndexReader();
sdreader.delete(new Term("word", "sherri"));
...
sdreader.optimize();
sdreader.close();
I am guessing your dictionary is built dynamically usi
I'm using an index based spellcheck dictionary and I was wondering if there
were a way for me to manually remove certain words from the dictionary.
Some of my content has some mis-spellings, and for example when I search for
the word sherrif (which should be spelled sheriff), it get recommendation
Hi Jo,
You may consider checking out Tika trunk, where we recently have a Tika JAX-RS
web service [1] committed as part of the tika-server module. You could probably
wire DIH into it and accomplish the same thing.
Cheers,
Chris
[1] https://issues.apache.org/jira/browse/TIKA-593
On Feb 24, 201
Or how can I perform a query on the current state of the index from
within an UpdateProcessor?
Thanks
On 2/25/11 6:30 AM, Mark wrote:
I am trying to write my own custom UpdateHandler that extends
DirectUpdateHandler2.
I would like to be able to query the current state of the index within
th
I am trying to write my own custom UpdateHandler that extends
DirectUpdateHandler2.
I would like to be able to query the current state of the index within
the addDoc method. How would I be able to accomplish this?
I tried something like the following but it was a big fat fail as it
quickly c
Check the "Need help in understanding output of searcher.explain()
function" thread.
http://mail-archives.apache.org/mod_mbox/lucene-java-user/201008.mbox/%3CAANLkTi=m9a1guhrahpeyqaxhu9gta9fjbnr7-8-zi...@mail.gmail.com%3E
Regards,
Jayendra
On Fri, Feb 25, 2011 at 6:57 AM, Bagesh Sharma wrote:
>
> I am trying to configure tomcat multi instances with that many number of
> services configured too. Right now that particular tomcat exe let create
> only one. If the same exe run again and tried to configure at other
> destination folder ,It throws an exception as service already exists.How can
(11/02/25 18:30), Tarjei Huse wrote:
Hi,
On 02/25/2011 02:06 AM, Koji Sekiguchi wrote:
(11/02/24 20:18), Tarjei Huse wrote:
Hi,
I got an index where I have two fields, body and caseInsensitiveBody.
Body is indexed and stored while caseInsensitiveBody is just indexed.
The idea is that by not s
Some of these links may help...
http://www.google.com/search?client=safari&rls=en&q=apache+tomcat+download&ie=UTF-8&oe=UTF-8
Adam
On Feb 25, 2011, at 3:16 AM, rajini maski wrote:
> Can anybody help me to get the source code of the Tomcat exe
> file i.e, source code of the installa
Hi Markus,
the result of my investigation is that Lucene currently can only handle
UTF-8 code within BMP [Basic Multilingual Plane] (plane 0) <= 0x.
Any code above BMP might end in unpredictable results which is bad.
If you get invalid UTF-8 from the index and use wt=xml it gives the error
pa
Hi sir ,
Can anyone explain me how this score is being calculated. i am searching
here "software engineer" using dismax handler. Total documents indexed are
477 and query results are 28.
Query is like that -
q=software+engineer&fq=location%3Adelhi
dismax setting is -
Any idea?
On Thu, Feb 24, 2011 at 6:49 PM, Chamnap Chhorn wrote:
> There are many product names. How could I list them all, and the list is
> growing fast as well?
>
>
> On Thu, Feb 24, 2011 at 5:25 PM, Grijesh wrote:
>
>>
>> may synonym will help
>>
>> -
>> Thanx:
>> Grijesh
>> http://lucid
On Fri, Feb 25, 2011 at 3:42 PM, rajini maski wrote:
> I am trying to configure tomcat multi instances with that many number of
> services configured too. Right now that particular tomcat exe let create
> only one. If the same exe run again and tried to configure at other
> destination folder ,It
On Feb 1, 2011, at 08:58 , Estrada Groups wrote:
> Has anyone noticed the rails application that installs with Solr4.0? I am
> interested to hear some feedback on that one...
I guess you're talking about the client/ruby/flare stuff? It's been untouched
for quite a while and has not been upgr
Hi Rosa,
Are you sure you have transformer="RegexTransformer" in your ?
My question was more about the solr DIH syntax. It doesn't work either with the
new regex.
Especially the syntax for this:
<--- Is it correct? (not
the regex, the syntax)?
In this case, I think you need to have two f
You don't want to use 0.8 if you're parsing PDF.
> Your best bet is perhaps upgrading to latest 1.4 branch, i.e. 1.4.2-dev
> (http://svn.apache.org/repos/asf/lucene/solr/branches/branch-1.4/) It
> includes Tika 0.8-SNAPSHOT and is a compatible drop-in (war/jar)
> replacement with lots of other bug
I am trying to configure tomcat multi instances with that many number of
services configured too. Right now that particular tomcat exe let create
only one. If the same exe run again and tried to configure at other
destination folder ,It throws an exception as service already exists.How can
I fix th
Your best bet is perhaps upgrading to latest 1.4 branch, i.e. 1.4.2-dev
(http://svn.apache.org/repos/asf/lucene/solr/branches/branch-1.4/)
It includes Tika 0.8-SNAPSHOT and is a compatible drop-in (war/jar) replacement
with lots of other bug fixes you'd also like (check changes.txt).
svn co http
Why do you want it?
Try asking on the Tomcat list :)
--
Jan Høydahl, search solution architect
Cominvent AS - www.cominvent.com
On 25. feb. 2011, at 09.16, rajini maski wrote:
> Can anybody help me to get the source code of the Tomcat exe
> file i.e, source code of the installation e
Hi,
On 02/25/2011 02:06 AM, Koji Sekiguchi wrote:
> (11/02/24 20:18), Tarjei Huse wrote:
>> Hi,
>>
>> I got an index where I have two fields, body and caseInsensitiveBody.
>> Body is indexed and stored while caseInsensitiveBody is just indexed.
>>
>> The idea is that by not storing the caseInsensit
Hi Grant,
Multi Sites == Multi Cores? :) http://wiki.apache.org/solr/MultiCore have a look
Regards
Stefan
On Fri, Feb 25, 2011 at 3:15 AM, Grant Longhurst
wrote:
> Hi,
>
>
>
> We are a e-commerce service provider and are looking at using solr for
> all the site searches. Was just wondering what
Since my users wanted to have a partial search functionality I had to
introduce following. I have declared two EdgeNGram filters with both side
"back" and "front" since they wanted to have partial search working from any
side.
When executing search (which brings back 4K plus reco
I have the following field defined in my schema:
I have the default field set to "person" and have indexed the
following document:
The following queries return the result as expec
I have tried the steps indicated here:
http://wiki.apache.org/solr/ExtractingRequestHandler
http://wiki.apache.org/solr/ExtractingRequestHandler
and when I try to parse a document nothing would happen, no error.. I have
copied the jar files everywhere, and nothing.. can anyone give me the steps
I have used this without issue. In the example solrconfig.xml replace
this line:
with this one:
Thanks,
Matt Weber
On Thu, Feb 24, 2011 at 7:47 PM, Bill Bell wrote:
> Thanks - yeah that is why I asked how to use it. But I still don't know
> how to use it.
>
> https://hudson.apache.org/huds
Hi Koji,
My question was more about the solr DIH syntax. It doesn't work either
with the new regex.
Especially the syntax for this:
/> <--- Is it correct? (not the regex, the syntax)?
Example: url=http://www.abcd.com/product.php?id=324 --> i want to index
source = abcd.com
thanks for you
From looking at the source, I see only the following option available for me to
write search results displaying jsp's: adjust SolrDispatchFilter to treat a
JspResponseWriter specially by:
- enriching the http-request with the search queries and reponses
- forward the request down the chain
It s
Can anybody help me to get the source code of the Tomcat exe
file i.e, source code of the installation exe .
Thanks..
51 matches
Mail list logo