On Dec 18, 2007 9:41 PM, Kamran Shadkhast <[EMAIL PROTECTED]> wrote:
> ...it would be great if we could dynamiclly control this during
> search if we want to search with stemming or not
The easiest is probably to have two copies of your field, using
, one stemmed and one not, and search in on
On 10/16/07, Nishant Soni <[EMAIL PROTECTED]> wrote:
> ...So is there a way to query solr about which of the tokens in the query
> actually matched ?...
The analyzer admin page should help, see
http://wiki.apache.org/solr/FAQ#head-b25df8c8393bbcca28f1f344c432975002e29ca9
-Bertrand
On 9/20/07, Thorsten Scherler <[EMAIL PROTECTED]> wrote:
> ...Betrand, does the French Snowball work fine?...
I've seen some weirdnesses, like "tennis" and "tenir" (means to hold)
both stemmed to "ten", but in all of our (simple) tests it was ok.
The application where we're using it does not requ
On 9/20/07, Thierry Collogne <[EMAIL PROTECTED]> wrote:
> ...Thank you very much. Moving the up in the chain fixed it
Yes, the problem was the EnglishPorterFilterFactory before the accents
removal: the stemmer doesn't know about accents, so no stemming
occured on "matthé" whereas "matthe" wa
On 9/20/07, Thierry Collogne <[EMAIL PROTECTED]> wrote:
> ..when we search for "matthé" or for "matthe", we get two totally
> different results
The analyzer admin tool should help you find out what's happening, see
http://wiki.apache.org/solr/FAQ#head-b25df8c8393bbcca28f1f344c432975002e29ca9
On 8/31/07, Tim Archambault <[EMAIL PROTECTED]> wrote:
> ...I'm thinking of sending a similar
> list-serv item out, but I noticed this is a solr-user list, not necessarily
> a developers list so I thought I'd ask
Note that there's also [EMAIL PROTECTED] for such purposes, see
http://www.apach
On 7/21/07, Alessandro Ferrucci <[EMAIL PROTECTED]> wrote:
... the user could enter the following combinations of words:
...
...where the second instance is either OR
. ...
The dismax handler can indeed search terms in several fields, but I'd
also suggest, as an alternative, copying all n
On 6/13/07, Vish D. <[EMAIL PROTECTED]> wrote:
...Wonder if TOM could be useful to integrate?
http://tom.library.upenn.edu/convert/sofar.html...
It might be interesting. and as I understand the goal of Tika is
mostly to be a framework for plugging in various types of analyzers.
So plugging in m
On 6/12/07, Vish D. <[EMAIL PROTECTED]> wrote:
...Sounds interesting. I can't seem to find any clear dates on the project
website. Do you know? ...V1 shipping date?...
Not at the moment, Tika just entered incubation and it's impossible to
predict what will happen.
But help is welcome, of cours
On 6/12/07, Yonik Seeley <[EMAIL PROTECTED]> wrote:
... I think Tika will be the way forward (some of the code for Tika is
coming from LIUS)...
Work has indeed started to incoroporate the Lius code into Tika, see
https://issues.apache.org/jira/browse/TIKA-7 and
http://incubator.apache.org/proj
On 6/7/07, Ian Holsman <[EMAIL PROTECTED]> wrote:
. it's called XSLT. most modern browsers can do the transform on the
client side.
otherwise there is some server side tools (cocoon I think does this) to
do the transform on the server before sending it out
Solr also does server-side XSLT,
On 4/25/07, brian beard <[EMAIL PROTECTED]> wrote:
...Has anyone tried using solr over https?
Is there any reason why this might not be feasible?...
As with any servlet-based system, using apache httpd's as an SSL
front-end with mod_proxy will work, and it has other benefits besides
not needin
Hi Jack,
On 4/27/07, Jack L <[EMAIL PROTECTED]> wrote:
...Is there a build script that automagically grab files from jetty's
source tree (local) and build a solr release? In other words, I can
try building with a newer version of jetty if it doesn't take too
much work - I don't know much about j
On 4/26/07, guruprasad <[EMAIL PROTECTED]> wrote:
...Is it only for Linux or can I install
Solr on my Windows Desktop too?...
Solr itself should run fine on any JVM 1.5, including Windows (and
several Solr developers are working on Windows IIUC).
Some of our docs refer to auxiliary scripts th
On 4/25/07, Jack L <[EMAIL PROTECTED]> wrote:
...Regardless, I think it's a good idea to use a newer, released (not RC)
version in general, considering 5.1 is one major version behind
Agreed, but note that we don't have any factual evidence that the
Jetty RC that we use is indeed the cause
On 4/25/07, Gal Nitzan <[EMAIL PROTECTED]> wrote:
When I search
Q=title:"dog%20da*"
...
Probably has to do with the way your "title" field is analyzed.
The best way to debug such problems is with the analyzer admin tool:
http://localhost:8983/solr/admin/analysis.jsp - that page will show
On 4/25/07, Jack L <[EMAIL PROTECTED]> wrote:
...Maybe it's time to think about upgrading Jetty...
It's in the pipeline, see https://issues.apache.org/jira/browse/SOLR-128
-Bertrand
On 4/25/07, Jack L <[EMAIL PROTECTED]> wrote:
...solr stops functioning properly after running for a few days.
The symptom is search returning nothing, or when I go to /solr/admin/,
I get file browsing page showing a list of files (.css, etc),...
Are your symptoms similar to those of
https://i
On 4/24/07, Grant Ingersoll <[EMAIL PROTECTED]> wrote:
...++ . /apache-solr-1.1.0-incubating/example/solr/conf/
scripts.conf
+++ $'\r'
: command not found /apache-solr-1.1.0-incubating/example/...
Not sure what $'\r' means exactly, but it might indicate invalid
characters either in the snapsho
On 4/23/07, Grant Ingersoll <[EMAIL PROTECTED]> wrote:
...The error says something about command not found line 15, but all the
files I looked at, line 15 was a comment...
Running your script with
bash -x myscript
should help, it will echo commands before executing them.
-Bertrand
On 4/17/07, Bill Tantzen <[EMAIL PROTECTED]> wrote:
...However, when I search with 'q=ethics' in solr, I get almost 10,000 matches.
With my client, I get 0
What kind of analyzer do you use when indexing that field? If you have
a stemmer, for example, "ethics" might be indexed without the e
On 4/16/07, Yonik Seeley <[EMAIL PROTECTED]> wrote:
...Yes, it's a typo.
Fixed in revision 529367.
-Bertrand
On 4/16/07, Jack L <[EMAIL PROTECTED]> wrote:
Is the lucene query syntax available in solr? ...
The syntax depends on the request handler used, if you're using the
standard one the docs are at
http://wiki.apache.org/solr/StandardRequestHandler
-Bertrand
On 4/16/07, Erik Hatcher <[EMAIL PROTECTED]> wrote:
...Further details on this: SolrCore has a finalizer() method that
closes the update handler. I'm not clear on finalizer() though. How/
when is that invoked? I know about Object.finalize(), but not
finalizer()...
Looking at the code, it s
On 4/6/07, Yonik Seeley <[EMAIL PROTECTED]> wrote:
...What "form" of logo do you prefer, A or B?
B
-Bertrand (a Tex Avery fan ;-)
On 4/6/07, Suresh Kannan <[EMAIL PROTECTED]> wrote:
I would like to post PDF, DOC, TXT into SOLR to do the indexing.
There's no way to do that directly at the moment, you'll need to
convert them to the XML format that Solr expects.
The Lucene FAQ at http://wiki.apache.org/lucene-java/LuceneFAQ
On 4/4/07, Ryan McKinley <[EMAIL PROTECTED]> wrote:
...We have been running solr for months as a band-aid, this release
integrates solr deeply...
Awesome - thanks for sharing this!
If you don't mind, it'd be cool to add some info to
http://wiki.apache.org/solr/PublicServers
-Bertrand
On 3/27/07, Shridhar Venkatraman <[EMAIL PROTECTED]> wrote:
...Reposting unABLE to match
No need to repost if your message made it to the list.
If it hasn't been answered yet, it either means that no one knows the
answer or that no one has had the time to answer yet. We're all
volunteers here.
On 3/24/07, Dimitar Ouzounov <[EMAIL PROTECTED]> wrote:
...I must be doing something wrong, maybe in the schema. Does anyone
have any suggestions?..
The best way to debug such problems is with the analyzer admin tool:
http://localhost:8983/solr/admin/analysis.jsp
You can try various combinati
Hi everybody,
Please note that the Early Bird registration for the ApacheCon Europe
2007 (May 1-4, Amsterdam) ends soon, see below for the official
announcement.
See http://www.eu.apachecon.com/ for more details and for the program.
Hope to see you there!
-Bertrand
---
On 3/21/07, Thierry Collogne <[EMAIL PROTECTED]> wrote:
I used the new jar file and removed -Dfile.encoding=UTF-8 from my jar call
and the problem isn't there anymore...
ok, thanks for the feedback!
-Bertrand
On 3/21/07, Thierry Collogne <[EMAIL PROTECTED]> wrote:
...What would be the best way of building the SimplePostTool.java
You can use "ant example" in the top-level directory of the Solr source code.
I have attached the current post.jar to SOLR-194 for convenience.
-Bertrand
On 3/21/07, Bertrand Delacretaz <[EMAIL PROTECTED]> wrote:
...For now, using this as a workaround should help:
java -Dfile.encoding=UTF-8 -jar post.jar
http://localhost:8983/solr/update utf8-example.xml..
Should be fixed now, if you can grab the latest SimplePostToolCode [1]
it shoul
On 3/21/07, Thierry Collogne <[EMAIL PROTECTED]> wrote:
...I am using the post.jar file to update the search indexes. Problem is that
foreign characters like é, à, ... don't work correctly...
You're right, I have entered the issue in
https://issues.apache.org/jira/browse/SOLR-194
For now, usi
On 3/21/07, Thierry Collogne <[EMAIL PROTECTED]> wrote:
...I mean if I do the following.
- delete all documents from the index
- add all documents
- do a commit.
Will this result in a temporary empty index, or will I always have results?...
Changes to the index are invisibl
Hi Thierry,
On 3/21/07, Thierry Collogne <[EMAIL PROTECTED]> wrote:
...I was wondering if it i possible to do something similar using Solr...
Looks like you need faceted search, see the examples at the end of
http://wiki.apache.org/solr/SimpleFacetParameters
-Bertrand
On 3/16/07, Brian Whitman <[EMAIL PROTECTED]> wrote:
...It's my understanding that the javax.xml.transform used by the
XSLTResponseWriter is 1.0 only-- at least, it does not understand any
of the 2.0 stuff I've thrown at it. I am new to world of XSL, thanks
to Solr, but it seems Saxon-B is a goo
On 3/12/07, stefano nicolai <[EMAIL PROTECTED]> wrote:
...All of these items have a field containing the date they were created
(it's a string field at the moment, as i have this type inside my DB).
I want to give a higher score to the ones with the most recent date...
You should be able to u
On 3/10/07, Walter Underwood <[EMAIL PROTECTED]> wrote:
If it does something different, that is a bug. RFC 3023 is clear. --wunder..
Sure - just wanted to confirm what I'm seeing, thanks!
-Bertrand
On 3/10/07, Walter Underwood <[EMAIL PROTECTED]> wrote:
It is better to use "application/xml". See RFC 3023.
Using "text/xml; charset=UTF-8" will override the XML
encoding declaration. "application/xml" will not...
I agree, but did you try this with our example setup, started with
"java -jar st
On 3/10/07, Morten Fangel <[EMAIL PROTECTED]> wrote:
...I send a document like the following:
---
...
I assume you're using your own code to "send" the document?
Currently you need to include a "Content-type: text/xml;
charset=UTF-8" header in your HTTP POST request, and (as you're doing)
th
On 3/9/07, rubdabadub <[EMAIL PROTECTED]> wrote:
...The site is a local portal and the traffic is very high and I am not
sure if Jetty is enough maybe it is
Just an additional note on this: asking four people about what "very
high" traffic means might also give you five different answers ;
On 3/9/07, rubdabadub <[EMAIL PROTECTED]> wrote:
...I am wondering what everyone is using when it comes to app server i.e.
Jetty, Resin, Tomcat etc
I suspect that asking four people might give you five different
answers on this one ;-)
Whichever servlet container you use, IMHO the importa
On 3/7/07, Jeff Rodenburg <[EMAIL PROTECTED]> wrote:
Oops, my bad I didn't see either 186 or 187 before entering 188. :-)
I have closed SOLR-186 and SOLR-187 as duplicates, please add relevant
info to SOLR-188 if needed.
-Bertrand
On 3/3/07, Dimitar Ouzounov <[EMAIL PROTECTED]> wrote:
...I tried using JDK 1.5.0_05, 1.5.0_10, 1.5.0_11, and 1.6.0. I used the Java
-server mode, tried increasing the heap size, but the JVM
still crashes...
Makes me wonder if it's not your new machine that's at fault (hardware
or OS problem m
On 3/3/07, Ryan McKinley <[EMAIL PROTECTED]> wrote:
...The rationale with the solrconfig stuff is that a broken config should
behave as best it can. This is great if you are running a real site
with people actively using it - it is a pain in the ass if you are
getting started and don't notice e
On 3/2/07, Jed Reynolds <[EMAIL PROTECTED]> wrote:
...my first try at defining a schema.xml file was tough because my
only feedback for a long time was "NullPointerException" from SolrCore
when I was trying to add content...
Can you give us enough information to reproduce the problem? What was
On 2/26/07, Ken Krugler <[EMAIL PROTECTED]> wrote:
...I was trying out the MoreLikeThis support, and getting some odd results...
Thanks for the info, I have added a link to your message at
https://issues.apache.org/jira/browse/SOLR-69
-Bertrand
On 2/14/07, Kainth, Sachin <[EMAIL PROTECTED]> wrote:
...The problem is that the tutorial seems to suggest that you
need Unix to run solr...
Solr itself should run fine on Windows, but you're right that most or
all of the scripts provided with Solr require a unixish shell.
Installing cygwin s
On 2/14/07, Erik Hatcher <[EMAIL PROTECTED]> wrote:
...Sorry if I'm sending things mangled somehow - and if anyone has
suggestions on correcting I'm all ears
For long links I tend to use http://tinyurl.com/, but it's a bit
painful to do that for all links.
-Bertrand
On 2/13/07, escher2k <[EMAIL PROTECTED]> wrote:
...Atleast from looking at the snapshooter script, it doesn't
seem to be doing anything specific...
The snapshooter script only makes an "instant snapshot" of the index
directory using cp -lr. This does not involve any copying of index
data.
The
On 2/5/07, Erik Hatcher <[EMAIL PROTECTED]> wrote:
...What numbers are folks capturing? What techniques are you using to
capture numbers?...
I've been using my httpstone utility
(http://code.google.com/p/httpstone/) along with ab
(http://httpd.apache.org/docs/2.2/programs/ab.html) to generate
On 1/31/07, Andrew Nagy <[EMAIL PROTECTED]> wrote:
>... Yes, there's a patch in http://issues.apache.org/jira/browse/SOLR-69 -...
Anyword on something like this being incorporated into the official SOLR
release?
The patch is quite simple, I think we could commit it soon if the
other committer
On 1/31/07, Brian Whitman <[EMAIL PROTECTED]> wrote:
Does Solr have support for the Lucene query-contrib "MoreLikeThis"
query type or anything like it? ...
Yes, there's a patch in http://issues.apache.org/jira/browse/SOLR-69 -
if you try it, please add your comments on that page.
-Bertrand
On 1/29/07, Leandro Saad <[EMAIL PROTECTED]> wrote:
...I'd like to know if solr can index Word, Excel and PDF files or I must
create a xml representation of those files matching my schema?...
Currently you must create the XML yourself outside of Solr.
This might change, see https://issues.apac
On 10/31/06, Yu-Hui Jin <[EMAIL PROTECTED]> wrote:
I kept the solr jetty server running on my box for a couple of days. Today I
found I can no longer access the admin page. It gives the following error
page:
HTTP ERROR: 404...
I've seen the same thing today on one of my Solr in
On 1/22/07, Yonik Seeley <[EMAIL PROTECTED]> wrote:
...When we get to it, I'd like to hear why it (things like PDF parsing)
should be inside Solr rather than outside using our update interfaces
Same here.
I haven't had time to follow the recent (rich) design discussions
about this stuff, b
On 1/17/07, Luis Neves <[EMAIL PROTECTED]> wrote:
...I see that is possible to use
"Boost Functions" to influence the score. How would that work in order to
improve the score of recent documents? (I have a timestamp field in the
schema)...
I've been using expressions like these in boolean quer
Hi Jorg,
On 1/16/07, Jörg Pfründer <[EMAIL PROTECTED]> wrote:
...is there anyone who has experience on internationalization
(internationalisation) with SOLR?...
I've been setting up a french language index in the last months, and
it works very well.
There are some pointers on how to analyz
On 1/16/07, Thorsten Scherler <[EMAIL PROTECTED]> wrote:
...Have a look at
https://issues.apache.org/jira/browse/SOLR-86...
Right, I should have mentioned this one as well. I have linked SOLR-20
and SOLR-86 now, so that people can see the various options for Java
clients.
-Bertrand
On 1/16/07, Pavel Penchev <[EMAIL PROTECTED]> wrote:
...What about the case where solr and my application are deployed in the
same instance of say tomcat. Is there a way to skip the http requests
and use a direct api?...
The javax.servlet.RequestDispatcher interface allows you to access
other
On 1/16/07, [EMAIL PROTECTED] <[EMAIL PROTECTED]> wrote:
...and how would you do it calling it from another web application, let's
say from a servlet or so?...
Doesn't make much difference if your client is a standalone or a web
application: you Solr client class will need to be configured wit
On 1/16/07, [EMAIL PROTECTED] <[EMAIL PROTECTED]> wrote:
...Could someone give me some code examples on how Solr requests can be
called by Java code...
Although our Java client landscape is still a bit fuzzy (there are
several variants floating around), you might want to look at the code
found
On 1/16/07, Jeff Rodenburg <[EMAIL PROTECTED]> wrote:
Nick - this depends on the analyzer used to index the field as well as the
analyzer used in your search query
Note that the Solr "analysis" page, in the admin interface, allows you
to see exactly how your field's content is converted for
On 1/9/07, Ryan McKinley <[EMAIL PROTECTED]> wrote:
...I would like to use faceted browsing to group documents by year,
month, and day. I can think of a few ways to do this, but I'd like to
see what folks think before i start down the wrong track
Dunno if you've already read it, but I foun
On 12/23/06, Alan Burlison <[EMAIL PROTECTED]> wrote:
...As well as centralising the index, I also want
to centralise the handling of the different document types...
My "Subversion and Solr" presentation from the last Cocoon GetTogether
might give you ideas for how to handle this, see the link
On 12/18/06, Linda Tan <[EMAIL PROTECTED]> wrote:
I just learned no attachments are allowed on this list. I've put the
image in the jira..
Thanks, it looks good indeed!
-Bertrand
On 12/15/06, Fuad Efendi <[EMAIL PROTECTED]> wrote:
...Can we play more with same base
idea?...
You're welcome to create modified or different proposals of course!
The logo was created by our trainee Nicolas, who's finishing his time
at my office next week, and he won't have much time (if any
Hi Solr users,
This is also being discussed on solr-dev, but we thought we'd gather
opinions from the whole community.
We are considering a new logo and would like to have your input:
please tell us which you like better, the current logo or the new
proposal, see http://issues.apache.org/jira/br
On 11/30/06, Mike Klaas <[EMAIL PROTECTED]> wrote:
...Try something like:
wget http://localhost:/gaz/solr/f0.xml -O - | curl
http://localhost:8983/solr/update --data-binary - -H
'Content-type:text/xml; charset=utf-8'
and if you use curl you can use it on both sides to avoid the
dependency
Hi Thorsten, good to see you here!
On 11/28/06, Thorsten Scherler
<[EMAIL PROTECTED]> wrote:
...Following the tutorial and looking at the examples it seems that solr
only supports one document type.
3007WFP
Dell Widescreen UltraSharp 3007WFP
...
That's right, to add documents to a So
On 11/23/06, Nicolas St-Laurent <[EMAIL PROTECTED]> wrote:
...I index huge Oracle tables with Lucene with a custom made
indexer/search engine. But I would prefer to use Solr instead...
Instead of using Lucene's API directly, with Solr you'll have to add
your documents to the index using HTTP P
On 9/24/06, Erik Hatcher <[EMAIL PROTECTED]> wrote:
...perhaps some authentication/
authorization as well as HTTPS should eventually make it into the
core, but getting more fine grained is unnecessary...
If meaningful URLs are used (admin/stats, admin/config,
admin/analysis, etc.), it is relat
Hi Simon,
...are there solr servers in production...
You can see a list at http://wiki.apache.org/solr/PublicServers -
there's some solid stuff running on Solr already!
-Bertrand
On 8/30/06, Diogo Matos <[EMAIL PROTECTED]> wrote:
...org.apache.solr.core.SolrException: Error instantiating class class
org.apache.lucene.analysis.snowball.SnowballAnalyzer
at org.apache.solr.core.Config.newInstance(Config.java:213)...
It looks like SnowballAnalyzer cannot be used in tha
On 8/10/06, Andrew May <[EMAIL PROTECTED]> wrote:
...I'm using the 28th July nightly build, which I believe contains all the
recent fixes...
Does your build contain the
http://issues.apache.org/jira/browse/SOLR-38 patch, and if so did you
try posting the utf8-example.xml document with post.sh
On 7/28/06, Yonik Seeley <[EMAIL PROTECTED]> wrote:
...Getting all the little details of connection handling correct can be
tough... it's probably a good idea if we work toward common client
libraries so everyone doesn't have to reinvent them
Jakarta's HttpClient [1] is IMHO a good base fo
On 7/19/06, Yonik Seeley <[EMAIL PROTECTED]> wrote:
...Can anyone else shed some light on this?..
I have to run now but I *think* there are encoding settings in
web.xml, and IIRC they might be different for Tomcat or Jetty. Setting
UTF-8 everywhere should help.
-Bertrand
On 7/19/06, Tricia Williams <[EMAIL PROTECTED]> wrote:
...What I called the _solr url encoding_ was the q= parameter
translated into encoding in the url...
I think I've seen the same problem, haven't investigated deeper but
IIUC the encoding used when posting a form is related to both the
enc
Hi Yonik,
Thanks very much for your replies!
Le 12 avr. 06 à 16:45, Yonik Seeley a écrit :
On 4/12/06, Bertrand Delacretaz <[EMAIL PROTECTED]> wrote:
...The project that I'm looking at is currently single-language
(French), which I assume can be handled by static configura
Hi Solr users,
I'm investigating indexers for a project, played a bit with both Solr
and Nutch recently, and the Solr "RESTful indexing component" concept
fits our needs quite well.
Before I dig too deep, are there any known limitations w.r.t indexing
of non-english text?
I know Lucene
81 matches
Mail list logo