Hi,
You can use curl with a file if you put the "@" char in front of it's name.
(Otherwise curl expects the data on the commandline).
curl http://localhost:8080/solr/update --data-binary @articles.xml
-Ursprüngliche Nachricht-
Von: Sean Bowman [mailto:[EMAIL PROTECTED]
Gesendet: Donner
I thought that too, I opened it up via vi and nothing was there.
Usually if I have a PC encoding issue ( I use EditPlus as a text
editor) it will show up in vi.
On Apr 26, 2007, at 5:19 PM, Ryan McKinley wrote:
paladin:/data/solr mtorgler1$ curl http://localhost:8080/solr/
update --data-bin
Wrapping is purely via email, the text xml is single line (on
purpose, originally it was a 3 paragraph field that was html encoded,
I simplified the text as a 'just in case')
On Apr 26, 2007, at 5:09 PM, Cody Caughlan wrote:
For the storyText "field" element, is that wrapping only in this em
The curl_multi is probably the most effective way, using straight PHP.
Another option would be to spawn several jobs, assuming unix/linux, and wait
for them to get done. It doesn't give you very good error handling (well,
none at all actually!) but would let you run multiple indexing jobs at once
On 4/26/07, Michael Kimsal <[EMAIL PROTECTED]> wrote:
We're (and by 'we' I mean my esteemed colleague!) working on patching a few
of these items to be in the solrconf.xml file and should likely have some
patches submitted next week. It's being done on 'company time' and I'm not
sure about the ex
Can you point me to the process for submitting these small patches? I'm
looking at the jira site but don't see much of anything there outlining a
process for submitting patches. Sorry to be so basic about this, but I'm
trying to follow correct procedures on both sides of the aisle, so to speak.
Moved to:
https://issues.apache.org/jira/browse/SOLR-217
patch included.
- will
-Original Message-
From: Will Johnson [mailto:[EMAIL PROTECTED]
Sent: Wednesday, April 25, 2007 2:54 PM
To: solr-user@lucene.apache.org
Subject: unused fields
Is there any way to make Solr not throw an Exc
That's an awesome tip to keep in ol' toolbox, Christian.
Michael Kimsal wrote:
Can you point me to the process for submitting these small patches? I'm
looking at the jira site but don't see much of anything there outlining a
process for submitting patches. Sorry to be so basic about this, but I'm
trying to follow correct procedures on both sides of t
Hello,
I'm running into some strange results for some facets of mine. Below you'll
see the XML returned from solr. I did a query using the standard request
handler. Notice the duplicated values returned (american standard, delta,
etc). There is actually quite a few of them. At first I though it ma
Have you tried using the schema.xml that is in example/solr/conf. It that
works then the problem is definitely in your schema.xml.
Bill
On 4/26/07, James liu <[EMAIL PROTECTED]> wrote:
but it is ok when i use tomcat.
2007/4/26, Ken Krugler <[EMAIL PROTECTED]>:
>
> >3.0.23 yesterday i try and
Hello Bertrand,
Is there a build script that automagically grab files from jetty's
source tree (local) and build a solr release? In other words, I can
try building with a newer version of jetty if it doesn't take too
much work - I don't know much about jetty or solr at the code level.
--
Best re
Hopefully someone can chime in with some tips, thanks!
It's likely you have the facet category added more than once for one
or more docs. Like this;
american standard
american standard
Are you adding the facet values on-the-fly? This happened to me and I
solved it by removing the duplicat
On Apr 27, 2007, at 4:24 AM, Burkamp, Christian wrote:
curl http://localhost:8080/solr/update --data-binary @
I think the issue was with a bad file in /data/solr/conf. After a
bunch of testing out of the installtion directory, I was able to use
the post.sh script to post correctly into t
So I just create my own 'issue' first? OK. Thanks.
On 4/27/07, Ryan McKinley <[EMAIL PROTECTED]> wrote:
Michael Kimsal wrote:
> Can you point me to the process for submitting these small patches? I'm
> looking at the jira site but don't see much of anything there outlining
a
> process for su
What's the procedure then for something to get included in the next
release?
Thanks again all!
On 4/27/07, Michael Kimsal <[EMAIL PROTECTED]> wrote:
So I just create my own 'issue' first? OK. Thanks.
On 4/27/07, Ryan McKinley <[EMAIL PROTECTED]> wrote:
>
> Michael Kimsal wrote:
> > Can you
Hi,
I'm considering adding support for caching results in memcached. Questions:
1. Has anyone already done this? (searched, didn't find anything)
2. Can anyone think of any reasons why this might not be a good idea? (I *just*
started considering this)
3. I read http://wiki.apache.org/solr/SolrC
Once the code/patch in the issue is put/committed to SVN, it means it will be
in the next release. You get your patch committed faster if it's clear, well
written and explained, if it comes with a unit test if it's a code change, and
so on.
Otis
. . . . . . . . . . . . . . . . . . . . . . . .
Hi Jack,
On 4/27/07, Jack L <[EMAIL PROTECTED]> wrote:
...Is there a build script that automagically grab files from jetty's
source tree (local) and build a solr release? In other words, I can
try building with a newer version of jetty if it doesn't take too
much work - I don't know much about j
Would creating a new QueryRateFilter servlet filter be a good place to put
this? This way it could stay out of the Solr core and coult be turned on/off
via web.xml.
Otis
- Original Message
From: Chris Hostetter <[EMAIL PROTECTED]>
To: "solr-user@lucene.apache.org"
Sent: Thursday, Apr
The example EmbeddedSolr class on the wiki makes use of getUpdateHandler
which was added after 1.1 (so it seems to be available only on trunk).
I'd really like to move to an embedded Solr sooner rather than later. My
questions are:
- Would it be easy/possible to work around the lack of
getUp
: Has anyone tried using solr over https?
not personally, my Solr instances are usually protected behind a firewall
(just like my databases) so i'm not typically worried about
securingconnections to them from my applications
: Is there any reason why this might not be feasible?
none that i can
: AND does not controll scoring, only matching. If you want dismax to
: be purely additive, pass tie=1.0 to the handler.
more specificly: the defaultOperator option in the schema.xml does not
affect the dismax parser used on the q param at all (only the stock
SolrQueryParser used for things like
Daniel Einspanjer wrote:
The example EmbeddedSolr class on the wiki makes use of getUpdateHandler
which was added after 1.1 (so it seems to be available only on trunk).
I'd really like to move to an embedded Solr sooner rather than later. My
questions are:
- Would it be easy/possible to work
Additional questions regarding EmbeddedSolr (for using the Solr API directly
without HTTP):
- Can I use separate JVMs for same Directory object? One process will
create/update/delete, and another search.
- Can I use separate JEE contexts inside same JVM?
Looks like "singleton" is a must, but sep
On 4/27/07, realw5 <[EMAIL PROTECTED]> wrote:
Hello,
I'm running into some strange results for some facets of mine. Below you'll
see the XML returned from solr. I did a query using the standard request
handler. Notice the duplicated values returned (american standard, delta,
etc). There is actual
Is it possible / is it an ok idea to have multiple solr instances
running on the same machine pointing to the same index files?
Essentially, I have two distinct needs - in some cases i need a commit
immediately after indexing one document, but most of the time it is fine
to wait 10 mins for ch
If you store internal docids, then you need to add the specific reader
(or index version?) as part of the key since the ids are transient.
-Yonik
On 4/27/07, Otis Gospodnetic <[EMAIL PROTECTED]> wrote:
Hi,
I'm considering adding support for caching results in memcached. Questions:
1. Has any
On 4/25/07, brian beard <[EMAIL PROTECTED]> wrote:
...Has anyone tried using solr over https?
Is there any reason why this might not be feasible?...
As with any servlet-based system, using apache httpd's as an SSL
front-end with mod_proxy will work, and it has other benefits besides
not needin
On 4/27/07, Ryan McKinley <[EMAIL PROTECTED]> wrote:
Is it possible / is it an ok idea to have multiple solr instances
running on the same machine pointing to the same index files?
If only one at a time is used to update the index, then yes it is possible.
Essentially, I have two distinct nee
Hi Otis,
I'm considering adding support for caching results in memcached. Questions:
1. Has anyone already done this? (searched, didn't find anything)
Not exactly, but we do something similar to this for Nutch searches
using ehcache (http://krugle.com/kse/projects/eFNJEmX). But we store
th
On 4/26/07, Erik Hatcher <[EMAIL PROTECTED]> wrote:
I think we should open up as many of the switches as we can to
QueryParser, allowing users to tinker with them if they want, setting
the defaults to the most common reasonable settings we can agree upon.
I think we should also try and handle w
On 4/27/07, Otis Gospodnetic <[EMAIL PROTECTED]> wrote:
Would creating a new QueryRateFilter servlet filter be a good place to put
this? This way it could stay out of the Solr core and coult be turned on/off
via web.xml.
There's already gotta be some nice external tools that parse log files
This is for monitoring -- what happened in the last 30 seconds.
Log file analysis doesn't really do that.
I think the XML output in admin/stats.jsp may be enough for us.
That gives the cumulative requests on each handler. Those are
counted in StandardRequestHandler DisMaxRequestHandler and
are ava
: 2. Can anyone think of any reasons why this might not be a good idea? (I
: *just* started considering this)
: 3. I read http://wiki.apache.org/solr/SolrCaching , but I think the
: whole cache discarding and warming wouldn't be needed if what I store in
: memcached is: StringRepresentationOfQuer
I have a dynamic field setup for facets. It looks like this:
I do this, because we add facets quite often, so having to modify the schema
every time would be unfeasible.
I'm currently reindexing from scratch, so I cannot try wt=python for little
bit longer. Once it's done indexing I'll give t
: - Can I use separate JVMs for same Directory object? One process will
: create/update/delete, and another search.
: - Can I use separate JEE contexts inside same JVM?
:
: Looks like "singleton" is a must, but separate "search" should be
: possible...
in theory it should work, Solr doens't do an
: > Essentially, I have two distinct needs - in some cases i need a commit
: > immediately after indexing one document, but most of the time it is fine
: > to wait 10 mins for changes if that has better performance.
:
: Sounds like a configuration issue... set autocommit to 10 minutes, but
: explic
On 4/27/07, realw5 <[EMAIL PROTECTED]> wrote:
I have a dynamic field setup for facets. It looks like this:
I do this, because we add facets quite often, so having to modify the schema
every time would be unfeasible.
I'm currently reindexing from scratch, so I cannot try wt=python for little
b
Walter Underwood wrote:
This is for monitoring -- what happened in the last 30 seconds.
Log file analysis doesn't really do that.
I think the XML output in admin/stats.jsp may be enough for us.
That gives the cumulative requests on each handler. Those are
counted in StandardRequestHandler DisMax
: i will use /usr/solrapp/conf/solr1_solrconfig.xml, solr2_solrconfig.xml,
: solr3_solrconfig.xml...and so.
:
: when i test these instance, i just stay in /usr/solrapp.conf/,,,not like
: now,
:
: i have to change
: /usr/solrapp/solr1/conf,,,/usr/solrapp/solr2/conf,,,/usr/solrapp/solr3/conf,
Hmmm.
: The "solr" on the rsync command line is just a label which is defined in
: rsyncd.conf on the master. rsyncd.conf is created on the fly by the script
: rsyncd-start:
...
: This label is then mapped to the path defined in $data_dir.
Ah... right, i forgot about that.
: > Why does it nee
Ok, I just finished indexing about 20k in documents. I took a look at so far
the problem has not appearred again. What I'm thinking caused it was I was
not adding overwritePending & overwriteCommited in the add process. Therefor
over time as data was being cleaned up, it was just appending to the
Chris Hostetter wrote:
: > Essentially, I have two distinct needs - in some cases i need a commit
: > immediately after indexing one document, but most of the time it is fine
: > to wait 10 mins for changes if that has better performance.
:
: Sounds like a configuration issue... set autocommit to
On 4/27/07, realw5 <[EMAIL PROTECTED]> wrote:
Ok, I just finished indexing about 20k in documents. I took a look at so far
the problem has not appearred again. What I'm thinking caused it was I was
not adding overwritePending & overwriteCommited in the add process. Therefor
over time as data was
: It's likely you have the facet category added more than once for one
: or more docs. Like this;
:
: american standard
: american standard
:
: Are you adding the facet values on-the-fly? This happened to me and I
: solved it by removing the duplicate facet fields.
that's really odd ... i can't t
On 4/26/07, Michael Kimsal <[EMAIL PROTECTED]> wrote:
My colleague, after some digging, found in SolrQueryParser
(around line 62)
setLowercaseExpandedTerms(false);
The default for Lucene is true. Was this intentional? Or an oversight?
Way back before Solr was opensourced, and Chris
Hi everybody,
I just wanted to point out that there has been some discussion going on on
the solr-dev list about making a Solr 1.2 release in the near future.
There is no ETA on when this will happen, at this point it's mainly a
discussion of "what pending Jira patches should we ensure to include
Hi,
We experience some encoding probs with the unicode characters getting
out of solr.
Let me explain our flow:
-fetch a web page
-decode entities and unicode characters(such as $#149; ) using Neko
library
-get a unicode String in Java
-Sent it to SOLR through XML created by SAX, with the right
: writer, I notice a space at the end. I can fix this issue by triming all my
: values before sening them to solr :-)
The built in Field Faceting works on the indexed values, so Solr can solve
this for you if you use something like this for your facet field type...
In our experience, setting a LowercaseFilter in the query did not work; we had
to call setLowercaseExpandedTerms(true) to get wildcard queries to be
case-insensitive.
Here's our analyzer definition from our solr schema:
On 4/27/07, Michael Pelz Sherman <[EMAIL PROTECTED]> wrote:
In our experience, setting a LowercaseFilter in the query did not work; we had
to call setLowercaseExpandedTerms(true) to get wildcard queries to be
case-insensitive.
Correct, because in that case the QueryParser does not invoke anal
On 4/27/07, HUYLEBROECK Jeremy RD-ILAB-SSF
-In the query output from SOLR (XML message), the character is not
encoded as an entity (not •) but the character itself is used
(character 149=95 hexadecimal).
That's fine, as they are equivalent representations, and that
character is directly represe
: -fetch a web page
: -decode entities and unicode characters(such as $#149; ) using Neko
: library
: -get a unicode String in Java
: -Sent it to SOLR through XML created by SAX, with the right encoding
: (UTF-8) specified everywhere( writer, header etc...)
: -it apparently arrives clean on the SO
: when we do a search on a nonexisting field, we get a SolrException:
: undefined field
: (this was for query "nonfield:test")
:
: but when we use wildcards in our query, we dont get the undefined field
: exception,
: so the query "nonfield:*test" works fine ... just zero results...
:
: is this n
PLEASE REMOVE ME FROM THIS MAILING LIST!!!
Whoever manages this list, can you please remove me i have tried sending
emails to the unsubscribe email, but i just keep getting more emails. This
is really an issue for me... so your help would be great!
Thanks,
Paul
From: Chris Hostetter <
HMmm
mkdir+ln-s => solve my problem
thks hoss.
2007/4/28, Chris Hostetter <[EMAIL PROTECTED]>:
: i will use /usr/solrapp/conf/solr1_solrconfig.xml, solr2_solrconfig.xml,
: solr3_solrconfig.xml...and so.
:
: when i test these instance, i just stay in /usr/solrapp.conf/,,,not like
: now,
:
i used to think cache data with memcached.
why i think that?
for example, i have 45 solr instance, and i have to merge their results into
on array and sort by score or datetime, if i use rows=10, it means i will
get max 45*10 results,,but it only show 10result per page.
how to do with 440 resul
im stuck:
Have a facet, and field in an document called estimatedRepairs, it is
declared in the schema.xml as
I execute a query with the below parameters
q=state%3Avirgina;
&facet.query=estimatedRepairs:[*+TO+1000.0]
&facet.query=estimatedRepairs:[1000.0+TO+*]
&fac
I think performance u should notice.
when one instance do index,commit,optimize,,,it will cose much CPU,RAM.
i think it will disturb your search(when they use other solr instance)
unless u know the time they will search or search's frequency is low.
2007/4/28, Ryan McKinley <[EMAIL PROTECTED]>:
yes, i tried and failed.
afternoon i will redownload solr and test .
2007/4/28, Bill Au <[EMAIL PROTECTED]>:
Have you tried using the schema.xml that is in example/solr/conf. It that
works then the problem is definitely in your schema.xml.
Bill
On 4/26/07, James liu <[EMAIL PROTECTED]> wrot
i think curl_multi is slow.
thks, i will try.
2007/4/27, Michael Kimsal <[EMAIL PROTECTED]>:
The curl_multi is probably the most effective way, using straight PHP.
Another option would be to spawn several jobs, assuming unix/linux, and
wait
for them to get done. It doesn't give you very good
now i test the newest solr (nothing modified)
i failed to start solr with resin 3.0
2007/4/28, James liu <[EMAIL PROTECTED]>:
yes, i tried and failed.
afternoon i will redownload solr and test .
2007/4/28, Bill Au <[EMAIL PROTECTED]>:
>
> Have you tried using the schema.xml that is in exampl
I think the real-time-ness of this is the key. What's the current QPS? How
many in-flight queries do we have? What is the average or mean response time?
What's the response time for the 90% percentile? etc. Anyhow, not my current
itch, just trying to point out what Wunder is after.
Otis
64 matches
Mail list logo