Hello,
"Recently someone mentioned that it would be possible to have a 'replace
existing document' feature rather than just dropping and adding documents
with the same unique id."
AFAIK, this is not possible. You have the update in lucene, but internally it
just does a delete/add operation
"We
A big update was just posted to the Solrsharp project. This update now
provides for first-class support for highlighting in the library.
The implementation is really robust and provides the following features:
- Structured highlight parameter assignment based on the SolrField
object
- F
Hi
Is there any way to define the numer/type of fields of the documents
returned in the "moreLikeThis" part of the response, when "mlt" is
set to true?
Currently I'm using morelikethis to show the number and sources of
similar documents - therefore I'd need only the "source" field of
th
On Aug 21, 2007, at 9:25 PM, Lance Norskog wrote:
Recently someone mentioned that it would be possible to have a
'replace
existing document' feature rather than just dropping and adding
documents
with the same unique id.
There is such a patch: https://issues.apache.org/jira/browse/SOLR-13
Hello,
Sorry for stupid question. I'm trying to index html file as one of
the fields in Solr, I've setup appropriate analyzer in schema but I'm
not sure how to add html content to Solr. Encapsulating HTML content
within field tag is obviously not valid. How do I add html content?
Hope the query
You need to encode your html content so it can be include as a normal
'string' value in your xml element.
As far as remember, the only unsafe characters you have to encode as
entities are:
< -> <
> -> >
" -> "e;
& -> &
(google xml entities to be sure).
I dont know what language you use , but fo
Thanks Jérôme!
It seems to work now. I just hope the provided
HTMLStripWhitespaceTokenizerFactory will strip the right tags now.
I use Java and used HtmlEncoder provided in
http://itext.ugent.be/library/api/ for encoding with success. (just
in case someone happens to search this thread)
Ravi
Hello,
Currently in our application we are using the master/slave setup and
have a batch update/commit about every 5 minutes.
There are a couple queries that we would like to run almost realtime so
I would like to have it so our client sends an update on every new
document and then have solr
I understand - thanks, Yonik.
I notice that LuceneQueryOptimizer is still used in
SolrIndexSearcher.search(Query, Filter, Sort) - is the idea then that this
method is deprecated, or that the config parameter
query/boolTofilterOptimizer is no longer to be used? As for the other
search() methods, t
On 8/22/07, Jonathan Woods <[EMAIL PROTECTED]> wrote:
> I notice that LuceneQueryOptimizer is still used in
> SolrIndexSearcher.search(Query, Filter, Sort) - is the idea then that this
> method is deprecated,
Hmmm, so it is. I hadn't noticed because that method is not called
from any query handle
Not high priority, but a few thoughts occur, then:
- perhaps it would be better to use org.apache.lucene.search.Searcher by
composition and have SolrIndexSearcher merely implement Searchable.
- or... perhaps search(...) should perform optimally cache-aware searches -
else integrators might wrongl
Hello, I was thinking that solr - with its built in faceting - would make for a
great apache log file storage system. I was wondering if anyone knows of any
module or library for apache to write log files directly to solr or to a lucene
index?
Thanks
Andrew
Sorry for revisiting this 3 weeks old thread.
I downloaded the nighlty yesterday.
I noticed that some classes have API docs (.html) but no source code
(.java).
For example, there is a javadoc for
org.apache.solr.client.solrj.util.ClientUtils
but no ClientUtils.java:
bash-3.00$ find . -type f | gre
Recently I ran into this topic. I googled it a little and didn't find much
information.
It would be great to have solr working with RAMDirectory and Terracotta. We
could stop using crons for rsync, right?
Has anyone tried that out?
Are the score values generated in Solr relative to the index or are they
against an absolute standard?
Is it possible to create a scoring algorithm with this property? Are there
parts of the score inputs that are absolute?
My use case is this: I would like to do a parallel search against two Solr
Indexes cannot be directly compared unless they have similar collection
statistics. That is the same terms occur with the same frequency
across all indexes and the average document lengths are about the same
(though the default similarity in Lucene may not care about average
document length--I
tried it, didn't work that well...so I ended up making my own little faceted
Search engine directly using RAMDirectory and clustering it via
Terracotta...not as good as SOLR(smile), but it worked.
i actually posted some questions awhile back in trying to get it to work. so
terracotta can "hook"
How come it didn't work? How did you add RAMDir support to solr?
On 8/22/07, Jeryl Cook <[EMAIL PROTECTED]> wrote:
>
> tried it, didn't work that well...so I ended up making my own little
> faceted Search engine directly using RAMDirectory and clustering it via
> Terracotta...not as good as SOLR(s
Hi All,
This is the scenario, I have two search SOLR instances running on two
different partitions, I am treating one of the servers strictly
read-only (for search) (search server) and the other
Instance (index server) for indexing. The index file data directory
reside on a NFS partition, I am
Hello,
in my custom request handler, I want to determine which fields are
constrained by the user.
E.g. the query (q) might be "ipod AND brand:apple" and there might
be a filter query (fq) like "color:white" (or more).
What I want to know is that "brand" and "color" are constrained.
AFAICS I co
Jeryl,
I remember you asking about how to hook in the RAMDirectory a while back.
It seemed like there was maybe some support within Solr that you needed. I
assume you're suggesting adding an issue in the Solr JIRA, right?
Is there something that the Terracotta team can do to help?
Cheers,
Or
Hello!
I was wondering if anyone has written a script that displays any
stats from SOLR.. queries per second, number of docs added.. this
sort of thing.
Sort of a general dashboard for SOLR.
I'd rather not write it myself if I don't need to, and I didn't see
anything conclusive in the ar
Hi Stefan,
Currently there is no way to specify the list of fields to be returned by
the MoreLikeThis handler. I've been looking to address this issue in
https://issues.apache.org/jira/browse/SOLR-295 (point 3) however in the
broader scheme of things, it seems logical to wait until
https://issues
Matthew,
Maybe the SOLR Statistics page would suit your purpose?
(click on "statistics" from the main solr page or use the following url)
http://localhost:8983/solr/admin/stats.jsp
cheers,
Piete
On 23/08/07, Matthew Runo <[EMAIL PROTECTED]> wrote:
>
> Hello!
>
> I was wondering if anyone has w
At Infoseek, we ran a separate search index with today's updates
and merged that in once each day. It requires a little bit of
federated search to prefer the new content over the big index,
but the daily index can be very nimble for update.
wunder
On 8/22/07 7:58 AM, "mike topper" <[EMAIL PROTECT
If I am not wrong once you have the RAMDir feature mounting Terracotta
should be transparent and fast, right?
On 8/22/07, Orion Letizi <[EMAIL PROTECTED]> wrote:
>
>
> Jeryl,
>
> I remember you asking about how to hook in the RAMDirectory a while back.
> It seemed like there was maybe some support
: I noticed that some classes have API docs (.html) but no source code
: (.java).
: For example, there is a javadoc for
: org.apache.solr.client.solrj.util.ClientUtils
: but no ClientUtils.java:
i beleive this issue is that none of the source from the client
directory is included in the builds at
:
: There are a couple queries that we would like to run almost realtime so
: I would like to have it so our client sends an update on every new
: document and then have solr configured to do an autocommit every 5-10
: seconds.
:
: reading the Wiki, it seems like this isn't possible because of the
: 3) I had to bounce the tomcat search SOLR Webapp instance for it to
: read the index files, is it mandatory? In a distributed environment, do
: we always have to
:
: Bounce the SOLR Webapp instances to reflect the changes in the index
: files?
it sounds like you esentially have a master/sl
: in my custom request handler, I want to determine which fields are
: constrained by the user.
:
: E.g. the query (q) might be "ipod AND brand:apple" and there might
: be a filter query (fq) like "color:white" (or more).
:
: What I want to know is that "brand" and "color" are constrained.
techni
: aren't expandable at query time. It would be quite cool if Solr could do
: query-time expansions of dynamic fields (e.g. hl.fl=page_*) however that
: would require some knowledge of the dynamic fields already stored in the
: index, which I don't think is currently available in either Solr or Lu
Hello,
i am using faceting in a project and would like to do date faceting with
facet.date. That works fine, but as well returns dates which have no
resulting pages underneath, i.e. the facet count equals 0. Is it possible to
constrain this just to dates for which results exist similar to
facet.m
Note that Hoss was earlier calling for someone to submit an implementation
of SolrDirectoryFactory...
http://www.nabble.com/forum/ViewPost.jtp?post=12260989&framed=y
Jon
> -Original Message-
> From: Jonathan Ariel [mailto:[EMAIL PROTECTED]
> Sent: 23 August 2007 03:23
> To: solr-user@lu
33 matches
Mail list logo