Hi I have two server(Physical) that run my application and solr. I use
external file field to do some search result ranking.
According to the wiki page, external file field data need to resident in
{solr}\data directory. Because EFF data is generated by my application. How
can I push this file to
Is there any API to support upload file for ExternalFileField to /data/
directory or any good practice on this?
My application and Solr Server were physically separated on two place.
Application will calculate a score and generate a file for
ExternalFileField.
Thanks for any input.
I have many plain text xml that I transfer to form of solr xml format.
But every time I send them to solr, I hit OOM exception.
How to configure solr to "eat" these big xml?
Please guide me a way. Thanks
floyd
h to using DataImportHandler to pull from XML instead of pushing
> 5. Increase amount of memory to Solr (-X command line flags)
>
> Regards,
>Alex.
>
> Personal website: http://www.outerthoughts.com/
> Current project: http://www.solr-start.com/ - Accelerating your Solr
> pr
to Solr. Generally, I'd say letting a user hit Solr directly is
> a bad thing - especially a user who doesn't know the details of how Solr
> works.
>
> Upayavira
>
> On Mon, Mar 31, 2014, at 07:17 AM, Floyd Wu wrote:
> > Hi Alex,
> >
> > Thanks for your r
Usually IR system is measured using Precision & Recall.
But depends on what kind of system you are developing to fit what scenario.
Take a look
http://en.wikipedia.org/wiki/Precision_and_recall
2014-04-01 10:23 GMT+08:00 azhar2007 :
> Hi people. Ive developed a search engine to implement and i
Hi,
I have many XML Message file formatted like this
https://wiki.apache.org/solr/UpdateXmlMessages
These files are generated by my index builder daily.
Currently I am sending these file through http post to Solr but sometimes I
hit OOM exception or pending too many tlog.
Do you have better way t
rent project: http://www.solr-start.com/ - Accelerating your Solr
> proficiency
>
>
> On Sun, Jun 15, 2014 at 3:44 PM, Floyd Wu wrote:
> > Hi,
> > I have many XML Message file formatted like this
> > https://wiki.apache.org/solr/UpdateXmlMessages
> >
> > These fi
tting roughly ten huge files in
> parallel is a way to perform good. Once again, nuke tlog.
>
>
> On Sun, Jun 15, 2014 at 12:44 PM, Floyd Wu wrote:
>
> > Hi,
> > I have many XML Message file formatted like this
> > https://wiki.apache.org/solr/UpdateXmlMessage
so far. Submitting roughly ten huge files in
> > parallel is a way to perform good. Once again, nuke tlog.
> >
> >
> > On Sun, Jun 15, 2014 at 12:44 PM, Floyd Wu wrote:
> >
> >> Hi,
> >> I have many XML Message file formatted like this
> >>
Hi Shawn,
I've tried to set 4GB heap for Solr and the OOM exception rellay get reduce
and also performance gained.
Floyd
2014-06-16 0:00 GMT+08:00 Shawn Heisey :
> On 6/15/2014 2:54 AM, Floyd Wu wrote:
> > Thank you Alex.
> > I'm doing commit every 100 fiels.
> &
Hi Mikhail
Thanks for you suggestions.
Floyd
2014-06-16 17:28 GMT+08:00 Mikhail Khludnev :
> On Mon, Jun 16, 2014 at 6:57 AM, Floyd Wu wrote:
>
> > Hi Mikhail,
> > What is the pros. to disable tlog?
> >
> I consumes the heap much providing the benefits (real-time
Will these awesome features being implemented in Solr soon
2014/6/20 下午10:43 於 "Yonik Seeley" 寫道:
> On Fri, Jun 20, 2014 at 10:15 AM, Yago Riveiro
> wrote:
> > Yonik,
> >
> > This native code uses in any way the docValues?
>
> Nope... not yet. It is something I think we should look into in the
Hi Yonik, i dont' understand the relationship between solr and heliosearch
since you were committer of solr?
I just curious.
2014/6/21 上午12:07 於 "Yonik Seeley" 寫道:
> On Fri, Jun 20, 2014 at 11:16 AM, Floyd Wu wrote:
> > Will these awesome features being implemented in
After re-index 2 xml files and done commit, optimization many times, I
still have many tlog files in data/tlof directory.
Why?
How to remove those files(delete them directly or just ignored them?)
What is the difference if tlog files exist or not?
Please kindly guide me.
Thanks
Floyd
mmit-in-sorlcloud/
>
> If that doesn't help, what version of Solr? How
> big are you tlog files? Details matter.
>
> Best,
> Erick
>
>
> On Sun, Nov 3, 2013 at 10:03 AM, Floyd Wu wrote:
>
> > After re-index 2 xml files and done commit, optimization many t
Hi there,
I'm not fully understand what kind of usage example that DocValues can be
used?
When I set field docValues=true, do i need to change anyhting in xml that I
sent to solr for indexing?
Please point me.
Thanks
Floyd
PS: I've googled and read lots of DocValues discussion but confused.
need to add the tag in
> the solrconfig.xml and the docValuesFormat=true on the fieldType definition.
>
> --
> Yago Riveiro
> Sent with Sparrow (http://www.sparrowmailapp.com/?sig)
>
>
> On Wednesday, November 20, 2013 at 9:38 AM, Floyd Wu wrote:
>
> > Hi the
ttp://searchhub.org/2013/04/02/fun-with-docvalues-in-solr-4-2/
>
> --
> Yago Riveiro
> Sent with Sparrow (http://www.sparrowmailapp.com/?sig)
>
>
> On Wednesday, November 20, 2013 at 10:15 AM, Floyd Wu wrote:
>
> > Hi Yago
> >
> > Thanks for you reply. I
Hi there,
I've built a SolrCloud cluster from example, but I have some question.
When I send query to one leader (say
http://xxx.xxx.xxx.xxx:8983/solr/collection1) and no problem everything
will be fine.
When I shutdown that leader, the other replica(
http://xxx.xxx.xxx.xxx:9983/solr/collection1)
hat has gone away.
>
> Also, ZK aware SolrJ Java client that load-balances across all nodes in
> cluster.
>
>
> On Wed, Jul 10, 2013 at 2:52 PM, Floyd Wu wrote:
>
> > Hi there,
> >
> > I've built a SolrCloud cluster from example, but I have some questi
ct any query specifically to a leader. It
> will
> > automatically be routed to the right leader.
> > You may put a load balancer on top to just fix the problem with querying
> a
> > node that has gone away.
> >
> > Also, ZK aware SolrJ Java client that load-balan
loke
>
>
> On Wed, Jul 10, 2013 at 4:11 PM, Furkan KAMACI >wrote:
>
> > By the this is not related to your question but this may help you for
> > connecting Solr via C#: http://solrsharp.codeplex.com/
> >
> > 2013/7/10 Floyd Wu
> >
> > > Hi Furka
When using StandardAnalyzer to tokenize string "Pacific_Rim" will get
ST
textraw_bytesstartendtypeposition
pacific_rim[70 61 63 69 66 69 63 5f 72 69 6d]0111
How to make this string to be tokenized to these two tokens "Pacific",
"Rim"?
Set _ as stopword?
Please kindly help on this.
Many thanks.
F
: Re: How to avoid underscore sign indexing problem?
>
>
> On 8/21/2013 7:54 PM, Floyd Wu wrote:
>
>> When using StandardAnalyzer to tokenize string "Pacific_Rim" will get
>>
>> ST
>> textraw_**bytesstartendtypeposition
>> pacific_rim[70 61 63 69 6
Floyd
2013/8/22 Floyd Wu
> Thank you all.
> By the way, Jack I gonna by your book. Where to buy?
> Floyd
>
>
> 2013/8/22 Jack Krupansky
>
>> "I thought that the StandardTokenizer always split on punctuation, "
>>
>> Proving that you haven'
. Although this decrease search quality a little,
but user need higher recall rate than precision.
Thank you all.
Floyd
2013/8/22 Floyd Wu
> After trying some search case and different params combination of
> WordDelimeter. I wonder what is the best strategy to index string
> "2DA
Anybody can point me a direction?
Many thanks.
2013/3/20 Floyd Wu
> Hi everyone,
>
> I have a problem and have no luck to figure out.
>
> When I issue a query to
> Query 1
>
> http://localhost:8983/solr/select?q={!boost+b=recip(ms(NOW/HOUR,last_modified_datetime),3.
Hi Michael, How do I configure posthighlighter with my solr 4.2 box?
Please kindly point me. Many thanks.
2013/6/15 下午10:48 於 "Michael McCandless" 寫道:
> You could also try the new[ish] PostingsHighlighter:
>
> http://blog.mikemccandless.com/2012/12/a-new-lucene-highlighter-is-born.html
>
> Mike M
In my test case, it seems this new highlighter not working.
When field set multivalue=true, the stored text in this field can not be
highlighted.
Am I miss something? Or this is current limitation? I have no luck to find
any documentations mentioned this.
Floyd
ultiValued (not capital 'V'). This probably isn't the
> problem, but just in case.
>
> Anything in the logs? What is the field definition?
> Did you re-index after changing to multiValued?
>
> Best
> Erick
>
> On Tue, Jun 18, 2013 at 11:01 PM, Floyd Wu w
Any idea can help on this?
2013/6/22 Erick Erickson
> Unfortunately, from here I need to leave it to people who know
> the highlighting code
>
> Erick
>
> On Wed, Jun 19, 2013 at 8:40 PM, Floyd Wu wrote:
> > Hi Erick,
> >
> > "multival
Hi there,
There are many English soundex implementation can be referenced, but I
wonder how to do Chinese soundex(sounds like) filter (maybe).
any idea?
Floyd
ave to be storing the phonetic forms for all of the words.
>
> -- Ken
>
>
>>> From: Floyd Wu
>>> To: solr-user@lucene.apache.org
>>> Sent: Thursday, October 20, 2011 5:43 AM
>>> Subject: Does anybody has experience in Chinese soundex(sounds like) of
&
Does anybody know how to implement this idea in SOLR. Please kindly
point me a direction.
For example, when user enter a keyword in Chinese "貝多芬" (this is
Beethoven in Chinese)
but key in a wrong combination of characters "背多分" (this is
pronouncation the same with previous keyword "貝多芬").
There
's pinyin is chang in 长沙,you should segment query and
> words/phrases first. word segmentation is a basic problem is Chinese IR
>
>
> 2011/10/21 Floyd Wu
>
>> Does anybody know how to implement this idea in SOLR. Please kindly
>> point me a direction.
>>
>
Hi Jason,
I'm very curious about how you build( rebuild ) such a big index efficiently?
Sorry that hijack this topic.
Floyd
2011/11/1 Jason Biggin :
> Wondering if anyone has experience with replicating large indexes. We have a
> Solr deployment with 1 master, 1 master/slave and 5 slaves. Our
Hi there,
Is it possible to separate ACL index and document index and achieve to
search by user role in SOLR?
Currently my implementation is to index ACL with document, but the
document itself change frequently. I have to perform rebuild index
every time when ACL change. It's heavy for whole syst
ument ACL will need to
re-build index "with document content".
It make no sense to rebuild when I only change ACL.
Have any idea? Or I just misunderstanding these patch?
Floyd
2011/11/23 Floyd Wu :
> Hi there,
>
> Is it possible to separate ACL index and document index an
f user has
> granular ACL over may documents or groups. I've seen it work ok with
> up to 1000 or so ACLs per user query. So you build that filter query
> from the client using some external database to lookup user ACLs
> before sending request to SOLR.
>
> Bob
>
>
&
Hi there
I have a problem, the situation is when I issue a query to single instance,
Solr response XML like following
as you can see, the score is normal()
===
0
23
_l_title,score
0
_l_unique_key:12
*
true
999
1.9808292
GTest
12
===
But wh
Does anybody can help on this ?
Many thanks
2010/9/29 Floyd Wu
> Hi there
>
> I have a problem, the situation is when I issue a query to single instance,
> Solr response XML like following
> as you can see, the score is normal()
> ===
>
>
> 0
&
Hi Chris
Thanks. But do you have any suggest or work-around to deal with it?
Floyd
2010/10/2 Chris Hostetter
>
> : But when I issue the query with shard(two instances), the response XML
> will
> : be like following.
> : as you can see, that score has bee tranfer to a element of
>..
Hi there,
What is the difference between Lucid distribution of Solr and Apache
distribution?
And can I use Lucid distribution for free in my commercial project?
Hi there,
If I dont need Morelikethis, spellcheck, highlight.
Can I remove this configuration section in solrconfig.xml?
In other workd, does solr load and use these SearchComponet on statup and
suring runtime?
Remove this configuration will or will not speedup query?
Thanks
I tried this work-around, but seems not work for me.
I still get array of score in the response.
I have two physical server A and B
localhost --> A
test -->B
I issue query to A like this
http://localhost:8983/solr/core0/select?shards=test:8983/solr,localhost:8983/solr/core0&indent=on&version=2.
irmed this situation is
not coincidence.
2010/10/20 Floyd Wu
> I tried this work-around, but seems not work for me.
> I still get array of score in the response.
>
> I have two physical server A and B
>
> localhost --> A
> test -->B
>
> I issue query to A li
Hi there,
I have a field(which is externalFileField, called rankingField) and that
value(type=float) is calculated by client app.
For the solr original scoring model, affect boost value will result
different ranking. So I think product(score,rankingField) may equivalent to
solr scoring model.
W
ere is a tie in ranking (two docs
> have the same rank value)
> 1. the reverse of 2.
>
> Otis
> --
> Performance Monitoring - http://sematext.com/spm/index.html
> Search Analytics - http://sematext.com/search-analytics/index.html
>
>
>
>
> On Mon, Nov 19, 2012 at 9
0.0
0.0
0.0
0.0
6.0
3.0
0.0
0.0
0.0
0.0
3.0
2012/11/20 Otis Gospodnetic
> Hi Floyd,
>
> Use &debugQuery=true and let's see it.:)
>
> Otis
> --
> Performance Monitoring - http://sematext.com/spm/index.html
> Search Analytics - http://se
s is being used? (remove it, put some junk
> in it, see if that gives an error)
>
> Otis
> --
> Performance Monitoring - http://sematext.com/spm/index.html
> Search Analytics - http://sematext.com/search-analytics/index.html
>
>
>
>
> On Mon, Nov 19, 2012 at 10:16
t; --
> Performance Monitoring - http://sematext.com/spm/index.html
> Search Analytics - http://sematext.com/search-analytics/index.html
>
>
>
>
> On Mon, Nov 19, 2012 at 9:29 PM, Floyd Wu wrote:
>
> > Hi there,
> >
> > Before ExternalFielField introduced,
Hi Chris,
Thanks! Before your great suggestions, I give up using function query to
calculate product of score and rankingField and using exactly the same with
your boost query solution. Of course it works fine. The next step will be
design suitable function to output a ranking value that also cons
roduct(query($q),2) desc,score
> desc&fl=score,_score_:product(query($q),2),[explain]
>
> Cheers,
> Dan
>
> On Tue, Nov 20, 2012 at 2:29 AM, Floyd Wu wrote:
>
> > Hi there,
> >
> > Before ExternalFielField introduced, change document boost value to
>
lities of Lucene and Solr.
>
> Upayavira
>
> On Wed, Nov 28, 2012, at 10:15 AM, Floyd Wu wrote:
> > Hi there,
> >
> > If I have a list that is key-value pair in text filed or database table.
> > How do I achieve dynamic ranking based on search term? That say when
Hi there,
I'm not sure if I understand this clearly.
'bf' is that final score will be add some value return by bf?
for example-> score + bf = final score
'boost' is that score will be multiply with value that return by boost?
for example-> score * boost = final score
When using both( 'bf' and
-- Jack Krupansky
>
> -Original Message- From: Floyd Wu
> Sent: Monday, December 03, 2012 11:00 PM
> To: solr-user@lucene.apache.org
> Subject: Difference between 'bf' and 'boost' when using eDismax
>
>
> Hi there,
>
> I'm not sure if I under
y cache.
>
> Otis
> --
> SOLR Performance Monitoring - http://sematext.com/spm/index.html
>
>
>
>
>
> On Mon, Dec 10, 2012 at 10:11 PM, Floyd Wu wrote:
>
> > Hi There,
> > Sorry for sapmming if this question had already asked.
> >
> > Wha'
Hi there,
I have a document and its title is "20111213_solr_apache conference report".
When I use analysis web interface to see what tokens exactly solr analyze
and the following is the result
term text20111213_solrapacheconferencereportterm type
Why 20111213_solr tokenized as and "_" char w
Hi,
As I know when using fl=*, score means we need to get all field and score as
returned search result. And if field is stored, all text will be returned as
part of result.
Now I have 2x fields, some of fields name have no prefix or fixed naming
rule and cannot be predicted what name will be.
I
:: http://search-lucene.com/
>
>
>
> ----- Original Message
> > From: Floyd Wu
> > To: solr-user@lucene.apache.org
> > Sent: Wed, April 13, 2011 2:34:49 PM
> > Subject: how to get lots fields this way?
> >
> > Hi,
> >
> > As I know w
if this is edit distance implementation, what is the result apply to CJK
query? For example, "您好"~3
Floyd
2011/6/30 entdeveloper
> I'm using Solr trunk.
>
> If it's levenstein/edit distance, that's great, that's what I want. It just
> didn't seem to be officially documented anywhere so I wante
Hi all,
I need to make date faceted query and I tried to use facet.range but can't
get result I need.
I want to make 4 facet like following.
1 Months,3 Months, 6Months, more than 1 Year
The onlinedate field in schema.xml like this
I hit the solr by this url
http://localhost:8983/solr/select
; Tomás
> On Tue, Jul 26, 2011 at 1:23 AM, Floyd Wu wrote:
>
> > Hi all,
> >
> > I need to make date faceted query and I tried to use facet.range but
> can't
> > get result I need.
> >
> > I want to make 4 facet like following.
> >
> >
Hi Alex, I'd like to know how to "using Client and Server Certificates to
protect
the connection and embedding those certificates into clients?"
Please kindly share your experience.
Floyd
2012/11/8 Alexandre Rafalovitch
> It is very easy to do this on Apache, but you need to be aware that
> U
Hi there,
Does anybody can kindly tell me how to setup solr to use BM25?
By the way, are there any experiment or research shows BM25 and classical
VSM model comparison in recall/precision rate?
Thanks in advanced.
n implementations
> that have tested the new ranking models available in Solr.
>
> Tom
>
>
>
> On Wed, Nov 14, 2012 at 9:16 PM, Floyd Wu wrote:
>
> > Hi there,
> > Does anybody can kindly tell me how to setup solr to use BM25?
> > By the way, are there any experiment or research shows BM25 and classical
> > VSM model comparison in recall/precision rate?
> >
> > Thanks in advanced.
> >
>
67 matches
Mail list logo