Thanks for your patience and helps.
Recently, I acquired a batch of commits?? SHA data of Lucene, of which the
time span is from 2010 to 2015. In order to get original info, I tried to use
these SHA data to track commits. First, I cloned Lucene repository to my local
host, using the cmd gi
Thanks for your patience and helps.
Recently, I acquired a batch of commits?? SHA data of Lucene, of which the
time span is from 2010 to 2015. In order to get original info, I tried to use
these SHA data to track commits. First, I cloned Lucene repository to my local
host, using the cmd gi
Dear Shawn and Chris,
Thanks very much for your replies and helps.
And so sorry for my mistakes of first-time use of Mailing Lists.
On 11/9/2017 5:13 PM, Shawn wrote:
> Where did this information originate?
My SHA data come from the paper On the Naturalness of Buggy Code(Baishakhi Ray,
et al. IC
every table, I should be able to map the
column I choose from the table to solr Id, any solution will be greatly
appreciated.
`Tom
--
View this message in context:
http://lucene.472066.n3.nabble.com/dataimport-handler-tp4112830.html
Sent from the Solr - User mailing list archive at Nabble.com.
-29] DEBUG org.apache.solr.handler.dataimport.LogTransformer
û The name is
`Tom
--
View this message in context:
http://lucene.472066.n3.nabble.com/TemplateTransformer-returns-null-values-tp4114539.html
Sent from the Solr - User mailing list archive at Nabble.com.
Thanks Alexandre for quick response,
I tried both the ways but still no luck null values, anything I am doing
fundamentally wrong?
query="select DOC_IDN, BILL_IDN from document_fact" >
and
query="select DOC_IDN as id ,BILL_IDN as bill_id from document_fact" >
Hi,
I'm new to Solr, and I'm having a problem with faceting. I would really
appreciate it if you could help :)
I have a set of documents in JSON format, which I could post to my Solr
core using the post.jar tool. Each document contains two fields, namely
"startDate" and "endDate", both of which a
; Best,
> Erick
>
> On Mon, Nov 3, 2014 at 8:18 PM, Yubing (Tom) Dong 董玉冰
> wrote:
> > Hi,
> >
> > I'm new to Solr, and I'm having a problem with faceting. I would really
> > appreciate it if you could help :)
> >
> > I have a set of document
io of writing to the index to searching is usually
> hugely in favor of doing the work during indexing.
>
> Best,
> Erick
>
> On Mon, Nov 3, 2014 at 8:52 PM, Yubing (Tom) Dong 董玉冰
> wrote:
> > Hi Erik,
> >
> > Thanks for the reply! Do you mean parse and
Turns out that update processors perfectly suit me needs. I ended up using
the StatelessScriptUpdateProcessor with a simple js script :-)
On Mon Nov 03 2014 at 下午10:40:52 Yubing (Tom) Dong 董玉冰 <
tom.tung@gmail.com> wrote:
> I see. Thank you! :-)
>
> Sent from my Android p
hi folks,
i think i found a bug in the spellchecker but am not quite sure:
this is the query i send to solr:
http://lh:8983/solr/CompleteIndex/select?
&rows=0
&echoParams=all
&spellcheck=true
&spellcheck.onlyMorePopular=true
&spellcheck.extendedResults=no
&q=a+bb+ccc++
and this is the resul
same
On 22.03.2012 10:00, Markus Jelsma wrote:
Can you try spellcheck.q ?
On Thu, 22 Mar 2012 09:57:19 +0100, tom wrote:
hi folks,
i think i found a bug in the spellchecker but am not quite sure:
this is the query i send to solr:
http://lh:8983/solr/CompleteIndex/select?
&ro
so any one has a clue what's (might be) going wrong ?
or do i have to debug and myself and post a jira issue?
PS: unfortunately i cant give anyone the index for testing due to NDA.
cheers
On 22.03.2012 10:17, tom wrote:
same
On 22.03.2012 10:00, Markus Jelsma wrote:
Can yo
runs in a jetty.
- the embedded codes dont expose any of the solr servlets
note: that the slave config, if started in jetty, does proper
replication, while when embedded it doesnt.
using solr 3.5
thx
tom
ok tested it myself and a slave runnning embedded works, just not within
my application -- yet...
On 20.06.2012 18:14, tom wrote:
hi,
i was just wondering if i need to do smth special if i want to have an
embedded slave to get replication working ?
my setup is like so:
- in my clustered
hi,
i'm using the suggester with a file like so:
suggest
name="classname">org.apache.solr.spelling.suggest.Suggester
name="lookupImpl">org.apache.solr.spelling.suggest.fst.FSTLookup
content
0.05
true
100
autocomplete.d
BTW: a core unload doesnt release the lock either ;(
On 21.06.2012 14:39, tom wrote:
hi,
i'm using the suggester with a file like so:
suggest
name="classname">org.apache.solr.spelling.suggest.Suggester
name="lookupImpl">org.apache.solr
good reason for this or rather a bug?
should i move the topic to the dev list?
On 21.06.2012 14:49, tom wrote:
BTW: a core unload doesnt release the lock either ;(
On 21.06.2012 14:39, tom wrote:
hi,
i'm using the suggester with a file like so:
suggest
name=
il.com>> wrote:
On Fri, Jun 22, 2012 at 10:37 AM, tom mailto:dev.tom.men...@gmx.net>> wrote:
cross posting this issue to the dev list in the hope to get
a response here...
I think you are right. Closing the Stream / Reader is the
responsibi
If you are using another app to create the index, I think you can remove the
update servlet mapping in the web.xml.
--
View this message in context:
http://lucene.472066.n3.nabble.com/Restrict-access-to-localhost-tp2004475p2014129.html
Sent from the Solr - User mailing list archive at Nabble.com
Is it a good rule of thumb, that when dealing with large indexes copyField
should not be used. It seems to duplicate the indexing of data.
You don't need copyField to be able to search on multiple fields. Example,
if I have two fields: title and post and I want to search on both, I could
just qu
Thanks Erick
--
View this message in context:
http://lucene.472066.n3.nabble.com/copyField-for-big-indexes-tp3275712p3275816.html
Sent from the Solr - User mailing list archive at Nabble.com.
Bill,
I was using it as a simple default search field. I realise now that's not
a good reason to use copyField. As I see it now, it should be used if you
want to search in a way that is different: use different analyzers, etc; not
for just searching on multiple fields in a single query.
Thank
10K documents. Why not just batch them?
You could read in 10K from your database, load em into an array of
SolrDocuments. and them post them all at once to the Solr server? Or do em
in 1K increments if they are really big.
--
View this message in context:
http://lucene.472066.n3.nabble.com/S
t number encountered."
which would seem to be necessary to get the group id.
Does Solr add anything to Lucene in this regard?
Thanks,
Tom
I'm not quite sure where
to configure/code this in Solr.
Thanks,
Tom
On 10/30/06, Yonik Seeley <[EMAIL PROTECTED]> wrote:
> Hi Tom, I moderated your email in... you need to subscribe to prevent
> your emails being blocked in the future.
Thanks. That's fix
viously, my
implementation won't work at the moment, since all queries will cache
the value generated using the results generated by the value of trim
on the initial query.
Any suggestions for where to go poking around to fix this vs. caching?
Thanks,
Tom
At 11:10 AM 11/8/200
At 01:35 PM 11/15/2006, you wrote:
On 11/15/06, Tom <[EMAIL PROTECTED]> wrote:
It looks like that for trimming, the places I want to modify are in
ScorePriorityQueue and FieldSortedHitQueue. When trimming, I want to
return the top item in the group that matches, whether by score or
sor
first.
Thanks,
Tom
uld it be faster to do that on a field with
fewer values (but the same number of documents)
e.g. type:[* TO *] where the type field has a small number of values.
Or does that not matter?
Thanks,
Tom
At 02:49 PM 11/21/2006, you wrote:
: > I mean is there something I can put in a solr UR
don't want to affect the score,
but I would like to have cached, I should use Filter Queries, right?
(SolrParams.FQ)
Thanks,
Tom
anything better than JConsole also welcome).
Thanks,
Tom
re monitor.xml, I get them in insertion order
in that case as well.)
Please let me know if my assumptions or my methods aren't correct.
Thanks,
Tom
0
4
10
0
on
id:[* TO *]
1
2.2
electronicsmonitor
30" TFT active matrix LCD, 2560 x 1600,
.25mm
Hi Yonik,
Thanks for the quick response.
At 07:45 AM 12/28/2006, you wrote:
On 12/27/06, Tom <[EMAIL PROTECTED]> wrote:
I'm having a problem getting boosts to work the way I think they are
supposed to.
Do you have a specific relevance problem you are trying to solve, or
just tes
At 12:03 PM 12/28/2006, you wrote:
On 12/28/06, Tom <[EMAIL PROTECTED]> wrote:
Could you index your documents in the desired order? This is the
default sort order.
I don't think I can control document order, as documents may get
edited after creation.
If not, you can add a fi
ven't really figured out Lucene scoring.
document boosts aren't maintained in the index ... they are multiplied by
the various field boosts and lengthNorms and stored on a per field basis.
Thanks! I had seen comments that the doc boost wasn't stored, but
didn't know how it worked.
Tom
Any hints gratefully accepted
Cheers
Tom
On Tue, Jan 12, 2016 at 2:32 PM, Shawn Heisey wrote:
> On 1/12/2016 6:05 AM, Tom Evans wrote:
>> Hi all, trying to move our Solr 4 setup to SolrCloud (5.4). Having
>> some problems with a DIH config that attempts to load an XML file and
>> iterate through the nodes in that
On Tue, Jan 12, 2016 at 3:00 PM, Shawn Heisey wrote:
> On 1/12/2016 7:45 AM, Tom Evans wrote:
>> That makes no sense whatsoever. DIH loads the data_import.conf from ZK
>> just fine, or is that provided to DIH from another module that does
>> know about ZK?
>
> This
them?
Cheers
Tom
vmhost there is at most one
replica. I'm assuming that a shard leader and a replica are both
treated as replicas in this scenario.
Thanks
Tom
On Mon, Feb 1, 2016 at 8:34 PM, Erick Erickson wrote:
> See the createNodeset and node parameters for the Collections API CREATE and
> ADDR
patch)?
Thanks, Tom
that specify an expensive join condition
that we specify in the fq, so that subsequent requests with the same
fq won't have to do the same expensive query, and was wondering
whether we needed to ensure that the query goes to the same node when
we move to cloud.
Cheers
Tom
day, and use the stats component to calculate
the mean average. This blog post explains it:
https://lucidworks.com/blog/2015/01/29/you-got-stats-in-my-facets/
Cheers
Tom
On Wed, Feb 10, 2016 at 12:13 PM, Markus Jelsma
wrote:
> Hi Tom - thanks. But judging from the article and SOLR-6348 faceting stats
> over ranges is not yet supported. More specifically, SOLR-6352 is what we
> would need.
>
> [1]: https://issues.apache.org/jira/browse/SOLR-63
I am having problem configuring Solr to read Nutch data or Integrate with
Nutch.
Does anyone able to get SOLR 5.4.x to work with Nutch?
I went through lot of google's article any still not able to get SOLR 5.4.1
to searching Nutch contents.
Any howto or working configuration sample that you can
Hello,
I am working on a project using Solr to search data from retrieved from
Nutch.
I have successfully integrated Nutch with Solr, and Solr is able to search
Nutch's data.
However I am having a bit of a problem. If I query Solr, it will bring back
the numfound and which document the query str
ot;name = T" or maybe "name: T". Ultimately by
searching for the string "name" I am trying to find the value of name.
Thanks for your time. I appreciate your help
-T
On Feb 18, 2016 1:18 AM, "Binoy Dalal" wrote:
> Append &fl=
>
> On Thu, 18 Feb 2016, 1
On Mon, Feb 29, 2016 at 11:43 AM, David Santamauro
wrote:
> You will have noticed below, the field definition does not contain
> multiValues=true
What version of the schema are you using? In pre 1.1 schemas,
multiValued="true" is the default if it is omitted.
Cheers
Tom
s the core.properties for each shard is
created inside the solr home.
This is obviously no good, as when releasing a new version of the solr
home, they will no longer be in the current solr home.
Cheers
Tom
Hmm, I've worked around this by setting the directory where the
indexes should live to be the actual solr home, and symlink the files
from the current release in to that directory, but it feels icky.
Any better ideas?
Cheers
Tom
On Thu, Mar 3, 2016 at 11:12 AM, Tom Evans wrote:
>
sections of
our solrconfig.xml files, and mergeFactor is not mentioned at all.
> $ ack -B 1 -A 1 '10
212- -->
> $ ack --all maxMergeDocs
> $
Any ideas?
Cheers
Tom
ncerned that this is a little
heavyweight for a status check to see whether we can direct requests
at this server or not.
Cheers
Tom
On Wed, Mar 16, 2016 at 2:14 PM, Tom Evans wrote:
> Hi all
>
> [ .. ]
>
> The option I'm trying now is to make two ping handler for skus that
> join to one of items/products, which should fail on the servers which
> do not support it, but I am concerned that this is a
On Wed, Mar 16, 2016 at 4:10 PM, Shawn Heisey wrote:
> On 3/16/2016 8:14 AM, Tom Evans wrote:
>> The problem occurs when we attempt to query a node to see if products
>> or items is active on that node. The balancer (haproxy) requests the
>> ping handler for the appropriat
use to get
both the page of results we want and a nextCursorMark for the
subsequent page?
Cheers
Tom
n a document which sorts higher than the
supplied mark appears. Seems more complex, but maybe I'm not
understanding the internals correctly.
Fortunately for us, 90% of our users prefer infinite scroll, and 97%
of them never go beyond page 3.
Cheers
Tom
me cluster. Of course, you could think of a set of
servers within a cluster as a "logical" cluster if it just serves
particular collection, but "cluster" to me would be all of the servers
within the same zookeeper tree, because that is where cluster state is
maintained.
Cheers
Tom
nd forcing a leader election also has no effect.
Any ideas? The only viable option I see is to create a new collection,
index it and then remove the old collection and alias it in.
Cheers
Tom
to move to Solr 6, as we have only just
finished validating 5.5.0 with our original queries!
Cheers
Tom
Awesome, thanks :)
On Fri, Apr 15, 2016 at 4:19 PM, Anshum Gupta wrote:
> Hi Tom,
>
> I plan on getting a release candidate out for vote by Monday. If all goes
> well, it'd be about a week from then for the official release.
>
> On Fri, Apr 15, 2016 at 6:52 AM, Tom Ev
shards. Depending on your shard/cluster topology, this can increase
performance if you are returning large amounts of data - many or large
fields or many documents.
Cheers
Tom
rds the better approach in this case.
>
> Could some one please share their views/ experience?
Try it and see - everyone's data/schemas are different and can affect
indexing speed. It certainly sounds achievable enough - presumably you
can at least produce the documents at that rate?
Cheers
Tom
Hi Solr Community
I have been trying to add user authentication to our Solr 5.3.1 RedHat install.
I’ve found some examples on user authentication on the Jetty side. But they
have failed.
Does any one have a step by step example on authentication for the admin
screen? And a core?
Thanks
Tom
. HTML interface access (example:
ajax-solr<https://github.com/evolvingweb/ajax-solr>)
Thanks
Tom LeZotte
Health I.T. - Senior Product Developer
(p) 615-875-8830
On Aug 24, 2015, at 10:05 AM, Alexandre Rafalovitch
mailto:arafa...@gmail.com>> wrote:
Thanks for the email from th
Bosco,
We use CAS for user authentication, not sure if we have Kerberos working
anywhere. Also we are not using ZooKeeper, because we are only running one
server currently.
thanks
Tom LeZotte
Health I.T. - Senior Product Developer
(p) 615-875-8830
On Aug 24, 2015, at 3:12 PM, Don Bosco
-children-with-block-join-parser
Thanks for any hints,
Tom
The question is:
Solr 5.3 supports faceting on nested documents [1], with a great tutorial
from Yonik [2].
In the tutorial example, the query to get the documents for faceting is
directly performed on the child documents:
$ curl http
HI
Solr/Tika uses the /tmp directory to process documents. At times the directory
hits 100%. This causes alarms from Nagios for us. Is there a way in Solr/Tika
to limit the amount of space used in /tmp? Value could be 80% or 570MB.
thanks
Tom LeZotte
Health I.T. - Senior Product Developer
(p
>
> and
> true
> false
>
Putting things in "invariants" overrides whatever is passed for that
parameter in the request parameters. By putting "false" in invariants, you are making it impossible
to clean + index as part of DIH, because "clean" is always false.
Cheers
Tom
e, I'd like to know the cumulative time
spent paused in GC since the last poll, and the longest GC pause that
we see. munin polls every 5 minutes, are there suitable counters
exposed by JMX that it could extract?
Thanks in advance
Tom
anyway.
So really:
1) Is it OK to turn JMX monitoring on on production systems? The
comments in solr.in.sh suggest not.
2) What JMX beans and attributes should I be using to monitor GC
pauses, particularly maximum length of a single pause in a period, and
the total length of pauses in that period?
Cheers
Tom
t you use as part
of your methodology, so that you know what domain it belongs in, but
that would just be for your benefit, Solr would not complain if you
put a "child" field in a parent or vice-versa.
Cheers
Tom
PS:
I would not use Solr 4.7 for this. Nested docs are a new-ish featu
e command line, or just a better way of handling
this that I'm not aware of?
Cheers
Tom
On Mon, Dec 14, 2015 at 1:22 PM, Shawn Heisey wrote:
> On 12/14/2015 10:49 AM, Tom Evans wrote:
>> When I tried this in SolrCloud mode, specifying
>> "-Dsolr.data.dir=/mnt/solr/" when starting each node, it worked fine
>> for the first collection, but then the
Hi,
I am running Solr 5.0.0 and have a question about proximity search and
multiValued fields.
I am indexing xml files of the following form with foundField being a field
defined as multiValued and text_en my in schema.xml.
8
"Oranges from South California - ordered"
"Green Apples - available"
t; values.
>
> Also, leave "AND" out of the query phrases - you're just trying to match
> the product name and availability.
>
>
> -- Jack Krupansky
>
> On Tue, Mar 3, 2015 at 4:51 PM, Tom Devel wrote:
>
> > Hi,
> >
> > I am running Sol
Erick,
Thanks a lot for the explanation, makes sense now.
Tom
On Tue, Mar 3, 2015 at 5:54 PM, Erick Erickson
wrote:
> bq: Does it mean that words between " symbols, such as "Orange ordered" are
> treated as a single term, with (implicitly) AND conjunction between them?
erarchy.of.fields.Description", the rest for now should
be just ignored. Is here any significance of which definition comes first?
Thanks for any hints,
Tom
gt; Solr Analyzers, Tokenizers, Filters, URPs and even a newsletter:
> http://www.solr-start.com/
>
>
> On 6 March 2015 at 11:21, Tom Devel wrote:
> > Hi,
> >
> > I am running solr 5 using basic_configs and have a questions about the
> > order of defining fields
temporary files can be extracted? We
currently set:
SOLR_PID_DIR=/data/solr/run
SOLR_HOME=/data/solr/cores
SOLR_LOGS_DIR=/data/solr/logs
Cheers
Tom
On Tue, Mar 24, 2015 at 4:00 PM, Tom Evans wrote:
> Hi all
>
> We're migrating to SOLR 5 (from 4.8), and our infrastructure guys
> would prefer we installed SOLR from an RPM rather than extracting the
> tarball where we need it. They are creating the RPM file themselves,
have their own copy of the solr
install.
I think the way we will go for this is in production to run all our
solr instance as the "solr" user, who will own the files in /opt/solr,
and have their solr home directory wherever they choose. In dev, we
will just do something...
Cheers
Tom
once a
day.
When viewing the memory usage of the servers, the master server's JVM
has 8.8GB RSS, but the slave only has 1.2GB RSS.
Can someone hit me with the cluebat please? :)
Cheers
Tom
e even more in cache - we have about 35GB
of index data.
Cheers
Tom
On Tue, Apr 21, 2015 at 11:25 AM, Markus Jelsma
wrote:
> Hi - what do you see if you monitor memory over time? You should see a
> typical saw tooth.
> Markus
>
> -Original message-
>> From:Tom Evans
7;t getting any requests.
Again, sorry!
Tom
On Tue, Apr 21, 2015 at 11:54 AM, Tom Evans wrote:
> We monitor them with munin, so I have charts if attachments are
> acceptable? Having said that, they have only been running for a day
> with this memory allocation..
>
> Describing the
changes/Changes.html#v5.1.0.bug_fixes
Cheers
Tom
instances as discussed in the JIRAs? How do you usually do this?
Thanks for any help or hints.
Tom
Hello,
I have customized my Solr results so that they display only 3 fields: the
document ID, name and last_modified date. The results are in JSON.
This is a sample of my Javascript function to execute the query:
var query = "";
//set user input t
I need to change the web context and the port for a SolrCloud installation.
Example, change:
host:8080/some-api-here/
to this:
host:8983/solr/
Does anyone know how to do this with SolrCloud? There are values stored in
clusterstate.json and /leader/elect and I could change them but
that seem
(In Solr I mean)
>
> Am I on the right track or are you asking something other than how to get
> Solr on host:8983/solr ?
>
> On Wed, May 11, 2016 at 11:56 AM, Tom Gullo wrote:
>
>> I need to change the web context and the port for a SolrCloud installation.
>>
&g
That helps. I ended up updating the sole.in.sh file in /etc/default and that
was in getting picked up. Thanks
> On May 11, 2016, at 2:05 PM, Tom Gullo wrote:
>
> My Solr installation is running on Tomcat on port 8080 with a web context
> name that is different than /solr. We
from Shawn:
https://mail-archives.apache.org/mod_mbox/lucene-solr-user/201604.mbox/%3c570d0a03.5010...@elyograg.org%3E
Cheers
Tom
created.
The snitch is specified at collection creation time, or you can use
MODIFYCOLLECTION to set it after the fact. See this wiki patch for
details:
https://cwiki.apache.org/confluence/display/solr/Rule-based+Replica+Placement
Cheers
Tom
nformative.
Start from the top page and browse away!
https://cwiki.apache.org/confluence/display/solr/Apache+Solr+Reference+Guide
Handy to keep the glossary handy for any terms that you don't recognise:
https://cwiki.apache.org/confluence/display/solr/Solr+Glossary
Cheers
Tom
Do you have to group, or can you collapse instead?
https://cwiki.apache.org/confluence/display/solr/Collapse+and+Expand+Results
Cheers
Tom
On Tue, Jun 14, 2016 at 4:57 PM, Jay Potharaju wrote:
> Any suggestions on how to handle result grouping in sharded index?
>
>
> On Mon, Jun 1
TO *] AND -ingredient_tag_id:(35223)
Is there any way I can make the query and highlighting work as
expected as part of q?
Is there any downside to putting the exclusion part in the fq in terms
of performance? We don't use score at all for our results, we always
order by other parameters.
Che
;0" (and no other message) and kept the down node
as the leader (!)
Deleting the failed collection from the failed node and re-adding it
has the same "Leader said I'm not the leader" error message.
Any other ideas?
Cheers
Tom
;leader that is not the leader" server
about 15-20 minutes ago, but we still have not had a leader election.
Cheers
Tom
On Tue, Jul 19, 2016 at 4:30 PM, Erick Erickson wrote:
> How many replicas per Solr JVM? And do you
> see any OOM errors when you bounce a server?
> And how pati
the "leader that is not the leader" shut down.
Issuing a FORCELEADER via collections API doesn't in fact force a
leader election to occur.
Is there any other way to prompt Solr to have an election?
Cheers
Tom
On Tue, Jul 19, 2016 at 5:10 PM, Tom Evans wrote:
> There are 11
ublished)'}" - I'm returned this:
{u'count': 86760, u'date.max': 146836800.0, u'date.min': 129409920.0}
What do these numbers represent - I'm guessing it is milliseconds
since epoch? In UTC?
Is there any way to control the output format or TZ?
Is there any benefit in using JSON facets to determine this, or should
I just continue using stats?
Cheers
Tom
1 - 100 of 516 matches
Mail list logo