Hello,
this is my first time writing at this mailing lost, so hello everyone.
I am having issues with synonyms.
I added the synonym to one of my field types:
|
I also added some Synonyms
Hello,
I'm pretty new to Solr, as a PHP developer.
I'm still reading the tutorials for getting started with Solr, adding and
indexing data. I'm still using the example/start.jar, as I still didn't
succeed to config a true (production-ready) Solr instance. But doesn't
matter.
As I can't deal with
Hello again,
I still trying to index a with solr cloud and dih. I can index but it seems
that indexation is done on only 1 shard. (my goal was to parallelze that to
go fast)
This my conf:
I have 2 tomcat instances,
One with zookeeper embedded in solr 4.4.0 started and 1 shard (port 8080)
The othe
You can use the dynamic fields feature of Solr to map unknown field
names to types.
For example, a dynamic field named as "*_s" i.e. any field name ending
with _s can be mapped to string and so on. In your cases, if your
field names do not follow a set pattern, then you can even specify a
dynamic
DataImportHandler does not parallelize indexing at all. It is a single
threaded indexer which runs on a single node. However, the documents
themselves are routed to the correct shard by SolrCloud. Therefore,
what you are observing on your servers is normal.
If you want to parallelize indexing then
SOLR has a nice analysis page. You can use it to get insight what is
happening after each filter is applied at index/search time
Regards
Pravesh
--
View this message in context:
http://lucene.472066.n3.nabble.com/Problem-with-Synonyms-tp4087905p4087915.html
Sent from the Solr - User mailing l
Hi jerome.dupont
please check what is the updateHandler in your solrconfig.xml
--> by
default,it is solr.NoOpDistributingUpdateProcessorFactor
db-data-config.xml
sample
2013/9/3
>
> H
The Schema REST API is a new feature and supports only adding fields
(and that too since Solr 4.4). It doesn't support modifying fields
yet.
On Tue, Sep 3, 2013 at 2:39 PM, bengates wrote:
> Hello,
>
> I'm pretty new to Solr, as a PHP developer.
> I'm still reading the tutorials for getting start
Hi,
I've setup a ZK instance and also deployed Solr in Tomcat7 on a different
instance in Amazon EC2.
Afterwards I tried starting tomcat specifying the ZK host IP, like so:
sudo service tomcat7 start -DzkHost=<>:2181 -DnumShards=3
-Dcollection.configName=myconf
-Dbootstrap_confdir=/usr/share/solr/
Hello,
Thanks for your quick reply.
This is what I feared.
Do you know if this is planned for Solr 4.5 or Solr 5.0 ?
I didn't see anything about it in the roadmap.
Thank you,
Ben
--
View this message in context:
http://lucene.472066.n3.nabble.com/Update-field-properties-via-Schema-Rest-API-
Am 03.09.2013 12:11, schrieb pravesh:
SOLR has a nice analysis page. You can use it to get insight what is
happening after each filter is applied at index/search time
Regards
Pravesh
--
View this message in context:
http://lucene.472066.n3.nabble.com/Problem-with-Synonyms-tp4087905p4087915.
Did you find any other exceptions in the logs?
When I pasted the script section of your data config into my test
setup, I got an error saying that there is an unclosed string literal
in line 6
On Tue, Sep 3, 2013 at 12:23 AM, Kunzman, Doug wrote:
> Hi -
>
> I'm new to Solr and am trying to combi
in my case - the fields with apostrophe not returned in results
When I search for -- dev it shows me following results
dev
dev's
devendra
but when I search for -- dev' (dev with apo only)
Nothing comes out as result ?
What could be the workaround ?
Thanks
Devendra
--
View this messa
Hmmm, seems like it should work.
First thing I'd try is using the admin interface and look at the analysis
page to
see how the input is tokenized both at index and search time, that's
sometimes
surprising.
Second, again using the browser, attach &debug=query to the URL. That will
echo back what t
Is editing a text file really all that onerous? You can edit the
schema.xml file with any editor you're comfortable with and
issue the core RELOAD command in the interim.
Best
Erick
On Tue, Sep 3, 2013 at 6:20 AM, bengates wrote:
> Hello,
>
> Thanks for your quick reply.
>
> This is what I fea
Please explain exactly what "but nothing
really happens" means. Do you mean that
you see the SF in the analysis page but there
are no substitutions? Or you don't get
search results? Or???
You have to reload the core after making changes
at minimum, you can restart the Solr instance
if you're paran
The SF part is in the analysis page but nothing is substituted.
I reloaded, removed and readded the core, reindexednothing worked :(
I wonder if the SF actually uses the correct file for synonyms. I have
it laying in the conf folder of the core. Is that correct?
Am 03.09.2013 13:32, schri
Hi,
We have a large, sharded SolrCloud index of >300 million documents which
we use to explore our web archives. We want to facet on fields that have
very large numbers of distinct values, e.g. host names and domain names
of pages and links. Thus, overall, we expect to have millions of
distinct
Hi Roman,
Thanks, the --additionalSolrParams was just what I wanted and works fine.
BTW, if you have some special "bug tracking forum" for the tool, I'm happy
to submit questions / bug reports there. Otherwise, this email list is ok
(for me at least).
One other thing I have noticed in the err lo
Hello Erick,
Thank you for your reply.
Unfortunately, yes it is.
I work with a company that has a catalog with many new attributes every day,
and sometimes the existing ones change. For instance, one attribute may live
with the unit for months (e.g. screen_size ="32 cm") and one day my provider
> However, the Solr instance we direct our client query to is consuming
> significantly more RAM (10GB) and is still failing after a few queries when
> it runs out of heap space. This is presumably due to the role it plays,
> aggregating the results from each shard.
That seems quite odd... Wha
Hi,
I have setup SolrCloud with tomcat. I use solr 4.1.
I have zookeeper running on 192.168.1.10.
A tomcat running solr_myidx on 192.168.1.10 on port 8080.
A tomcat running solr_myidx on 192.168.1.11 on port 8080.
My solr.xml is like this:
hostPort="8080" hostContext="solr_myidx" zkClientTi
When i try to deploy using jetty, everything works fine, and the solr
instance gets in the cloud
sudo java -Dbootstrap_confdir=./solr/collection1/conf
-Dcollection.configName=myconf -DzkHost=<>:2181 -DnumShards=3 -jar
start.jar
--
View this message in context:
http://lucene.472066.n3.nabble.co
The default facet.limit is 10, but it's set to 50 for most of the
facets. I've included the query parameters below. In case it makes any
difference, there are quite a lot of facet fields with large numbers of
terms, and the queries are being generated by the Sarnia Drupal module.
Thanks,
Andy
---
Hi Dmitry,
Thanks for the feedback. Yes, it is indeed jmeter issue (or rather, the
issue of the plugin we use to generate charts). You may want to use the
github for whatever comes next
https://github.com/romanchyla/solrjmeter/issues
Cheers,
roman
On Tue, Sep 3, 2013 at 7:54 AM, Dmitry Kan
We are harvesting and indexing bibliographic data, thus having many distinct
author names in our index. While testing Solr 4 I believe I had pushed a single
core to 100 million records (91GB of data) and everything was working fine and
fast. After adding a little more to the index, then followin
It works
I've done what you said:
_ In my request to get list of documents, I add a where clause filtering on
the select getting the documents to index:
where noticebib.numnoticebib LIKE '%${dataimporter.request.suffixeNotice}'"
_ And I called my dih on each shard with the parameter suffixeNotice
I don't know much about Tika but in the example data-config.xml that
you posted, the "xpath" attribute on the field "text" won't work
because the xpath attribute is used only by a XPathEntityProcessor.
On Thu, Aug 29, 2013 at 10:20 PM, Andreas Owen wrote:
> I want tika to only index the content i
CloudSolrServer can only be used if you are actually using SolrCloud
(i.e. a ZooKeeper aware setup). If you only have a multi-core setup,
then you can use LBHttpSolrServer.
See http://wiki.apache.org/solr/LBHttpSolrServer
On Tue, Aug 27, 2013 at 2:11 PM, Dharmendra Jaiswal
wrote:
> Hello,
>
> I
Hi,
We have a need to specify a different query analyzer depending on input
parameters dynamically.
We need this so that we can use different stopword lists at query time.
Would any one know how I might be able to achieve this in solr?
I'm aware of the solution to specify different field types,
I think I have it all sorted out. There are some weird network issues here
where my test set up is, so that may have been part of the over all issue.
Timeouts wouldn't have fixed this issue, that's for sure.
On Sat, Aug 31, 2013 at 7:17 AM, Erick Erickson wrote:
> bq: Though I am seeing some fu
Hi,
Quick question about data import handlers in Solr cloud. Does anyone use
more than one instance to support the DIH process? Or is the typical setup
to have one box setup as only the DIH and keep this responsibility outside
of the Solr cloud environment? I'm just trying to get picture of his
Interesting because I was getting the issue when I was passing the full
path (without the trailing / ) to Tomcat.
On Mon, Sep 2, 2013 at 11:34 PM, Prasi S wrote:
> The issue is resolved. I have given all the path inside tomcat as relative
> paths( solr home, solr war). That was the creating the
Your email is vague in terms of what you are actually *doing* and what
behavior you are seeing.
Providing specific details like "This is my schema.xml and this is my
solrconfig.xml; when i POST this file to this URL i get this result and i
would instead like to get this result" is useful for
Ah, thanks for the closure, it's always nice to know. I used to work
with a guy who had a list of "network fallacies", that amounted to
"you can't trust them fully"
Erick
On Tue, Sep 3, 2013 at 12:12 PM, Jared Griffith
wrote:
> I think I have it all sorted out. There are some weird network
: like to understand how the ordering is defined so that I can compute an
: integer that is sorted in the same way. For example (shard "id" << 24) |
: docid or something like that.
If you want to ensure a consistent ordering, you have to index a
(unique) value that you use as a secondary sort -
Those are the Fallacies of Distributed Computing from L. Peter Deutsch. The
first fallacy is "The network is reliable."
http://en.wikipedia.org/wiki/Fallacies_of_Distributed_Computing
wunder
On Sep 3, 2013, at 10:26 AM, Erick Erickson wrote:
> Ah, thanks for the closure, it's always nice to kn
You don't need to index fields several times, you can index is just into
one field, and use the different query analyzers just to build the query.
We're doing this for authors, for example - if query language says
"=author:einstein", the query parser knows this field should be analyzed
differently
: 2. The items "at" and "l" are not preceded by name.
you're getting back a list of items, the odd items ("at", "l") are
strings, and the even items are more complex objects associated with those
strings
: Can I interfere with the structure?
You can choose how the JSON Writer represents the i
Sounds like it would be better for you to preprocess the query in your
application layer. Your requirements seem too open-ended to "wire" into
Solr.
But, to be sure, please elaborate exactly what sort of variations you need
in query analysis.
-- Jack Krupansky
-Original Message-
Fr
Thankfully it's none of those but more than likely a bad DHCP server
(Windows) or client (or combo there of) that is causing the network to
freak out. I'll try adjusting the timeouts up to see if it will alleviate
this.
I am seeing that when I try to restart the solr instances sometimes they
seem
Was it a test instance that you created 8983 is the default port, so
possibly you started an instance before you had the ports setup properly,
and it registered in zookeeper as a valid instance. You can use the Core
API to UNLOAD it (if it is still running), if it isn't running anymore, I
have yet
Yep, that's the one, thanks...
On Tue, Sep 3, 2013 at 1:38 PM, Walter Underwood wrote:
> Those are the Fallacies of Distributed Computing from L. Peter Deutsch.
> The first fallacy is "The network is reliable."
>
> http://en.wikipedia.org/wiki/Fallacies_of_Distributed_Computing
>
> wunder
>
> On
On 9/3/2013 4:13 AM, maephisto wrote:
> I've setup a ZK instance and also deployed Solr in Tomcat7 on a different
> instance in Amazon EC2.
> Afterwards I tried starting tomcat specifying the ZK host IP, like so:
>
> sudo service tomcat7 start -DzkHost=<>:2181 -DnumShards=3
> -Dcollection.configNa
On 9/3/2013 3:59 AM, devendra W wrote:
> in my case - the fields with apostrophe not returned in results
Don't use special characters in field names. If it wouldn't work as an
variable name, function name (or other identifier) in a typical
programming language (Java, C, Perl), then it will probab
I was having problems updating SolrCloud with a large batch of records. The
records are coming in bursts with lulls between updates.
At first, I just tried large updates of 100,000 records at a time.
Eventually, this caused Solr to hang. When hung, I can still query Solr.
But I cannot do any delet
Show us your full field type with analyzer.
I suspect that the problem is that one of the index-time filters is turning
"dev's" into "devs" (WDF does that), but at query-time there is no filter
that removes a trailing apostrophe.
Use the Solr Admin UI Analysis page to see home "dev's" gets in
If you want to alter the score in a customized way based on indexed text data
on a per-value basis then index Lucene payloads, and use PayloadTermQuery.
See the javadocs for PayloadTermQuery in particular and follow the
references. This is a bit dated but read this:
http://searchhub.org/2009/08/0
On 09/03/2013 12:50 PM, Chris Hostetter wrote:
: like to understand how the ordering is defined so that I can compute an
: integer that is sorted in the same way. For example (shard "id" << 24) |
: docid or something like that.
If you want to ensure a consistent ordering, you have to index a
(u
Our index is too large to uninvert on the fly, so we've been looking
into using DocValues to keep a particular field uninverted at index
time. See http://wiki.apache.org/solr/DocValues
I don't know if this will solve your problem, but it might be worth
trying it out.
-Greg
On Tue, Sep 3, 2013
Hey guys,
I am looking into an issue we've been having with SolrCloud since the
beginning of our testing, all the way from 4.1 to 4.3 (haven't tested 4.4.0
yet). I've noticed other users with this same issue, so I'd really like to
get to the bottom of it.
Under a very, very high rate of updates (
When I have a field using CJKBigramFilter, parsed CJK chars have a different
parsedQuery than non-CJK queries.
(旧小说 is 3 chars, so 2 bigrams)
args sent in: q={!qf=bi_fld}旧小说&pf=&pf2=&pf3=
debugQuery
{!qf=bi_fld}旧小说
{!qf=bi_fld}旧小说
(+DisjunctionMaxQuerybi_fld:旧小
bi_fld:
Re the relevancy changes I note below for edismax, there are already some
issues filed:
pertaining to the difference in how the phrase queries are merged into the main
query:
See Michael Dodsworth's comment of 25/Sep/12 on this issue:
https://issues.apache.org/jira/browse/SOLR-2058 <-- ti
The query parser sees "q=foo bar" as two separate source query terms and
analyzes each separately, but "q=旧小说" is seen by the query parser as a
single source query term and then that one source query term gets tokenized
by the query term analyzer as two CJK bigrams.
Try "q=foo-bar" and you sho
54 matches
Mail list logo