Hi,
Dear All,
Apologize for lengthy email
SOLR Version: 4
Im a newbie to SOLR and have gone through tutorial but could not get a
solution. The below requirement doesnt seem to be impossible but I
think Im missing the obvious.
In my RDBMS, there is a Qualification table and an Employee tab
On 26 October 2012 08:51, Billy Newman wrote:
> Any plans on adding update functionality to DIH?
What do you mean by "update functionality"?
Re-running an import with changed values for a
document with an existing ID will update values
in the Solr index.
If you mean adding new documents, please
Any plans on adding update functionality to DIH?
If you don't need them searchable, you don't use Solr4 with transaction log
and it is a relatively fast process (so it's not a pain to run it all again
in case of some failure), I would just commit at the end. In any other
case, I would use autocommit.
Anyway, I'm not sure if you'll see much differ
By default you get the top result from each group (if you are not sorting
on a field specifically, then you should be getting the document with the
highest score per group).
This can be changed with the "group.limit" parameter. See
http://wiki.apache.org/solr/FieldCollapsing#Request_Parameters
To
>
> Is there way to set-up logging to output something when segment merging
> runs?
>
I think segment merging is logged when you enable infoStream logging (you
should see it commented in the solrconfig.xml)
>
> Can be segment merges throttled?
>
You can change when and how segments are merged wit
I have problems with very low indexing speed as soon as core size grows
over 15 GB. I suspect that it can be due io intensive segment merging.
Is there way to set-up logging to output something when segment merging
runs?
Can be segment merges throttled?
whops! Jack got it faster :D
Sorry for "double" posting.
On Thu, Oct 25, 2012 at 11:41 PM, Jack Krupansky wrote:
> You may want to do a copyfield and have one field that is stemmed for
> casual matches at a lower boost while one is unstemmed and boosted higher.
>
> You could also do a copyfield
First of all, if you stem on indexing and you don't on query time, your
queries wont find anything valuable.
This is an easy approach. If you want full term (no stemmed) matches were
boosted you have to query like:
?q=data management&defType=edismax&qf=*name_without_stemming^200* name^100
text ui
You may want to do a copyfield and have one field that is stemmed for casual
matches at a lower boost while one is unstemmed and boosted higher.
You could also do a copyfield to a non-tokenized field (TextField with
KeywordTokenizer and lower case and trim filters) and give a match in that
fie
Thanks a lot for your reply.
Consider the below query
?q=data management&defType=edismax&qf=name^100 text uid^0.3&fl=name
I am trying to boost the name as much as possible, even then the results are
in below order
-
MANAGER
-
MANAGER
-
MANAGERS
-
...AA DATA MANAGEMENT
It also provides high availability for indexing and searching.
On Thu, Oct 25, 2012 at 4:43 PM, Bill Au wrote:
> So I guess one would use SolrCloud for the same reasons as distributed
> search:
>
> When an index becomes too large to fit on a single system, or when a single
> query takes too long
So I guess one would use SolrCloud for the same reasons as distributed
search:
When an index becomes too large to fit on a single system, or when a single
query takes too long to execute.
Bill
On Thu, Oct 25, 2012 at 3:38 PM, Shawn Heisey wrote:
> On 10/25/2012 1:29 PM, Bill Au wrote:
>
>> Is
On 10/25/2012 1:29 PM, Bill Au wrote:
Is SolrCloud using distributed search behind the scene? Does it have the
same limitations (for example, doesn't support MoreLikeThis) distributed
search has?
Yes and yes.
Dear All,
Apologize for lengthy email
SOLR Version: 4
Im a newbie to SOLR and have gone through tutorial but could not get a
solution. The below requirement doesnt seem to be impossible but I
think Im missing the obvious.
In my RDBMS, there is a Qualification table and an Employee table. A
Hi,
A few question on Solr Auto suggest below
Q1)I tried using the Index based Suggest functionality with solr 3.6.1 ,
can I combine this with file based boosting .Currently when I specify the
index field and the sourcelocation,the file in the source location is not
considered.
Is there any
On 10/25/2012 10:52 AM, Billy Newman wrote:
This is OK by me since I don't really want index that field anyways,
but I am wondering if I am doing something wrong. Is there a way to
mark a column in the Data Config as 'not indexed'. Or should I put a
field in my Solr Schema and mark it as indexe
I am using columns in my DataConfig specifically to grab them later as
a template.
i.e.
Where stuff is the field I want to index, but all stuff is just a
temporary column used such that I can get at it via the template
transformer. 'stuff' is in my schema 'allStuff' is not.
I get the followi
You neglected to show us your example queries and how you wanted to boost
them. The simple answer is to supply multiple query terms with increasing
level of detail (e.g., more terms in a quoted phrase) and apply a query
boost that is higher for the more exact matches.
If you don't get expected
I'm running some test on Solr 4.0 before putting it into production, and
I've just encountered an issue with hit highlighting.
I started by placing my index from 3.3 into a Solr 4.0 install. I then
edited the field definition in the schema config to match my schema from
3.3, with the addition of
Hi,
We have a name field which I am boosting using dismax parser.
I want the search to bring the documents that has exact match compared to
partial match.
Ex:
Name:account
Name:account number
Name:account number - closed account
When I search for account number, I want solr to return the
I tried to define a suggest component as appears in Wiki.
I also defined a specific /suggest request handler.
This doesn't work in SolrCloud setup, as the query is distributed to the
default /select handler instead.
Specifically, shard handler gets default urls and other cores forward to
/select.
s
Could you provide a little more detail. I'm not sure if you are simply
describing behavior you didn't expect or like, or whether there might be an
actual problem.
-- Jack Krupansky
-Original Message-
From: sameera somisetty
Sent: Wednesday, October 24, 2012 6:47 PM
To: solr-user@luce
The problem seems to have been caused by my failure to completely remove
the existing index files when I switched the inStock field from an
indexed boolean field to externally maintained. After I removed
everything and re-indexed from scratch, the error went away.
On 10/24/2012 08:57 PM, Car
Sorry, but not currently. Great suggestion though. I didn't check to see if
there might already be a Jira for it, but I vaguely recall some discussion
or mention of it.
-- Jack Krupansky
-Original Message-
From: Radim Kolar
Sent: Wednesday, October 24, 2012 5:49 PM
To: solr-user@luce
Thanks Alexey -
Could you move this to a JIRA issue?
- Mark
On Oct 25, 2012, at 7:53 AM, AlexeyK wrote:
> setup:
> 1 node, 4 cores, 2 shards.
> 15 documents indexed.
>
> problem:
> init stage times out.
>
> probable cause:
> According to the init flow, cores are initialized one by one synch
Hi Deniz,
Haven't used geofilt query parser but with solrJ you don't need URL encoding.
Use solrQuery.set("fq","{!geofilt sfield=store}"); instead.
--- On Thu, 10/25/12, deniz wrote:
> From: deniz
> Subject: org.apache.lucene.queryparser.classic.ParseException - a Bug?
> To: solr-user@lucen
Yes #5 is the same thing (sorry, I didn't read them all thoroughly). Your
description of the phrases being 'tags' suggests that you don't need term
positions for matching, and as you noted, you would get unwanted partial
matches. And, the TermQuerys would be much faster.
Peter
On Wed, Oct 24, 20
You may well have already seen this, but in case not:
http://blog.thetaphi.de/2012/07/use-lucenes-mmapdirectory-on-64bit.html
FWIW,
Erick
On Wed, Oct 24, 2012 at 9:51 PM, Shawn Heisey wrote:
> On 10/24/2012 6:29 PM, Aaron Daubman wrote:
>>
>> Let me be clear that that I am not interested in RAMD
Dear Mrs/Mr,
Using facet date/range on a date typed field and on a distributed search
between schema compatible cores, the use of facet.mincount=1 brings a cut down
number of facet values (over 500 facet values expected, 5 facet values
retrieved). I wonder whether facet.mincount is supported on
setup:
1 node, 4 cores, 2 shards.
15 documents indexed.
problem:
init stage times out.
probable cause:
According to the init flow, cores are initialized one by one synchronously.
Actually, the main thread waits
ShardLeaderElectionContext.waitForReplicasToComeUp until retry threshold,
while replic
Using SolrCloud release with following configuration:
string
elevate.xml
explicit
text
elevator
Running the query
http://localhost:8080/solr/collection1/elevate?q=evelatedtext
constantly getting the following exception:
SEVERE: nu
can i do something like this: (fails with fieldType: missing mandatory
attribute 'class')
termVectors="true" termPositions="true" termOffsets="true"/>
one field type will extending another type to save copy and paste.
33 matches
Mail list logo