hey,everyone, In our application we are using Solr 4.6.
I had the idea to use stats component for score pse-udo field.
Is it exists workaround of using "…stats=true&stats.field=score..." ?
thanks a lot!
--
View this message in context:
http://lucene.472066.n3.nabble.com/stats-pse-udo-field-sc
Thank you very much for you help Ahmet.
However the language detection is still not workin. :(
My solrconfig.xml didn't contain that lst section inside the update
requestHandler.
That's the content I added:
class="solr.XmlUpdateRequestHandler">
>
> langid
>
i think you should check your scheme.xml and solrconfig.xml encoding format =
utf-8。
--
View this message in context:
http://lucene.472066.n3.nabble.com/Solr-does-not-recognize-language-tp4133711p4134643.html
Sent from the Solr - User mailing list archive at Nabble.com.
Why this should be a problem?
Both files start with
On Mon, May 5, 2014 at 11:44 AM, Frankcis wrote:
> i think you should check your scheme.xml and solrconfig.xml encoding
> format =
> utf-8。
>
>
>
> --
> View this message in context:
> http://lucene.472066.n3.nabble.com/Solr-does-not-recogniz
because if your encoding format doesn't both utf-8, building index will lead
to messy code, of course, you will not get the expected result.
--
View this message in context:
http://lucene.472066.n3.nabble.com/Solr-does-not-recognize-language-tp4133711p4134647.html
Sent from the Solr - User mail
Hi all!
Sorry in advance if this question was posted but I were unable to find it
with search engines.
Filter SpanishLightStemFilterFactory is not working properly with wildcards
or Im misunderstanding something. I have the field
With this type:
Hi everybody
can anyone give me a suitable interpretation for cat_rank in
http://people.apache.org/~hossman/ac2012eu/ slide 15
thanks
Hi Victor,
How do you index your documents? Your last config looks correct. However for
example if you use data import handler you need to add update.chain there too.
Same as extraction request hadler if you are using sole-cell.
/home/username/data-config.xml
langid
B
Hi Roman,
What you are experiencing is a OK and known. Stemming and wildcard searches
could be counter intuitive sometimes. But luckily remedy is available. Use the
following filters, and your wildcard searches will be happy. Please not that
this change will require solr-restart and re-index.
Generally, stemming filters are not supported when wildcards are present.
Only a small subset of filters work with wildcards, such as the case
conversion filters.
But, you stay that you are using the stemmer to remove diacritical marks...
you can/should use ASCIIFoldingFilterFactory or Mapping
SOLVED!
First solution I tried (the Ahmet's one) worked fine!
Thank you!
-Mensaje original-
De: Jack Krupansky [mailto:j...@basetechnology.com]
Enviado el: lunes, 05 de mayo de 2014 13:19
Para: solr-user@lucene.apache.org; rgonza...@normagricola.com
Asunto: Re: Wildcard malfunctioning
I am not able to search for the data while indexing.
Indexing is done via the dataimport handler.
While searching for the documents (in between indexing is happening), it
gives the broken pipe exception and wont search anything.
What should be the proper solution for this problem?
Am I missing some
How will a query like below will get executed, In which order
I understand that when this query is executed fields mentioned in fieldList
will be returned. What I don't understand is how the "samplestring1" and
"samplestring2" will get searched with the query fields specified
I think I will be ab
Hi there,
I'm indexing my documents using mysolr. I mainly generate a lost of json
objects and the run: solr.update(documents_array,'json')
On Mon, May 5, 2014 at 1:08 PM, Ahmet Arslan wrote:
> Hi Victor,
>
> How do you index your documents? Your last config looks correct. However
> for exampl
Hi All
I am completely new to solr and hoping to understand the basics. Can one of
you help me to understand what the following query does, in which order it
is getting executed
I understand that when this query is executed fields mentioned in fieldList
will be returned. What I don't understand i
Read up on the edismax query parser first:
http://wiki.apache.org/solr/ExtendedDisMax
The "^" operator is known as boosting or field boosting and is used to
influence document scores for relevancy.
It has no analog in SQL.
-- Jack Krupansky
-Original Message-
From: nativecoder
Sent
I changed the hardcoded BlockJoinChildQParser setting to use the parent
scoring and that seems to work. So I think I got rid of the scoring issue
:).
I also voted for the issue!
Didn't find a solution for the highlighting issue at the moment, but I am
considering to omit highlighting for now as i
thanks,
basically I'm running solr on my localhost(computer) and trying to access it
through the emulator in eclipse, NOT in the physical phone.
Can it be done?
--
View this message in context:
http://lucene.472066.n3.nabble.com/can-t-make-GET-request-to-solr-in-android-app-tp4134584p4134706.
The index is made with the same version of solr, that is searching (4.6.0),
the config file (solrconfig.xml) & schema.xml is the same too.
The only way for me to solve this issue is to let only one process to index
at the same time. Wouldnt a layer of message queue resolve this issue?
2014-05-04
Hi,
It's not an error if you see my code, there is a catch statement, which
contains the "FAIL" message, it does always show it.
--
View this message in context:
http://lucene.472066.n3.nabble.com/can-t-make-GET-request-to-solr-in-android-app-tp4134584p4134709.html
Sent from the Solr - User
On 5/5/2014 5:19 AM, Jack Krupansky wrote:
> But, you stay that you are using the stemmer to remove diacritical
> marks... you can/should use ASCIIFoldingFilterFactory or
> MappingCharFilterFactory.
I like ICUFoldingFilterFactory for this, but it does require additional
contrib jars (included in t
Is there an option in Solr (solrconfig.xml or somewhere else) to regularize
commits to the index.
I meant to do a 'sleep' between each commit to the index, when data
to-be-indexed is waiting inside a stack.
2014-05-05 15:58 GMT+01:00 Hakim Benoudjit :
> The index is made with the same version of
Hi Victor,
I don't know mysolr, I assume you are using /update/json, lets add your chain
to defaults section.
application/json
langid
On Monday, May 5, 2014 4:06 PM, Victor Pascual
wrote:
Hi there,
I'm indexing my documents using mysolr. I mainly
I already went through the link. I understand about the boosting factor for
the relevancy
query=samplestring1 AND samplestring2
defType: edismax
queryFields: Exact_Field1^1.0 Exact_Field2^0.9 Field1^0.8 Field2^0.7
fieldList: Column1, Column2
I need to understand whether the samplestring1 and sam
I don't think so. Solr excels at getting the score of single
documents, not aggregation.
It's not at all clear to me, though, that the sum of documents' scores
is a reasonable thing to sort by. Consider grouping on a very common
term. You'd never do this, but group on the elements of a text field.
On 5/5/2014 5:39 AM, Sohan Kalsariya wrote:
> I am not able to search for the data while indexing.
> Indexing is done via the dataimport handler.
> While searching for the documents (in between indexing is happening), it
> gives the broken pipe exception and wont search anything.
> What should be t
You should not be committing from the client by and large, use the
and options in solrconfig.xml.
See:
http://searchhub.org/2013/08/23/understanding-transaction-logs-softcommit-and-commit-in-sorlcloud/
Best,
Erick
On Mon, May 5, 2014 at 8:12 AM, Hakim Benoudjit wrote:
> Is there an option in
On 5/5/2014 9:02 AM, blach wrote:
> It's not an error if you see my code, there is a catch statement, which
> contains the "FAIL" message, it does always show it.
In your code, you are not printing the stack trace or throwing the
exception. If you want to see it in your own code, you'll need to
i
"dismax" means Disjunction Maximum, which means Lucene takes the highest
scoring clause (field), for each search term. This is effectively an OR of
the clauses.
-- Jack Krupansky
-Original Message-
From: nativecoder
Sent: Monday, May 5, 2014 11:21 AM
To: solr-user@lucene.apache.org
S
Yes Im reading about SOLRJ now
I wrote this code for it, but its the same problem, in this case all the app
is stopping, this is the code
String urlString =
"http://localhost:8983/solr";;
SolrServer solr = new HttpSolrServer(urlString);
On 5/5/2014 11:05 AM, blach wrote:
> I wrote this code for it, but its the same problem, in this case all
> the app is stopping, this is the code String urlString =
> "http://localhost:8983/solr";; SolrServer solr = new
> HttpSolrServer(urlString);
>
> SolrQuery query = new SolrQuery(); query.set("
Solr 4.7.2 (and 4.6.1)
Tomcat 7.0.52
Java 1.7.0_45 (and _55)
I'm getting some really odd behavior with some XSLT documents. I've been
doing some upgrades to Java & Solr and I'm trying to narrow down where the
problems are happening.
I have a few XSLT docs that I put into the conf/xslt directory
Thanks Nicole. Leveraging dynamic field definitions is a great idea. Probably
work for me as I've a bunch of fields which are indexed as "String". Just
curious about the sharding, are you using Solr Cloud. I thought of taking
the dedicated shard / core route , but then, as using a composite key (fo
I've tried it & it worked by letting solr do the commit instead of my solr
client.
In solrconfig.xml:
autocommit max_time has been set to 5 minutes & autosoftcommit max_time to
something bigger.
Thanks a lot guys!
2014-05-05 16:30 GMT+01:00 Erick Erickson :
> You should not be committing from t
That answer helps a lot
Where would the OR clause be ?
(Exact_Field1:samplestring1 *OR* Exact_Field1:samplestring2) AND
(Exact_Field2:samplestring1 *OR* Exact_Field2:samplestring2) AND
(Field1:samplestring1 *OR* Field1:samplestring2) AND (Field2:samplestring1
*OR* Field2:samplestring2)
Please n
Shot in the dark: perhaps you have a doc w/o a value in the description
field, which means the xsl:variable's select doesn't match anything; which
perhaps means that your XSLT engine then leaves the variable undefined.
: Solr 4.7.2 (and 4.6.1)
: Tomcat 7.0.52
: Java 1.7.0_45 (and _55)
:
: I'm
I'll found out that "storing" Documents as separate docs+id does not
help either.
You must have an completely separate collection/core to get things work fast.
Kind regards,
Jochen
Zitat von Jochen Barth :
Ok, https://wiki.apache.org/solr/SolrPerformanceFactors
states that: "Retrieving the
Thank you Shawn
I did what you told me. now this is my code:
import org.apache.solr.client.solrj.SolrQuery;
import org.apache.solr.client.solrj.SolrServer;
//import org.apache.solr.client.solrj.SolrServer;
import org.apache.solr.client.solrj.SolrServerException;
import org.apache.solr.client.s
On 5/5/2014 12:17 PM, blach wrote:
> Thank you Shawn
>
> I did what you told me. now this is my code:
> it gives me error that org.apache.solr.client.solrj is not found
I don't know how to do classpath management in the Android enviroment.
You'll need to add the solrj jar to your applica
: Hi everybody
: can anyone give me a suitable interpretation for cat_rank in
: http://people.apache.org/~hossman/ac2012eu/ slide 15
Have you seen the video?
http://vimeopro.com/user11514798/apache-lucene-eurocon-2012/video/55822630
That slide starts ~ 23:00 and i go through a descriptio
That answer helps a lot
Where would the OR clause be ?
(Exact_Field1:samplestring1 OR Exact_Field1:samplestring2) AND
(Exact_Field2:samplestring1 OR Exact_Field2:samplestring2) AND
(Field1:samplestring1 OR Field1:samplestring2) AND (Field2:samplestring1
OR Field2:samplestring2)
Please note that
Checked that first -- it's a test site with a small sample size. The field
is set in all of the items. And refreshing the query a few times can yield
either result (with/without the error).
I'm reverting back to an old version of my stack (my code, plus tomcat &
solr), I'll step through my previ
I have included the reference for this library in good way but still giving
me the same error.
feeling
--
View this message in context:
http://lucene.472066.n3.nabble.com/can-t-make-GET-request-to-solr-in-android-app-tp4134584p4134785.html
Sent from the Solr - User mailing list archive at Nab
Take a look through the article I linked, 5 minutes may be an issue
since the transaction log will hold all 5 minutes worth of input. In
batch processes this can be quite a bit of data. Worse, when a Solr
instance terminates unexpectedly, the entire transaction log can be
replayed.
Consider settin
As per the stemming docs (
https://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#Stemming ), I want
to score the original term higher than the stemmed version by adding:
to a field type that is already created (with Stemming). I have 100M documents
in this index, and it gets slo
The full details are farther down in the stack...
: null:org.apache.solr.common.SolrException: SolrCore 'master' is not
: available due to init failure: Error initializing QueryElevationComponent.
...
: Caused by: org.apache.solr.common.SolrException: Error initializing
: QueryElevationCo
How could Solr accomplish an end-user behavior like a strict search?
Let’s say an end-user decides to use quotation marks in their keywords to
provide specificity in their search results.
Current:
If you were to query: your future, then 10 results would return and print to
the page.
Expected:
Hello,
I have a weird relevancy requirement. We search news content hence
chronology is very important and also relevancy, although both are mutually
exclusive. For example, if the search terms are - malaysia airline crash
blackbox - my requirements are as follows
docs containing all word
I haven't personally used this technique, but I gather that the intent is
that the unstemmed term will have a lower term frequency (more unique) than
the stemmed term which may generate the same stemmed term from a number of
different source terms.
To answer your question, no, you don't need a
The recip function query is the proper way to boost by reverse chronological
order, but you may have to play around with the boost factor so that date
does not completely overwhelm the natural relevancy.
Use the debugQuery=true parameter and look at the "explain" section to see
what the docume
Hi Ravi,
Regarding recency please see :
http://www.slideshare.net/lucenerevolution/potter-timothy-boosting-documents-in-solr
Regarding "docs containing all words" there is function query that elevates
those docs to top. Search existing mailing list past posts.
Ahmet
On Tuesday, May 6, 2014 1
Hi Reyes,
I think it is not clear your question.
Please see : https://wiki.apache.org/solr/UsingMailingLists
Ahmet
On Tuesday, May 6, 2014 12:23 AM, "Reyes, Mark" wrote:
How could Solr accomplish an end-user behavior like a strict search?
Let’s say an end-user decides to use quotation marks i
Hi,
I am trying to plot a non date field by time in order to draw an histogram
showing its evolution during the week.
For example, if I have a tweet index:
Tweet:
date
retweetCount
3 tweets indexed:
Tweet | Date | Retweet
A01/01 100
B01/01 100
C01/02 100
If I
Okay, let¹s try it this wayŠ
CURRENTLY:
Step 1: Type, your future into the search bar.
Step 2: 10 search results return.
I¹D LIKE TO SEE THIS:
Step 1: Type, ³your future² into the search bar.
Step 2: 1 search result returns.
Can this be accomplished through the Solr UI?
Thanks,
Mark
On 5/5/14
The term "strict search" is not in the Lucene/Solr nomenclature - it could
mean any number of things.
It sounds as if maybe you want to do a phrase search, looking for an exact
phrase - yes, you can do that by enclosing the phrase in quotes.
-- Jack Krupansky
-Original Message-
From
I'm using Sorl to create an image search functionality that allows users to
search for an existing image in the site to add to new content. A given piece
of content has a field that can store multiple images, so I will need to use a
multi-value Solr field to store image data. Currently, I'm sto
: My understanding is that DynamicField can do something like
: FOO_BAR_TEXT_* but what I really need is *_TEXT_* as I might have
: FOO_BAR_TEXT_1 but I also might have WIDGET_BAR_TEXT_2. Both of those
: field names need to map to a field type of 'fullText'.
I'm pretty sure you can get what you
my scheme.xml:
id
name
update docs:
"docs": [
{
"name": "苹果4s",
"type": "手机",
Alexandre, you could use something like
http://blog.sematext.com/2012/09/25/new-tool-jmxc-jmx-console/ to quickly
dump everything out of JMX and see if there is anything there Solr Admin UI
doesn't expose. I think you'll find there is more in JMX than Solr Admin
UI shows.
Otis
--
Performance Moni
Thanks Otis,
JMXC looks interesting, though I cannot seem to find the "Open Source"
section on your website it used to link to.
Regards,
Alex.
Personal website: http://www.outerthoughts.com/
Current project: http://www.solr-start.com/ - Accelerating your Solr proficiency
On Tue, May 6, 2014
If you are looking for that level of understanding, you are best
enabling the debug flag. Then you will get a full breakdown of what
matched which field and why. Including scores, preferences, etc.
Possibly with debug.explained.structured enabled:
http://wiki.apache.org/solr/CommonQueryParameters#d
You can have two parallel multi-value fields and as long as you don't
introduce null/empty values, they will kept together. However, for
recent Solr (4.7? certainly 4.8), you may want to look at parent/child
entries and join/parent/child queries.
Regards,
Alex.
Personal website: http://www.oute
hi,iorixxx, i'm Frankcis, not Victor , are you make the wrong email?
2014-05-05 23:20 GMT+08:00 iorixxx [via Lucene] <
ml-node+s472066n4134713...@n3.nabble.com>:
> Hi Victor,
>
> I don't know mysolr, I assume you are using /update/json, lets add your
> chain to defaults section.
>
>
>
>
You can do phrase search explicitly with quotes. Or you could look at
something like Term query parser:
https://cwiki.apache.org/confluence/display/solr/Other+Parsers#OtherParsers-TermQueryParser
You can also enable autoGeneratePhraseQueries on the field type to try
the phrase queries, but that's
Can you sort by score, than date? Assuming similar articles will get
same score (may need to discount frequency/length).
There is also QueryRescore API introduced in Lucene 4.8 that might be
relevant. Though I have no idea how that would get exposed in Solr.
Regards,
Alex.
Personal website: ht
Hmmm, I _think_ pivot faceting works here. One dimension would be day
and the other retweet count. The response will have the number of
retweets per day, you'd have to sum them up I suppose.
Best,
Erick
On Mon, May 5, 2014 at 3:18 PM, Romain wrote:
> Hi,
>
> I am trying to plot a non date field
I mark all the filters that support wildcards with (multi) on my list:
http://www.solr-start.com/info/analyzers/ . I uses actual interface
markers to derive that list, so it should be most up to date.
Regards,
Alex.
Personal website: http://www.outerthoughts.com/
Current project: http://www.sol
You haven't answered _why_ this is a good idea. I'm having a hard
time understanding what would be _useful_ about sorting this way. Just
because the sum of scores in a group is greater than the sum of scores
in another says _nothing_ about how relevant any of the docs in the group
are relative to e
thank you, Erick, you're right, the maxScore of document within each group is
more effective than the sum of scores in a group, especially some use-case
just as your assumption(group 1 could have 10M documents all with a score of
.01 and group 2 could have 1 document with a score of 1,000 and group
Frankly, I really don't know how to make that happen. I took a quick
look at the function query stuff (I don't have them all memorized yet)
and I just can't seem to make them bend that way.
I can imagine writing custom code to make it work but I don't really
know how much effort would be involved
we are using SOLr to index pdf documents but there are cases where PDFs
are usually a scanned document with no text to extract and index .
Is there a plugin or module in SOLR that we can integrate so that it would
actually extract a text / OCR and then index?
Thanks in advance
Chandan Tamraka
thank you, Erick, you're good man,
this is the client requirement:
In the forum, there is a lot of discussion of the content under different
subjects, search for a keyword,
which will lead to a result that the word of content or subject match the
query, group these document based on every subject,
Nothing I am aware of for Solr directly. You may have better luck
chasing this at TIKA mailing list, as that's what Solr uses under
covers to index PDF otherwise. Doing a quick search for Tika and OCR
brings up a number of links.
Regards,
Alex.
Personal website: http://www.outerthoughts.com/
Cur
The dates won't match unless you truncate all of them to day. But then if
you want to have slots of 15minutes it won't work as you would need to
truncate the dates every 15minutes in the index.
In ES, they have 1 field to make the slots and 1 field to insert into the
bucket, e.g.:
{
"query" :
Thanks a lot Shawn for the help!
we have given dedicated server to the solr and the RAM size is 650 MB.
This didn't happen when I was doing it locally.
I have seen the same problem in sphinx framework but it was solved using
some feature called as "rotate",
and we were able to search the QUERY w
75 matches
Mail list logo