Hi
I have already configured the Tomcat instance as per the link
http://wiki.apache.org/solr/SolrTomcat for the URI Charset Config
The necessary updates have made in Tomcat's conf/server.xml with
URIEncoding="UTF-8".
Thank you for your reply.
Sanjailal KP
--
On 5/22/12, Lance Norskog wrote:
>
There are are many steps that can go wrong. Your platform should have
UTF-8 as its default encoding. Windows and Macos don't do this. I had
to configure Chrome to use UTF-8 as its default display encoding.
Also, if you use Tomcat, it has to be configured for UTF-8:
http://wiki.apache.org/solr/Solr
Hi,
Thank you so much for replying.
The MySQL database server is running on a Fedora Core 12 Machine with Hindi
Language Support enabled. Details of the database are - ENGINE=MyISAM and
DEFAULT CHARSET=utf8
Data is imported using the Solr DataImportHandler (mysql jdbc driver).
In the schema.xm
Hi,
Thank you so much for replying.
The MySQL database server is running on a Fedora Core 12 Machine with Hindi
Language Support enabled. Details of the database are - ENGINE=3DMyISAM and
DEFAULT CHARSET=3Dutf8
Data is imported using the Solr DataImportHandler (mysql jdbc driver).
In the schem
you should define your search first.
if the site is www.google.com. how do you match it. full string
matching or partial matching. e.g. is "google" should match? if it
does, you should write your own analyzer for this field.
On Tue, May 22, 2012 at 2:03 PM, Shameema Umer wrote:
> Sorry,
> Please
Sorry,
Please let me know how can I search site name using the solr query syntax.
My results should show title, url and content.
Title and content are being searched even though the
content.
I need url or site name too. please, help.
Thanks in advance.
On Tue, May 22, 2012 at 11:05 AM, ketan kor
On 22 May 2012 05:12, sudarshan wrote:
[...]
> class="solr.CSVRequestHandler"
> startup="lazy" />
[...]
> Response:
>
>
>
> Error 400 Unexpected character 'b' (code 98) in prolog; expected
> '<'
> at [row,col {unknown-source}]: [1,1]
>
>
> HTTP ERROR 400
: Interesting, omitTf=true eventhough it would give strict enforcement,
: wouldnt it affect the relevancy? Like, I am wondering if the ordering
: amongst the three word matches would be not as good as it would be when we
: have omitNorms=true&omitTf=true. Do you have an idea?
It will *absolutely*
: No, it's not just one single query, rather, as I've mentioned before, it's
: combination of searches with result count for each combination. Explained
: in detail below:
: 1) ("SQL Server" OR SQL)
: 2) ("Visual Basic" OR VB.NET)
: 3) (Java AND JavaScript)
: 4) ("SQL Server" OR SQL) AND ("Visual
Interesting, omitTf=true eventhough it would give strict enforcement,
wouldnt it affect the relevancy? Like, I am wondering if the ordering
amongst the three word matches would be not as good as it would be when we
have omitNorms=true&omitTf=true. Do you have an idea?
On Mon, May 21, 2012 at 8:51
: - /suggest?q=suggest_terms:lap*&fq=type:P&fq=(-type:B)
: numFound=1
: doc, so adding a doc will also fulfill right?
: /suggest?q=suggest_terms:lap*&fq=type:P&fq=(-type:B OR name:aa)
: numFound=0
:
: is there a logical explanation??
http://www.lucidimagination.com/blog/2011/12/28/why-not-and-o
Why do you want to control what gets indexed into a core and then
knowing what core to search? That's the kind of "knowing" that SolrCloud
solves. In SolrCloud, it handles the distribution of documents across
shards and retrieves them regardless of which node is searched from.
That is the point of
The php.net plugin is the best. SolrPHPClient is missing several features.
Sent from my Mobile device
720-256-8076
On May 21, 2012, at 6:35 AM, Tolga wrote:
> Hi,
>
> Can you recommend a good PHP UI to search? Is SolrPHPClient good?
: I want to have a strick enforcement that In case of a 3 word search, those
: results that match all 3 term should be presented ahead of those that match
: 2 terms when I set mm=2.
:
: I have seen quite some cases where, those results that match 2 out of 3
: words appear ahead of those matching
Specifically if I'm doing a query using the solr mlt handler
(http://wiki.apache.org/solr/MoreLikeThisHandler) and stream.body to supply the
source doc is there any way to boost result documents based on document age?
I already know how to do that for a regular query using dismax
(http://wiki.
Hi Guys,
I use following command to start solr cloud according to solr cloud wiki.
yydzero:example bjcoe$ java -Dbootstrap_confdir=./solr/conf
-Dcollection.configName=myconf -DzkRun -DnumShards=2 -jar start.jar
yydzero:example2 bjcoe$ java -Djetty.port=7574 -DzkHost=localhost:9983 -jar
start.jar
: I should clarify the error a bit. When I make a select request on my first
: core (called core0) using the wt=json parameter I get a 400 response with
: the explanation "undefined field: gid". The field gid is not defined in the
: schema.xml file of my first core. But, it is defined in the schem
My favourite php library is solarium. Everything OOP. I've tried a few.
http://www.solarium-project.org/
Sent from my iPhone
On 21/05/2012, at 6:44 PM, Johannes Goll wrote:
> yes, I am using this library and it works perfectly so far. If
> something does not work you can just modify it
> http
: Subject: Date format in the schema.xml
: References: <1336981696.60953.yahoomailclas...@web121705.mail.ne1.yahoo.com>
: In-Reply-To: <1336981696.60953.yahoomailclas...@web121705.mail.ne1.yahoo.com>
https://people.apache.org/~hossman/#threadhijack
Thread Hijacking on Mailing Lists
When starting
: Subject: Newbie with Carrot2?
: References: <35E48F3294A0416A8F476E9C173321F3@msrvcn04>
: In-Reply-To: <35E48F3294A0416A8F476E9C173321F3@msrvcn04>
https://people.apache.org/~hossman/#threadhijack
Thread Hijacking on Mailing Lists
When starting a new discussion on a mailing list, please do not
yes, I am using this library and it works perfectly so far. If
something does not work you can just modify it
http://code.google.com/p/solr-php-client/
Johannes
2012/5/21 Tolga :
> Hi,
>
> Can you recommend a good PHP UI to search? Is SolrPHPClient good?
Hi,
Using curl -u will only attempt to log in to Jetty/Solr, which is not password
protected I assume. What you really would like is for the HTTP call which Solr
does based on stream.url to attempt a login. Such functionality is not
implemented as far as I know. You may try the syntax
stream.u
Hi,
I want to index emails using solr. I put the user name, password, hostname
in data-config.xml under mail folder. This is a valid email but when I run
in url http://localhost:8983/solr/mail/dataimport?command=full-import It
said cannot access mail/dataimporter reason: no found. But when i run
I want to post index a http document that is password protected.
It has a username name login.
I tried doing this
curl -u username:password
"http://localhost:8983/solr/update/extract?literal.id=doc900&commit=true"; -F
stream.url=http://somewebsite.com/docs/DOC2609
but it just indexes the login
I am having a situation where I need to join 3 tables to pull the required
information. Can anyone throw me some ideas!!!
select A.sid, B.cid, C.NAME
from table1 A, table2 B, table3 C
where A.sid= C.sid
and
A.oid = B.oid
and C.typeid = 5
and C.flag = 0
and B.cid= 1000;
Can you please provide sch
Hi,
I believe, in your "colored" fragmentsBuilder definition, you have not
mentioned anything in your pre and post tags and that may be the reason
that you are getting snippets of text, without highlighting.
Please refer http://wiki.apache.org/solr/HighlightingParameters and check
the "hl.fragment
: Is there a way to not only get the number of times a term appears for
: a particular field (faceting) as well as the number of documents that
: were associated with a particular term? So for instance if I had the
: following docs
Nope... faceting is associated with _sets_ of documents, so ther
For the fragListBuilder
it's
fragment builder is
70
0.5
[-\w ,/\n\"']{20,200}
Thanks!
--
View this message in context:
http://lucene.472066.n3.nabble.
Hello,
Was wondering how to access the cluster labels, and docs(ids) via SolrJ?
I have added the following:
query.seParam("q", userQuery);
query.setParam("clustering", true);
query.setParam("qt", "/core2/clustering");
query.setParam("carrot.title", "title");
But how to access
Hi,
Can you please provide the definitions of the following 3 objects from your
solrconfig.xml ?
simple
colored
regex
For eg,
the "simple" hl.fragListBuilder should be defined as mentioned below in
your solrconfig.xml
On Mon, May 21, 2012 at 2:06 PM, 12rad wrote:
> The field I am trying
: I am using solr 1.3 with jdk 1.5.0_14 and weblogic 10MP1 application server
: on Solaris. I use embedded solr server. More details :
FWIW: Solr 1.3 is *REALLY* old ... do not be suprised if much of the info
you are given (or read) doesn't apply.
: - some mail threads on this forum seem to ind
Ok, it seems that a maven dependency to zookeeper version 3.3 broke this.
Now it connects to the zk instance.
Thanks.
On Mon, May 21, 2012 at 5:31 PM, Daniel Brügge <
daniel.brue...@googlemail.com> wrote:
> Thanks for your feedback. I don't know.
>
> I've tried just now with the newest trunk ver
hi,
my use case here is to search all the incoming documents for certain
comination of words which are pre-determined. So what am doing here is,
create a batch of x docs according to their creation date, index them,
commit them and search them for query (pre-determined).
My question is, if i have t
Parvin,
What you are looking for is already available in the "bleeding edge",
unreleased version of Solr, which will become version 4.0 sometime later this
year. You can download it at [1] and test it out. The feature is called
SolrCloud [2] and it replaces the old replication mechanism in 1.x
The field I am trying to highlight is stored.
In the searchHandler i've set the parameters as follows:
on
text
5
1000
51
true
regex
simple
colored
1000
true
true
true
I still don't see any highl
Usually I would recommend trying to index all languages into one Solr core.
The determining factor for me is how much "overlap" there is in fields for
each language, i.e. how many common fields for each language. For example
if you have 60 common fields to all languages, but only 8 fields that are
What OS? I was just trying trunk and looking at that view on Chrome on OSX and
Linux and did not see an issue.
On May 21, 2012, at 1:15 PM, Phil Hoy wrote:
> After further investigation I have found that it is not a problem on firefox,
> only chrome and IE.
>
> Phil
>
> -Original Message
After further investigation I have found that it is not a problem on firefox,
only chrome and IE.
Phil
-Original Message-
Sent: 21 May 2012 18:05
To: solr-user@lucene.apache.org
Subject: trunk cloud ui not working
Hi,
I am running from the trunk and the localhost:8983/solr/#/~cloud pa
I think there is a way in sunspot to give an explicit name to a field so
that sunspot doesn't generate for field
names. I think it is the ":as" function, such as:
string :name, :as => :name_s
So, you can then refer to "name" in your ruby code and "name_s" will be the
field name in Solr.
--
On May 21, 2012, at 12:10 PM, Mark Miller wrote:
> I think the reason that you see a multiple values error when you try the
> other order is because of the lack of a document clone (the other issue I
> mentioned a few emails back). Addressing that won't solve your issue though
I take that back
https://issues.apache.org/jira/browse/SOLR-3473
-Original message-
> From:Mark Miller
> Sent: Mon 21-May-2012 18:11
> To: solr-user@lucene.apache.org
> Subject: Re: SolrCloud deduplication
>
> Looking again at the SignatureUpdateProcessor code, I think that indeed this
> won't currently
Looking again at the SignatureUpdateProcessor code, I think that indeed this
won't currently work with distrib updates. Could you file a JIRA issue for
that? The problem is that we convert update commands into solr documents - and
that can cause a loss of info if an update proc modifies the upda
Thanks all for the explanations.
Anderson
2012/5/21 Jack Krupansky
> And, generally when I see a field that has values like ".2231-7", it
> should be a "string" field rather than tokenized text. As a string, you can
> then do straight wildcards without surprises.
>
>
> -- Jack Krupansky
> -
Thanks for your feedback. I don't know.
I've tried just now with the newest trunk version and the embedded ZK on
port 9983.
In the logs of the zk-solr it shows:
*INFO: Accepted socket connection from /XXX.XXX.XXX.XXX:1055*
*May 21, 2012 3:27:34 PM org.apache.zookeeper.server.NIOServerCnxn doIO*
And, generally when I see a field that has values like ".2231-7", it
should be a "string" field rather than tokenized text. As a string, you can
then do straight wildcards without surprises.
-- Jack Krupansky
-Original Message-
From: Jack Krupansky
Sent: Monday, May 21, 2012 11:2
I change the fieldtype of field to the follow:
As you see, i just keep the WhitespaceTokenizerFactory. That's works. Now i
could find using "*2231?7", "*2231*7", "*2231-7",
"*2231*",".2231-7".
How i can see, with this tokenizer the text was not spplitted. Is that
Before Solr 3.6, which added MultiTermAwareComponent for analyzers, the
presence of a wildcard completely short-circuited (prevented) the query-time
analysis, so you have to manually emulate all steps of the query analyzer
yourself if you want to do a wildcard. Even with 3.6, not all filters are
Hi.
In debug mode, the generated query was:
field:*2231-7
field:*2231-7
field:*2231-7
field:*2231-7
The analisys of indexing the text .2231-7 produces this result:
Index Analyzer .22317 .22317 .22317 .22317 #1;1322.
#1;7 .22317
And for search fo
Hi again,
It seemed to work fine but in the end duplicates are not overwritten. We first
run the SignatureProcessor and then the DistributedProcessor. If we do it the
other way around the digest field receives multiple values and throws errors.
Is there anything else we can do or another patch
Changing my field type to string for my uniquekey field solved the problem.
Thanks to Jack and Erik for the fix!
On May 18, 2012, at 5:33 PM, Jack Krupansky wrote:
> Typically the uniqueKey field is a "string" field type (your schema uses
> "text_general"), although I don't think it is supposed
Hi,
SOLR-2822 seems to work just fine as long as the SignatureProcessor precedes
the DistributedProcessor in the update chain.
Thanks,
Markus
-Original message-
> From:Mark Miller
> Sent: Fri 18-May-2012 16:05
> To: solr-user@lucene.apache.org; Markus Jelsma
> Subject: Re: SolrCl
My problem was gone. Thanks Staszek and Dawid!
koji
--
Query Log Visualizer for Apache Solr
http://soleami.com/
(12/05/21 18:11), Stanislaw Osinski wrote:
Hi Koji,
Dawid came up with a simple fix for this, it's committed to trunk and 3.6
branch.
Staszek
Have you looked at DataStax Enterprise?
On May 21, 2012 12:25 AM, "Parvin Gasimzade"
wrote:
> Hi,
>
> I am using solr with replication. I have one master that indexes data and
> two slaves which pulls index from master and responds to the queries.
>
> My question is, how can i create fault tolera
Ok, thanks a bunch!
I think the url's are set up properly but we have sort of made our own
solrconfig files so it's probably the file handler then.
I will look into that, but I'm 99.999% sure that this was my problem.
Again, thank you for the quick reply!
-Original Message-
From: Erik Ha
We intend to have separate, language specific search UI.
At the moment we like solution with separate cores more because it is more
flexible. But as a rule flexibility costs in terms of performance and we would
like to know that price.
Jack, what did you mean by 'Managing a bunch of small and t
Quoting from the new trunk example schema:
"field names should consist of alphanumeric or underscore characters only
and
not start with a digit. This is not currently strictly enforced,
but other field names will not have first class support from all
components
and back compat
Yeah, a bug report would be good. But really this is a Sunspot bug report.
Field names should NOT have :'s in them. Field names should stick to standard
Java identifier rules, otherwise it's escaping madness.
You could try something like this as a workaround:
bq=_val_:"dynamicfield\:val
Hi,
Can you recommend a good PHP UI to search? Is SolrPHPClient good?
On May 21, 2012, at 08:11 , Aleksander Akerø wrote:
> The css files from the browse GUI in solr 3.6 does not seem to work properly
> when solr is deployed with multiple cores and I can’t figure out how to
> solve this. I know this have been an issue in solr but I thought it was
> fixed in the newe
Hey,
I'm running solr (3.5.0.2011.11.30.16.37.06) and have encountered what
I think is a bug with the boost function (bf) parameter.
I've used sunspot (for use of solr with rails) which allows managing
dynamic fields, which by default creates fields like
"dynamicfield:value1","dynamicfiel
Hi
The css files from the browse GUI in solr 3.6 does not seem to work properly
when solr is deployed with multiple cores and I cant figure out how to
solve this. I know this have been an issue in solr but I thought it was
fixed in the newer versions.
Any answers or pointers on how to get
hi Tomas ,
My queries are complex ,i am faceting on many fields ,and using highlighting
and using boosts etc in the same query .
auto warming takes hell lot of time hence i have removed it .
--
View this message in context:
http://lucene.472066.n3.nabble.com/problem-in-replication-tp39846
Is it possible that your text editor/display does not support UTF-8
encoding?
Assuming the data is properly encoded, do you have the encoding="UTF-8"
attribute in your DIH dataSource tag?
-- Jack Krupansky
-Original Message-
From: KP Sanjailal
Sent: Monday, May 21, 2012 7:37 AM
To:
Solr appears to force your UniqueKey field to be required even though you
don't have an explicit required="true" attribute.
As a debugging aid, try adding default="missing" to your "id" field
definition and then you can query on "id:missing" and see what data is being
indexed without an id. Bu
Hi,
Thank you so much for replying.
The MySQL database server is running on a Fedora Core 12 Machine with Hindi
Language Support enabled. Details of the database are - ENGINE=MyISAM and
DEFAULT CHARSET=utf8
Data is imported using the Solr DataImportHandler (mysql jdbc driver).
In the schema.xm
Hello all,
I am facing problem to integrate the UIMA in SOLR.
I followed the following steps, provided in README file shipped along with
Uima to integrate it in Solr
Step1.
I set tags in solrconfig.xml appropriately to point the jar files.
Step2.
modified my "schema.xml" adding the fi
Take a look at the "/browse" request handler in the example solrconfig.xml
and compare how it does highlighting to what you are doing. There are a lot
of little details, so maybe even one might be missing.
Also, you can only highlight stored fields, so make sure that "text" is
stored. In the S
> text:abstract&hl=true&hl.fl=text&f.text.hl.snippets=2&f.text.hl.fragsize=200&debugQuery=true
Three things to check:
1-) See your text field declared as suitable for highlighting.
http://wiki.apache.org/solr/FieldOptionsByUseCase
2-) Increase hl.maxAnalyzedChars=Integer.MAX
3-) Increase Inte
I've never used nutch. Maybe someone else can help, or you can ask on
the nutch mailing list.
Greetings,
Kuli
Am 21.05.2012 12:50, schrieb Tolga:
Yes.
On 5/21/12 1:49 PM, Michael Kuhlmann wrote:
Am 21.05.2012 12:40, schrieb Tolga:
How do I verify it exists? I've been crawling the same site
Yes.
On 5/21/12 1:49 PM, Michael Kuhlmann wrote:
Am 21.05.2012 12:40, schrieb Tolga:
How do I verify it exists? I've been crawling the same site and it
wasn't giving an error on Thursday.
It depends on what you're doing.
Are you using nutch?
-Kuli
Am 21.05.2012 12:40, schrieb Tolga:
How do I verify it exists? I've been crawling the same site and it
wasn't giving an error on Thursday.
It depends on what you're doing.
Are you using nutch?
-Kuli
How do I verify it exists? I've been crawling the same site and it
wasn't giving an error on Thursday.
Regards,
On 5/21/12 1:20 PM, Michael Kuhlmann wrote:
Am 21.05.2012 12:07, schrieb Tolga:
Hi,
I am getting this error:
[doc=null] missing required field: id
[...]
I've got this entry in
Am 21.05.2012 12:07, schrieb Tolga:
Hi,
I am getting this error:
[doc=null] missing required field: id
[...]
I've got this entry in schema.xml:
What to do?
Simply make sure that every document you're sending to Solr contains
this id field.
I assume it's declared as your unique id fiel
I found this:
https://issues.apache.org/jira/browse/LUCENE-2236
So, it seems this feature is not supported in Solr 1.4 at all. Is there
any possible work around? If not, I'll have to consider splitting my
schema into two which will be quite a big change :(
- Ravish
On Mon, May 21, 2012 at 11:
Hi,
I am getting this error:
[doc=null] missing required field: id
request: http://localhost:8983/solr/update?wt=javabin&version=2
at
org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttpSolrServer.java:430)
at
org.apache.solr.client.solrj.impl.CommonsHttpSolrSer
Ahh, this is because I have to override DefaultSimilarity to turn off
tf/idf scoring? But this will apply to all the fields and general search
on text fields as well? Is there a way to apply custom similarity to
specific field types or fields only? Is there no way of turning TF/IDF off
without t
Hi Koji,
Dawid came up with a simple fix for this, it's committed to trunk and 3.6
branch.
Staszek
On Sun, May 20, 2012 at 5:15 PM, Koji Sekiguchi wrote:
> Hi Staszek,
>
> Thank you for the fix so quickly!
>
> As a trial, I set:
>
> org.apache.**
> solr.handler.clustering.**carrot2.**LuceneCar
Hi Parvin,
Fault tolerant architecture is something you need to decide on your
requirement. At some point of time there may require some manual
intervention to recover from crash. You need to see how much percentage you
could support fault tolerant. It certainly may not be 100. We could handle
sit
78 matches
Mail list logo