On Wed, Aug 26, 2009 at 10:24 AM, Bernadette Houghton <
bernadette.hough...@deakin.edu.au> wrote:
> We have an encoding problem with our solr application. That is, non-ASCII
> chars displaying fine in SOLR, but in googledegook in our application .
>
> Our tomcat server.xml file already contains UR
Hi Shalin, stupid question - I'm an apache/solr newbie - but how do I access
the JVM???
Regards
Bern
-Original Message-
From: Shalin Shekhar Mangar [mailto:shalinman...@gmail.com]
Sent: Wednesday, 26 August 2009 5:10 PM
To: solr-user@lucene.apache.org
Subject: Re: encoding problem
On
On Wed, Aug 26, 2009 at 12:42 PM, Bernadette Houghton <
bernadette.hough...@deakin.edu.au> wrote:
> Hi Shalin, stupid question - I'm an apache/solr newbie - but how do I
> access the JVM???
>
When you execute the java executable, just add -Dfile.encoding=UTF-8 as a
command line argument to the ex
On Tue, Aug 25, 2009 at 10:40 AM, bhaskar chandrasekar wrote:
> Hi,
>
> Can any one helpe me with the below scenario?.
>
> Scenario 1:
>
> Assume that I give Google as input string
> i am using Carrot with Solr
> Carrot is for front end display purpose
It seems like Carrot is the one making the
Thanks for your quick reply, Shalin.
Tomcat is running on my Windows machine, but does not appear in Windows
Services (as I was expecting it should ... am I wrong?). I'm running it from a
startup.bat on my desktop - see below. Do I add the Dfile line to the
startup.bat?
SOLR is part of the rep
On Tue, Aug 25, 2009 at 4:24 AM, Joe Calderon wrote:
> hello *, im currently faceting on a shingled field to obtain popular
> phrases and its working well, however ide like to limit the number of
> shingles that get created, the solr.ShingleFilterFactory supports
> maxShingleSize, can it be made t
On Wed, Aug 26, 2009 at 12:52 PM, Bernadette Houghton <
bernadette.hough...@deakin.edu.au> wrote:
> Thanks for your quick reply, Shalin.
>
> Tomcat is running on my Windows machine, but does not appear in Windows
> Services (as I was expecting it should ... am I wrong?). I'm running it from
> a st
Thanks for that.
it works now ;-)
Erik Hatcher-4 wrote:
>
>
> On Aug 25, 2009, at 6:35 PM, Britske wrote:
>> Moreover, I can't seem to find the actual code in FacetComponent or
>> anywhere
>> else for that matter where the {!ex}-param case is treated. I assume
>> it's in
>> FacetComponen
check this http://wiki.apache.org/solr/CoreAdmin
when you create a core you are allowed to use the same instance dir as
the old core just ensure that you give a different datadir
On Wed, Aug 26, 2009 at 3:05 PM, pavan kumar
donepudi wrote:
> Paul,
> Can you please guide me on which option i need
Do you mean the schema or the solrconfig.xml?
The request handler is configured in the solrconfig.xml and you can find
out more about this particular configuration in
http://wiki.apache.org/solr/DisMaxRequestHandler?highlight=(CategorySolrRequestHandler)|((CategorySolrRequestHandler)).
To u
These are the reasons why we are thinking on splitting and index via multi-core:
First of all all, we have an index of news which size is about 9G. As
we will keep aggregating news forever and ever and let users do free
text search on our system, we think that it will be easier for IT
crowd to man
Hi.
When indexing the string "Günther" with
HTMLStripWhitespaceTokenizerFactory (in analysis.jsp), I get two tokens,
"Gü" and "nther".
Is this a bug, or am I doing something wrong?
(Using a Solr nightly from 2009-05-29)
Anders.
For the installation of apache solr integration module in Drupal we need to
install solr.
The must do thing is we need to change the solr schema.xml and configure.xml
files with the files in apache solr integration module.
can any body explain the reason behind this change.
--
View this mess
If you are complaining about Web Application (other than SOLR) (probably
behind-the Apache HTTPD) having encoding problem - try to troubleshoot it
with Mozilla Firefox + Live Http Headers plugin.
Look at "Content-Encoding" HTTP response headers, and don't forget about
tag inside HTML...
-Fuad
I haven't been able to find what makes a function query count as a match
when used a part of a boolean query with Occur.MUST.
A Term query is simple, if the term is not found, it doesn't count as a
match. What's the equivalent for a function query? A score of zero (or less
than zero, as implied by
Hi Anders,
Sorry, I don't know this is a bug or a feature, but
I'd like to show an alternate way if you'd like.
In Solr trunk, HTMLStripWhitespaceTokenizerFactory is
marked as deprecated. Instead, HTMLStripCharFilterFactory and
an arbitrary TokenizerFactory are encouraged to use.
And I'd recomme
Hello all,
When I start up Solr from the example directory using start.jar, it seems to
start up, but when I go to the localhost admin url
(http://localhost:8983/solr/admin) I get a 404 (See message appended below).
Has the url for the Solr admin changed?
Tom
Tom Burton-West
---
Here
Hello!
Try running ant example and then run Solr.
--
Regards,
Rafał Kuć
> Hello all,
> When I start up Solr from the example directory using start.jar, it
> seems to start up, but when I go to the localhost admin url
> (http://localhost:8983/solr/admin) I get a 404 (See message appended
>
The servlet container (resin) where i deploy solr shows :
ERROR: transport error 202: bind failed: Address already in
use
ERROR: JDWP Transport dt_socket failed to initialize,
TRANSPORT_INIT(510)
JDWP exit error AGENT_ERROR_TRANSPORT_INIT(197): No transports
initialized
[../../../src/share/back/
Is Solr like a RDBMS in that I can have multiple programs querying and
updating the index at once, and everybody else will see the updates
after a commit, or do I have to something explicit to see others
updates? Does it matter whether they're using the web interface,
SolrJ with a
CommonsHttpSolrS
Once a commit occurs, all data added before it (by any & all clients) becomes
visible to all searches henceforth.
The "web interface" has direct access to Solr, and SolrJ remotely accesses that
Solr.
SolrEmbeddedSolrServer is something that few people should actually use. It's
mostly for embe
JDPA/JDWP are for remote debugging of SUN JVM...
It shouldn't be SOLR related... check configs of Resin...
-Fuad
http://www.tokenizer.org
-Original Message-
From: Licinio Fernández Maurelo [mailto:licinio.fernan...@gmail.com]
Sent: August-26-09 12:49 PM
To: solr-user@lucene.apache.org
Hi,
Can any one help me with the below scenario?.
Scenario 1:
Assume that I give Google as input string
i am using Carrot with Solr
Carrot is for front end display purpose
the issue is
Assuming i give "BHASKAR" as input string
It should give me search results pertaining to BHASKAR only.
I have the same situation now.
If I don't want to use http connection, so I need to use EmbeddedSolrServer
that what I think I need correct?
We have Master/slaves solr, the applications use slaves for search. The Master
only taking the new index from Database and slaves will pull the new index
You could implement a Data Import Handler "EntityProcessor". There are at
least 5 implementations I can see for you to learn from that come with Solr.
If Solr truly doesn't need to be up and running as a server to serve any
queries, then EmbeddedSolrServer will be fine.
~ David Smiley
Author
See my response to Paul Tomblin. You could use the existing DataImportHandler
"SqlEntityProcessor" for DB access. The DIH framework is fairly extensible.
BTW, I wouldn't immediately dismiss using HTTP to give data to Solr just
because you believe it will be slow without having tried it. Using
> I don't want or try not to use http connection from Database to Solr
Master because of network latency( very slow).
"network latency" does not play any role here; throughput is more important.
With separate SOLR instance on a separate box, and with separate java
application (SOLR-bridge) queryin
>
> Is Solr like a RDBMS in that I can have multiple programs querying and
> updating the index at once, and everybody else will see the updates after a
> commit, or do I have to something explicit to see others updates?
>
Yes, everyone gets to search on an existing index unless writes to the index
Hi Everyone,
When trying to utilize the new HTTP based replication built into Solr 1.4 I
encounter a problem. When I view the replication admin page on the slave all
of the master values are null i.e. Replicatable Index Version:null,
Generation: null | Latest Index Version:null, Generation: null.
Thanks.
The issue we have actually, it could be firewall issue more likely than network
latency, that's why we try to avoid to use http connection.
Fixing the firewall is not an option right now.
We have around 3 millions docs to load from DB to Solr master( first initial
load only) and subseque
You could have used your previous thread itself (
http://www.lucidimagination.com/search/document/31c1ebcedd4442b/exact_pattern_search_in_solr),
Bhaskar.
In your scenario one, you need an exact token match, right? You are getting
expected results if your field type is "text". Look for the
"WordDel
You'll probably want to call Solr commit, however you'll want to
call IW.flush underneath (via a new Solr commit flag?).
Yes, the Solr caches would be somewhat useless if you're calling
Solr commit/flush rapidly. See SOLR-1308 on improving caches for
NRT.
On Tue, Aug 25, 2009 at 7:22 PM, KaktuCha
Do you have firewall between DB and possible SOLR-Master instance? Do you
have firewall between Client application and DB? Such configuration is
strange... by default firewalls allow access to port 80, try to set port 80
for SOLR-Tomcat and/or configure AJP mapping for front-end HTTPD which you
mig
I have one quick question...
If in solrconfig.xml, if it says ...
${solr.abortOnConfigurationError:false}
does it mean defaults to false if it is not set
as system property?
Thanks,
Dharmveer
--
View this message in context:
http://www.nabble.com/%24%7Bsolr.abortOnConfigurationError%3Afalse
On Aug 26, 2009, at 3:33 PM, djain101 wrote:
I have one quick question...
If in solrconfig.xml, if it says ...
${solr.abortOnConfigurationError:false}abortOnConfigurationError>
does it mean defaults to false if it is
not set
as system property?
correct
I'm trying to figure out if Solr is the right solution for a problem I'm
facing. I have 2 data entities: P(arent) & C(hild). P contains up to 100
instances of C. I need to expose an interface that searches attributes of
entity C, but displays them grouped by parent entity, P. I need to include
fac
>then 2) get all P's by ID, including facet counts, etc.
>The problem I face with this solution is that I can have many matching P's
(10,000+), so my second query will have many (10,000+) constraints.
SOLR can automatically provide you P's with Counts, and it will be
_unique_...
Even if cardinal
We already opened port 80 from solr to DB so that's not the issue, but
httpd(port 80) is very flaky if there is firewall between Solr and DB.
We have Solr master/slaves env, client access the search thru slaves( master
only accept the new index from DB and slaves will pull the new indexes from
With this configuration probably preferred method is to run standalone Java
application on same box as DB, or very close to DB (in same network
segment).
HTTP is not a bottleneck; main bottleneck is
indexing/committing/merging/optimizing in SOLR...
Just as a sample, if you submit to SOLR batch o
On Wed, Aug 26, 2009 at 11:27 AM, Christophe
Biocca wrote:
> I haven't been able to find what makes a function query count as a match
> when used a part of a boolean query with Occur.MUST.
A function query matches all non-deleted documents.
-Yonik
http://www.lucidimagination.com
>I just worried that http will be a bottle neck, that's why I prefer JDBC
connection method.
- JDBC is a library for Java Application; it connects to Database; it uses
proprietary protocol provided by DB vendor in most cases, and specific port
number
- SolrJ is a library for Java Application; it c
No, we don't want to put at the same box as Database box.
Agree, that indexing/committing/merging and optimizing is the bottle neck.
I think it worths to try SolrJ with CommmonsHttpSolrServer option for now and
let's see what happened to load 3 millions docs.
Thanks
Francis
-Original Mess
Thanks for the response.
It's interesting because when I run jconsole all I can see is one
ReplicationHandler jmx mbean. It looks like it is defaulting to the first slice
it finds on its path. Is there anyway to have multiple replication handlers or
at least obtain replication on a per "slice"
The class SortableFloatFieldSource cannot be accessed from outside its
package. So it can't be used as part of a FunctionQuery.
Is there a workaround to this, or should I roll my own? Will it be fixed in
1.4?
Thanks for the response.
I will try CommonsHttpSolrServer for now.
Francis
-Original Message-
From: Fuad Efendi [mailto:f...@efendi.ca]
Sent: Wednesday, August 26, 2009 1:34 PM
To: solr-user@lucene.apache.org
Subject: RE: SolrJ and Solr web simultaneously?
With this configuration proba
While testing my code I discovered that my copyField with PatternTokenize
does not do what I want. This is what I am indexing into Solr:
2.0|Solr In Action
My copyField is simply:
field titleRaw is of type title_raw:
For my example
Hi,
I have a situation where a particular kind of document can be categorized in
different ways, and depending on the categories it is in it will have
different fields that describe it (in practice the number of fields will be
fairly small, but whatever). These documents will each have a full-text
Hi,
I am attempting to perform a faceted distributed search with manual
sorting of the value of a facet. Is this something which is possible
through a Solr query or would I be better off inserting a manual
weighting field and sort by that?
To clarify - I am performing a distributed search over
Hello,
I'm having an issue getting the master to replicate its index to the slave.
Below you will find my configuration settings. Here is what is happening: I can
access the replication dashboard for both the slave and master and I can
successfully execute HTTP commands against both of t
SortableFloatField works in function queries... it's just that
everyone goes through SortableFloatField.getValueSource() to create
them. Will that work for you?
-Yonik
http://www.lucidimagination.com
On Wed, Aug 26, 2009 at 6:23 PM, Christophe
Biocca wrote:
> The class SortableFloatFieldSource
>
> Will Solr sort by fields that are unindexed?
>
Unfortunately, No.
Cheers
Avlesh
On Thu, Aug 27, 2009 at 4:03 AM, Isaac Foster wrote:
> Hi,
>
> I have a situation where a particular kind of document can be categorized
> in
> different ways, and depending on the categories it is in it will hav
Is it also the case that it will not narrow by them?
Isaac
On Wed, Aug 26, 2009 at 8:59 PM, Avlesh Singh wrote:
> >
> > Will Solr sort by fields that are unindexed?
> >
> Unfortunately, No.
>
> Cheers
> Avlesh
>
> On Thu, Aug 27, 2009 at 4:03 AM, Isaac Foster >wrote:
>
> > Hi,
> >
> > I have a
>
> Is it also the case that it will not narrow by them?
If "narrowing" means faceting, then again a no.
Cheers
Avlesh
On Thu, Aug 27, 2009 at 6:36 AM, Isaac Foster wrote:
> Is it also the case that it will not narrow by them?
>
> Isaac
>
> On Wed, Aug 26, 2009 at 8:59 PM, Avlesh Singh wrote:
Hello,
My apologies, but there was a mix-up reserving our meeting location,
and we don't have access to it.
I'm very sorry, and beer is on me next month. Promise :)
Sent from my Internets
On Aug 25, 2009, at 4:21 PM, Bradford Stephens > wrote:
Hey there,
Apologies for this not going out
With a relational database, the approach that has been working for us
and many customers is to first give DataImportHandler a go. It's
powerful and fast. 3M docs should index in about an hour or less, I'd
speculate. But using DIH does require making access from Solr to the
DB server soli
While Andrzej's talk will focus on things at the Lucene layer, I'm
sure there'll be some great tips and tricks useful to Solrians too.
Andrzej is one of the sharpest folks I've met, and he's also a very
impressive presenter. Tune in if you can.
Erik
Begin forwarded message:
Fr
Solr sorts on indexed fields only, currently. And only a single value
per document per sort field (careful with analyzed fields, and no
multiValued fields).
Unwise and impossible - of course this depends on the scale you're
speaking of. How many documents? What types of fields? How sma
Hi,
When Solr retrives records based on a input match , it gives total count of
records.
Say for Ex , it displays like : 1 out of 20,000 for the particular search
string.
How the total count of records are fetched in Solr , does it refer any Schema
or XML file?.
Regards
Bhaskar
>
> How the total count of records are fetched in Solr , does it refer any
> Schema or XML file?.
>
Sorry, but I did not get you. What does that mean? The total count is not
stored anywhere; it is computed based on how many documents you have in your
index matching the query.
Cheers
Avlesh
On Thu
I am wondering... are new SOLR filtering features faster than standard
Lucene queries like
{query} AND {filter}???
Why can't we improve Lucene then?
Fuad
P.S.
https://issues.apache.org/jira/browse/SOLR-1169
https://issues.apache.org/jira/browse/SOLR-1179
-Original Message-
Frankly, I never tried any DIH... probably it is the best option for this
specific case (they have Java developer) - but one should be knowledgeable
enough to design SOLR schema... And I noticed here (and also at HBase
mailing list) many first-time users are still thinking in terms of
Relational-DB
Hey Guys,
Ok, I found this:
Troubleshooting Errors
It's possible that you get an error related to the following:
SEVERE: Exception starting filter SolrRequestFilter
java.lang.NoClassDefFoundError: Could not initialize class
org.apache.solr.core.SolrConfig
at
org.apache.solr.servlet.Solr
Looks like you totally ignored my previous post...
> Who is vendor of this "openjdk-1.6.0.0"? Who is vendor of JVM which this
JDK
> runs on?
>... such installs for Java are totally mess, you
may have incompatible Servlet API loaded by bootstrap classloader before
Tomcat classes
First of al
The log messages are shown when you hit the admin page. So on't worry
about that. Keep a minimal configuration of Replication. All you need
is masterUrl and pollInterval.
On Thu, Aug 27, 2009 at 5:52 AM, J G wrote:
>
>
>
>
>
>
>
> Hello,
>
> I'm having an issue getting the master to replicate it
There is no hard limit. It is going to be decided by your h/w . You
will be limited by the no:of files that can be kept open by your
system.
On Thu, Aug 27, 2009 at 1:06 AM, djain101 wrote:
>
> Hi,
>
> Is there any maximum limit on the number of cores one solr webapp can have
> without compromisin
On Wed, Aug 26, 2009 at 11:53 PM, Ron Ellis wrote:
> Hi Everyone,
>
> When trying to utilize the new HTTP based replication built into Solr 1.4 I
> encounter a problem. When I view the replication admin page on the slave
> all
> of the master values are null i.e. Replicatable Index Version:null,
Hi,
In Schema.xml file,I am not able ot find splitOnCaseChange="1".
I am not looking for case sensitive search.
Let me know what file you are refering to?.
I am looking for exact match search only
Moreover for scenario 2 the KeywordTokenizerFactory
and EdgeNGramFilterFactory refers which link
On Thu, Aug 27, 2009 at 1:05 AM, Ryan McKinley wrote:
>
> On Aug 26, 2009, at 3:33 PM, djain101 wrote:
>
>
>> I have one quick question...
>>
>> If in solrconfig.xml, if it says ...
>>
>>
>> ${solr.abortOnConfigurationError:false}
>>
>> does it mean defaults to false if it is not
>> set
>> as sy
68 matches
Mail list logo