Hi,
We are using solr-server for a large data-set. We need some changes in
solr schema.xml (datatype change from integer to sint for few fields). It
turns out that the two datatypes (integer and sint) are incompatible and
hence we need to re-index SOLR.
My question is:
Is there any way by wh
Hello,
We are trying to get Solr to work for a really huge parts database. Details
of the database
- 55 million parts
- Totally 3700 properties (facets). But each record will not have value for
all properties.
- Most of these facets are defined as dynamic fields within the Solr Index
We were getti
Thanks for the revert Noble. A few questions are still open:
1. Can I pass parameters to DIH and be able to use them inside the
"query" attribute of an entity inside the data-config file?
2. Can I use the same data-import-handler in someway so that indexing can
be carried out in parall
On Thu, Jul 30, 2009 at 9:45 PM, Andrew Clegg wrote:
>
>
> Erik Hatcher wrote:
>>
>>
>> On Jul 30, 2009, at 11:54 AM, Andrew Clegg wrote:
>>> >> url="${domain.pdb_code}-noatom.xml" processor="XPathEntityProcessor"
>>> forEach="/">
>>> >> xpath="//*[local-name()='structCate
On Fri, Jul 31, 2009 at 11:11 AM, Avlesh Singh wrote:
> I am using Solr 1.3 and have a few questions regarding DIH:
>
> 1. Can I pass parameters to DIH and be able to use them inside the
> "query" attribute of an entity inside the data-config file?
> 2. I am indexing some 2 million database r
What field type are you using? What kind of filters have you applied on the
field?
The easiest way to make it work it to use a "string" field.
Cheers
Avlesh
On Fri, Jul 31, 2009 at 11:09 AM, Sushan Rungta wrote:
> I tried this but this didn't worked...
>
> Regards,
> Sushan
>
> At 12:37 AM 7/30
I am using Solr 1.3 and have a few questions regarding DIH:
1. Can I pass parameters to DIH and be able to use them inside the
"query" attribute of an entity inside the data-config file?
2. I am indexing some 2 million database records using DIH with 4-5
nested entities (just one level
I tried this but this didn't worked...
Regards,
Sushan
At 12:37 AM 7/30/2009, Avlesh Singh wrote:
You may index your data using a delimiter, like $my-field-content$. While
searching, perform a phrase query with the leading and trailing "$" appended
to the query string.
Cheers
Avlesh
On Wed, J
Check the system request handler: http://localhost:8983/solr/admin/system
Should look something like this:
1.3.0.2009.07.28.10.39.42
1.4-dev 797693M - jayhill - 2009-07-28
10:39:42
2.9-dev
2.9-dev 794238 - 2009-07-15 18:05:08
-Jay
On Thu, Jul 30, 2009 at 10:32 AM, Walter Underwood wrote:
> I
Hi,
I have a field in my schema specified using
Where "sint" is specified as follows (the default from schema.xml)
When I do a facet on this field using sort=index I always get the values back
in lexicographic order. Eg: adding this to a query string...
facet=true&facet.field=wordCount&
On Thu, Jul 30, 2009 at 6:34 PM, Bill Au wrote:
> FYI, it took me a while to discover that SolrJ by default uses a GET request
> for
> query, which uses ISO-8859-1.
That depends on the servlet container. SolrJ GET requests are sent in
UTF-8. Some servlet containers such as Tomcat need extra
co
Thanks, Robert. That's exactly what my problem was. Things work find after
I make sure that all my processing (index and query) are using UTF-8. FYI,
it took me a while to discover that SolrJ by default uses a GET request for
query, which uses ISO-8859-1. I had to explicitly use a POST to do qu
On Fri, Jul 31, 2009 at 1:43 AM, ahammad wrote:
> From what I can gather, it is not finding the data and/or column, and thus
> cannot populate the required field. However, the data is there, which I was
> able to prove outside of Solr.
>
> Is there a way to generate more descriptive logs for this
On Thu, Jul 30, 2009 at 4:18 PM, Erik Hatcher wrote:
>
> On Jul 30, 2009, at 3:32 PM, Stephen Duncan Jr wrote:
>
> What's the effect of showItems attribute on the fieldValueCache in Solr
>> 1.4?
>>
>
> Just outputs details of the last accessed items from the cache in the stats
> display.
>
>
As far as our NOC guys know the machines are approximately the same, aside
from the OS. The Leopard machine is running the default 1.5 JVM.
And it's possible that some other application or config issues is to blame.
Nobody's "blaming" the OS or Lucene, we're just asking around.
Searches on Googl
On Jul 30, 2009, at 3:32 PM, Stephen Duncan Jr wrote:
What's the effect of showItems attribute on the fieldValueCache in
Solr 1.4?
Just outputs details of the last accessed items from the cache in the
stats display.
Erik
if (showItems != 0) {
Map items = cache.getLates
Hello all,
I've been having this issue for a while now. I am indexing a Sybase
database. Everything is fantastic, except that there is 1 column that I can
never get back. I don't have direct database access via Sybase client, but I
was able to extract the data using some Java code.
The field is
Hello,
I'm a new user of solr but I have worked a bit with Lucene before. I get some
out of memory exception when optimizing the index through Solr and I would like
to find out why.
However, the only message I get on standard output is:
Jul 30, 2009 9:20:22 PM org.apache.solr.common.SolrExceptio
What's the effect of showItems attribute on the fieldValueCache in Solr 1.4?
--
Stephen Duncan Jr
www.stephenduncanjr.com
I recommend, in this case, that you use Solr's autocommit feature (see
solrconfig.xml) rather than having your indexing clients issue their
own commits. Overlapped searcher warming is just going to be too much
of a hit on RAM, and generally unnecessary with autocommit.
Erik
On Jul
Hi All,
I'm planning to have a certain number of processes posting
independently in a solr instance.
This instance will solely act as a master instance. No clients queries on it.
Is there a problem if i set maxWarmingSearchers to something like 30 or 40?
Also, how do I disable the cache warmi
: I don't know if it does the same from everyone, but when I use the
: reply function of my mail agent, it sets the recipient to the user who
: sent the message, and not the mailing list.
:
: So it's quite annoying cause I have to change the recipient each time
: I reply to someone on the list.
Hi Erik,
Thanks for the tip. H, well that's a good point, or maybe I will
just do the word filtering upfront and store it separately now that I
think about it more.
Darren
On Thu, 2009-07-30 at 13:05 -0400, Erik Hatcher wrote:
> On Jul 30, 2009, at 1:00 PM, Shalin Shekhar Mangar wrote:
>
>
check:
https://issues.apache.org/jira/browse/SOLR-945
this will not likely make it into 1.4
On Jul 30, 2009, at 1:41 PM, Jérôme Etévé wrote:
Hi,
Nope, I'm not using solrj (my client code is in Perl), and I'm with
solr 1.3.
J.
2009/7/30 Shalin Shekhar Mangar :
On Thu, Jul 30, 2009 at 8
2009/7/30 Erik Hatcher :
>
> On Jul 30, 2009, at 1:44 PM, Jérôme Etévé wrote:
>
>> Hi all,
>>
>> I don't know if it does the same from everyone, but when I use the
>> reply function of my mail agent, it sets the recipient to the user who
>> sent the message, and not the mailing list.
>>
>> So it's
On Jul 30, 2009, at 1:44 PM, Jérôme Etévé wrote:
Hi all,
I don't know if it does the same from everyone, but when I use the
reply function of my mail agent, it sets the recipient to the user who
sent the message, and not the mailing list.
So it's quite annoying cause I have to change the reci
Hi all,
I don't know if it does the same from everyone, but when I use the
reply function of my mail agent, it sets the recipient to the user who
sent the message, and not the mailing list.
So it's quite annoying cause I have to change the recipient each time
I reply to someone on the list.
Can
Hi,
Nope, I'm not using solrj (my client code is in Perl), and I'm with solr 1.3.
J.
2009/7/30 Shalin Shekhar Mangar :
> On Thu, Jul 30, 2009 at 8:31 PM, Jérôme Etévé
> wrote:
>>
>> Hi All,
>>
>> I'm wondering if it's possible to post documents to solr in JSON format.
>>
>> JSON is much fast
I think the properties page in the admin UI lists the Lucene version,
but I don't have a live server to check that on at this instant.
wunder
On Jul 30, 2009, at 10:26 AM, Chris Hostetter wrote:
: > i want to get the lucene index format version from solr web app
(as
: the Luke request h
: > i want to get the lucene index format version from solr web app (as
: the Luke request handler writes it out:
:
:indexInfo.add("version", reader.getVersion());
that's the index version (as in "i have added docs to the index, so the
version number has changed") the question is about the
On Thu, Jul 30, 2009 at 10:35 PM, Erik Hatcher
wrote:
>
> On Jul 30, 2009, at 1:00 PM, Shalin Shekhar Mangar wrote:
>
> On Thu, Jul 30, 2009 at 9:53 PM, wrote:
>>
>> Hi,
>>> I am exploring the faceted search results of Solr. My query is like this.
>>>
>>>
>>>
>>> http://localhost:8983/solr/sele
On Jul 30, 2009, at 1:00 PM, Shalin Shekhar Mangar wrote:
On Thu, Jul 30, 2009 at 9:53 PM, wrote:
Hi,
I am exploring the faceted search results of Solr. My query is like
this.
http://localhost:8983/solr/select?q=*:*&facet=true&facet.field=text&facet.limit=500&facet.prefix=wick
If I don
On Thu, Jul 30, 2009 at 9:53 PM, wrote:
> Hi,
> I am exploring the faceted search results of Solr. My query is like this.
>
>
> http://localhost:8983/solr/select?q=*:*&facet=true&facet.field=text&facet.limit=500&facet.prefix=wick
>
> If I don't use the prefix, I get back totals for words like 1,
On Thu, Jul 30, 2009 at 8:31 PM, Jérôme Etévé wrote:
> Hi All,
>
> I'm wondering if it's possible to post documents to solr in JSON format.
>
> JSON is much faster than XML to get the queries results, so I think
> it'd be great to be able to post data in JSON to speed up the indexing
> and lower
Thanks for the reply;
I had thought the solution would be altering the XML.
Ensdorf Ken wrote:
>
>> The problem is that the indexed form of this XML is flattened so the
>>
>> entity has 2 garage names, 2 min values and 2 max values, but the
>> grouping
>> between the garage name and it's min
Hoss I see what you mean. I am trying to implement a CustomUpdateProcessor
checking out here:
http://wiki.apache.org/solr/UpdateRequestProcessor
What is confusing me now is that I have to implement my logic in
processComit as you said:
>>you'll still need the "double commit" (once so you can see
It's very easy to write your own entity processor. At least, that is my
experience with extending the SQLEntityProcessor to my needs. So, maybe
you'd be better off subclassing the xpath processor and handling the
xpath in a way you can keep your configuration straight forward.
Andrew Clegg sc
On Jul 30, 2009, at 12:19 PM, Andrew Clegg wrote:
Don't worry -- your hints put me on the right track :-)
I got it working with:
Now, to get it to ignore missing files without an error... Hmm...
onError="skip" or abort, or continue
Erik
Hi,
I am exploring the faceted search results of Solr. My query is like this.
http://localhost:8983/solr/select?q=*:*&facet=true&facet.field=text&facet.limit=500&facet.prefix=wick
If I don't use the prefix, I get back totals for words like 1,a,of,2,3,4.
1 letter/number occurrences in my documen
Chantal Ackermann wrote:
>
>
> my experience with XPathEntityProcessor is non-existent. ;-)
>
>
Don't worry -- your hints put me on the right track :-)
I got it working with:
Now, to get it to ignore missing files without an error... Hmm...
Che
Erik Hatcher wrote:
>
>
> On Jul 30, 2009, at 11:54 AM, Andrew Clegg wrote:
>>> url="${domain.pdb_code}-noatom.xml" processor="XPathEntityProcessor"
>> forEach="/">
>>> xpath="//*[local-name()='structCategory']/*[local-name()='struct']/
>> *[local-name()='title']"
>
Right, I figured that that's how it should have sorted... which is why
I did a range from 0 to 200
That should have worked for my example, but it removed things over
200, which using lexical sorting seems to still be invalid.
What's left are things like: 998914
Now, obviously that is exp
Hi Andrew,
my experience with XPathEntityProcessor is non-existent. ;-)
Just after a quick look at the method that throws the exception:
private void addField0(String xpath, String name, boolean multiValued,
boolean isRecord) {
List paths = new
LinkedList(Arrays.
On Jul 30, 2009, at 11:54 AM, Andrew Clegg wrote:
xpath="//*[local-name()='structCategory']/*[local-name()='struct']/
*[local-name()='title']"
/>
The XPathEntityProcessor doesn't support that fancy of an xpath - it
supports only a limited subset. Try /structCate
Chantal Ackermann wrote:
>
> Hi Andrew,
>
> your inner entity uses an XML type datasource. The default entity
> processor is the SQL one, however.
>
> For your inner entity, you have to specify the correct entity processor
> explicitly. You do that by adding the attribute "processor", and th
On Jul 30, 2009, at 9:19 AM, Licinio Fernández Maurelo wrote:
i want to get the lucene index format version from solr web app (as
luke do), i've tried looking for the info at luke handler response,
but i havn't found this info
the Luke request handler writes it out:
indexInfo.add("version
Look again at ExtractingRequestHandler.
I havn't looked at what post.jar does internally, but it probably
doesn't work with ExtractingRequestHandler unless you can send other
params as well. I would use curl as the examples in the doc for
ExtractingRequestHandler does. Or figure out if post.ja
I am trying to post a Word document using the Solr post.jar file. When
I attempt this, using a command line interface, I get a fatal error.
I have looked at the following resources:
Solr.com: Tutorial, Docs, FAQ, & ExtractingRequestHandler.
As near as I can tell, I have all the files in the pro
Hi Andrew,
your inner entity uses an XML type datasource. The default entity
processor is the SQL one, however.
For your inner entity, you have to specify the correct entity processor
explicitly. You do that by adding the attribute "processor", and the
value is the classname of the processor
Hi All,
I'm wondering if it's possible to post documents to solr in JSON format.
JSON is much faster than XML to get the queries results, so I think
it'd be great to be able to post data in JSON to speed up the indexing
and lower the network load.
All the best !
Jerome Eteve.
--
Jerome Eteve
>
> instead they should be sent to separate fields
> author_fname
> author_lname
> author_email
>
or, a dynamic field called author_* (I am assuming all of the author fields
to be of the same type).
And if you use SolrJ, you can transform this info into a data structure like
"Map authorInfo", whe
First of all, apologies if you get this twice. I posted it by email an hour
ago but it hasn't appeared in any of the archives, so I'm worried it's got
junked somewhere.
I'm trying to use a DataImportHandler to merge some data from a database
with some other fields from a collection of XML files,
On Jul 30, 2009, at 9:44 AM, Reece wrote:
Hello everyone :)
I was trying to purge out older things.. in this case of a certain
type of document that had an ID lower than 200. So I posted this:
id:[0 TO 200] AND type:I
Now, I have only 49 type "I" items total in my index (shown by
/s
> The problem is that the indexed form of this XML is flattened so the
>
> entity has 2 garage names, 2 min values and 2 max values, but the
> grouping
> between the garage name and it's min and max values is lost. The
> danger is
> that we end up doing a comparison of the min-of-the-mins and the
> Hey Ken,
> Thanks for your reply.
> When I wrote '5|6' I ment that this is a multiValued field with two
> values
> '5' and '6', rather than the literal string '5|6' (and any Tokenizer).
> Does
> your reply still holds? That is, are multiValued fields dependent on
> the
> notion of tokenization t
Grant, thanks for the reply. We tested our requirement against 1.4-dev and
were able to achieve what we wanted. The site we're rebuilding has low
traffic, so we're going to run with 1.4-dev.
Cheers.
- Original Message -
From: "Grant Ingersoll"
To:
Sent: Thursday, July 30, 2009 8:05
Hello everyone :)
I was trying to purge out older things.. in this case of a certain
type of document that had an ID lower than 200. So I posted this:
id:[0 TO 200] AND type:I
Now, I have only 49 type "I" items total in my index (shown by
/solr/select?q=type:I), when there should be num
i want to get the lucene index format version from solr web app (as
luke do), i've tried looking for the info at luke handler response,
but i havn't found this info
--
Lici
Yes, I get that.
The problem arises when you have multiple authors. How can I know which
first name goes with which user id etc...
Cheers
Noble Paul നോബിള് नोब्ळ्-2 wrote:
>
> apparently all the dat ais going to one field 'author'
>
> instead they should be sent to separate fields
> author
Hi,
I have a set of XML data that holds Minimum and Maximum values and I need to
be able to do specific range queries against them.
(Note that this is a contrived example, and that in reality the garage would
probably hold all the individual prices of all its cars, but this is
analogous to the p
apparently all the dat ais going to one field 'author'
instead they should be sent to separate fields
author_fname
author_lname
author_email
so you would get details like
John
Doe
j...@doe.com
On Wed, Jul 29, 2009 at 7:39 PM, ahammad wrote:
>
> Hi all,
>
> Not sure how good my
On Jul 29, 2009, at 2:38 PM, Mike wrote:
Hi,
We're using Lucid Imagination's LucidWorks Solr 1.3 and we have a
requirement to implement multiple-select faceting where the facet
cells show up as checkboxes and despite checked options, all of the
options continue to persist with counts. Th
On 30 Jul 2009, at 11:17, Jörg Agatz wrote:
It sounds like you're not using 'entities' for your '&' characters
(ampersands) in your XML.
These should be converted to "&" This should look familiar if
you've
ever written any HTML.
I dont understand this
i musst change even & to & ?
Yes, '&'
On Jul 30, 2009, at 6:17 AM, Jörg Agatz wrote:
Also, i use the Comandline tool "java .jar post.jar xyz.xml"
i donkt know what you are mean with
It sounds like you're not using 'entities' for your '&' characters
(ampersands) in your XML.
These should be converted to "&" This should look famili
Also, i use the Comandline tool "java .jar post.jar xyz.xml"
i donkt know what you are mean with
It sounds like you're not using 'entities' for your '&' characters
(ampersands) in your XML.
These should be converted to "&" This should look familiar if you've
ever written any HTML.
I dont understa
perfect!
That resolved my issue.
BTW. This was my first posting on this list.
I must say that the responses were quick and to the point!!! Good community
help!
On Thu, Jul 30, 2009 at 10:58 AM, AHMET ARSLAN wrote:
>
> > How can I tell Solr to skip unwanted fields and only index
> > the fields t
> How can I tell Solr to skip unwanted fields and only index
> the fields that I have declared in my schema.xml?
More precisely: (taken from schema.xml)
Indeed, or enclose the text in CDATA tags which should work as well.
On Thu, 2009-07-30 at 09:52 +0100, Toby Cole wrote:
> Any chance of getting that stack trace as more than one line? :)
> Also, where are you posting your documents from? (e.g. Java, PHP,
> command line etc).
>
> It sound
Edwin Stauthamer wrote:
Hi,
I want to index a perfectly good solr XML-file into an Solr/Lucene instance.
The problem is that the XML has many fields that I don't want to be indexed.
I tried to index the file but Solr gives me an error because the XML
contains fields that I have not declared in
Any chance of getting that stack trace as more than one line? :)
Also, where are you posting your documents from? (e.g. Java, PHP,
command line etc).
It sounds like you're not using 'entities' for your '&' characters
(ampersands) in your XML.
These should be converted to "&" This should look
: I want Solr to index / store only a few fields from the XML-file to be
: indexed and skip all the other fields.
I think Dynamic fields [1] can help you.
[1]
http://wiki.apache.org/solr/SchemaXml#head-82dba16404c8e3318021320638b669b3a6d780d0
I don't think there is a way to do that.
On Thu, Jul 30, 2009 at 1:39 PM, Edwin
Stauthamer wrote:
> Hi,
>
> I want to index a perfectly good solr XML-file into an Solr/Lucene instance.
> The problem is that the XML has many fields that I don't want to be indexed.
>
> I tried to index the file but
Hi,
I want to index a perfectly good solr XML-file into an Solr/Lucene instance.
The problem is that the XML has many fields that I don't want to be indexed.
I tried to index the file but Solr gives me an error because the XML
contains fields that I have not declared in my schema.xml
How can I t
Sorry, schema.xml file is here in this mail...
noor wrote:
FYI
Attached schema.xml file.
And the add doc xml snippets are,
501
ESQ.VISION.A72
201
CpuLoopEnd Process=$Z4B1 CpuPin=0,992
Program=\VEGAS.$SYSTEM.SYS00.MEASFH Terminal=\VEGAS.$TSPM.#TERM
CpuBusy=
FYI
Attached schema.xml file.
And the add doc xml snippets are,
501
ESQ.VISION.A72
201
CpuLoopEnd Process=$Z4B1 CpuPin=0,992
Program=\VEGAS.$SYSTEM.SYS00.MEASFH Terminal=\VEGAS.$TSPM.#TERM
CpuBusy=0 MemPage=24 User=50,10
\VEGAS.$QQDS
PLGOVNP
Good Morning SolR :-) its morning in Germany!
i have a Problem, with the Indexing...
I often become an Error.
I think it is because in the XML stand this Character "&"
I need the Character, what happens?
SimplePostTool: FATAL: Solr returned an error:
comctcwstxexcWstxLazyExcept
On Thu, Jul 30, 2009 at 4:52 AM, Jason Rutherglen <
jason.rutherg...@gmail.com> wrote:
> I created an issue and have added some notes
> https://issues.apache.org/jira/browse/SOLR-1316
>
Also see https://issues.apache.org/jira/browse/SOLR-706
--
Regards,
Shalin Shekhar Mangar.
77 matches
Mail list logo