you feel are lacking? Or is it just across the board? Wikis
>> are both good and bad for documentation, IMO.
>>
>> -Grant
>
>
--
==
Fergus McMenemieEmail:fer...@twig.me.uk
Techmore Li
; name="id"/>
>> >
>> >
>> >
>> >
>> >
>> >
>>
>> keep the field as follows
>> > column="TmpCourseId" name="CourseId"
>> template="Course:${Course.CourseId}" name="id"/>
>>
>>
>>
>>
>> --
>> -
>> Noble Paul | Principal Engineer| AOL | http://aol.com
>>
--
===
Fergus McMenemie Email:fer...@twig.me.uk
Techmore Ltd Phone:(UK) 07721 376021
Unix/Mac/Intranets Analyst Programmer
===
>Hi,
>
>Please find the schema file attached. Please let me know what I am doing wrong.
>
>Regards
>Chaitali
>
>--- On Wed, 10/14/09, Fergus McMenemie wrote:
>
>
>From: Fergus McMenemie
>Subject: Re: Error when indexing XML files
>To: solr-user@lucene.
Hi,
Please find the schema file attached. Please let me know what I am doing wrong.
Regards
Chaitali
--- On Wed, 10/14/09, Fergus McMenemie wrote:
From: Fergus McMenemie
Subject: Re: Error when indexing XML files
To: solr-user@lucene.apache.org
Date: Wednesday, October 14, 2009, 2:25 AM
d to delete these rows using
>> DIH?In other words, where/how do I specify this?
>>
>>
>The $deleteDocByQuery is for deleting Solr documents by a Solr query and not
>DB rows.
>
>--
>Regards,
>Shalin Shekhar Mangar.
--
===
>Hi,
>
>I am trying to index XML files using SolrJ. The original XML file contains
>nested elements. For example, the following is the snippet of the XML file.
>
>
> SOMETHING
> SOME_OTHER_THING
>
>
>I have added the elements "name" and "facility" in Schema.xml file to make
>these e
>On Thu, Oct 1, 2009 at 7:59 PM, Claudio Martella > wrote:
>
>>
>> About the copyField issue in general: as it copies the content to the
>> other field, what is the sense to define analyzers for the destination
>> field? The source is already analyzed so i guess that the RESULT of the
>> analysis i
>Fergus McMenemie wrote:
>>> Fergus McMenemie wrote:
>>>> Hi all,
>>>>
>>>> I am attempting to test some changes I made to my DIH based
>>>> indexing process. The changes only affect the way I
>>>> describe my fields in
>Fergus McMenemie wrote:
>> Hi all,
>>
>> I am attempting to test some changes I made to my DIH based
>> indexing process. The changes only affect the way I
>> describe my fields in data-config.xml, there should be no
>> changes to the way the data is in
Hi all,
I am attempting to test some changes I made to my DIH based
indexing process. The changes only affect the way I
describe my fields in data-config.xml, there should be no
changes to the way the data is indexed or stored.
As a QA check I was wanting to compare the results from
indexing the
am the only one who knows
>it. I would love to have more eyes on that.
>
>>I would like to open a JIRA for improving XPathRecordReader.
>Please go ahead. You can paste the contents of this mail in the list .
>There may be others with similar ideas
>
>Noble.
--
===
fferedReader.readLine(Unknown Source)
>at
>org.apache.solr.handler.dataimport.LineEntityProcessor.nextRow(LineEn
>tityProcessor.java:109)
>... 8 more
>
>
>
>Note that my input files have 53812 lines, which is the same as the document
>number that I'm choking on
maths to the transformers and
I think we will have a turing complete language:-)
fergus.
>Thanks,
>Grant
--
===
Fergus McMenemie Email:fer...@twig.me.uk
Techmore Ltd Phone:(UK) 07721 376021
Unix/Mac/Intranets Analyst Programmer
===
t; >> >
>>> >> > - category: Category 2; id: 4; author: Author 4
>>> >> >
>>> >> >
>>> >> >
>>> >> > Any ideas on how I can get to a parent node from within a child during
>>> >> > data import? If it cant be done, what do you suggest would be the best
>>> >> > way so I can keep using the DataImportHandler... would XSLT be a good
>>> >> > idea to 'flatten out' the structure a bit?
>>> >> >
>>> >> >
>>> >> >
>>> >> > Thanks
>>> >> >
>>> >> >
>>> >> >
>>> >> > This is what my XML document looks like:
>>> >> >
>>> >> >
>>> >> >
>>> >> > Category 1
>>> >> >
>>> >> > 1
>>> >> > Author 1
>>> >> >
>>> >> >
>>> >> > 2
>>> >> > Author 2
>>> >> >
>>> >> >
>>> >> >
>>> >> > Category 2
>>> >> >
>>> >> > 3
>>> >> > Author 3
>>> >> >
>>> >> >
>>> >> > 4
>>> >> > Author 4
>>> >> >
>>> >> >
>>> >> >
>>> >> >
>>> >> >
>>> >> >
>>> >> > And this is what my dataConfig looks like:
>>> >> >
>>> >> >
>>> >> >
>>> >> > >> >> > url="http://localhost:9080/data/20090817070752.xml";
>>> >> > processor="XPathEntityProcessor" forEach="/document/category/item"
>>> >> > transformer="DateFormatTransformer" stream="true"
>>> >> > dataSource="dataSource">
>>> >> > >> >> > commonField="true" />
>>> >> >
>>> >> >
>>> >> >
>>> >> >
>>> >> >
>>> >> >
>>> >> >
>>> >> >
>>> >> > This is how I have specified my schema
>>> >> >
>>> >> > >> >> > required="true" />
>>> >> >
>>> >> >
>>> >> >
>>> >> >
>>> >> > id
>>> >> > id
>>> >> >
--
===
Fergus McMenemie Email:fer...@twig.me.uk
Techmore Ltd Phone:(UK) 07721 376021
Unix/Mac/Intranets Analyst Programmer
===
/20090817070752.xml";
>> > processor="XPathEntityProcessor" forEach="/document/category/item"
>> > transformer="DateFormatTransformer" stream="true" dataSource="dataSource">
>> >> > commonField="true" />
>> >
>> >
>> >
>> >
>> >
>> >
>> >
>> >
>> > This is how I have specified my schema
>> >
>> > > > required="true" />
>> >
>> >
>> >
>> >
>> > id
>> > id
>> >
>> >
>> >
>> >
>> >
>> >
>> > _
>> > Need a place to rent, buy or share? Let us find your next place for you!
>> > http://clk.atdmt.com/NMN/go/157631292/direct/01/
>>
>>
>>
>> --
>> -
>> Noble Paul | Principal Engineer| AOL | http://aol.com
>
>_
>Get Hotmail on your iPhone Find out how here
>http://windowslive.ninemsn.com.au/article.aspx?id=845706
--
===
Fergus McMenemie Email:fer...@twig.me.uk
Techmore Ltd Phone:(UK) 07721 376021
Unix/Mac/Intranets Analyst Programmer
===
.
>>
>> If I remove the 2 document elements and wrap both entity sets in just
>> one document tag, then both sets get indexed, which seemingly achieves
>> my goal. This just doesnt make sense from my understanding of how DIH
>> works. My 2 content types are indeed separate so
>On Mon, Sep 7, 2009 at 5:58 PM, Fergus McMenemie wrote:
>
>> >This testcase is quite independent of anything in Solr. It is a
>> >standalone utility and the only dependency is stax.
>> >discalimer (I run these testcases from Intellij and command line)
>&g
nd play with XPathRecordReader.java
other than
ant -Dtestcase=TestXPathRecordReader test
Which takes 8secs to run here? I am not using XpathRecordReader
outside of DIH, but looking to see how I would add support for
xpaths such as //a.
Fergus.
>
>On Mon, Sep 7, 2009 at 3:26 PM, Fergus McMenemie w
But when you want to run testcases... you are doing
that from the command line? Are you are only using the IDE as an editor?
>Regards
>Rajan
>
>On Mon, Sep 7, 2009 at 3:26 PM, Fergus McMenemie wrote:
>
>> Hello all,
>>
>> I would appreciate help from somebody who h
.
PS: I am a total netbeans newbie.
--
===
Fergus McMenemie Email:fer...@twig.me.uk
Techmore Ltd Phone:(UK) 07721 376021
Unix/Mac/Intranets Analyst Programmer
===
tored="true" multiValued="false" termVectors="false"
>> alias="source.date"/>
>>
>> is there any jira issue related?
>>
>> Thx
>>
>> --
>> Lici
>>
--
===
Fergus McMenemie Email:fer...@twig.me.uk
Techmore Ltd Phone:(UK) 07721 376021
Unix/Mac/Intranets Analyst Programmer
===
; http://www.nabble.com/Using-Multiple-fields-in-UniqueKey-tp24476088p24476088.html
>> Sent from the Solr - User mailing list archive at Nabble.com.
--
===
Fergus McMenemie Email:fer...@twig.me.uk
Techmore Ltd Phone:(UK) 07721 376021
Unix/Mac/Intranets Analyst Programmer
===
given starting point.
If you have another file, listing the files to be indexed, then
I would use "LineEntityProcessor" to process that list.
One or other of the above would locate file to be indexed and
would pass the filename to XPathEntityProcessor with useSolrAddSchema
set t
>Shalin Shekhar Mangar wrote:
>> On Thu, Jul 2, 2009 at 11:08 PM, Mark Miller wrote:
>>
>>
>>> It looks like DIH implements its own subset of the Xpath spec.
>>>
>>
>>
>> Right, DIH has a streaming implementation supporting a subset of XPath only.
>> The supported things are in the wiki ex
e author and title parse out fine: > > xpath="/book/title"/>
>> >
>> > But I can't get at the data inside the tags. I want to get all
>> > non-markup text inside the body tag with something like this:
>> >
>> >
>> &g
>FWIW
>I strongly agree with your sentiments, Manual.
>One of the neat maven features that isn't well known is just being able to do
>"mvn jetty:run" and have Jetty load up right away (no creating of a web-app
>directory or packaging of a war or anything like that).
>What I hate about ant based p
>Fergus McMenemie schrieb:
>
>> The article could explain the difference between fq= and
>> facet.query= and when you should use one in preference to
>> the other.
>
>My understanding is that while these query modifiers rely on the
>same impleme
cet.query= and when you should use one in preference to
the other.
Regards Fergus.
--
===
Fergus McMenemie Email:fer...@twig.me.uk
Techmore Ltd Phone:(UK) 07721 376021
Unix/Mac/Intranets Analyst Programmer
===
>On Tue, Jun 9, 2009 at 7:25 PM, Michael Ludwig wrote:
>>
>> A filter query is cached, which means that it is the more useful the
>> more often it is repeated. We know how often certain queries arise, or
>> at least have the means to collect that data - so we know what might be
>> candidates for
I had also been wondering about this, but was to lazy/busy to post a
question. Now that it is resolved it would help lots if you could
post ad example of how you invoked enableRemoteStreaming for your
document(s)?
Rgds Fergus.
>Thanks for the quick response, Grant.
>
>
>
>We tried it and it s
>Generally a good idea, but be prepared to entertain requests that should
>also ask you to be able to perform the query using those aliases. I mean
>when you talk about something "similar" to aliases in SQL, those aliases can
>be used in SQL scripts in the where clause too.
>
>Cheers
>Avlesh
I am
>> * SolrPlugins
>> * SolrRelevancyCookbook
>> * LargeIndexes - Covers how to design and operate a very large
>> Solr index.
>> + *
>> [http://oshyn.com/_bpost_1906/Implementing_Search_Suggest_with_Apache_Solr_(Part1)
>>
>> Search Suggest Functionality]
>> +
>>
ill return. So where the
>following would fail:-
>
>
>
>This can be used instead:-
>
>sourceColName="fileAbsolutePath" />
>
>So I guess we have the best of both worlds!
Regards Fergus.
--
===
Fergu
>Something that would be interesting is to share solr configs for
>various types of indexing tasks. From a solr configuration aimed at
>indexing web pages to one doing large amounts of text to one that
>indexes specific structured data. I could see those being posted on
>the wiki and help
efficient if SOLR can return the result as above.
>
>Thanks!
>
>--
>Jeffrey Gelens Buyways B.V. Tel. 050 853 6600
>Webengineer Friesestraatweg 215c Fax. 050 853 6601
>http://www.buyways.nl 9743 AD Groningen KvK 01074105
--
==
sponse I am getting
>>
>> -
>> 0
>> 0
>> 0
>> 2009-05-15 11:58:00
>> Indexing completed. Added/Updated: 0 documents. Deleted 0
>> documents.
>> 2009-05-15 11:58:00
>> 2009-05-15 11:58:00
>> 0:0:0.172
>>
>> This response format is experimental. It is likely to
>> change in the future.
>>
>>
>>
>> Do I missing anything here or is there any format on the input xml,??
>> please
>> help resolving this.
>>
>> Thanks and regards,
>> Jay
>>
--
===
Fergus McMenemie Email:fer...@twig.me.uk
Techmore Ltd Phone:(UK) 07721 376021
Unix/Mac/Intranets Analyst Programmer
===
field is assumed to called "id".
3) the entity needs to have transformer="RegexTransformer"
>
>2009/5/13 Fergus McMenemie :
>>>Hi
>>>
>>>Is it possible, through dataimport handler to remove an existing
>>>document from the Solr index
>I was wondering if there is an interest in a UK (South East) solr user
>group meeting
>
>Please let me know if you are interested. I am happy to organize.
>
>Regards,
>
>Colin
Yes Very interested. I am in lincolnshire.
--
==
esn't get removed.
>
>Regards
>Andrew
Yes but only in the latest trunk. If your "active" field is false
do you want to see the document deleted? Do you have another field
which is a unique ID for the document?
Fergus
--
=======
taimport?command=full-import
command?
--
=======
Fergus McMenemie Email:fer...@twig.me.uk
Techmore Ltd Phone:(UK) 07721 376021
Unix/Mac/Intranets Analyst Programmer
===
able within each search result.
>
> Otis
>--
>Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch
>
>
>
>- Original Message
>> From: Fergus McMenemie
>> To: solr-user@lucene.apache.org
>> Sent: Wednesday, April 15, 2009 11:18:21 AM
>> Subject:
ing the duplicate
checking with allowDups=true can save lots of time"
Fergus.
--
===
Fergus McMenemie Email:fer...@twig.me.uk
Techmore Ltd Phone:(UK) 07721 376021
Unix/Mac/Intranets Analyst Programmer
===
>On Apr 15, 2009, at 11:18 AM, Fergus McMenemie wrote:
>
>> Hi,
>>
>> Having all kinds of fun with distributed search using shards:-)
>>
>> I have 30K documents indexed using DIH into an index. Another
>> index contain documents indexed using solr-cell.
source document can be linked to, and to do so I think I need to
know which shard a particular result came from. Is this a FAQ?
Regards
--
===
Fergus McMenemie Email:fer...@twig.me.uk
Techmore Ltd Phone:(UK
>On Apr 2, 2009, at 9:23 AM, Fergus McMenemie wrote:
>
>> Grant,
>>
>>
>>
>>> I should note, however, that the speed difference you are seeing may
>>> not be as pronounced as it appears. If I recall during ApacheCon, I
>>> commented
What about my xml file, and
>txt file?
>
>Thank you,
>Alex
>
>
>On Tue, Apr 14, 2009 at 12:37 AM, Alejandro Gonzalez <
>alejandrogonzalezd...@gmail.com> wrote:
>
>> you should construct the xml containing the fields defined in your
>> schema.xml and give them the values from the text files. for example if you
>> have an schema defining two fields "title" and "text" you should construct
>> an xml with a field "title" and its value and another called "text"
>> containing the body of your doc. then you can post it to Solr you have
>> deployed and make a commit an it's done. it's possible to construct an xml
>> defining more than jus t a doc
>>
>>
>>
>>
>> "doc1 title"
>> "doc1 text"
>>
>> .
>> .
>> .
>>
>> "docn title"
>> "docn text"
>>
>>
>>
>>
>>
>> 2009/4/14 Noble Paul ??  ˳Ë
>>
>> > what is the cntent of your text file?
>> > Solr does not directly index files
>> > --Noble
>> >
>> > On Tue, Apr 14, 2009 at 3:54 AM, Alex Vu wrote:
>> > > Hi all,
>> > >
>> > > Currently I wrote an xml file and schema.xml file. What is the next
>> step
>> > to
>> > > index a txt file? Where should I put my txt file I want to index?
>> > >
>> > > thank you,
>> > > Alex V.
>> > >
>> >
>> >
>> >
>> > --
>> > --Noble Paul
>> >
>>
--
===
Fergus McMenemie Email:fer...@twig.me.uk
Techmore Ltd Phone:(UK) 07721 376021
Unix/Mac/Intranets Analyst Programmer
===
>On Apr 6, 2009, at 10:16 AM, Fergus McMenemie wrote:
>
>> Hmmm,
>>
>> Not sure how this all hangs together. But editing my solrconfig.xml
>> as follows
>> sorted the problem:-
>>
>>> multipartUploadLimitInKB="2048" />
>
andardHostValve.invoke(StandardHostValve.java:128)
>> at
>> org.apache.catalina.valves.ErrorReportValve.invoke(ErrorReportValve.java:102)
>> at
>> org.apache.catalina.core.StandardEngineValve.invoke(StandardEngineValve.java:109)
>> at
>> org.apache.cat
>Hi Fergus,
>
>On Tue, Apr 07, 2009 at 05:06:23PM +0100, Fergus McMenemie wrote:
>> >Thank you much Fergus,
>> >
>> >I was considering implementing a database which would hold a path name
>> >and an MD5 sum of each file.
>> Snap. That is close
ves the same. Although attempting
to use /*/ fails. Another lesson learnt!
#! /usr/local/bin/perl
use strict;
my($s)="cat mat rat hat";
my($c)=0;
print " a-match", ++$c, "='$1'\n" while( $s =~ m/(at)/g );
$c=0;
print " b-match", ++$c, "=
also dump all checksums and pathnames from solr if/when you wanted
to validate your folder structure and or indexes.
>Regards,
>Veselin K
>
>On Tue, Apr 07, 2009 at 09:01:31AM +0100, Fergus McMenemie wrote:
>> Veselin,
>>
>> Well, as far as solr is concerned, there is tw
; >
>> > In case all my files are in one folder which is scanned frequently, is
>> > there a Solr feature of checking and skipping a file if it has already
>> > been indexed
>> > and not changed since?
>> >
>> >
>> > Th
gt;> they need to have the same solr.xml (for multicore etc). We don't want
>> to replicate the indexes also (we got very light search traffic, but
>> very high indexing traffic) so they need to use the same index.
>>
>>
>> Thanks,
>> -vivek
>>
--
===
Fergus McMenemie Email:fer...@twig.me.uk
Techmore Ltd Phone:(UK) 07721 376021
Unix/Mac/Intranets Analyst Programmer
===
atalina.valves.ErrorReportValve.invoke(ErrorReportValve.java:105)
>
>Although the PDF is big, it contains very little text; it is a map.
>
> "java -jar solr/lib/tika-0.3.jar -g" appears to have no bother with it.
>
>Fergus...
>--
>
>===
t&entity=jc
See the docs at:-
http://wiki.apache.org/solr/DataImportHandler#head-1582242c1bfc1f3e89f4025bf2055791848acefb
Fergus.
--
===
Fergus McMenemie Email:fer...@twig.me.uk
Techmore Ltd Phone:(UK) 07721 376021
Unix/Mac/Intranets Analyst Programmer
===
Engineer, Zappos.com
>jnewb...@zappos.com - 702-943-7562
Ditto!
As best I understand, you somehow need to arrange for each different
combination of colour, size and width to be indexed as a separate sol
document.
--
=======
Fergus McMene
s for the all the help.
Fergus.
--
=======
Fergus McMenemie Email:fer...@twig.me.uk
Techmore Ltd Phone:(UK) 07721 376021
Unix/Mac/Intranets Analyst Programmer
===
th it.
Fergus...
--
=======
Fergus McMenemie Email:fer...@twig.me.uk
Techmore Ltd Phone:(UK) 07721 376021
Unix/Mac/Intranets Analyst Programmer
===
>On Apr 2, 2009, at 4:26 AM, Fergus McMenemie wrote:
>> I cant get ExtractingRequestHandler to work with tomcat. Using the
>> latest version from svn and then a "make clean dist" and copying the
>> war file to a clean tomcat does not work.
>
>make?! :)
Oops!
.create(RequestHandlers.java:154)
> at
> org.apache.solr.core.RequestHandlers$1.create(RequestHandlers.java:163)
Any ideas?
--
===
Fergus McMenemie Email:fer...@twig.me.uk
Techmore Ltd Phone:(UK) 07721 376021
Unix/Mac/Intranets Analyst Programmer
===
>On Apr 1, 2009, at 9:39 AM, Fergus McMenemie wrote:
>
>> Grant,
>>
>> Redoing the work with your patch applied does not seem to
>
>>
>> make a difference! Is this the expected result?
>
>No, I didn't expect Solr 1095 to fix the problem. Overwrit
2m49.997s
--
===
Fergus McMenemie Email:fer...@twig.me.uk
Techmore Ltd Phone:(UK) 07721 376021
Unix/Mac/Intranets Analyst Programmer
===
m -rf solr/data" before tomcat is launched.
So I do not understand how the above helps. UNLESS there are duplicate
gaz entries.
>In the meantime, I'm trying to see if I can pinpoint down a specific
>change and see if there is anything that might help it perform better.
&g
ents.
>>
>> Any idea why this DIH instruction would see constant value appear twice??
>>
--
===
Fergus McMenemie Email:fer...@twig.me.uk
Techmore Ltd Phone:(UK) 07721 376021
Unix/Mac/Intranets Analyst Programmer
===
. I think commons-csv was upgraded
>>>> 2. The CSV loader stuff was refactored to share common code
>>>>
>>>> I'm still investigating.
>>>>
>>>> -Grant
>>>
>>> --
>>> Grant Ingersoll
>&
(ThreadPoolExecutor.java:907)
>>> at java.lang.Thread.run(Thread.java:637)
>>>
>>> I see two things in CHANGES.txt that might apply, but I'm not sure:
>>> 1. I think commons-csv was upgraded
>>> 2. The CSV loader stuff was refactored to s
Hello,
Due to limitations with the way my content is organised and DIH I have
to add “-imgCaption:[* TO *]” to some of my queries. I discovered the
name=”appends” functionality tucked away inside solconfig.xml. This
looks a very useful feature, and I created a new requestHandler to deal
with my pr
H, my tuppence worth!
IMHO I do not think this should be built into solr. Doing it properly
leads to all kinds of nasty platform dependent issues... will we then
want to add notification features on success/failure? via email?
Ideally, all the scheduled activities on a system should be ce
>I am looking for a implementation of DIH feature: It also takes in a
>properties file for the data source configuration
>(http://issues.apache.org/jira/browse/SOLR-469)
>
>I want to externalize the data source parameters like driver, url, user and
>password to property file outside the solr. My
t; ": "" at line 1, column 21. Was expecting one of:
... ... ... "+" ... "-" ... "(" ... "*" ... "^" ...
... ... ... ... ... "[" ... "{" ...
...
My encoding did not work! Help!
--
=
:17 AM, Fergus McMenemie wrote:
>
>> Hello,
>>
>> Can anybody describe the intended purpose, or provide a
>> few examples, of how the DIH entity= command option works.
>>
>> Am I supposed to build a data-conf.xml file which contains
>> many different al
--
===
Fergus McMenemie Email:fer...@twig.me.uk
Techmore Ltd Phone:(UK) 07721 376021
Unix/Mac/Intranets Analyst Programmer
===
gt;
>> Erik
>>
>>
>>On Feb 13, 2009, at 8:17 AM, Fergus McMenemie wrote:
>>
>>> Paul,
>>>
>>> Following up your usenet sussgetion:
>>>
>>> >> ignoreMissingVariables="true"/>
>>>
>
ements are ,
>> 1)read a file line by line
>> 2) filter out lines (include or exclude ) based on a regex
>> 3) extract parts (named parts) from the line using another regex
>>
>> Noble
>>
>>
>> On Tue, Mar 10, 2009 at 1:50 AM, Fergus McMenemie
>>
>Le 09-mars-09 à 22:29, Fergus McMenemie a écrit :
>>> how would I implement entity-processor if I were able to get the list
>>> of recently changed documents of our sites?
>>
>> H, this sounds like a job for my manifestEnityProcessor
>> see if you can
manifestEnityProcessor
see if you can find the thread titled:-
"a new DIH manifestEnityProcessor"
is your list of changed documents a list of additions and
updates only, or does it contain deletes as well?
Fergus.
--
=======
>
>On Mon, Mar 9, 2009 at 10:44 PM, Fergus McMenemie wrote:
>>>manifest processing has a very limited usecase. Why can't it be
>>>processed using a PlainTextEntityProcessor and write a Tranformer to
>>>read lines using regex?
>>>
>> Ehmmm Ok. T
or crawlers
where we had to.
Fergus
>
>--Noble
>
>On Mon, Mar 9, 2009 at 8:30 PM, Fergus McMenemie wrote:
>> Hello,
>>
>> I have almost finished a new DIH EntityProcessor which
>> I am calling the manifestEnityProcessor. It is designed
>> around the idea tha
?
Suggestions for a different name?
Suggestions on how to do the delete bitty from within an entity?
Regards Fergus.
--
===
Fergus McMenemie Email:fer...@twig.me.uk
Techmore Ltd Phone:(UK) 07721
want into the XML doc itself where your stylesheet
>has access to it.
>
>
>-Hoss
Doh! of course.
Thanks.
--
=======
Fergus McMenemie Email:fer...@twig.me.uk
Techmore Ltd Phone:(UK) 07721 376021
--
===
Fergus McMenemie Email:fer...@twig.me.uk
Techmore Ltd Phone:(UK) 07721 376021
Unix/Mac/Intranets Analyst Programmer
===
I have created SOLR-1033 in JIRA to address this issue.
At 13:32 + 21/2/09, Fergus McMenemie wrote:
>>On Mon, Feb 16, 2009 at 3:22 PM, Fergus McMenemie wrote:
>>>
>>> 2) Having used TemplateTransformer to assign a value to an
>>> entity column th
>On Mon, Feb 16, 2009 at 3:22 PM, Fergus McMenemie wrote:
>>
>> 2) Having used TemplateTransformer to assign a value to an
>> entity column that column cannot be used in other
>> TemplateTransformer operations. In my project I am
>> attempting to r
Regards Fergus.
--
===
Fergus McMenemie Email:fer...@twig.me.uk
Techmore Ltd Phone:(UK) 07721 376021
Unix/Mac/Intranets Analyst Programmer
===
g files does though.
>
> Erik
>
>
>On Feb 13, 2009, at 8:17 AM, Fergus McMenemie wrote:
>
>> Paul,
>>
>> Following up your usenet sussgetion:
>>
>> > ignoreMissingVariables="true"/>
>>
>> and to add more to what I was thinking...
&g
/>
>> > xpath="/record/mediaBlock/mediaObject/@vurl" />
>>
>>The trouble is that vurl is only defined as a child of "/record/mediaBlock"
>>so my attempt to create id, the uniqueKey fails for the parent document
>>"/record"
>>
>>I am hacking around with "TemplateTransformer.java" to sort this but was
>>wondering if there was a good reason for this behavior.
>>
--
===
Fergus McMenemie Email:fer...@twig.me.uk
Techmore Ltd Phone:(UK) 07721 376021
Unix/Mac/Intranets Analyst Programmer
===
/mediaBlock/mediaObject/@vurl" />
>
>The trouble is that vurl is only defined as a child of "/record/mediaBlock"
>so my attempt to create id, the uniqueKey fails for the parent document
>"/record"
>
>I am hacking around with "TemplateTransformer.j
sort this but was
wondering if there was a good reason for this behavior.
Regards.
--
=======
Fergus McMenemie Email:fer...@twig.me.uk
Techmore Ltd Phone:(UK) 07721 376021
Unix/Mac/Intranets Analyst Programmer
===
mediaBlock/mediaObject/@vurl" />
> xpath="/record/mediaBlock/caption" />
>
>Is is OK to have an xpath expression within forEach which is a child
>of another of the forEach xpath expressions?
>
Yes. It works fine, duplicate "uniqueKey"s were making it a
.
Is is OK to have an xpath expression within forEach which is a child
of another of the forEach xpath expressions?
Or.. is there a better way of doing this?
Regards
--
===
Fergus McMenemie Email:fer...@twig.me.uk
Techmore
Thanks,
That fixed it.
>On Wed, Feb 11, 2009 at 4:19 PM, Fergus McMenemie wrote:
>
>
>> java.lang.NoSuchFieldError: docCount
>>at
>> org.apache.solr.handler.dataimport.SolrWriter.getDocCount(SolrWr
st" is fine. Removing the javascript contrib directory
allows the "ant dist" to complete and I have a usable war file. However
I suspect this may not represent best practise; however "ant test" is still
fine.
What does removal of the this contrib function loose me? I was wondering if
it went with the DIH ScriptTransformer?
Regards Fergus.
--
===
Fergus McMenemie Email:fer...@twig.me.uk
Techmore Ltd Phone:(UK) 07721 376021
Unix/Mac/Intranets Analyst Programmer
===
olrIndexSearcher
Regards to all.
--
=======
Fergus McMenemie Email:fer...@twig.me.uk
Techmore Ltd Phone:(UK) 07721 376021
Unix/Mac/Intranets Analyst Programmer
===
up. I suspect that the addition of //para would
cover many of the use cases, and what was left could be covered
by a preceding XSLT transform.
--
===
Fergus McMenemie Email:fer...@twig.me.uk
Techmore Ltd Phone:(UK) 07721 376021
Unix/Mac/Intranets Analyst Programmer
===
Thanks Shalin,
Using the following appears to work properly!
Regards Fergus
>On Wed, Feb 4, 2009 at 1:35 AM, Fergus McMenemie wrote:
>
>> > dataSource="myfilereader"
>> processor="XPathEntityProcessor"
>> url
upported.
>
> dateTimeFormat="MMdd" />
>
>
>On Mon, Feb 2, 2009 at 9:24 AM, Noble Paul ?? Â Ë³Ë <
>noble.p...@gmail.com> wrote:
>
>> this patch must help
>>
>> On Mon, Feb 2, 2009 at 10:49 PM, Shalin Shekhar Mangar
>> wrote:
>>
rds Fergus.
--
=======
Fergus McMenemie Email:fer...@twig.me.uk
Techmore Ltd Phone:(UK) 07721 376021
Unix/Mac/Intranets Analyst Programmer
===
d -- " + fList);
Assert.assertEquals(3, fList.size());
}
Regards Fergus.
>On Mon, Feb 2, 2009 at 2:36 AM, Fergus McMenemie wrote:
>
>> Hello
>>
>> I have been trying to find out why DIH in FileListEntityProcessor
>> mode did not appear to be recursing into
iles(aFile, files);
+ return;
+}
long sz = aFile.length();
Date lastModified = new Date(aFile.lastModified());
if (biggerThan != -1 && sz <= biggerThan)
--
===
Fergus McMenemie Email
>
>I could work around the problem by creating SOLR fields like
>"home_address_street" and "office_address_street" and do some xpath
>mapping. However I don't want to do it as we can have multiple
>'other' addresses. Also I have other fields whose type is not easily
>distinguished like address.
>
>As I mentioned being new to SOLR I might have completely goofed on a
>way to set it up - much appreciate any direction on it. I am using
>SOLR 1.3
>
>Regards,
>Guna
--
===
Fergus McMenemie Email:fer...@twig.me.uk
Techmore Ltd Phone:(UK) 07721 376021
Unix/Mac/Intranets Analyst Programmer
===
1 - 100 of 124 matches
Mail list logo