Help creating schema for indexable document

2009-08-07 Thread rossputin
Hi Guys. I am struggling to create a schema with a determinist content model for a set of documents I want to index. My indexable documents will look something like: 1 code1 code2 mycategory My service will be mission critical and will accept batch imports from a potent

Additional metadata when using Solr Cell

2009-05-14 Thread rossputin
Hi. I am indexing a PDF document with the ExtractingRequestHandler. My curl post has a URL like: ../solr/update/extract?ext.idx.attr=true&ext.def.fl=text&ext.literal.id=123&ext.literal.author=Somebody Sure enough I see in the server logs: params={ext.def.fl=text&ext.literal.id=123&ext.idx.att

Re: Additional metadata when using Solr Cell

2009-05-14 Thread rossputin
There is no reference to the author field I am trying to set.. I am using the latest nightly download. -- Ross Grant Ingersoll-6 wrote: > > what does /admin/luke show for fields and terms in the fields? > > On May 14, 2009, at 10:03 AM, rossputin wrote: > >> >>

Re: Additional metadata when using Solr Cell

2009-05-14 Thread rossputin
There is now, thanks for your help. On the same topic.. is there a best practice for modifying schema, in a future-proof way ? -- Ross Grant Ingersoll-6 wrote: > > Do you have an author field in your schema? > > On May 14, 2009, at 10:31 AM, rossputin wrote: > >> >

highlight results from pdf search

2009-05-30 Thread rossputin
Hi. I have some PDF documents indexed through solr cell. My highlighting queries work fine on standard xml doc types, eg the samples. I would now like to highlight some queries on a PDF document. Currently for my simple examples I am just indexing a PDF, providing an id, and an arbitrary ext.l

Solr document structure for preserving version information

2009-06-05 Thread rossputin
Hi Guys. This is a schema design question I suppose. I would like to store a series of version elements comprising of two attributes, 'updated' (a date) and 'reason' (just a simple string). I aim to produce xml based on a search which would look something like: So I realise I could

posting binary file and metadata in two separate documents

2009-07-07 Thread rossputin
Hi. I am currently using Solr Cell to extract content from binary files, and I am passing along some additional metadata with ext.literal params. Sample below: curl "http://localhost:8983/solr/update/extract?ext.literal.id=2&ext.literal.some_code1=code1&ext.literal.some_code2=code2&ext.idx.attr=

Re: posting binary file and metadata in two separate documents

2009-07-10 Thread rossputin
and there will be many of them. Thanks in advance for your help, regards, Ross rossputin wrote: > > Hi. > > I am currently using Solr Cell to extract content from binary files, and I > am passing along some additional metadata with ext.literal params. Sample > below: > &

Re: posting binary file and metadata in two separate documents

2009-07-17 Thread rossputin
Hi. Thanks for your reply, shame nobody has already implemented the multiple 'ContentStreams' idea :-) With regards to posting in a form, I had considered that, but unfortunately there can be an arbitrary number of 'ext.literals', so it would be difficult to build a form which would handle all ca

post error - ERROR:unknown field 'title'

2009-07-20 Thread rossputin
Hi guys. I have two different solr versions as I am evaluating nightly builds. On a more recent one.. I think 15th July I am getting the following error : ERROR:unknown field 'title' I am posting to 'solr/update/extract' with the following: curl "http://localhost:8983/solr/update/extract?ext.