Store and retrieve an xml sequence without losing the markup

2012-12-20 Thread Modou DIA
Hi everybody,

i'm newbie with Solr technologies but in the past i worked with lucene
and another solution similar to Solr.
I'm working with solr 4.0. I use solrj for embedding an Solr server in
a cocoon 2.1 application.

I want to know if it's possible to store (without indexing) a field
containing a xml sequence. I mean a field which can store xml data in
indexes without losing xpath informations.

For exemple, this's a document to index:


  
id_1
testing

  
testing
  

  
...


As you can see, the field named subdoc contains an xml sequence.

So, when i query the indexes, i want to retrieve the data in subdoc
and i want to conserve the xml markup.

Thank you for your help.
-- 
--
| Modou DIA
| modo...@gmail.com
--


Re: Store and retrieve an xml sequence without losing the markup

2012-12-21 Thread Modou DIA
The index is stored normally but, when i ask the index, all xml markup
is deleted in the response.

For exemple, when i store this field:


  


I obtain this kind of response when i ask index:


2012/12/20 Alexandre Rafalovitch :
> What happens if you just supply it as CDATA into a string field? Store, no
> index, probably compressed and lazy.
>
> Regards,
> Alex
> On 20 Dec 2012 09:30, "Modou DIA"  wrote:
>
>> Hi everybody,
>>
>> i'm newbie with Solr technologies but in the past i worked with lucene
>> and another solution similar to Solr.
>> I'm working with solr 4.0. I use solrj for embedding an Solr server in
>> a cocoon 2.1 application.
>>
>> I want to know if it's possible to store (without indexing) a field
>> containing a xml sequence. I mean a field which can store xml data in
>> indexes without losing xpath informations.
>>
>> For exemple, this's a document to index:
>>
>> 
>>   
>> id_1
>> testing
>> 
>>   
>> testing
>>   
>> 
>>   
>> ...
>> 
>>
>> As you can see, the field named subdoc contains an xml sequence.
>>
>> So, when i query the indexes, i want to retrieve the data in subdoc
>> and i want to conserve the xml markup.
>>
>> Thank you for your help.
>> --
>> --
>> | Modou DIA
>> | modo...@gmail.com
>> --
>>



-- 
--
| Modou DIA
| modo...@gmail.com
--


Re: Store and retrieve an xml sequence without losing the markup

2012-12-21 Thread Modou DIA
I am working with an xml format named EAD (Encoded Archival
Description). That format need to be precompiled before it can be
digested by solr.

So I just need to store in my index the original XML file.

Maybe I need to work with DIH ?

2012/12/20 Upayavira :
> Solr does not support nested structures. You need to flatten your data
> before indexing. You can store data in the way you did to be returned to
> your users, but you will not be able to search within the XML as XML.
>
> If you can explain the problem you are trying to solve, maybe folks here
> can help you find an alternative way of getting there.
>
> Upayavira
>
> On Thu, Dec 20, 2012, at 02:29 PM, Modou DIA wrote:
>> Hi everybody,
>>
>> i'm newbie with Solr technologies but in the past i worked with lucene
>> and another solution similar to Solr.
>> I'm working with solr 4.0. I use solrj for embedding an Solr server in
>> a cocoon 2.1 application.
>>
>> I want to know if it's possible to store (without indexing) a field
>> containing a xml sequence. I mean a field which can store xml data in
>> indexes without losing xpath informations.
>>
>> For exemple, this's a document to index:
>>
>> 
>>   
>> id_1
>> testing
>> 
>>   
>> testing
>>   
>> 
>>   
>> ...
>> 
>>
>> As you can see, the field named subdoc contains an xml sequence.
>>
>> So, when i query the indexes, i want to retrieve the data in subdoc
>> and i want to conserve the xml markup.
>>
>> Thank you for your help.
>> --
>> --
>> | Modou DIA
>> | modo...@gmail.com
>> --



-- 
--
| Modou DIA
| modo...@gmail.com
--