Re: Delete document

2008-03-07 Thread Maximilian Hütter
Feng Gao schrieb:
> Try this: 05991
> 
> As I remember, we only can delete one doc once, but I am not sure. Give that
> a try later.
> 
> Feng
> 
Yes, you can only delete one document by id at once
The command posted before won't work anyway, as ...
is already one command and you can't have multiple command in one solr
request.
So you could also send multiple solr-requests, each containing:
0286-14582373
assuming that the "id" field is really your unique-key for the documents.
What doesn't work is this:

0286-14582371
0286-14582372
0286-14582373


Could please some one add this to Solr.

You could try:
id:[0286-14582371 TO 0286-14582373]
It would delete these documents: 0286-14582373, 0286-14582372,
0286-14582371 assuming that the range query works correctly on that field.

Best regards,

Max

-- 
Maximilian Hütter
blue elephant systems GmbH
Wollgrasweg 49
D-70599 Stuttgart

Tel:  (+49) 0711 - 45 10 17 578
Fax:  (+49) 0711 - 45 10 17 573
e-mail :  [EMAIL PROTECTED]
Sitz   :  Stuttgart, Amtsgericht Stuttgart, HRB 24106
Geschäftsführer:  Joachim Hörnle, Thomas Gentsch, Holger Dietrich


Admin ping

2008-03-07 Thread Doug Steigerwald

Came in this morning to find some alerts that the admin interface has basically 
died.  Everything
was fine until about 4am.  No updates or queries going on at that time (this is a QA machine). 
Anyone know why it might die like this?


Solr 1.3 trunk build from Jan 23rd, 4GB heap size, 4x3.2GHz Xeon, 8GB RAM 
total, RHEL 5.1, 64bit.

Mar 7, 2008 5:42:46 AM org.apache.solr.common.SolrException log
SEVERE: org.apache.jasper.JasperException: PWC6117: File "/admin/ping.jsp" not 
found
at 
org.apache.jasper.compiler.DefaultErrorHandler.jspError(DefaultErrorHandler.java:60)
at 
org.apache.jasper.compiler.ErrorDispatcher.dispatch(ErrorDispatcher.java:346)
at 
org.apache.jasper.compiler.ErrorDispatcher.jspError(ErrorDispatcher.java:140)
at org.apache.jasper.compiler.JspUtil.getInputStream(JspUtil.java:881)
at 
org.apache.jasper.xmlparser.XMLEncodingDetector.getEncoding(XMLEncodingDetector.java:114)
at
org.apache.jasper.compiler.ParserController.determineSyntaxAndEncoding(ParserController.java:347)
at 
org.apache.jasper.compiler.ParserController.doParse(ParserController.java:181)
at 
org.apache.jasper.compiler.ParserController.parse(ParserController.java:111)
at org.apache.jasper.compiler.Compiler.generateJava(Compiler.java:169)
at org.apache.jasper.compiler.Compiler.compile(Compiler.java:387)
at 
org.apache.jasper.JspCompilationContext.compile(JspCompilationContext.java:579)
at 
org.apache.jasper.servlet.JspServletWrapper.service(JspServletWrapper.java:344)
at 
org.apache.jasper.servlet.JspServlet.serviceJspFile(JspServlet.java:464)
at org.apache.jasper.servlet.JspServlet.service(JspServlet.java:358)
at javax.servlet.http.HttpServlet.service(HttpServlet.java:820)

This happened a few weeks ago, but someone just restarted Solr to get the admin interface back. 
They said that updates and queries were still working fine.


Thanks.
Doug


Re: Admin ping

2008-03-07 Thread Yonik Seeley
Jetty unpacks the war in a sub-dir of /tmp by default (which can get
cleaned by other things).

https://issues.apache.org/jira/browse/SOLR-118

devs: any reason we shouldn't create example/work and make sure it
gets in the distributions?

-Yonik

On Fri, Mar 7, 2008 at 7:19 AM, Doug Steigerwald
<[EMAIL PROTECTED]> wrote:
> Came in this morning to find some alerts that the admin interface has 
> basically died.  Everything
>  was fine until about 4am.  No updates or queries going on at that time (this 
> is a QA machine).
>  Anyone know why it might die like this?
>
>  Solr 1.3 trunk build from Jan 23rd, 4GB heap size, 4x3.2GHz Xeon, 8GB RAM 
> total, RHEL 5.1, 64bit.
>
>  Mar 7, 2008 5:42:46 AM org.apache.solr.common.SolrException log
>  SEVERE: org.apache.jasper.JasperException: PWC6117: File "/admin/ping.jsp" 
> not found
>  at 
> org.apache.jasper.compiler.DefaultErrorHandler.jspError(DefaultErrorHandler.java:60)
>  at 
> org.apache.jasper.compiler.ErrorDispatcher.dispatch(ErrorDispatcher.java:346)
>  at 
> org.apache.jasper.compiler.ErrorDispatcher.jspError(ErrorDispatcher.java:140)
>  at 
> org.apache.jasper.compiler.JspUtil.getInputStream(JspUtil.java:881)
>  at 
> org.apache.jasper.xmlparser.XMLEncodingDetector.getEncoding(XMLEncodingDetector.java:114)
>  at
>  
> org.apache.jasper.compiler.ParserController.determineSyntaxAndEncoding(ParserController.java:347)
>  at 
> org.apache.jasper.compiler.ParserController.doParse(ParserController.java:181)
>  at 
> org.apache.jasper.compiler.ParserController.parse(ParserController.java:111)
>  at 
> org.apache.jasper.compiler.Compiler.generateJava(Compiler.java:169)
>  at org.apache.jasper.compiler.Compiler.compile(Compiler.java:387)
>  at 
> org.apache.jasper.JspCompilationContext.compile(JspCompilationContext.java:579)
>  at 
> org.apache.jasper.servlet.JspServletWrapper.service(JspServletWrapper.java:344)
>  at 
> org.apache.jasper.servlet.JspServlet.serviceJspFile(JspServlet.java:464)
>  at org.apache.jasper.servlet.JspServlet.service(JspServlet.java:358)
>  at javax.servlet.http.HttpServlet.service(HttpServlet.java:820)
>
>  This happened a few weeks ago, but someone just restarted Solr to get the 
> admin interface back.
>  They said that updates and queries were still working fine.
>
>  Thanks.
>  Doug
>


Re: Admin ping

2008-03-07 Thread Benson Margulies
Suggestion,

another ant target that creates an example dir outside of the tree?

I was a little bit surprised by the following scenario:

1) svn co
2) ant example
3) edit schema.xml
4) svn st

In the future, I'll run cp -r before I start messing with the example.

I'm +1 for the work directory.

I'd also like to understand how the classpaths are managed. How does
solr/lib get into the classpath structure? Is it in the system classpath for
jetty or the webapp class path for the solr webapp?



On Fri, Mar 7, 2008 at 8:58 AM, Yonik Seeley <[EMAIL PROTECTED]> wrote:

> Jetty unpacks the war in a sub-dir of /tmp by default (which can get
> cleaned by other things).
>
> https://issues.apache.org/jira/browse/SOLR-118
>
> devs: any reason we shouldn't create example/work and make sure it
> gets in the distributions?
>
> -Yonik
>
> On Fri, Mar 7, 2008 at 7:19 AM, Doug Steigerwald
> <[EMAIL PROTECTED]> wrote:
> > Came in this morning to find some alerts that the admin interface has
> basically died.  Everything
> >  was fine until about 4am.  No updates or queries going on at that time
> (this is a QA machine).
> >  Anyone know why it might die like this?
> >
> >  Solr 1.3 trunk build from Jan 23rd, 4GB heap size, 4x3.2GHz Xeon, 8GB
> RAM total, RHEL 5.1, 64bit.
> >
> >  Mar 7, 2008 5:42:46 AM org.apache.solr.common.SolrException log
> >  SEVERE: org.apache.jasper.JasperException: PWC6117: File
> "/admin/ping.jsp" not found
> >  at org.apache.jasper.compiler.DefaultErrorHandler.jspError(
> DefaultErrorHandler.java:60)
> >  at org.apache.jasper.compiler.ErrorDispatcher.dispatch(
> ErrorDispatcher.java:346)
> >  at org.apache.jasper.compiler.ErrorDispatcher.jspError(
> ErrorDispatcher.java:140)
> >  at org.apache.jasper.compiler.JspUtil.getInputStream(
> JspUtil.java:881)
> >  at org.apache.jasper.xmlparser.XMLEncodingDetector.getEncoding(
> XMLEncodingDetector.java:114)
> >  at
> >  org.apache.jasper.compiler.ParserController.determineSyntaxAndEncoding(
> ParserController.java:347)
> >  at org.apache.jasper.compiler.ParserController.doParse(
> ParserController.java:181)
> >  at org.apache.jasper.compiler.ParserController.parse(
> ParserController.java:111)
> >  at org.apache.jasper.compiler.Compiler.generateJava(
> Compiler.java:169)
> >  at org.apache.jasper.compiler.Compiler.compile(Compiler.java
> :387)
> >  at org.apache.jasper.JspCompilationContext.compile(
> JspCompilationContext.java:579)
> >  at org.apache.jasper.servlet.JspServletWrapper.service(
> JspServletWrapper.java:344)
> >  at org.apache.jasper.servlet.JspServlet.serviceJspFile(
> JspServlet.java:464)
> >  at org.apache.jasper.servlet.JspServlet.service(JspServlet.java
> :358)
> >  at javax.servlet.http.HttpServlet.service(HttpServlet.java:820)
> >
> >  This happened a few weeks ago, but someone just restarted Solr to get
> the admin interface back.
> >  They said that updates and queries were still working fine.
> >
> >  Thanks.
> >  Doug
> >
>


Question about facet in solr

2008-03-07 Thread émile coué

Hi,


I am new in solr, and I use the SolrJ Client,

If I stores into solr doc1 is doc2 documents, having for the fields champX
respectively the values valeurA, valeurB.

When I use facet on the champX field, it returned the following info:

value (2)

valeura (1)

valeurb (1)

a (1)

b (1)

Thing I want not, I would like :

valeura (1)

valeurb (1)

An explanation please
-- 
View this message in context: 
http://www.nabble.com/Question-about-facet-in-solr-tp15893883p15893883.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Question about facet in solr

2008-03-07 Thread Erik Hatcher
My hunch is you want to set that to a "string" field so it is not  
tokenized - whereas it looks like you have a tokenized field with  
word delimiter and stemming.   I'm not sure how "value" got in there  
(is this French stemming?).  With "string" type, case will be exact  
in your facets, not all lowercased as you desired.


Erik


On Mar 7, 2008, at 10:04 AM, émile coué wrote:



Hi,


I am new in solr, and I use the SolrJ Client,

If I stores into solr doc1 is doc2 documents, having for the fields  
champX

respectively the values valeurA, valeurB.

When I use facet on the champX field, it returned the following info:

value (2)

valeura (1)

valeurb (1)

a (1)

b (1)

Thing I want not, I would like :

valeura (1)

valeurb (1)

An explanation please
--
View this message in context: http://www.nabble.com/Question-about- 
facet-in-solr-tp15893883p15893883.html

Sent from the Solr - User mailing list archive at Nabble.com.




Re: Question about facet in solr

2008-03-07 Thread émile coué

Hi,

Can you give me more explanation, what must I do, I'm new in solr?

Thanks



Erik Hatcher wrote:
> 
> My hunch is you want to set that to a "string" field so it is not  
> tokenized - whereas it looks like you have a tokenized field with  
> word delimiter and stemming.   I'm not sure how "value" got in there  
> (is this French stemming?).  With "string" type, case will be exact  
> in your facets, not all lowercased as you desired.
> 
>   Erik
> 
> 
> On Mar 7, 2008, at 10:04 AM, émile coué wrote:
> 
>>
>> Hi,
>>
>>
>> I am new in solr, and I use the SolrJ Client,
>>
>> If I stores into solr doc1 is doc2 documents, having for the fields  
>> champX
>> respectively the values valeurA, valeurB.
>>
>> When I use facet on the champX field, it returned the following info:
>>
>> value (2)
>>
>> valeura (1)
>>
>> valeurb (1)
>>
>> a (1)
>>
>> b (1)
>>
>> Thing I want not, I would like :
>>
>> valeura (1)
>>
>> valeurb (1)
>>
>> An explanation please
>> -- 
>> View this message in context: http://www.nabble.com/Question-about- 
>> facet-in-solr-tp15893883p15893883.html
>> Sent from the Solr - User mailing list archive at Nabble.com.
> 
> 
> 

-- 
View this message in context: 
http://www.nabble.com/Question-about-facet-in-solr-tp15893883p15900449.html
Sent from the Solr - User mailing list archive at Nabble.com.



Illegal xml/html character; unicode problems near solr

2008-03-07 Thread Peter Cline

Hi all,

I'm new to the list, but I've been struggling with this problem for some 
time. I'm getting Illegal xml/html character errors and I'm trying to 
track down the source. The characters in question seem to be in the 
128-159 (decimal) range, which is illegal in XML. The characters are 
mostly diacritics and other types of accents.


The original data is encoded in UTF-8. I have verified that the data 
doesn't contain any of these characters prior to indexing, and when I 
get the records in question back in a list of results, they display 
fine. The problem arises when the characters occur in a facet value and 
I try to pass it through the URL.


As an example, consider a facet value:
Brasseur de Bourbourg, abb%C3%A9, 1814-1874, former owner

The %C3%A9 is an e with a diacritic, so roughly abbe'.

The following is a snippet of a link to use a facet:
search-faceted.html?q=[* TO 
*]&facet=true&rows=25&fq=name_facet:"Brasseur de 
Bourbourg, abb%C3%A9, 1814-1874, former owner""


These characters are correctly specified. When it returns, I get an 
illegal character error. Examining the XML, I get an fq value of:

name_facet:"Brasseur de Bourbourg, abbé, 1814-1874, former owner"

I'm not sure how that will display in the email, but in short, it's not 
what I put in. Further, it's not legal html and things break.


Does anyone have any thoughts about this? I apologize if this has been 
asked somewhere in the past, but I did some digging and couldn't come up 
with anything. I welcome any input.


Regards,

Peter


Peter Cline, Digital Library Applications Programmer
University of Pennsylvania Library
email: pcline at pobox dot upenn dot edu


Re: Composite key for uniqueKeyId

2008-03-07 Thread Jon Baer

Hi Norberto,

This sounds exactly what Im looking to do, do you have an example?

(Keep in mind Im using data-config.xml - DataImporter)

Im interested in merging different types of content in, ie:

NEWS12345
VIDEO12345

So Id like to end up w/ different keys per type if possible.

Thanks.

- Jon

On Mar 6, 2008, at 11:21 PM, Norberto Meijome wrote:


On Thu, 6 Mar 2008 11:33:38 -0500
Jon Baer <[EMAIL PROTECTED]> wrote:


Im interested to know if composite keys are now possible or if there
is anything to copyField I can use to get composite keys working for
my doc ids?


FWIW, we just do this @ doc generation time - grab several fields,  
massage them into shape, normalise, assign to docID

B
_
{Beto|Norberto|Numard} Meijome

...using the internet as it was originally intended... for the  
further research of pornography and pipebombs.


I speak for myself, not my employer. Contents may be hot. Slippery  
when wet. Reading disclaimers makes you go blind. Writing them is  
worse. You have been Warned.




Re: Illegal xml/html character; unicode problems near solr

2008-03-07 Thread Yonik Seeley
On Fri, Mar 7, 2008 at 12:30 PM, Peter Cline <[EMAIL PROTECTED]> wrote:
>  The following is a snippet of a link to use a facet:
>  search-faceted.html?q=[* TO
>  *]&facet=true&rows=25&fq=name_facet:"Brasseur de
>  Bourbourg, abb%C3%A9, 1814-1874, former owner""
>
>  These characters are correctly specified. When it returns, I get an
>  illegal character error. Examining the XML, I get an fq value of:
>  name_facet:"Brasseur de Bourbourg, abbÃÂ(c), 1814-1874, former owner"

Is this bad XML part of the responseHeader (parameters that are simply
being echoed back)?
If so, it's most likely the config on whatever servlet container you
are using... you need to configure it to accept UTF-8 URLs rather than
latin-1 (Tomcat defaults to the old-style latin-1 AFAIK)

-Yonik


RE: Illegal xml/html character; unicode problems near solr

2008-03-07 Thread nicolas . dessaigne
I think Tomcat defaults to the operating system default, e.g. cp1252 on a
classic windows.

You need to add an attribute URIEncoding="UTF-8" to the Connector you use in
the server.xml conf.

Nicolas

-Message d'origine-
De : [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] De la part de Yonik Seeley
Envoyé : vendredi 7 mars 2008 18:53
À : solr-user@lucene.apache.org
Objet : Re: Illegal xml/html character; unicode problems near solr

On Fri, Mar 7, 2008 at 12:30 PM, Peter Cline <[EMAIL PROTECTED]> wrote:
>  The following is a snippet of a link to use a facet:
>  search-faceted.html?q=[* TO
>  *]&facet=true&rows=25&fq=name_facet:"Brasseur de
>  Bourbourg, abb%C3%A9, 1814-1874, former owner""
>
>  These characters are correctly specified. When it returns, I get an
>  illegal character error. Examining the XML, I get an fq value of:
>  name_facet:"Brasseur de Bourbourg, abbÃÂ(c), 1814-1874, former owner"

Is this bad XML part of the responseHeader (parameters that are simply
being echoed back)?
If so, it's most likely the config on whatever servlet container you
are using... you need to configure it to accept UTF-8 URLs rather than
latin-1 (Tomcat defaults to the old-style latin-1 AFAIK)

-Yonik


Re: mixing solr date and magic date words

2008-03-07 Thread Chris Hostetter
: I would like to send this query to Solr:
: 
: date:[2008-02-02T16:45:58Z-90DAY TO 2008-02-02T16:45:58Z-3DAY]

the trunk has code to support date math expressions on arbitrary dates (in 
solr 1.2 date math only worked relative to "NOW") but there is currently a 
bug in which milliseconds (which are suppose to be optional) are mandatory 
when dealing with date math ... it will be fixed by 1.3, but if you want 
to use the trunk now you would have to write that as...

   date:[2008-02-02T16:45:58.000Z-90DAY TO 2008-02-02T16:45:58.000Z-3DAY]



-Hoss



Re: What is default Date time format in Solr

2008-03-07 Thread Chris Hostetter

: I heard Solr Date time format is 24 hours.

that is correct.

: emf.artist:[2007-12-31T22:20:00Z TO  2007-12-31T22:39:00Z]
: 
: I am not able to get the content what I expected.
: 
: But, I tried with following query:-
: 
: emf.artist:[2007-12-31T10:20:00Z TO  2007-12-31T10:39:00Z]

Is your emf.artist field stored?  
If so what value do you see in the field when you do that second query and 
get the results you are looking for?  if they don't match what you think 
they should be, then the code you have reading dates from your index and 
writing them to Solr isn't doing what you think it's doing.




-Hoss



Re: Admin ping

2008-03-07 Thread Chris Hostetter

: another ant target that creates an example dir outside of the tree?

The example is just an example - most of our users will never run "ant 
example" anyway (it's done before the release is packaged up)

making "ant example" copy the example configs to ./build/example and the 
only packaging thta might be cleaner then what we currently do ... but 
it's really just a question of what makes life easier for devs at that 
point.

: I'd also like to understand how the classpaths are managed. How does
: solr/lib get into the classpath structure? Is it in the system classpath for
: jetty or the webapp class path for the solr webapp?

It's a custom classloader (a child of the solr webapp class loader) that 
Solr uses anytime it needs to instantiate new instances.




-Hoss



Re: Illegal xml/html character; unicode problems near solr

2008-03-07 Thread Peter Cline

Nicolas and Yonik,

Thank you both for your excellent responses--this fixed my problem.  Now 
it's time to go back and remove all the hacks I was using to pin this 
thing together without proper utf-8 support. 


Thanks again,
Peter

[EMAIL PROTECTED] wrote:

I think Tomcat defaults to the operating system default, e.g. cp1252 on a
classic windows.

You need to add an attribute URIEncoding="UTF-8" to the Connector you use in
the server.xml conf.

Nicolas

-Message d'origine-
De : [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] De la part de Yonik Seeley
Envoyé : vendredi 7 mars 2008 18:53
À : solr-user@lucene.apache.org
Objet : Re: Illegal xml/html character; unicode problems near solr

On Fri, Mar 7, 2008 at 12:30 PM, Peter Cline <[EMAIL PROTECTED]> wrote:
  

 The following is a snippet of a link to use a facet:
 search-faceted.html?q=[* TO
 *]&facet=true&rows=25&fq=name_facet:"Brasseur de
 Bourbourg, abb%C3%A9, 1814-1874, former owner""

 These characters are correctly specified. When it returns, I get an
 illegal character error. Examining the XML, I get an fq value of:
 name_facet:"Brasseur de Bourbourg, abbÃÂ(c), 1814-1874, former owner"



Is this bad XML part of the responseHeader (parameters that are simply
being echoed back)?
If so, it's most likely the config on whatever servlet container you
are using... you need to configure it to accept UTF-8 URLs rather than
latin-1 (Tomcat defaults to the old-style latin-1 AFAIK)

-Yonik

  


Re: Problem with html code inside xml

2008-03-07 Thread Latj


When I use HTML::Entities to encode my text, I get this error:

SEVERE: org.xmlpull.v1.XmlPullParserException: could not resolve entity
named 'para'

Its complaining about finding:   ¶   in my text. Anyone know why this
is a problem?





Jérôme Etévé-2 wrote:
> 
> If I understand, you want to keep the raw html code in solr like that
> (in your posting xml file):
> 
> 
>   
> 
> 
> I think you should encode your content to protect these xml entities:
> <  ->  <
>> -> >
> " -> "
> & -> &
> 
> If you use perl, have a look at HTML::Entities.
> 
> 
> On 9/25/07, [EMAIL PROTECTED] <[EMAIL PROTECTED]> wrote:
>> Hello,
>>
>> I've got some problem with html code who is embedded in xml file:
>>
>> Sample source .
>>
>> 
>> 
>> 
>>  Les débats
>> 
>> 
>> Le premier tour des élections fédérales se
>> déroulera le 21
>> octobre prochain. D'ici là, La 1ère vous propose plusieurs rendez-
>> vous, dont plusieurs grands débats à l'enseigne de Forums.
>> 
>> 
>> 
>> 
>> my para textehere
>> 
>> 
>> Vous trouverez sur cette page toutes les
>> dates et les heures de
>> ces différents rendez-vous ainsi que le nom et les partis des
>> débatteurs. De plus, vous pourrez également écouter ou réécouter
>> l'ensemble de ces émissions.
>> 
>> 
>> 
>> -
>> When a make a query on solr I've got something like that in the
>> source code of the xml result:
>>
>> http://www.w3.org/1999/xhtml";>
>> <
>> div
>> class
>> =
>> "paragraph"
>> >
>> <
>> div
>> class
>> =
>> "paragraphTitle"
>> />
>> −
>> <
>> ...
>>
>> It is not exactly what I want. I want to keep the html tags, that all
>> without formatting.
>>
>> So the br tags and a tags are well formed in xml and json result, but
>> the div tags are not kept.
>> -
>> In the schema.xml I've got this for the html content
>>
>> 
>>
>>   > stored="true" multiValued="true"/>
>>
>> -
>>
>> Any help would be appreciate.
>>
>> Thanks in advance.
>>
>> S. Christin
>>
>>
>>
>>
>>
>>
> 
> 
> -- 
> Jerome Eteve.
> [EMAIL PROTECTED]
> http://jerome.eteve.free.fr/
> 
> 

-- 
View this message in context: 
http://www.nabble.com/Problem-with-html-code-inside-xml-tp12877194p15907551.html
Sent from the Solr - User mailing list archive at Nabble.com.



Re: Problem with html code inside xml

2008-03-07 Thread Reece
Just use cdata to have the parser ignore the html characters.

http://www.w3schools.com/xml/xml_cdata.asp

-Reece



On Fri, Mar 7, 2008 at 5:11 PM, Latj <[EMAIL PROTECTED]> wrote:
>
>
>  When I use HTML::Entities to encode my text, I get this error:
>
>  SEVERE: org.xmlpull.v1.XmlPullParserException: could not resolve entity
>  named 'para'
>
>  Its complaining about finding:   ¶   in my text. Anyone know why this
>  is a problem?
>
>
>
>
>
>  Jérôme Etévé-2 wrote:
>  >
>  > If I understand, you want to keep the raw html code in solr like that
>  > (in your posting xml file):
>  >
>  > 
>  >   
>  > 
>  >
>  > I think you should encode your content to protect these xml entities:
>  > <  ->  <
>  >> -> >
>  > " -> "
>  > & -> &
>  >
>  > If you use perl, have a look at HTML::Entities.
>  >
>  >
>  > On 9/25/07, [EMAIL PROTECTED] <[EMAIL PROTECTED]> wrote:
>  >> Hello,
>  >>
>  >> I've got some problem with html code who is embedded in xml file:
>  >>
>  >> Sample source .
>  >>
>  >> 
>  >> 
>  >> 
>  >>  Les débats
>  >> 
>  >> 
>  >> Le premier tour des élections fédérales se
>  >> déroulera le 21
>  >> octobre prochain. D'ici là, La 1ère vous propose plusieurs rendez-
>  >> vous, dont plusieurs grands débats à l'enseigne de Forums.
>  >> 
>  >> 
>  >> 
>  >> 
>  >> my para textehere
>  >> 
>  >> 
>  >> Vous trouverez sur cette page toutes les
>  >> dates et les heures de
>  >> ces différents rendez-vous ainsi que le nom et les partis des
>  >> débatteurs. De plus, vous pourrez également écouter ou réécouter
>  >> l'ensemble de ces émissions.
>  >> 
>  >> 
>  >> 
>  >> -
>  >> When a make a query on solr I've got something like that in the
>  >> source code of the xml result:
>  >>
>  >> http://www.w3.org/1999/xhtml";>
>  >> <
>  >> div
>  >> class
>  >> =
>  >> "paragraph"
>  >> >
>  >> <
>  >> div
>  >> class
>  >> =
>  >> "paragraphTitle"
>  >> />
>  >> −
>  >> <
>  >> ...
>  >>
>  >> It is not exactly what I want. I want to keep the html tags, that all
>  >> without formatting.
>  >>
>  >> So the br tags and a tags are well formed in xml and json result, but
>  >> the div tags are not kept.
>  >> -
>  >> In the schema.xml I've got this for the html content
>  >>
>  >> 
>  >>
>  >> >> stored="true" multiValued="true"/>
>  >>
>  >> -
>  >>
>  >> Any help would be appreciate.
>  >>
>  >> Thanks in advance.
>  >>
>  >> S. Christin
>  >>
>  >>
>  >>
>  >>
>  >>
>  >>
>  >
>  >
>  > --
>  > Jerome Eteve.
>  > [EMAIL PROTECTED]
>  > http://jerome.eteve.free.fr/
>  >
>  >
>
>  --
>  View this message in context: 
> http://www.nabble.com/Problem-with-html-code-inside-xml-tp12877194p15907551.html
>  Sent from the Solr - User mailing list archive at Nabble.com.
>
>


Re: Problem with html code inside xml

2008-03-07 Thread Yonik Seeley
On Fri, Mar 7, 2008 at 5:11 PM, Latj <[EMAIL PROTECTED]> wrote:
>  When I use HTML::Entities to encode my text, I get this error:
>
>  SEVERE: org.xmlpull.v1.XmlPullParserException: could not resolve entity
>  named 'para'
>
>  Its complaining about finding:   ¶   in my text. Anyone know why this
>  is a problem?

¶ is an HTML entity, not standard in XML.

-Yonik


Re: Composite key for uniqueKeyId

2008-03-07 Thread Chris Hostetter

I believe Norberto ment he was handling it in his update client code -- 
before sending the docs to Solr.

Something that *seems* possible but I've never actaully tried is writting 
a "ConcatTokenFilterFactory" that queues up all the tokens and joins 
them together (using some confiured string, defaulting to "")  then you 
could in theory do something like this...


  


  

...

compositeKey
...


...

that *might* work ... but things would be a little weird when viewing your 
results (compositeKey would have to be multivalued, and it would return as 
an array)


-Hoss



Solr-J problem

2008-03-07 Thread oleg_gnatovskiy

Hello. I just started using solrJ recently and ran into a problem. I execute
the following line after creating a SolrQuery: SolrDocumentList solrResults
= engine.query(solrQuery).getResults();. solrResults.size() is always 10,
while solrResults.getNumFound() varies based on the query. My question is,
how do I get access to the entire result set? Why do I only get a list of
the first 10? Any help would be greatly appreciated.
-- 
View this message in context: 
http://www.nabble.com/Solr-J-problem-tp15910308p15910308.html
Sent from the Solr - User mailing list archive at Nabble.com.



Re: Question about facet in solr

2008-03-07 Thread Erik Hatcher
In schema.xml, set champX to be "string" rather than, probably,  
"text".  Reindex your docs.  Now how do your facets come out?


Erik


On Mar 7, 2008, at 11:42 AM, émile coué wrote:



Hi,

Can you give me more explanation, what must I do, I'm new in solr?

Thanks



Erik Hatcher wrote:


My hunch is you want to set that to a "string" field so it is not
tokenized - whereas it looks like you have a tokenized field with
word delimiter and stemming.   I'm not sure how "value" got in there
(is this French stemming?).  With "string" type, case will be exact
in your facets, not all lowercased as you desired.

Erik


On Mar 7, 2008, at 10:04 AM, émile coué wrote:



Hi,


I am new in solr, and I use the SolrJ Client,

If I stores into solr doc1 is doc2 documents, having for the fields
champX
respectively the values valeurA, valeurB.

When I use facet on the champX field, it returned the following  
info:


value (2)

valeura (1)

valeurb (1)

a (1)

b (1)

Thing I want not, I would like :

valeura (1)

valeurb (1)

An explanation please
--
View this message in context: http://www.nabble.com/Question-about-
facet-in-solr-tp15893883p15893883.html
Sent from the Solr - User mailing list archive at Nabble.com.






--
View this message in context: http://www.nabble.com/Question-about- 
facet-in-solr-tp15893883p15900449.html

Sent from the Solr - User mailing list archive at Nabble.com.




Re: Composite key for uniqueKeyId

2008-03-07 Thread Jon Baer
That definitely sounds like the proper way to go + will try.  Im not  
too concerned w/ my keys coming back just that I can't seem to run the  
DataImportHandler w/o one.


I was able to temporarily get around it by returning it in the entity  
query.  Ie:



  


BTW, the DataImportHandler seems to still be a "patch", is there an  
estimation of if/when it will appear in trunk?


Thanks!

- Jon

On Mar 7, 2008, at 8:59 PM, Chris Hostetter wrote:



I believe Norberto ment he was handling it in his update client code  
--

before sending the docs to Solr.

Something that *seems* possible but I've never actaully tried is  
writting

a "ConcatTokenFilterFactory" that queues up all the tokens and joins
them together (using some confiured string, defaulting to "")  then  
you

could in theory do something like this...

   omitNorms="true">

 
   
   
 
   
   ...
   
   compositeKey
   ...
   
   
   ...

that *might* work ... but things would be a little weird when  
viewing your
results (compositeKey would have to be multivalued, and it would  
return as

an array)


-Hoss





Re: Solr-J problem

2008-03-07 Thread Otis Gospodnetic
I don't have the sources in front of me, but isn't there a setRows(int) method 
that you can call before running the query?

Otis
--
Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch

- Original Message 
From: oleg_gnatovskiy <[EMAIL PROTECTED]>
To: solr-user@lucene.apache.org
Sent: Friday, March 7, 2008 9:07:18 PM
Subject: Solr-J problem


Hello. I just started using solrJ recently and ran into a problem. I execute
the following line after creating a SolrQuery: SolrDocumentList solrResults
= engine.query(solrQuery).getResults();. solrResults.size() is always 10,
while solrResults.getNumFound() varies based on the query. My question is,
how do I get access to the entire result set? Why do I only get a list of
the first 10? Any help would be greatly appreciated.
-- 
View this message in context: 
http://www.nabble.com/Solr-J-problem-tp15910308p15910308.html
Sent from the Solr - User mailing list archive at Nabble.com.






Re: ranking on Multivalued fields

2008-03-07 Thread Otis Gospodnetic
Umar,

I'm not sure what you mean by a "subfield", can you explain please?
 
As for your second question, just add category:X to your query and you'll get 
matches ordered/ranked by score by default.

Otis

--
Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch

- Original Message 
From: Umar Shah <[EMAIL PROTECTED]>
To: solr-user@lucene.apache.org
Sent: Friday, March 7, 2008 1:17:35 AM
Subject: ranking on Multivalued fields

Hi,

I have a problem where i want to rank multivalued fields

suppose a multivalued field "category" having associated subfield "score".
First Is it possible to have a subfield in the mutlivalued field?
Second I want to get the documents ranked with the highest score say for the
category:X

thanks
Umar Shah





Re: Composite key for uniqueKeyId

2008-03-07 Thread Noble Paul നോബിള്‍ नोब्ळ्
Good to hear that people are using DatImportHandler
In a couple of days, we are giving another patch which is cleared by
our QA  with
better error handling, messaging and a lot of new features.

A committer will have to decide on when it is good enough to be committed
--Noble

On Sat, Mar 8, 2008 at 10:11 AM, Jon Baer <[EMAIL PROTECTED]> wrote:
> That definitely sounds like the proper way to go + will try.  Im not
>  too concerned w/ my keys coming back just that I can't seem to run the
>  DataImportHandler w/o one.
>
>  I was able to temporarily get around it by returning it in the entity
>  query.  Ie:
>
>  
>
>  
>
>  BTW, the DataImportHandler seems to still be a "patch", is there an
>  estimation of if/when it will appear in trunk?
>
>  Thanks!
>
>  - Jon
>
>
>
>  On Mar 7, 2008, at 8:59 PM, Chris Hostetter wrote:
>
>  >
>  > I believe Norberto ment he was handling it in his update client code
>  > --
>  > before sending the docs to Solr.
>  >
>  > Something that *seems* possible but I've never actaully tried is
>  > writting
>  > a "ConcatTokenFilterFactory" that queues up all the tokens and joins
>  > them together (using some confiured string, defaulting to "")  then
>  > you
>  > could in theory do something like this...
>  >
>  >  > omitNorms="true">
>  >  
>  >
>  >
>  >  
>  >
>  >...
>  >
>  >compositeKey
>  >...
>  >
>  >
>  >...
>  >
>  > that *might* work ... but things would be a little weird when
>  > viewing your
>  > results (compositeKey would have to be multivalued, and it would
>  > return as
>  > an array)
>  >
>  >
>  > -Hoss
>  >
>
>



-- 
--Noble Paul


Re: Composite key for uniqueKeyId

2008-03-07 Thread Vijay Rao
I am also looking forward to get this checked into the trunk.

Will there be a patch with Solr1.2 support?
Cheers
Vijay

On Sat, Mar 8, 2008 at 10:11 AM, Jon Baer <[EMAIL PROTECTED]> wrote:

> That definitely sounds like the proper way to go + will try.  Im not
> too concerned w/ my keys coming back just that I can't seem to run the
> DataImportHandler w/o one.
>
> I was able to temporarily get around it by returning it in the entity
> query.  Ie:
>
> 
>   
> 
>
> BTW, the DataImportHandler seems to still be a "patch", is there an
> estimation of if/when it will appear in trunk?
>
> Thanks!
>
> - Jon
>
> On Mar 7, 2008, at 8:59 PM, Chris Hostetter wrote:
>
> >
> > I believe Norberto ment he was handling it in his update client code
> > --
> > before sending the docs to Solr.
> >
> > Something that *seems* possible but I've never actaully tried is
> > writting
> > a "ConcatTokenFilterFactory" that queues up all the tokens and joins
> > them together (using some confiured string, defaulting to "")  then
> > you
> > could in theory do something like this...
> >
> > > omitNorms="true">
> >  
> >
> >
> >  
> >
> >...
> >
> >compositeKey
> >...
> >
> >
> >...
> >
> > that *might* work ... but things would be a little weird when
> > viewing your
> > results (compositeKey would have to be multivalued, and it would
> > return as
> > an array)
> >
> >
> > -Hoss
> >
>
>