Re: Stemming numbers

2012-01-12 Thread Chris Hostetter
: We've had some issues with people searching for a document with the : search term '200 movies'. The document is actually title 'two hundred : movies'. : : Do we need to add every number to our synonyms dictionary to : accomplish this? Is it best done at index or search time? if all you care a

Re: Solr 3.3 crashes after ~18 hours?

2012-01-12 Thread cowwoc
I believe this issue is related to this Jetty bug report: https://bugs.eclipse.org/bugs/show_bug.cgi?id=357318 Gili -- View this message in context: http://lucene.472066.n3.nabble.com/Solr-3-3-crashes-after-18-hours-tp3218496p3655937.html Sent from the Solr - User mailing list archive at Nabble.

Re: Question about updating index with custom field types

2012-01-12 Thread 罗赛
Hi Sylvain, I'm very sorry that I could not help you for I'm also doing pure English project... Erick, Thanks for your approach, I'll try it. Luo Sai On Wed, Jan 11, 2012 at 10:08 PM, Erick Erickson wrote: > I'm not sure what custom field types have to do with XML here. > Somewhere, you hav

Re: Relevancy and random sorting

2012-01-12 Thread Chris Hostetter
: We have a listing aggregator that gets product listings from a lot of : different sites and since they are added in batches, sometimes you see a : lot of pages from the same source (site). We are working on some changes to : shift things around and reduce this "blocking" effect, so we can presen

Re: SolrException: Invalid Date String:'oracle.sql.TIMESTAMP

2012-01-12 Thread yunfei wu
I guess you probably run into the issue between different date value format in your oracle db and in solr field. Solr only expects XML date value in UTC format - http://lucene.apache.org/solr/api/org/apache/solr/schema/DateField.html. You might need to consider DateFormatTransformer - http://wiki.

can solr automatically search for different punctuation of a word

2012-01-12 Thread alxsss
Hello, I would like to know if solr has a functionality to automatically search for a different punctuation of a word. For example if I if a user searches for a word Uber, and stemmer is german lang, then solr looks for both Uber and Über, like in synonyms. Is it possible to give a file with

replication failure, logs or notice?

2012-01-12 Thread Jonathan Rochkind
I think maybe my Solr 1.4 replications have been failing for quite some time, without me realizing it, possibly due to lack of disk space to replicate some large segments. Where would I look to see if a replication failed? Just the standard solr log? What would I look for? There's no facili

Re: SpatialSearch, geofilt and documents missing a value in sfield

2012-01-12 Thread Smiley, David W.
Hi Tanguy, On Jan 11, 2012, at 6:14 AM, Tanguy Moal wrote: > Dear ML, > > I'm performing some developments relying on spatial capabilities of solr. > > I'm using Solr 3.5, have been reading > http://wiki.apache.org/solr/SpatialSearch#Spatial_Query_Parameters and have > the basic behaviours I

Re: a way to marshall xml doc into a SolrInputDocument

2012-01-12 Thread jmuguruza
even if they could (not sure if they could be done there, as they involve properly formatting some fields so dates are in correct format etc, and maybe the format is checked first) I would prefer to do it in the solrj side as the code will be much simpler for me. thanks -- View this message in co

Re: a way to marshall xml doc into a SolrInputDocument

2012-01-12 Thread Tomás Fernández Löbbe
Can those modifications be made on the server side? If so, you could create an UpdateRequestProcessor. See http://wiki.apache.org/solr/UpdateRequestProcessor On Thu, Jan 12, 2012 at 5:19 PM, jmuguruza wrote: > If I have individual files in the expected Solr format (having just ONE doc > per file

a way to marshall xml doc into a SolrInputDocument

2012-01-12 Thread jmuguruza
If I have individual files in the expected Solr format (having just ONE doc per file): GB18030TEST Test with some GB18030 encoded characters No accents here ÕâÊÇÒ»¸ö¹¦ÄÜ 0 Is not there a way to easily marshal that file into a SolrInputDocument? Do I have to do the pars

RE: Determining which shard is failing using partialResults / some other technique?

2012-01-12 Thread Gilles Comeau
Hi all, Is there at least a way to print out which shard is being called in the logging and maybe logging a failure? INFO: [master] webapp=/solr path=/select params={facet=true&facet.mincount=1&facet.sort=count&q=(content_1500_chars:( ("allied+irish+banks"+OR+"+aib+")+AND+NOT+(bluray+OR+"R

RE: SolrException: Invalid Date String:'oracle.sql.TIMESTAMP

2012-01-12 Thread Colin Bennett
Hi, It looks like a date formatting issue, the Solr date field expects something like 1995-12-31T23:59:59.999Z See http://lucene.apache.org/solr/api/org/apache/solr/schema/DateField.html The data import handler does have a date transformer to convert dates http://wiki.apache.org/solr/DataImport

SolrException: Invalid Date String:'oracle.sql.TIMESTAMP

2012-01-12 Thread Joey Grimm
Hi, I am trying to use a dataImportHandler to import data from an oracle DB. It works for non-date fields but is throwing an exception once I included the MODIFIEDDATE field (oracle.timestamp field). Can anyone see what I'm doing wrong here? Thanks. schema.xml db-data-config.xml

Re: Search Specific Boosting

2012-01-12 Thread Brett
Hi Erick, Yeah, I've reviewed the debug output and can't make sense of why they are scoring the same. I have double checked that they are being indexed with different boost values for the search field. I've also increased the factors trying to get them be more granular so instead of boosting

Re: Relevancy and random sorting

2012-01-12 Thread Ahmet Arslan
> This document already has a field that indicates the source > (site). > The issue we are trying to solve is when we list all > documents without any > specific criteria. Since we bring the most recent ones and > the ones that > contains images, we end up having a lot of listings from a > single s

Re: Relevancy and random sorting

2012-01-12 Thread Alexandre Rocco
Michael, We are using the random sorting in combination with date and other fields but I am trying to change this to affect the ranking instead of sorting directly. That way we can also use other useful tweaks on the rank itself. Alexandre On Thu, Jan 12, 2012 at 11:46 AM, Michael Kuhlmann wrot

FacetComponent: suppress original query

2012-01-12 Thread Dmitry Kan
Hello list, I need to split the incoming original facet query into a list of sub-queries. The logic is done and each sub-query gets added into outgoing queue with rb.addRequest(), where rb is instance of ResponseBuilder. In the logs I see that along with the sub-queries the original query gets sub

Re: Highlighting issue with PlainTextEntityProcessor.

2012-01-12 Thread meghana
Hi Erik.. Thanks for your reply. And yes data was on index. but i found the problem , the problem was not of PlainTextEntityProcessor. highlighting was returning in multivalued field and in non-multivalued field there was less highlight. so i thought problem may be in PlainTextEntityProcessor. B

Re: Relevancy and random sorting

2012-01-12 Thread Michael Kuhlmann
Does the random sort function help you here? http://lucene.apache.org/solr/api/org/apache/solr/schema/RandomSortField.html However, you will get some very old listings then, if it's okay for you. -Kuli Am 12.01.2012 14:38, schrieb Alexandre Rocco: Erick, This document already has a field tha

Re: Relevancy and random sorting

2012-01-12 Thread Alexandre Rocco
Erick, This document already has a field that indicates the source (site). The issue we are trying to solve is when we list all documents without any specific criteria. Since we bring the most recent ones and the ones that contains images, we end up having a lot of listings from a single site, sin

Not able to see output in XML output

2012-01-12 Thread rajalapati
Hi, In my SOLR, I have a query based data-config written and was able to manage below steps but i was not able to see the output 1) Register Data Import Request handler in Solr-config.xml 2) Modify Data-Config.xml for the appropriate query to get data imported from which includes making use of

Re: Large data set or data corpus

2012-01-12 Thread jmuguruza
http://www.data.gov/ has lots of datasets available for free -- View this message in context: http://lucene.472066.n3.nabble.com/Large-data-set-or-data-corpus-tp3650316p3653154.html Sent from the Solr - User mailing list archive at Nabble.com.

Restricting access to shards / collections with SolrCloud

2012-01-12 Thread Jaran Nilsen
Hi. We're currently looking at SolrCloud to improve management of our Solr cluster. There is one use case which I am wondering if SolrCloud provide any support for out of the box, or if our best bet is to stick with our current solution. The use case is: We have a large number of shards, using t

Re: catchall field minus one field

2012-01-12 Thread elisabeth benoit
thanks a lot for your advice, I'll try that. Best regards, Elisabeth 2012/1/11 Erick Erickson > Hmmm, Once the data is included in the catch-all, it's indistinguishable > from > all the rest of the data, so I don't see how you could do this. A clause > like: > -excludeField:[* TO *] would exclu