Accessing multiple datasource from an entity

2018-09-05 Thread Monique Monteiro
Hi all,

I need to add a deltaQuery to a DIH configuration, but it should reference
a different datasource.  Example:






*
*  ...*

Is it possible?

Thanks in advance.

-- 
Monique Monteiro
Twitter: http://twitter.com/monilouise


Silent error from DataImportHandler

2018-09-10 Thread Monique Monteiro
Hi all,

I have a data import handler configured with an Oracle SQL query which
works like a charm.  However, when I have the same query configured in
Solr's data import handler, nothing happens, and it returns:



"*Total Requests made to DataSource*": "1",

"*Total Rows Fetched*": "0",

"*Total Documents Processed*": "0",

"*Total Documents Skipped*": "0",

"Full Dump Started": "2018-09-06 18:15:59", "Full Import failed": "2018-09-06
18:16:02"

Has anyone any ideas about what may be happening?  Is there any log file
which can tell the error?

Thanks in advance,

-- 
Monique Monteiro
Twitter: http://twitter.com/monilouise


Re: Silent error from DataImportHandler

2018-09-10 Thread Monique Monteiro
Hi Andrea,

Solr console doesn't return a very different information even with debug
mode enabled:

[image: image.png]

On Mon, Sep 10, 2018 at 12:00 PM Andrea Gazzarini 
wrote:

> You can check the solr.log or the solr-console.log. Another option is to
> activate the debug mode in the Solr console before running the data import.
>
> Andrea
>
> On 10/09/2018 16:57, Monique Monteiro wrote:
> > Hi all,
> >
> > I have a data import handler configured with an Oracle SQL query which
> > works like a charm.  However, when I have the same query configured in
> > Solr's data import handler, nothing happens, and it returns:
> >
> >
> >
> >  "*Total Requests made to DataSource*": "1",
> >
> >  "*Total Rows Fetched*": "0",
> >
> >  "*Total Documents Processed*": "0",
> >
> >  "*Total Documents Skipped*": "0",
> >
> > "Full Dump Started": "2018-09-06 18:15:59", "Full Import failed":
> "2018-09-06
> > 18:16:02"
> >
> > Has anyone any ideas about what may be happening?  Is there any log file
> > which can tell the error?
> >
> > Thanks in advance,
> >
>
>

-- 
Monique Monteiro
Twitter: http://twitter.com/monilouise


Re: Silent error from DataImportHandler

2018-09-10 Thread Monique Monteiro
Text:

{ "responseHeader": { "status": 0, "QTime": 463 }, "initArgs": [ "defaults",
[ "config", "data-cnpj-config.xml" ] ], "command": "full-import", "mode":
"debug", "documents": [], "verbose-output": [], "status": "idle", "
importResponse": "", "statusMessages": { "Time Elapsed": "0:0:0.432", "Total
Requests made to DataSource": "1", "Total Rows Fetched": "0", "Total
Documents Processed": "0", "Total Documents Skipped": "0", "Full Dump
Started": "2018-09-10 12:04:03", "Full Import failed": "2018-09-10 12:04:04"
} }

On Mon, Sep 10, 2018 at 12:08 PM Andrea Gazzarini 
wrote:

> Hi Monique,
> I think you cannot attach files / images, please post, if available, the
> url of the image or a text description.
>
> Andrea
>
> On 10/09/2018 17:05, Monique Monteiro wrote:
> > Hi Andrea,
> >
> > Solr console doesn't return a very different information even with
> > debug mode enabled:
> >
> > image.png
> >
> > On Mon, Sep 10, 2018 at 12:00 PM Andrea Gazzarini
> > mailto:a.gazzar...@sease.io>> wrote:
> >
> > You can check the solr.log or the solr-console.log. Another option
> > is to
> > activate the debug mode in the Solr console before running the
> > data import.
> >
> > Andrea
> >
> > On 10/09/2018 16:57, Monique Monteiro wrote:
> > > Hi all,
> > >
> > > I have a data import handler configured with an Oracle SQL query
> > which
> > > works like a charm.  However, when I have the same query
> > configured in
> > > Solr's data import handler, nothing happens, and it returns:
> > >
> > >
> > >
> > >  "*Total Requests made to DataSource*": "1",
> > >
> > >  "*Total Rows Fetched*": "0",
> > >
> > >  "*Total Documents Processed*": "0",
> > >
> > >  "*Total Documents Skipped*": "0",
> > >
> > > "Full Dump Started": "2018-09-06 18:15:59", "Full Import
> > failed": "2018-09-06
> > > 18:16:02"
> > >
> > > Has anyone any ideas about what may be happening?  Is there any
> > log file
> > > which can tell the error?
> > >
> > > Thanks in advance,
> > >
> >
> >
> >
> > --
> > Monique Monteiro
> > Twitter: http://twitter.com/monilouise
>
>

-- 
Monique Monteiro
Twitter: http://twitter.com/monilouise


Re: Silent error from DataImportHandler

2018-09-10 Thread Monique Monteiro
This is shown in the section "Raw Debug-Response".

On Mon, Sep 10, 2018 at 12:20 PM Andrea Gazzarini 
wrote:

> Hi Monique, this is the output; when you check the debug checkbox
> another section is printed
>
> Andrea
>
> On 10/09/2018 17:19, Monique Monteiro wrote:
> > Text:
> >
> > { "responseHeader": { "status": 0, "QTime": 463 }, "initArgs": [
> "defaults",
> > [ "config", "data-cnpj-config.xml" ] ], "command": "full-import", "mode":
> > "debug", "documents": [], "verbose-output": [], "status": "idle","
> > importResponse": "", "statusMessages": { "Time Elapsed": "0:0:0.432",
> "Total
> > Requests made to DataSource": "1", "Total Rows Fetched": "0", "Total
> > Documents Processed": "0", "Total Documents Skipped": "0", "Full Dump
> > Started": "2018-09-10 12:04:03", "Full Import failed": "2018-09-10
> 12:04:04"
> > } }
> >
> > On Mon, Sep 10, 2018 at 12:08 PM Andrea Gazzarini 
> > wrote:
> >
> >> Hi Monique,
> >> I think you cannot attach files / images, please post, if available, the
> >> url of the image or a text description.
> >>
> >> Andrea
> >>
> >> On 10/09/2018 17:05, Monique Monteiro wrote:
> >>> Hi Andrea,
> >>>
> >>> Solr console doesn't return a very different information even with
> >>> debug mode enabled:
> >>>
> >>> image.png
> >>>
> >>> On Mon, Sep 10, 2018 at 12:00 PM Andrea Gazzarini
> >>> mailto:a.gazzar...@sease.io>> wrote:
> >>>
> >>>  You can check the solr.log or the solr-console.log. Another option
> >>>  is to
> >>>  activate the debug mode in the Solr console before running the
> >>>  data import.
> >>>
> >>>  Andrea
> >>>
> >>>  On 10/09/2018 16:57, Monique Monteiro wrote:
> >>>  > Hi all,
> >>>  >
> >>>  > I have a data import handler configured with an Oracle SQL query
> >>>  which
> >>>  > works like a charm.  However, when I have the same query
> >>>  configured in
> >>>  > Solr's data import handler, nothing happens, and it returns:
> >>>  >
> >>>  >
> >>>  >
> >>>  >  "*Total Requests made to DataSource*": "1",
> >>>  >
> >>>  >  "*Total Rows Fetched*": "0",
> >>>  >
> >>>  >  "*Total Documents Processed*": "0",
> >>>  >
> >>>  >  "*Total Documents Skipped*": "0",
> >>>  >
> >>>  > "Full Dump Started": "2018-09-06 18:15:59", "Full Import
> >>>  failed": "2018-09-06
> >>>  > 18:16:02"
> >>>  >
> >>>  > Has anyone any ideas about what may be happening?  Is there any
> >>>  log file
> >>>  > which can tell the error?
> >>>  >
> >>>  > Thanks in advance,
> >>>  >
> >>>
> >>>
> >>>
> >>> --
> >>> Monique Monteiro
> >>> Twitter: http://twitter.com/monilouise
> >>
>
>

-- 
Monique Monteiro
Twitter: http://twitter.com/monilouise


Re: Silent error from DataImportHandler

2018-09-10 Thread Monique Monteiro
Hi Andrea,

In fact, I had to add *logLevel="debug"* to the DIH configuration.  Just
checking "Debug" on console is not enough.  I also checked "Verbose". Now,
a database exception (locked account is shown).

Thanks!
Monique


On Mon, Sep 10, 2018 at 12:26 PM Andrea Gazzarini 
wrote:

> I cannot give you detailed instructions as I don't have in front of me a
> Solr console with the dataimport enabled, but I remember that there's a
> detailed section which reports a lot of information.
>
> In the meantime: shooting in the dark, if the query is working and
> "total rows fetched" = 0, then the query is not working for some reason:
> can you check if some clause in your SQL include < or > ? They need to
> be escaped (< >)
>
> Andrea
>
>
> On 10/09/2018 17:22, Monique Monteiro wrote:
> > This is shown in the section "Raw Debug-Response".
> >
> > On Mon, Sep 10, 2018 at 12:20 PM Andrea Gazzarini 
> > wrote:
> >
> >> Hi Monique, this is the output; when you check the debug checkbox
> >> another section is printed
> >>
> >> Andrea
> >>
> >> On 10/09/2018 17:19, Monique Monteiro wrote:
> >>> Text:
> >>>
> >>> { "responseHeader": { "status": 0, "QTime": 463 }, "initArgs": [
> >> "defaults",
> >>> [ "config", "data-cnpj-config.xml" ] ], "command": "full-import",
> "mode":
> >>> "debug", "documents": [], "verbose-output": [], "status": "idle","
> >>> importResponse": "", "statusMessages": { "Time Elapsed": "0:0:0.432",
> >> "Total
> >>> Requests made to DataSource": "1", "Total Rows Fetched": "0", "Total
> >>> Documents Processed": "0", "Total Documents Skipped": "0", "Full Dump
> >>> Started": "2018-09-10 12:04:03", "Full Import failed": "2018-09-10
> >> 12:04:04"
> >>> } }
> >>>
> >>> On Mon, Sep 10, 2018 at 12:08 PM Andrea Gazzarini <
> a.gazzar...@sease.io>
> >>> wrote:
> >>>
> >>>> Hi Monique,
> >>>> I think you cannot attach files / images, please post, if available,
> the
> >>>> url of the image or a text description.
> >>>>
> >>>> Andrea
> >>>>
> >>>> On 10/09/2018 17:05, Monique Monteiro wrote:
> >>>>> Hi Andrea,
> >>>>>
> >>>>> Solr console doesn't return a very different information even with
> >>>>> debug mode enabled:
> >>>>>
> >>>>> image.png
> >>>>>
> >>>>> On Mon, Sep 10, 2018 at 12:00 PM Andrea Gazzarini
> >>>>> mailto:a.gazzar...@sease.io>> wrote:
> >>>>>
> >>>>>   You can check the solr.log or the solr-console.log. Another
> option
> >>>>>   is to
> >>>>>   activate the debug mode in the Solr console before running the
> >>>>>   data import.
> >>>>>
> >>>>>   Andrea
> >>>>>
> >>>>>   On 10/09/2018 16:57, Monique Monteiro wrote:
> >>>>>   > Hi all,
> >>>>>   >
> >>>>>   > I have a data import handler configured with an Oracle SQL
> query
> >>>>>   which
> >>>>>   > works like a charm.  However, when I have the same query
> >>>>>   configured in
> >>>>>   > Solr's data import handler, nothing happens, and it returns:
> >>>>>   >
> >>>>>   >
> >>>>>   >
> >>>>>   >  "*Total Requests made to DataSource*": "1",
> >>>>>   >
> >>>>>   >  "*Total Rows Fetched*": "0",
> >>>>>   >
> >>>>>   >  "*Total Documents Processed*": "0",
> >>>>>   >
> >>>>>   >  "*Total Documents Skipped*": "0",
> >>>>>   >
> >>>>>   > "Full Dump Started": "2018-09-06 18:15:59", "Full Import
> >>>>>   failed": "2018-09-06
> >>>>>   > 18:16:02"
> >>>>>   >
> >>>>>   > Has anyone any ideas about what may be happening?  Is there
> any
> >>>>>   log file
> >>>>>   > which can tell the error?
> >>>>>   >
> >>>>>   > Thanks in advance,
> >>>>>   >
> >>>>>
> >>>>>
> >>>>>
> >>>>> --
> >>>>> Monique Monteiro
> >>>>> Twitter: http://twitter.com/monilouise
> >>
>
>

-- 
Monique Monteiro
Twitter: http://twitter.com/monilouise


by: java.util.zip.DataFormatException: invalid distance too far back reported by Solr API

2019-01-30 Thread Monique Monteiro
Hi all,

I'm writing a Python routine to upload thousands of PDF files to Solr, and
after trying to upload some files, Solr reports the following error in a
HTTP 500 response:

"by: java.util.zip.DataFormatException: invalid distance too far back"

Does anyone have any idea about how to overcome this?

Thanks in advance,
Monique Monteiro


Re: by: java.util.zip.DataFormatException: invalid distance too far back reported by Solr API

2019-02-04 Thread Monique Monteiro
Hi all,

In fact, moving the parsing to the client solved the problem!

Thanks!
Monique

On Thu, Jan 31, 2019 at 8:25 AM Jan Høydahl  wrote:

> Hi
>
> This is Apache Tika that cannot parse a zip file or possibly a zip
> formatted office file.
> You have to post the full stack trace (which you'll find in the solr.log
> on server side)
> if you want help in locating the source of the issue, you may be able to
> configure Tika
>
> Have you tried to specify ignoreTikaException=true on the request? See
> https://lucene.apache.org/solr/guide/7_6/uploading-data-with-solr-cell-using-apache-tika.html
>
> At the end of the day it would be a much better architecture to parse the
> PDFs using plain standalone TikaServer and then construct a Solr Document
> in your Python code which is then posted to Solr. Reason is you have much
> better control over parse errors and how to map metadata to your schema
> fields. Also you don't want to overload Solr with all this work, it can
> even crash the whole Solr server if some parser crashes or gets stuck in an
> infinite loop.
>
> --
> Jan Høydahl, search solution architect
> Cominvent AS - www.cominvent.com
>
> > 30. jan. 2019 kl. 20:49 skrev Monique Monteiro  >:
> >
> > Hi all,
> >
> > I'm writing a Python routine to upload thousands of PDF files to Solr,
> and
> > after trying to upload some files, Solr reports the following error in a
> > HTTP 500 response:
> >
> > "by: java.util.zip.DataFormatException: invalid distance too far back"
> >
> > Does anyone have any idea about how to overcome this?
> >
> > Thanks in advance,
> > Monique Monteiro
>
>

-- 
Monique Monteiro
Twitter: http://twitter.com/monilouise


Unsubscribing

2019-03-18 Thread Monique Monteiro
How can I unsubscribe from this list?

Thanks in advance,

-- 
Monique Monteiro
Twitter: http://twitter.com/monilouise


MoreLikeThis in Solr 7.3.1

2018-06-19 Thread Monique Monteiro
Hi all,

I'm trying to access /mlt in Solr, but the index returns HTTP 404 error.

I've already configured the following:


   - /solr-7.3.1/server/solr/configsets/_default/conf/solrconfig.xml:

  **
**
*  _text_*
**
*  *

 AND

  **
**
*list *
* *
*  *

But none of this made "http://localhost:8983/solr/**/mlt?q=*:*
return anything other than 404.

Has anyone any idea about what may be happening?

Thanks in advance,

-- 
Monique Monteiro


Re: MoreLikeThis in Solr 7.3.1

2018-06-19 Thread Monique Monteiro
Hi Anshum,

I'm using SolrCloud, but both instances are on the same Solr installation
(it's just for test purposes), so I suppose they share configuration in
solr-7.3.1/server/solr/configsets/_default/conf/solrconfig.xml.

So should I recreate the collection ?

Thanks,
Monique

On Tue, Jun 19, 2018 at 4:41 PM Anshum Gupta  wrote:

> Hi Monique,
>
> Is this standalone Solr or SolrCloud ? If it is cloud, then you’d have to
> make sure that you uploaded the right config and collection should also be
> reloaded if you enabled it after creating the collection.
>
> Also, did you check the MLT Query parser that does the same thing but
> doesn’t require registering of the handler etc. You can find it’s
> documentation here:
> https://lucene.apache.org/solr/guide/7_3/other-parsers.html#more-like-this-query-parser
>
> * *Anshum
>
>
> On Jun 19, 2018, at 11:00 AM, Monique Monteiro 
> wrote:
>
> Hi all,
>
> I'm trying to access /mlt in Solr, but the index returns HTTP 404 error.
>
> I've already configured the following:
>
>
>   - /solr-7.3.1/server/solr/configsets/_default/conf/solrconfig.xml:
>
>  * path="/update/**,/query,/select,/tvrh,/elevate,/spell,/browse,/mlt">*
> **
> *  _text_*
> **
> *  *
>
> AND
>
>  **
> **
> *list *
> * *
> *  *
>
> But none of this made "http://localhost:8983/solr/**/mlt?q=*:*
> return anything other than 404.
>
> Has anyone any idea about what may be happening?
>
> Thanks in advance,
>
> --
> Monique Monteiro
>
>
>

-- 
Monique Monteiro
Blog: http://moniquelouise.spaces.live.com/
Twitter: http://twitter.com/monilouise


Re: MoreLikeThis in Solr 7.3.1

2018-06-19 Thread Monique Monteiro
I reloaded the collection with the command:

http://localhost:8983/solr/admin/collections?action=RELOAD&name=documentos_ce

But stil the same problem...

On Tue, Jun 19, 2018 at 4:48 PM Monique Monteiro 
wrote:

> Hi Anshum,
>
> I'm using SolrCloud, but both instances are on the same Solr installation
> (it's just for test purposes), so I suppose they share configuration in
> solr-7.3.1/server/solr/configsets/_default/conf/solrconfig.xml.
>
> So should I recreate the collection ?
>
> Thanks,
> Monique
>
> On Tue, Jun 19, 2018 at 4:41 PM Anshum Gupta  wrote:
>
>> Hi Monique,
>>
>> Is this standalone Solr or SolrCloud ? If it is cloud, then you’d have to
>> make sure that you uploaded the right config and collection should also be
>> reloaded if you enabled it after creating the collection.
>>
>> Also, did you check the MLT Query parser that does the same thing but
>> doesn’t require registering of the handler etc. You can find it’s
>> documentation here:
>> https://lucene.apache.org/solr/guide/7_3/other-parsers.html#more-like-this-query-parser
>>
>> * *Anshum
>>
>>
>> On Jun 19, 2018, at 11:00 AM, Monique Monteiro 
>> wrote:
>>
>> Hi all,
>>
>> I'm trying to access /mlt in Solr, but the index returns HTTP 404 error.
>>
>> I've already configured the following:
>>
>>
>>   - /solr-7.3.1/server/solr/configsets/_default/conf/solrconfig.xml:
>>
>>  *> path="/update/**,/query,/select,/tvrh,/elevate,/spell,/browse,/mlt">*
>> **
>> *  _text_*
>> **
>> *  *
>>
>> AND
>>
>>  **
>> *    *
>> *list *
>> * *
>> *  *
>>
>> But none of this made "http://localhost:8983/solr/*> name>*/mlt?q=*:*
>> return anything other than 404.
>>
>> Has anyone any idea about what may be happening?
>>
>> Thanks in advance,
>>
>> --
>> Monique Monteiro
>>
>>
>>
>
> --
> Monique Monteiro
> Blog: http://moniquelouise.spaces.live.com/
> Twitter: http://twitter.com/monilouise
>


-- 
Monique Monteiro
Blog: http://moniquelouise.spaces.live.com/
Twitter: http://twitter.com/monilouise


Re: MoreLikeThis in Solr 7.3.1

2018-06-20 Thread Monique Monteiro
Hi Anshum,

Thanks! By using Zookeeper CLI I managed to update the configs.

On Tue, Jun 19, 2018 at 6:29 PM Anshum Gupta  wrote:

> That explains it :)
>
> I assume you did make those changes on disk and did not upload the updated
> configset to zookeeper.
>
> SolrCloud instances use the configset from zk, so all changed files would
> have to be uploaded to zk.
>
> You can re-uplaod the configset using the zkcli.sh script that comes with
> Solr (or some other utility) :
> https://lucene.apache.org/solr/guide/7_3/command-line-utilities.html#using-solr-s-zookeeper-cli
>
> You can also use this script:
> https://lucene.apache.org/solr/guide/7_3/using-zookeeper-to-manage-configuration-files.html#uploading-configuration-files-using-bin-solr-or-solrj
>
> Here’s the config set API that can also be used to accomplish the same:
> https://lucene.apache.org/solr/guide/7_3/configsets-api.html#configsets-api-entry-points
>
> Whatever mechanism you choose to upload the updated config, you should be
> able to see the latest config @ the Solr admin UI (assuming you have access
> to that) by cloud > tree > configs > 
>
>
> * *Anshum
>
>
> On Jun 19, 2018, at 2:08 PM, Monique Monteiro 
> wrote:
>
> I reloaded the collection with the command:
>
>
> http://localhost:8983/solr/admin/collections?action=RELOAD&name=documentos_ce
>
> But stil the same problem...
>
> On Tue, Jun 19, 2018 at 4:48 PM Monique Monteiro  >
> wrote:
>
> Hi Anshum,
>
> I'm using SolrCloud, but both instances are on the same Solr installation
> (it's just for test purposes), so I suppose they share configuration in
> solr-7.3.1/server/solr/configsets/_default/conf/solrconfig.xml.
>
> So should I recreate the collection ?
>
> Thanks,
> Monique
>
> On Tue, Jun 19, 2018 at 4:41 PM Anshum Gupta  wrote:
>
> Hi Monique,
>
> Is this standalone Solr or SolrCloud ? If it is cloud, then you’d have to
> make sure that you uploaded the right config and collection should also be
> reloaded if you enabled it after creating the collection.
>
> Also, did you check the MLT Query parser that does the same thing but
> doesn’t require registering of the handler etc. You can find it’s
> documentation here:
>
> https://lucene.apache.org/solr/guide/7_3/other-parsers.html#more-like-this-query-parser
>
> * *Anshum
>
>
> On Jun 19, 2018, at 11:00 AM, Monique Monteiro 
> wrote:
>
> Hi all,
>
> I'm trying to access /mlt in Solr, but the index returns HTTP 404 error.
>
> I've already configured the following:
>
>
>  - /solr-7.3.1/server/solr/configsets/_default/conf/solrconfig.xml:
>
> * path="/update/**,/query,/select,/tvrh,/elevate,/spell,/browse,/mlt">*
> *    *
> *  _text_*
> *    *
> *  *
>
> AND
>
> **
> **
> *list *
> * *
> *  *
>
> But none of this made "http://localhost:8983/solr/* name>*/mlt?q=*:*
> return anything other than 404.
>
> Has anyone any idea about what may be happening?
>
> Thanks in advance,
>
> --
> Monique Monteiro
>
>
>
>
> --
> Monique Monteiro
> Blog: http://moniquelouise.spaces.live.com/
> Twitter: http://twitter.com/monilouise
>
>
>
> --
> Monique Monteiro
> Blog: http://moniquelouise.spaces.live.com/
> Twitter: http://twitter.com/monilouise
>
>
>

-- 
Monique Monteiro
Blog: http://moniquelouise.spaces.live.com/
Twitter: http://twitter.com/monilouise


Problem with fuzzy search and accentuation

2018-08-03 Thread Monique Monteiro
Hi all,

I'm having a problem when I search for a word with some non-ASCII
characters in combination with fuzzy search.

For example, if I type 'administração' or 'contratação' (both words end
with 'ção'), the search results are returned correctly.  However, if I type
'administração~', no result is returned.  For other terms, I haven't found
any problem.

My Solr version is  6.6.3.

Has anyone any idea about what may cause this issue?

Thanks in advance.

-- 
Monique Monteiro
Twitter: http://twitter.com/monilouise


Re: Problem with fuzzy search and accentuation

2018-08-03 Thread Monique Monteiro
By adding debug=true, I get the following:


   - administração (correct result):

"debug":{
"rawquerystring":"administração",
"querystring":"administração",
"parsedquery":"text:administr",
"parsedquery_toString":"text:administr",
"QParser":"LuceneQParser"}}


   - administração~ (incorrect behaviour, no results):

"debug":{
"rawquerystring":"administração~",
"querystring":"administração~",
"parsedquery":"text:administração~2",
"parsedquery_toString":"text:administração~2",
"QParser":"LuceneQParser"}}


   - tribunal (correct result):

"debug":{
"rawquerystring":"tribunal",
"querystring":"tribunal",
"parsedquery":"text:tribunal",
"parsedquery_toString":"text:tribunal",
"QParser":"LuceneQParser"}}


   - tribubal (correct result, no accents):

 "debug":{
"rawquerystring":"tribubal~",
"querystring":"tribubal~",
"parsedquery":"text:tribubal~2",
"parsedquery_toString":"text:tribubal~2",
"QParser":"LuceneQParser"}}

On Fri, Aug 3, 2018 at 3:26 PM Erick Erickson 
wrote:

> What does adding &debug=query show you the parsed query is in the two
> cases?
>
> My guess is that accent folding is kicking in one case but not the
> other, but that's
> a blind guess.
>
>
>
> On Fri, Aug 3, 2018 at 11:19 AM, Monique Monteiro
>  wrote:
> > Hi all,
> >
> > I'm having a problem when I search for a word with some non-ASCII
> > characters in combination with fuzzy search.
> >
> > For example, if I type 'administração' or 'contratação' (both words end
> > with 'ção'), the search results are returned correctly.  However, if I
> type
> > 'administração~', no result is returned.  For other terms, I haven't
> found
> > any problem.
> >
> > My Solr version is  6.6.3.
> >
> > Has anyone any idea about what may cause this issue?
> >
> > Thanks in advance.
> >
> > --
> > Monique Monteiro
> > Twitter: http://twitter.com/monilouise
>


-- 
Monique Monteiro
Twitter: http://twitter.com/monilouise


Re: Problem with fuzzy search and accentuation

2018-08-07 Thread Monique Monteiro
Hi Erick,

In fact, stemming was the culprit for the problem.

Thanks!
Monique Monteiro

On Fri, Aug 3, 2018 at 3:45 PM Erick Erickson 
wrote:

> Stemming is getting in the way here. You could probably use copyField
> to a field that doesn't stem and fuzzy search against that field
> rather than the stemmed one.
>
> Best,
> Erick
>
> On Fri, Aug 3, 2018 at 11:31 AM, Monique Monteiro
>  wrote:
> > By adding debug=true, I get the following:
> >
> >
> >- administração (correct result):
> >
> > "debug":{
> > "rawquerystring":"administração",
> > "querystring":"administração",
> > "parsedquery":"text:administr",
> > "parsedquery_toString":"text:administr",
> > "QParser":"LuceneQParser"}}
> >
> >
> >- administração~ (incorrect behaviour, no results):
> >
> > "debug":{
> > "rawquerystring":"administração~",
> > "querystring":"administração~",
> > "parsedquery":"text:administração~2",
> > "parsedquery_toString":"text:administração~2",
> > "QParser":"LuceneQParser"}}
> >
> >
> >- tribunal (correct result):
> >
> > "debug":{
> > "rawquerystring":"tribunal",
> > "querystring":"tribunal",
> > "parsedquery":"text:tribunal",
> > "parsedquery_toString":"text:tribunal",
> > "QParser":"LuceneQParser"}}
> >
> >
> >- tribubal (correct result, no accents):
> >
> >  "debug":{
> > "rawquerystring":"tribubal~",
> > "querystring":"tribubal~",
> > "parsedquery":"text:tribubal~2",
> > "parsedquery_toString":"text:tribubal~2",
> > "QParser":"LuceneQParser"}}
> >
> > On Fri, Aug 3, 2018 at 3:26 PM Erick Erickson 
> > wrote:
> >
> >> What does adding &debug=query show you the parsed query is in the two
> >> cases?
> >>
> >> My guess is that accent folding is kicking in one case but not the
> >> other, but that's
> >> a blind guess.
> >>
> >>
> >>
> >> On Fri, Aug 3, 2018 at 11:19 AM, Monique Monteiro
> >>  wrote:
> >> > Hi all,
> >> >
> >> > I'm having a problem when I search for a word with some non-ASCII
> >> > characters in combination with fuzzy search.
> >> >
> >> > For example, if I type 'administração' or 'contratação' (both words
> end
> >> > with 'ção'), the search results are returned correctly.  However, if I
> >> type
> >> > 'administração~', no result is returned.  For other terms, I haven't
> >> found
> >> > any problem.
> >> >
> >> > My Solr version is  6.6.3.
> >> >
> >> > Has anyone any idea about what may cause this issue?
> >> >
> >> > Thanks in advance.
> >> >
> >> > --
> >> > Monique Monteiro
> >> > Twitter: http://twitter.com/monilouise
> >>
> >
> >
> > --
> > Monique Monteiro
> > Twitter: http://twitter.com/monilouise
>


-- 
Monique Monteiro
Twitter: http://twitter.com/monilouise