Re: URL Case Sensitive/Insensitive

2018-12-11 Thread Walter Underwood
ilto:erickerick...@gmail.com] > Sent: Tuesday, December 11, 2018 12:41 PM > To: solr-user > Subject: Re: URL Case Sensitive/Insensitive > > What do you mean by "url case"? No, I'm not being snarky. > > The value returned in a doc is very different than

RE: URL Case Sensitive/Insensitive

2018-12-11 Thread Moyer, Brett
#x27;s! Brett Moyer    -Original Message- From: Erick Erickson [mailto:erickerick...@gmail.com] Sent: Tuesday, December 11, 2018 12:41 PM To: solr-user Subject: Re: URL Case Sensitive/Insensitive What do you mean by "url case"? No, I'm not being snarky. The value

Re: URL Case Sensitive/Insensitive

2018-12-11 Thread Toke Eskildsen
Moyer, Brett wrote: > What is the best practice on URL case? I work with web archiving and URL-normalisation is quite a tricky thing. The software we use is https://github.com/ukwa/webarchive-discovery and in there a lot of energy has been spend on the subject. Long story short, we index 2 for

Re: URL Case Sensitive/Insensitive

2018-12-11 Thread Erick Erickson
What do you mean by "url case"? No, I'm not being snarky. The value returned in a doc is very different than the value searched. The stored data is the original input without going through any filters. If you mean the value _returned_ by Solr from a stored field, then the case is exactly what

URL Case Sensitive/Insensitive

2018-12-11 Thread Moyer, Brett
Hello, I'm new to Solr been using it for a few months. A recent question came up from our business partners about URL casing. Previously their URLs were upper case, they made a change and now all lower. Both pages/URLs are still accessible so there are duplicates in Solr. They are requesting all