Re: Using jetty's GzipFilter in the example solr.war

2010-11-13 Thread Kiwi de coder
try to put u filter on top of web.xml (instead of middle or bottom), i try this few day and it just only a simple solution (not sure is a spec to put on top or is a bug) hope this help. kiwi On Sat, Nov 13, 2010 at 8:11 PM, Jay Luker wrote: > Hi, > > I thought I'd try turning on gzip compressi

Re: DIH for multilingual index & multiValued field?

2010-11-13 Thread Ken Stanley
On Sat, Nov 13, 2010 at 5:59 PM, Ken Stanley wrote: >   CREATE TABLE documents ( >       id INT NOT NULL AUTO_INCREMENT, >       language_code CHAR(2), >       tags CHAR(30), >       text TEXT, >       PRIMARY KEY (id) >   ); I apologize, but I couldn't leave the typo in my last post without a fo

Re: DIH for multilingual index & multiValued field?

2010-11-13 Thread Ken Stanley
On Sat, Nov 13, 2010 at 4:56 PM, Ahmet Arslan wrote: > For (1) you probably need to write a custom transformer. Something like: > public Object transformRow(Map row)     { > String language_code = row.get("language_code"); > String text = row.get("text"); > if("en".equals(language_code)) >       r

Re: DIH for multilingual index & multiValued field?

2010-11-13 Thread Ahmet Arslan
For (1) you probably need to write a custom transformer. Something like: public Object transformRow(Map row) { String language_code = row.get("language_code"); String text = row.get("text"); if("en".equals(language_code)) row.put("text_en", text); else if if("fr".equals(language_code))

Re: DIH for multilingual index & multiValued field?

2010-11-13 Thread Imran
I think a custom transformer would be of help in these scenarios http://wiki.apache.org/solr/DIHCustomTransformer Cheers -- Imran On Sat, Nov 13, 2010 at 8:55 PM, Andy wrote: > I have a MySQL table: > >CREATE TABLE documents ( >id IN

DIH for multilingual index & multiValued field?

2010-11-13 Thread Andy
I have a MySQL table: CREATE TABLE documents ( id INT NOT NULL AUTO_INCREMENT, language_code CHAR(2), tags CHAR(30), text TEXT, PRIMARY KEY (id) ); I have 2 questions about Solr DIH: 1) The "langauge_code" field indicates what language the "text" f

RE: IndexableBinaryStringTools (was FieldCache)

2010-11-13 Thread Steven A Rowe
On 11/13/2010 at 2:04 PM, Yonik Seeley wrote: n Sat, Nov 13, 2010 at 1:50 PM, Steven A Rowe wrote: > > Looks to me like the returned value is in a Solr-internal form of XML > > character escaping: \u is represented as "#0;" and \u0008 is > > represented as "#8;".  (The escaping code is in > >

Re: IndexableBinaryStringTools (was FieldCache)

2010-11-13 Thread Yonik Seeley
On Sat, Nov 13, 2010 at 1:50 PM, Steven A Rowe wrote: > Looks to me like the returned value is in a Solr-internal form of XML > character escaping: \u is represented as "#0;" and \u0008 is represented > as "#8;".  (The escaping code is in > solr/src/java/org/apache/common/util/XML.java.) Y

RE: IndexableBinaryStringTools (was FieldCache)

2010-11-13 Thread Steven A Rowe
Hi Mathias, > > > I assume that the char[] returned form > > > IndexableBinaryStringTools.encode is encoded in UTF-8 again > > > and then stored. At some point the information is lost and > > > cannot be recovered. > > > > Can you give an example? This should not happen. > > My character array r

Re: facetting when using field collapsing

2010-11-13 Thread Yonik Seeley
On Sat, Nov 13, 2010 at 10:46 AM, Lukas Kahwe Smith wrote: > > On 13.11.2010, at 10:30, Yonik Seeley wrote: > >> On Wed, Nov 10, 2010 at 9:12 AM, Lukas Kahwe Smith >> wrote: >>> The above wiki page seems to be out of date. Reading the comments in >>> https://issues.apache.org/jira/browse/SOLR-2

Re: facetting when using field collapsing

2010-11-13 Thread Lukas Kahwe Smith
On 13.11.2010, at 10:30, Yonik Seeley wrote: > On Wed, Nov 10, 2010 at 9:12 AM, Lukas Kahwe Smith > wrote: >> The above wiki page seems to be out of date. Reading the comments in >> https://issues.apache.org/jira/browse/SOLR-236 it seems like "group" should >> be replaced with "collapse". >

Re: facetting when using field collapsing

2010-11-13 Thread Yonik Seeley
On Wed, Nov 10, 2010 at 9:12 AM, Lukas Kahwe Smith wrote: > The above wiki page seems to be out of date. Reading the comments in > https://issues.apache.org/jira/browse/SOLR-236 it seems like "group" should > be replaced with "collapse". The Wiki page is not expansive, but I've tried to make it

Re: full text search in multiple fields

2010-11-13 Thread Ahmet Arslan
> I checked the url: http://wiki.apache.org/solr/DisMaxQParserPlugin > > When I execute this url on my local machine: > http://localhost:8983/solr/select/?q=video&qt=defType=dismax&qf=features^20.0+text^0.3 > > I get the error: unknown handler: defType=dismax > > So where can I download that han

Re: Looking for help with Solr implementation

2010-11-13 Thread Jean-Sebastien Vachon
Yes we did. Sorry for this. We both made the same error replying to the mailing list. - Original Message - From: "Thumuluri, Sai" To: Sent: Saturday, November 13, 2010 8:41 AM Subject: RE: Looking for help with Solr implementation Please refrain using this mailing group for solicit

RE: Looking for help with Solr implementation

2010-11-13 Thread Thumuluri, Sai
Please refrain using this mailing group for soliciting and take it offline -Original Message- From: AC [mailto:acanuc...@yahoo.com] Sent: Sat 11/13/2010 1:12 AM To: solr-user@lucene.apache.org Subject: Re: Looking for help with Solr implementation Hey Jean-Sebastien, Thanks for the rep

RE: Searching problem

2010-11-13 Thread Steven A Rowe
Hi Riz, You likely have some form of stemming in your indexing analysis chain - this may cause "panasonic", e.g., to be indexed as "panason". (The remainder of this email assumes that this is true.) When you search for "panasonic", presumably with the same stemming filter in your query analys

Re: filtering or getting accurate crawling results

2010-11-13 Thread Erick Erickson
I'm pretty sparse on my Nutch knowledge, you'd probably get more knowledgable answers on the Nutch mailing list. Best Erick On Fri, Nov 12, 2010 at 11:52 PM, Dennis Gearon wrote: > Actually, can Nutch be used for SCRAPING, not crawling? > > I don't just want the url, I want the data assigned to

Using jetty's GzipFilter in the example solr.war

2010-11-13 Thread Jay Luker
Hi, I thought I'd try turning on gzip compression but I can't seem to get jetty's GzipFilter to actually compress my responses. I unpacked the example solr.war and tried adding variations of the following to the web.xml (and then rejar-ed), but as far as I can tell, jetty isn't actually compressin

Re: How to Facet on a price range

2010-11-13 Thread Govind Kanshi
Kudos to Jan's pre-compute option and gwk's range facet answer. On Wed, Nov 10, 2010 at 2:52 PM, Geert-Jan Brits wrote: > Ah I see: like you said it's part of the facet range implementation. > Frontend is already working, just need the 'update-on-slide' behavior. > > Thanks > Geert-Jan > > 2010/

Re: A Newbie Question

2010-11-13 Thread Govind Kanshi
Another pov you might want to think about - what kind of search you want. Just plain - full text search or there is something more to those text files. Are they grouped in folders? Do the folders imply certain kind of grouping/hierarchy/tagging? I recently was trying to help somebody who had files

Re: Searching problem

2010-11-13 Thread Govind Kanshi
You must spend time on - http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters On Sat, Nov 13, 2010 at 10:42 AM, M.Rizwan wrote: > Hi All, > > Do you have any idea that why solr search for "panasonic*" ( without > quotes ) does not match "panasonic" ? > If we search "panasonic" it match

Re: full text search in multiple fields

2010-11-13 Thread PeterKerk
All helpful responses, so thank you for that. I checked the url: http://wiki.apache.org/solr/DisMaxQParserPlugin When I execute this url on my local machine: http://localhost:8983/solr/select/?q=video&qt=defType=dismax&qf=features^20.0+text^0.3 I get the error: unknown handler: defType=dismax