Re: Rounding errors with SOLR score

2014-03-21 Thread Raymond Wiker
Are you sure that SOLR is rounding incorrectly, and not simply differently from what you expect? I was surprised myself at some of the rounding behaviour I saw with SOLR, but according to http://en.wikipedia.org/wiki/Rounding , the results were valid (just not the round-up-from-half that I naively

Re: regex in Solr Query

2014-03-13 Thread Raymond Wiker
Regular expressions is a text-matching mechanism, so you shouldn't expect to be able to use it on numeric data. If your timestamps are of the form you indicate, you should be able to filter on pubdateraw:[2005 TO 2005]. On Thu, Mar 13, 2014 at 11:45 AM, Priti Solanki wrote: > Hi, > > > I

Re: Curl : shell script : The requested resource is not available. update/extract !

2014-03-10 Thread Raymond Wiker
"literal.id" should contain a unique identifier for each document (assuming that the unique identifier field in your solr schema is called "id"); see http://wiki.apache.org/solr/ExtractingRequestHandler . I'm guessing that the url for the ExtractinRequestHandler is incorrect, or maybe you haven't

Re: range types in SOLR

2014-03-01 Thread Raymond Wiker
Looks like you might be able to use sub-documents (or whatever it is called in SOLR) for this; create the parent document without any dates, and a child document for each date range. On 01 Mar 2014, at 19:41 , Thomas Scheffler wrote: > Am 01.03.14 18:24, schrieb Erick Erickson: >> I'm not cle

Re: Can not index raw binary data stored in Database in BLOB format.

2014-02-25 Thread Raymond Wiker
A few things: 1) If your database uses a BLOB, you should not use clobtransformer; FieldStreamDataSource should be sufficient. 2) In a previous message, it showed that the converted/etxracted document was empty (except for an html boilerplate wrapper). This was using the configuration I suggested

Re: Can not index raw binary data stored in Database in BLOB format.

2014-02-24 Thread Raymond Wiker
B type, then indexing is done smoothly. > But, I am trying to index binary data which are stored in DB in BLOB type. > > Need help. > > Thanking you, > Chandan > > > > -Original Message- > From: Raymond Wiker [mailto:rwi...@gmail.com] > Sent: Monday, Febru

Re: Can not index raw binary data stored in Database in BLOB format.

2014-02-24 Thread Raymond Wiker
ot; > format="text" > > > > > > > > > > > > > > This is looks like similar to your configuration. But when xml data are in > BLOB in database, indexing is done. But, when binary data are in

Re: Can not index raw binary data stored in Database in BLOB format.

2014-02-24 Thread Raymond Wiker
I've done something like this; the key was to use a FieldStreamDataSource to read from the BLOB field. Something like then ... On Mon, Feb 24, 2014 at 11:04 AM, Chandan khatua wrote: > Hi Gora ! > > Your concern was "What is the t

Re: query parameters

2014-02-18 Thread Raymond Wiker
That could be because the second condition does not do what you think it does... have you tried running the second condition separately? You may have to add a "base term" to the second condition, like what you have for the "bq" parameter in your config file; i.e, something like (*:* -organisation

Re: how to write an efficient query with a subquery to restrict the search space?

2014-01-23 Thread Raymond Wiker
Maybe you could move (field2:val2 or field4:val4) into a filter? E.g, q=(field1:val1 OR field2:val2 OR field3:val3 OR field4:val4)&fq=(field2:val2 OR field4:val4) If I have this correctly, the fq part should be evaluated first, and may even be found in the filter cache. On Thu, Jan 23, 2014 at

Re: Solr middle-ware?

2014-01-21 Thread Raymond Wiker
Speaking for myself, I avoid using "client apis" like SolrNet, SolrJ and FAST DSAPI for the simple reason that I feel that the abstractions they offer are so thin that I may just as well talk directly to the HTTP interface. Doing that also lets me build web applications that maintain their own stat

Re: Solr middle-ware?

2014-01-21 Thread Raymond Wiker
We're using Apache with mod_auth_sspi, mod_rewrite and mod_proxy to handle authentication and (limited) parameter validation. On the inside, we have a wrapper process that builds filters for document-level security based on the user's identity/identities and groups, does some more parameter validat

Re: Query by range of price

2014-01-20 Thread Raymond Wiker
Followup: I *think* something like this should work: $results = $solr->search($query, $start, $rows, array('sort' => 'price_min asc,update_date desc', 'facet.query' => 'price_min:[* TO 1300]')); On Mon, Jan 20, 2014 at 11:05 AM, Raymond Wiker

Re: Query by range of price

2014-01-20 Thread Raymond Wiker
That's exactly what I would expect from url-encoding '&'. So, the thing that you're doing works as it should, but you're probably doing something that you should not do (in this case, urlencode). I have not used SolrPHPClient myself, but from the example at http://code.google.com/p/solr-php-client

Re: Query by range of price

2014-01-16 Thread Raymond Wiker
What Erick is saying is that the facet.query seen by solr is price_min:[*+TO+1300] rather than price_min:[* TO 1300] Having done this sort of thing myself, my guess is that you're probably doing a urlencode operation more than you should be (on the facet.query value). On Fri, Jan 17, 2014 at

Re: Query by range of price

2014-01-16 Thread Raymond Wiker
To start with, you have "+"-coded spaces in the range part, but the sort parameter has an unencoded space character. Not sure if this is the reason that it fails, but it is certainly a reason to look closer at how you encode your queries... On 16 Jan 2014, at 12:29 , rachun wrote: > Hi Gurus

Re: SOLR Security - Displaying endpoints to public

2014-01-07 Thread Raymond Wiker
Indeed it is - but you'll also need mod_proxy ("just" rewriting will not be sufficient). On Tue, Jan 7, 2014 at 3:42 AM, Otis Gospodnetic wrote: > Apache url_rewrite can help with this and it's only a few minutes to set > up. > > Otis > -- > Performance Monitoring * Log Analytics * Search Analy

Re: SOLR Security - Displaying endpoints to public

2014-01-06 Thread Raymond Wiker
On 06 Jan 2014, at 19:37 , Shawn Heisey wrote: > On 1/6/2014 11:18 AM, Shawn Heisey wrote: >> Even if you disable admin handlers so that it's impossible to gather full >> information about your schema and other settings, generating legitimate >> queries is probably enough for an attacker to ge

Re: Solr hanging when extracting a some broken .doc files

2013-12-19 Thread Raymond Wiker
On Thu, Dec 19, 2013 at 10:01 AM, Charlie Hull wrote: > On 18/12/2013 09:03, Alexandre Rafalovitch wrote: > >> Charlie, >> >> Does it mean you are talking to it from a client program? Or are you >> running Tika in a listen/server mode and build some adapters for standard >> Solr processes? >> > >

Re: Storing MYSQL DATETIME field in solr as String

2013-12-17 Thread Raymond Wiker
On 17 Dec 2013, at 17:30 , manju16832003 wrote: > Hi Raymond, > You mean during the DIH execution? Yes; as part of the sql statement.

Re: Storing MYSQL DATETIME field in solr as String

2013-12-16 Thread Raymond Wiker
To me, the obvious way of doing this would be to CAST the DATETIME to CHAR(n), or (probably better) use DATE_FORMAT(). On Tue, Dec 17, 2013 at 5:21 AM, Shalin Shekhar Mangar < shalinman...@gmail.com> wrote: > I'm sorry. I thought you wanted to parse a date stored as string into > a java.util.Dat

Re: [ContentStreamUpdateRequest] POST Http Request header

2013-12-13 Thread Raymond Wiker
I think you're wrong about this; both the "literal.*"parameters and the file will be sent as part of the body, as "multipart/form-data". It is possible that whatever library you're using are placing the "literal.*" parameters in the url, but that is by no means necessary (or even a good idea).

Re: [ContentStreamUpdateRequest] POST Http Request header

2013-12-13 Thread Raymond Wiker
I think you're wrong about this; both the "literal.*"parameters and the file will be sent as part of the body, as "multipart/form-data". It is possible that whatever library you're using are placing the "literal.*" parameters in the url, but that is by no means necessary (or even a good idea).

Re: Indexing on plain text and binary data in a single HTTP POST request

2013-12-10 Thread Raymond Wiker
I would index all attachments separately, but with some sort of reference back to the mail message. That way, I could use the update handler for the text and metadata of the mail message, and the the update/extract handler for the binary attachment(s) and a restricted set of metadata (file name, co

Re: Indexing on plain text and binary data in a single HTTP POST request

2013-12-09 Thread Raymond Wiker
On 09 Dec 2013, at 17:20 , neerajp wrote: > > 2) Your binary content is encoded in some way inside XML, right? Not just > random binary, which would make it invalid XML? Like base64 or something? > > [Neeraj]: I want to use random binary(*not base64 encoded*) in some of the > XML fields insi

Re: Upgrading Solr cluster without downtime

2013-12-01 Thread Raymond Wiker
We have been doing exactly that through several versions of Solr: we unpack the new version on one set of replicas, install empty directories for the core(s) we want to use, and create empty core.properties files in these. Then, we start the new replicas, using a (stand-alone) zookeeper for the

Re: Periodic Slowness on Solr Cloud

2013-11-22 Thread Raymond Wiker
You mentioned earlier that you are not setting -Xms/-Xmx; the values actually in use would then depend on the Java version, whether you're running 32- or 64-bit Java, whether Java thinks your machines are "servers", and whether you have specified the "-server" flag – and possibly a few other thi

Re: Huge Response Time

2013-11-07 Thread Raymond Wiker
A few options: 1) Check what the response times are if you return only a small number of fields from the query (e.g, just the "id" field). If the response times improve greatly, you are probably returning some very long fields, and you may be able to drop some of these from the query result. 2

Re: get min-max prices as facets

2013-11-06 Thread Raymond Wiker
As far as I understand, the filter terms (fq parameters) should be automatically cached. On 06 Nov 2013, at 19:08 , PeterKerk wrote: > Hi Raymond, > > Thanks. > This caching of the main query you mention, is that done automatically or do > I have to do something for that? > > > > -- > View

Re: get min-max prices as facets

2013-11-06 Thread Raymond Wiker
One way of getting the min/max values is to fire off two additional queries, using the current query and filter params, a field list ("fl") containing just the field you want to find min/max for, asking for a single hit (rows=1), and sorting by that field - asc in one query, desc in the other. T

Re: Facet question: Getting only the matched value from multivalued field

2013-11-05 Thread Raymond Wiker
We have a somewhat similar case; what we will do is to have one analysed field in conjunction with a string field (possibly with case folding). That way, we can use the original field values for displaying as facets, but also allow searches for parts of the facet values. On Tue, Nov 5, 2013 at 1:

Re: Performance of "rows" and "start" parameters

2013-11-05 Thread Raymond Wiker
Are you restricting the set of fields that you return from the queries? If not, it could be that you are returning fields that are potentially very large, and may affect query performance that way. On Tue, Nov 5, 2013 at 11:38 AM, michael.boom wrote: > Thank you! > > I suspect that maybe my box

Re: AJAX Solr returning the default wildcard *:* and not what I query

2013-10-31 Thread Raymond Wiker
The parameters indicate a jQuery.ajax call with result type "jsonp" - a

Re: Solr Update URI is not found

2013-10-27 Thread Raymond Wiker
On 28 Oct 2013, at 01:19 , Bayu Widyasanyata wrote: > request: http://localhost:8080/solr/update?wt=javabin&version=2 I think this url is incorrect: there should be a core name between "solr" and "update".

Re: securing Solr Admin

2013-10-22 Thread Raymond Wiker
On Oct 22, 2013, at 19:29 , Shawn Heisey wrote: > On 10/22/2013 8:09 AM, Raheel Hasan wrote: >> This sounds like trouble. >> >> I have used Solr in my script (php) such that I curl it for query (using >> "solr/automata/select?q="). If I make it completely off-public, how will my >> own site acces

Re: securing Solr Admin

2013-10-22 Thread Raymond Wiker
On Oct 22, 2013, at 15:32 , Raheel Hasan wrote: > Hi, > > I want to know how to secure the admin section. The site " > http://wiki.apache.org/solr/SolrSecurity"; has a lot of stuff, but I want to > put htaccess based restriction. > > Can anyone tell me where to place the htaccess? > > I am usin

Re: DIH with SolrCloud

2013-10-08 Thread Raymond Wiker
It looks like your select statement does not return any rows... have you verified it with some sort of SQL client? On Tue, Oct 8, 2013 at 8:57 AM, Prasi S wrote: > Hi , > I have setup solrcloud with solr4.4. The cloud has 2 tomcat instances with > separate zookeeper. > > i execute the below co

Re: Prevent public access to Solr Admin Page

2013-09-26 Thread Raymond Wiker
On Sep 26, 2013, at 11:13 , uwe72 wrote: > Hi there, > > how can i prevent that everybody who knows the URL of our solr admin page, > has the right to access it? > > Thanks in advance! > Uwe I'd restrict access to the jetty server to localhost, and use an Apache httpd instance (or some other

Re: how can I use DataImportHandler on multiple MySQL databases with the same schema?

2013-09-17 Thread Raymond Wiker
You can also define multiple dataimporthandlers in solrconfig.xml, each with their own data-config. On Wed, Sep 18, 2013 at 7:45 AM, Alexandre Rafalovitch wrote: > You can create multiple entities in DIH definition and they will all run. > Means duplicating the mapping definition apart from data

Re: Get the commit time of a document in Solr

2013-09-12 Thread Raymond Wiker
On Sep 12, 2013, at 20:55 , phanichaitanya wrote: > Apologies again. But here is another try : > > I want to make sure that documents that are indexed are committed in say an > hour. I agree that if you pass commitWithIn params and the like will make > sure of that based on the time configuration

Re: DataImportHandler oddity

2013-09-12 Thread Raymond Wiker
w.com/questions/4168494/resultset-getstring-on-varchar2-column-returns-empty-string > > I don't think this is specific to DataImportHandler. > > > On Thu, Sep 12, 2013 at 12:43 PM, Raymond Wiker wrote: > > Followup: I just tried modifying the select with > > >

Re: DataImportHandler oddity

2013-09-12 Thread Raymond Wiker
e gave me this: http://bugs.caucho.com/view.php?id=4224(via http://forum.caucho.com/showthread.php?t=27574). On Thu, Sep 12, 2013 at 8:25 AM, Raymond Wiker wrote: > I'm trying to index a view in an Oracle database, and have come across > some strange behaviour: all the VARCHAR2 fie

DataImportHandler oddity

2013-09-11 Thread Raymond Wiker
I'm trying to index a view in an Oracle database, and have come across some strange behaviour: all the VARCHAR2 fields are being returned as empty strings; this also applies to a datetime field converted to a string via TO_CHAR, and the url field built by concatenating two constant strings and a nu

Re: Problem parsing suggest response

2013-09-02 Thread Raymond Wiker
On Sep 2, 2013, at 17:39 , Mysurf Mail wrote: > Hi, > I am having problems parsing suggest json response in c#. > Here is an example > > { > > - responseHeader: > { > - status: 0, > - QTime: 1 > }, > - spellcheck: > { > - suggestions: > [ > - "at", >

Re: Schema

2013-08-22 Thread Raymond Wiker
On Aug 22, 2013, at 19:53 , Kamaljeet Kaur wrote: > On Thu, Aug 22, 2013 at 10:56 PM, SolrLover [via Lucene] > wrote: >> >> Now use DIH to get the data from MYSQL database in to SOLR.. >> >> http://wiki.apache.org/solr/DataImportHandler > > > These are for versions 1.3, 1.4, 3.6 or 4.0. > Why

Re: Negation words

2013-08-19 Thread Raymond Wiker
wheezed AND NOT "not wheezed" or +wheezed -"not wheezed" perhaps? Note: this assumes that you meant to search with the keyword "wheezed" and not "wheeze". On Mon, Aug 19, 2013 at 2:38 PM, venkatesham.gu...@igate.com < venkatesham.gu...@igate.com> wrote: > I am searching with a keyword and if

Re: Facet field display name

2013-08-12 Thread Raymond Wiker
I'm guessing that this will only be a partial solution, as you'll want to use the "real" field name for filtering, and the encoding problems will still have to be solved (since there will be a need to support the national characters in the field values as well as the field names...) Personally, I

Re: SOLR OR query, want 1 of the 2 results

2013-08-12 Thread Raymond Wiker
It will probably have better performance than having a "plan b" query that executes if the first query fails... On Mon, Aug 12, 2013 at 2:27 PM, PoM wrote: > That would actually be a decent solution, although it isn't the best i will > try if it gives any performance issues > > > > -- > View th

Re: Is there a way to store binary data (byte[]) in DocValues?

2013-08-12 Thread Raymond Wiker
base64-encode the binary data? That will give you strings, at the expense of some storage overhead. On Mon, Aug 12, 2013 at 2:38 PM, Mathias Lux wrote: > Hi! > > I'm basically searching for a method to put byte[] data into Lucene > DocValues of type BINARY (see [1]). Currently only primitives an

Re: Question about filter query: "half" of my index is slower than the other?

2013-08-09 Thread Raymond Wiker
On Aug 9, 2013, at 17:36 , Neal Ensor wrote: > So, I have an oddball question I have been battling with in the last day or > two. > > I have an 8 million document solr index, roughly divided down the middle by > an identifying "product" value, one of two distinct values. The documents > in both

Re: Solr4.4 DIH Headache

2013-08-08 Thread Raymond Wiker
On Aug 8, 2013, at 15:57 , Spadez wrote: > Hi, > > QUESTION 1 > > > First things first, for the dataimport handler. Is it correct that when I > visit it from the admin panel it takes me to this URL: > > *http://x.com:8080/solr/#/collection1/dataimport//dataimport > * > When I visit it on this

Re: Enabling DIH breaks Solr4.4

2013-08-08 Thread Raymond Wiker
I think the problem is that you have the wrong name for the jar file: you have apache-solr-dataimporthandler instead of simply solr-dataimporthandler. In my solrconfig.xml, I have --- which may or may not work for you.

Re: Document generation from database and partially from other source for the same item

2013-08-07 Thread Raymond Wiker
Assuming that you're doing this in a Windows environment, you could define your spreadsheet as an ODBC data source and define a datasource for it in DIH. Then, you would extract the main documents from your database, and the keywords from the ODBC datasource layered on top of your spreadsheet. No

Re: DIH Problem: create multiple docs from a single entity

2013-08-07 Thread Raymond Wiker
On Aug 7, 2013, at 18:10 , Lee Carroll wrote: > Hi > > I've 2 tables with the following data > > table 1 > id treatment_list > 1 a,b > 2 b,c > > table 2 > treatment id, name > a name1 > b name 2 > c name 3 > > Using DIH can you create an index

Re: Solr doesn't make indexes for all the enteries

2013-08-07 Thread Raymond Wiker
You're explicitly asking for only 10 search results - that's what the "rows=10" parameter does. If you want to see alll results, you can either increase "rows", or run multiple queries, increasing "offset" each time. On Wed, Aug 7, 2013 at 12:21 PM, Kamaljeet Kaur wrote: > Hello, > I am a newbi

Re: external zookeeper with SolrCloud

2013-08-07 Thread Raymond Wiker
You said earlier that you had 6 zookeeper instances, but the zkHost param only shows 5 instances... is that correct? On Tue, Aug 6, 2013 at 11:23 PM, Joshi, Shital wrote: > Machines are definitely up. Solr4 node and zookeeper instance share the > machine. We're using -DzkHost=zk1,zk2,zk3,zk4,zk

Re: Knowing what field caused the retrival of the document

2013-08-06 Thread Raymond Wiker
user@lucene.apache.org >> Subject: Re: Knowing what field caused the retrival of the document >> >> But what if this for multiple words ? >> I am guessing solr knows why the document is there since I get to see the >> paragraph in the highlight.(hl) section. >&g

Re: Transform data at index time: country -> continent

2013-08-06 Thread Raymond Wiker
Another option might be to use a pre-existing web service... it should be relatively easy to add that to your dataimporthandler configuration (if you're using DIH, that is :-) A quick google search gave me http://www.geonames.org; see http://www.geonames.org/export/ for API information. On Tue,

Re: Knowing what field caused the retrival of the document

2013-08-06 Thread Raymond Wiker
If you were searching for single words (terms), you could use the 'tf' function, by adding something like matchesinname:tf(name, "whatever") to the 'fl' parameter - if the 'name' field contains "whatever", the (result) field 'matchesinname' will be 1. On Tue, Aug 6, 2013 at 10:24 AM, Mysurf M

Re: Invalid UTF-8 character 0xfffe during shard update

2013-08-05 Thread Raymond Wiker
ico Chiacchiaretta < federico.c...@gmail.com> wrote: > No, the content has no XML tags included (hope I understood what you were > asking here). > > Federico > > > 2013/8/5 Raymond Wiker > > > On Aug 5, 2013, at 20:12 , Federico Chiacchiaretta < > > federico.

Re: Invalid UTF-8 character 0xfffe during shard update

2013-08-05 Thread Raymond Wiker
On Aug 5, 2013, at 20:12 , Federico Chiacchiaretta wrote: > Hi Raymond, > I agree with you, 0xfffe is a special character, that is why I was asking > how it's handled in solr. > In my document, 0xfffe does not appear at the beginning, it's in the > content. > > Just an update about testing I'm d

Re: Invalid UTF-8 character 0xfffe during shard update

2013-08-05 Thread Raymond Wiker
I think #xfffe is special; it is used as a "byte order mark" to identify the encoding used. In that case, it should only appear at the beginning of the document. Sent from my iPhone On 5 Aug 2013, at 17:19, Federico Chiacchiaretta wrote: > Hi Shawn, > thanks for your answer. > From the docs

Re: Transform data at index time: country -> continent

2013-08-05 Thread Raymond Wiker
Don't know about "best practice", but to me, the obvious solution would be to have a database table holding the relationships between countries and continents, and using a join to get the continent. On Mon, Aug 5, 2013 at 9:59 AM, Christian Köhler - ZFMK wrote: > Hi, > > I am indexing data from

Re: Solr round ratings to nearest integer value

2013-08-04 Thread Raymond Wiker
That query searches for ratings in the range 2.0 to 2.5, which does not include 1.75. The fl setting only affects the returned value, and not the search. If you want to search for ratings that would round to 2, your query should be something like fq=[1.5 TO 2.5] Note: the exact boundaries should

Re: DataImportHandler, BlobTransformer, FieldReaderDataSource and TikaEntityExtractor

2013-08-02 Thread Raymond Wiker
's no BlobTransformer in DataImportHandler. You'll have to write one. > Also, you'd probably need to write a FieldInputStreamDataSource instead of > FieldReaderDataSource. > > > On Tue, Jul 30, 2013 at 12:30 PM, Raymond Wiker wrote: > > > I have a case where I

Re: Solr round ratings to nearest integer value

2013-08-01 Thread Raymond Wiker
On Aug 1, 2013, at 17:36 , Thyagaraj wrote: > Hello Raymond Wiker, > > I just want to change the value while returning from the index (i.e, at > query time). I tried the one like below but it's not working and showing > same fraction value instead any nearest integer, >

Re: Solr round ratings to nearest integer value

2013-08-01 Thread Raymond Wiker
You don't say whether you want to change the value in the index, or the value that you return from the index (i.e, at query time). If you want to change the value that you place in the index, you could modify your SQL query to use the ROUND() function on the "ratings" value. For Oracle, you could

Re: Email regular expression.

2013-07-30 Thread Raymond Wiker
On Jul 30, 2013, at 22:05 , Luis Cappa Banda wrote: > Anyway, I still need to do a query like the following to retrieve those > documents with at least one E-mail detected: > > http://localhost:8080/mysolr/select?q=emails:[* TO > *]&start=0&rows=10&sort=mydate desc Can't you just use emails:* ?

DataImportHandler, BlobTransformer, FieldReaderDataSource and TikaEntityExtractor

2013-07-30 Thread Raymond Wiker
I have a case where I want to documents and metadata content from a datebase. The metadata is is not a problem, but it does not appear that I can handle the document content (held as BLOBS in the database) with out-of-the-box SOLR 4.4 functionality. I was hoping to to be able to solve this by doin

Re: Shows different result with using 'and' and 'AND'

2013-07-25 Thread Raymond Wiker
The query syntax is case sensitive; "and" is treated as a search term and not as an operator. On Thu, Jul 25, 2013 at 1:00 PM, Payal.Mulani < payal.mul...@highqsolutions.com> wrote: > Hi, > > I am using solr14 and when I search with 'and' the it searches the > documents > containing 'and' as a t

Re: filter query result by user

2013-07-23 Thread Raymond Wiker
Simple: the field needs to be "indexed" in order to search (or filter) on it. On Tue, Jul 23, 2013 at 3:26 PM, Mysurf Mail wrote: > I want to restrict the returned results to be only the documents that were > created by the user. > I then load to the index the createdBy attribute and set it to

Re: How to improve the Solr "OR" query performance

2013-07-03 Thread Raymond Wiker
On Jul 3, 2013, at 05:48 , huasanyelao wrote: > Nowdays, I've got a urgent task to improve the "OR" query performance with > solr. > I have deployed 9 shards with solr-cloud in two server(each server : 16 > cores, 32G RAM). > The total document count: 60,000,000, total index size : 9G. > Accord

Re: Need help on Solr

2013-06-20 Thread Raymond Wiker
On Jun 20, 2013, at 18:26 , Abhishek Bansal wrote: > Yeah I know, out of the box there is one id field. I removed it from > schema.xml > > I have also added below code to automatically generate an ID. > > multiValued="false"/> > > Is that a valid configuration for an id field (assuming t

Re: Not getting results when searching a term from Solr Admin

2013-06-12 Thread Raymond Wiker
It looks like I haven't paid sufficient attention to your earlier messages - sorry. It is quite clear that "content" contains bing, and you should have gotten results back if the default search field was content. It could be that your solrconfig.xml file sets df to a field that does not contain "b

Re: Not getting results when searching a term from Solr Admin

2013-06-12 Thread Raymond Wiker
Hmmm did you restart SOLR after changing the schema? And did you try searching for content:bing (alternatively, setting the df parameter to "content" (without quotes)? On Wed, Jun 12, 2013 at 12:12 PM, coderslay wrote: > Apologies Raymond for the Name. > > I have tried doing that also and st

Re: Not getting results when searching a term from Solr Admin

2013-06-12 Thread Raymond Wiker
I'm not Jack, but... ... locate the line in schema.xml that says content and replace "content" with "text". You may also have to edit solrconfig.xml if the request handler defines the parameter "df" - this, too, should point to your default field. On Wed, Jun 12, 2013 at 10:44 AM, coderslay w

Re: Not getting results when searching a term from Solr Admin

2013-06-12 Thread Raymond Wiker
There's your problem, then - you have "content" as the default search field, but your copyField nodes treat "text" as the default search field. If you change the default search field to text, you should be able to search for "bing"; otherwise, you'll need to use something like "content:bing". On

Re: Not getting results when searching a term from Solr Admin

2013-06-12 Thread Raymond Wiker
It appears that the word "bing" appears in the title; is the title field copied into the default search field (assuming that you even have a default search field)? If not, you need to somehow specify the field(s) that you want to search in. On Wed, Jun 12, 2013 at 7:52 AM, coderslay wrote: > Hi

Re: LIMIT on number of OR in fq

2013-06-10 Thread Raymond Wiker
A better option would be to use POST instead of GET. On Mon, Jun 10, 2013 at 8:50 AM, Aloke Ghoshal wrote: > True, the container's request header size limit must be the reason then. > Try: > > http://serverfault.com/questions/136249/how-do-we-increase-the-maximum-allowed-http-get-query-length-i

Re: Download CSV, Strange thing is happening !!

2013-06-06 Thread Raymond Wiker
I think you'd be better off using field names that look like Java identifiers - e.g, mark10 instead of 10th_mark. Actually, let me rephrase that: you SHOULD be using field names that look like Java identifiers - less headache, all round. On Thu, Jun 6, 2013 at 4:01 PM, anurag.jain wrote: > I h

Re: data-import problem

2013-06-05 Thread Raymond Wiker
On Jun 5, 2013, at 20:39 , Stavros Delisavas wrote: > Thanks so far. > > This change makes Solr work over the title-entries too, yay! Unfortunatly > they don't get processed(skipped rows). In my log it says > "missing required field id" for every entry. > > I checked my schema.xml. In there "id

Re: Receiving unexpected Faceting results.

2013-06-05 Thread Raymond Wiker
3) Use the parameter facet.prefix, e.g, facet.prefix=dotan-. Note: this particular case will not work if the field you're facetting on is tokenised (with "-" being used as a taken separator). 4) Use the parameter facet.mincount - looks like you want to set it to 1, instead of the default which is

Re: Not able to search Spanish word with ascent in solr

2013-05-29 Thread Raymond Wiker
On May 29, 2013, at 18:09 , jignesh wrote: > Solr returning error 500, when i post data with ascent chars... > > Any solution for that? The solution probably involves using the correct encoding, and ensuring that the HTTP request sets the appropriate header values accordingly. In other words,

Re: fq & facet on double and non-indexed field

2013-05-23 Thread Raymond Wiker
On May 23, 2013, at 14:25 , gpssolr2020 wrote: > Thanks Erick.. > > > i hope we cant do q also on non-indexed field. > > Whats is the difference between q and fq other than cache . > > > > Thanks. How do you expect to search on a field that is non-indexed (and thus non-searchable)?

Re: Adding filed in Schema.xml

2013-05-20 Thread Raymond Wiker
On May 20, 2013, at 05:05 , Kamal Palei wrote: > I have put the code to add these fields in document object and index it. > I have not deleted whole indexed data and reindex it. But I expect whatever > new documents are added, for those documents these two fields salary and > experience should be

Re: Solr Licensing (Sizzle)

2013-05-13 Thread Raymond Wiker
On May 13, 2013, at 14:15 , Polhodzik Peter (ext) wrote: > In the source code of Apache Solr 4.2.0 there is an unclear license reference > in > > · \solr-4.2.0\solr\webapp\web\js\lib\jquery-1.7.2.min.js > and > · \solr-4.2.0\solr\webapp\web\js\require.jstxt > > Can you please

Re: What Happens to Consistency if I kill a Leader and Startup it again?

2013-05-02 Thread Raymond Wiker
If you're using zookeeper, this should not be allowed to happen (I think). On Thu, May 2, 2013 at 2:12 PM, Furkan KAMACI wrote: > Thanks for the answer. This is what I try to say: > > time = t > Node A (Leader): version is 100 > Node B (Replica): version is 90 > > time = t+1 > Node A (Killing):

Re: Unsubscribing from JIRA

2013-05-01 Thread Raymond Wiker
On May 1, 2013, at 19:07 , johnmu...@aol.com wrote: > Are you saying because I'm subscribed to dev, which I'm, is why I'm getting > JIRA mails too, and the only way I can stop JIRA mails is to unsubscribe from > dev? I don't think so. I'm subscribed to other projects, both dev and user, > and

Re: Re: Shard update error when using DIH

2013-04-29 Thread Raymond Wiker
You have --- I think this needs to be "long".

Re: Shard update error when using DIH

2013-04-29 Thread Raymond Wiker
On Mon, Apr 29, 2013 at 10:01 AM, heaven wrote: > Hi, seems like I have exactly the same error: > > Apr 28, 2013 11:41:57 PM org.apache.solr.common.SolrException log > SEVERE: null:java.lang.UnsupportedOperationException > at > > org.apache.lucene.queries.function.FunctionValues.longVal(F

Re: SimplePostTool: WARNING: IOException while reading response: java.io.FileNotFoundException

2013-04-26 Thread Raymond Wiker
On Fri, Apr 26, 2013 at 2:45 PM, Furkan KAMACI wrote: > > I use that command to post: > java -Durl=http://localhost:8983/solr/update/extract -Dauto -jar post.jar > 523387.pdf > I think you need to have the collection name in the url... something like http://localhost:8983/solr/mycollection/update

Re: Document is missing mandatory uniqueKey field: id for Solr PDF indexing

2013-04-26 Thread Raymond Wiker
You could start by doing java post.jar -help --- the 7th example shows exactly what you need to do to add a document id. On Fri, Apr 26, 2013 at 11:30 AM, Furkan KAMACI wrote: > I use Solr 4.2.1 and these are my fields: > > multiValued="false" /> > > > > > multiValued="true"/> > > stored=

Re: Pros and cons of using RAID or different RAIDS?

2013-04-19 Thread Raymond Wiker
On Apr 20, 2013, at 05:03 , Otis Gospodnetic wrote: > Yeah, but as far as I know, there is nothing Solr-specific about that. > > See http://www.acnc.com/raid > There's a hw-specific dimension to this, too: for my company's enterprise search solution, we had to replace our initial RAID setup (R

Re: is phrase search possible in solr

2013-04-19 Thread Raymond Wiker
On Apr 19, 2013, at 16:59 , vicky desai wrote: > I want to do a phrase search in solr without analyzers being applied to it > eg - If I search for *"DelhiDareDevil"* (i.e - with inverted commas)it > should search the exact text and not apply any analyzers or tokenizers on > this field > However i

Re: first time with new keyword, solr take to much time to give the result

2013-04-16 Thread Raymond Wiker
On Tue, Apr 16, 2013 at 3:13 PM, Montu v Boda wrote: > hi > > we are trying to return 10,000 rows > > it is necessary to return 1 rows because from that 1, we are pick > only top 100 record based on the user permission and permission is stored > in > database not on solr. > > and if we tr

Re: Basic auth on SolrCloud /admin/* calls

2013-04-11 Thread Raymond Wiker
On Apr 11, 2013, at 17:12 , adfel70 wrote: > Hi > I need to implement security in solr as follows: > 1. prevent unauthorized users from accessing to solr admin pages. > 2. prevent unauthorized users from performing solr operations - both /admin > and /update. > > > Is the conclusion of this thre

Re: Indexed data not searchable

2013-04-10 Thread Raymond Wiker
On Wed, Apr 10, 2013 at 10:35 AM, Max Bo wrote: > Just for information: I indicate that the problem occurs when I try to add > the fields, created, last_modified, issued (all three have the type date) > and the field rightsholder. > > Maybe it is helpful! > >From the example you sent earlier, it

Re: Field exist in schema.xml but returns

2013-04-08 Thread Raymond Wiker
You have misspelt the tag name in the field definition... you have "fiald" instead of "field". On Tue, Apr 9, 2013 at 7:43 AM, deniz wrote: > hi all, I am using solrcloud and running some simple test queries... > though i > am getting a undefined field error for a field that I have in my schema

Re: Moving from SOLR3.6 to SOLR4.0 - Last remaining warnings

2013-04-08 Thread Raymond Wiker
On Apr 8, 2013, at 19:09 , "Jack Krupansky" wrote: > "replace with UpdateRequestHandler" > > Just compare your solrconfig to the new one and consider updating yours and > using the newer Solr update API that automatically uses the content type to > internally dispatch to the proper update handl