Re: appear garbled when I use DIH from oracle database

2012-06-12 Thread Lance Norskog
You need to unpack the GBK encoding into Unicode strings. This might be an Oracle function in the SQL query. On Fri, Jun 8, 2012 at 3:16 AM, 涂小刚 wrote: > Hello: >  when I use DIH from oracle database,it appears garbled,why?     ps:my > oracle database is  GBK encoding with chinese. > how can I so

Re: Indexing Multiple Datasources

2012-06-12 Thread Lance Norskog
Are you trying to do a JOIN on these two tables inside the DIH? On Tue, Jun 12, 2012 at 8:35 PM, Gora Mohanty wrote: > On 11 June 2012 21:29, Kay wrote: >> Hello, >> >> We have 2 MS SQL Server Databases which we wanted to index .But most of the >> columns in the Databases have the same names. Fo

Re: edismax and untokenized field

2012-06-12 Thread Afroz Ahmad
In the example above your schema is applying the tokenizers and filter only during index time. For your query terms to also pass through the same pipeline you need to modify the field type and add a section. I believe this should fix your problem. Thanks Afroz :

Re: Unexpected DIH behavior for onError attribute

2012-06-12 Thread Gora Mohanty
On 13 June 2012 01:17, Pranav Prakash wrote: > It seems that upon setting onError=skip, the DIH does not proceed to next > records in the db, and only unto those entries which were prior to an > error-causing record are being updated/added. [...] Please show us your DIH configuration file, rememb

Re: Indexing Multiple Datasources

2012-06-12 Thread Gora Mohanty
On 11 June 2012 21:29, Kay wrote: > Hello, > > We have 2 MS SQL Server Databases which we wanted to index .But most of the > columns in the Databases have the same names. For e.g. Both the DB’s have > the columns First name ,Last name ,etc. It is not clear how you want to handle this: Should the

Re: Different sort for each facet

2012-06-12 Thread Jack Krupansky
f.people.facet.sort=count should work. Make sure you don't have a conflicting setting for that same field and attribute. Does the "people" facet sort by count correctly with f.sort=index? What are the attributes and field type for the "people" field? -- Jack Krupansky -Original Message-

Interleaving Results from Sub-Queries

2012-06-12 Thread Andrew Morrison
I'm working on creating a Query that interleaves the results of a set of sub-queries and was hoping I could get some input on the design. The general idea is that if given Query q1 and Query q2 I'd like to add them to a parent Query q0 so that the when q0 is scored, the order of results in score

Re: Indexing Multiple Datasources

2012-06-12 Thread Jack Krupansky
I believe that it will run them sequentially. The second would start only after the first finishes Did you give both "entity" names in your Solr request (two "entity" options with the two top-level entity names)? Although, if you specified no "entity" names names in the request DIH should r

Re: Promote Ranking based on Usage

2012-06-12 Thread Jack Krupansky
Just for reference you should start by reviewing Lucid's "click scoring framework": http://lucidworks.lucidimagination.com/display/lweug/Click+Scoring+Relevance+Framework To do all of that yourself is a major undertaking, but maybe you could identify a simpler subset that does just enough to sa

Re: Unexpected DIH behavior for onError attribute

2012-06-12 Thread Jack Krupansky
Make sure you have the onError=skip on the proper entity. -- Jack Krupansky -Original Message- From: Pranav Prakash Sent: Tuesday, June 12, 2012 3:47 PM To: solr-user@lucene.apache.org Subject: Unexpected DIH behavior for onError attribute It seems that upon setting onError=skip,

Re: Indexing Data option for subdirectories?

2012-06-12 Thread Erik Hatcher
If they aren't Solr XML format, but you can write an XSLT to transform it to Solr XML, you can use this: Erik On Jun 12, 2012, at 15:20 , Jack Krupansky wrote: > There isn't a recursion option for post.jar (I did check.) > > Mayb

Re: Indexing Data option for subdirectories?

2012-06-12 Thread Gora Mohanty
On 13 June 2012 00:50, Jack Krupansky wrote: > There isn't a recursion option for post.jar (I did check.) > > Maybe your best bet is the "find" shell command. This may not be 100% > correct, but something like: > >   find /data -name '*.xml' -exec java -jar post.jar {} [...] The above should end

Deduplication in MLT

2012-06-12 Thread Pranav Prakash
I have an implementation of Deduplication as mentioned at http://wiki.apache.org/solr/Deduplication. It is helpful in grouping search results. I would like to achieve the same functionality in my MLT queries, where the result set should include grouped documents. What is a good way to do the same?

Promote Ranking based on Usage

2012-06-12 Thread jliz2803
Hi we have just started using Solr at our company. We have Solr setup and are using C# to make communicate with it. The user will perform a search then make a selection from the search results. We want to promote documents based on how often the user selects them. I was wondering if someone cou

Re: Indexing Multiple Datasources

2012-06-12 Thread Kay
Thanks for the reply jack! We tried giving each data source a name (For e.g. ds-1,ds-2 etc) but what it does is when we checked the log it establishes connection with the first data source and indexed while the second DB is getting ignored. Yes! what we wanted to try is in our system we have many

Re: Indexing Data option for subdirectories?

2012-06-12 Thread Jack Krupansky
There isn't a recursion option for post.jar (I did check.) Maybe your best bet is the "find" shell command. This may not be 100% correct, but something like: find /data -name '*.xml' -exec java -jar post.jar {} This is assuming that these are pre-formatted Solr XML update files with "" an

Re: solr nested multivalued fields

2012-06-12 Thread jerome
Thanks, From all the material i have looked at and searched I am inclined to believe that those are indeed my options, any others are still welcome... -- View this message in context: http://lucene.472066.n3.nabble.com/solr-nested-multivalued-fields-tp3989114p3989260.html Sent from the Solr - Use

Re: I need help on escaping the special char.

2012-06-12 Thread Jack Krupansky
See: http://lucene.472066.n3.nabble.com/index-special-characters-solr-td3987157.html Basically, list the special characters in a text file with the "types" attribute and map them to type "ALPHA". -- Jack Krupansky -Original Message- From: Prachi Phatak Sent: Tuesday, June 12, 2012

I need help on escaping the special char.

2012-06-12 Thread Prachi Phatak
I tried WordDelimiterFactory with types option. It doesn't seem working. How can I escape i.e.+ - && || ! ( ) { } [ ] ^ " ~ * ? : \ in configuration. Prachi.

Re: Sharding in SolrCloud

2012-06-12 Thread Mark Miller
On Jun 12, 2012, at 3:39 AM, lenz...@gfi.ihk.de wrote: > Hello, > > we tested SolrCloud in a setup with one collection, two shards and one > replica per shard and it works quite fine with some example data. > Now, we plan to set up our own collection and determine in how many shards > we shou

Re: deploy a brand new index in solrcloud

2012-06-12 Thread Mark Miller
On Jun 10, 2012, at 2:56 AM, Anatoli Matuskova wrote: > I've thought in setting replication in solrCloud: > http://www.searchworkings.org/forum/-/message_boards/view_message/339527#_19_message_339527 > What I don't know is if while replication is being handled, the replica > slaves (that are not

Re: Solr PHP highload search

2012-06-12 Thread Jack Krupansky
Add "&debugQuery=true" to your query and look at the "timing" section that comes back with the response to see q breakdown of Qtime. It should offer some insight into which search component(s) are taking the most time. That might point you in the right direction for improvements. Also, see how

Re: SolrJ dependencies

2012-06-12 Thread Sami Siren
On Tue, Jun 12, 2012 at 4:22 PM, Thijs wrote: > Hi > I just checked out and build solr&lucene from branches/lucene_4x > > I wanted to upgrade my custom client to this new version (using solrj). > So I copied lucene/solr/dist/apache-solr-solrj-4.0-SNAPSHOT.jar & >  lucene/solr/dist/apache-solr-core

Re: Issues with whitespace tokenization in QueryParser

2012-06-12 Thread John Berryman
Robert Muir told me that there is somewhat of a workaround for this. For defType=lucene. Just escape every whitespace with a slash. So instead of *new dress shoes* search for *new\ dress\ shoes*. Of course you lose the ability to use normal lucene syntax. I was hoping that this workaround would al

Re: SolrJ dependencies

2012-06-12 Thread Jack Krupansky
Maybe the migration from the "EOL" Commons HTTP Client to HTTP Components has something to do with this. The wiki probably needs Solr release-specific instructions. And maybe the lib folder is not quite right. You can read about the migration here: https://issues.apache.org/jira/browse/SOLR-202

Different sort for each facet

2012-06-12 Thread Christopher Gross
In Solr 3.4, is there a way I can sort two facets differently in the same query? If I have: http://mysolrsrvr/solr/select?q=*:*&facet=true&facet.field=people&facet.field=category is there a way that I can sort people by the count and category by the name all in one query? Or do I need to do tha

Re: solr nested multivalued fields

2012-06-12 Thread Jack Krupansky
Maybe "Result Grouping/Field Collapsing" might work for you: http://wiki.apache.org/solr/FieldCollapsing Otherwise, multivalued string fields, with first and last name combined into one string might be the best you can do. -- Jack Krupansky -Original Message- From: jerome Sent: Tues

RE: PageRanking with DIH

2012-06-12 Thread Dyer, James
To boost a document with DIH, see this section about "$docBoost" in the wiki here: http://wiki.apache.org/solr/DataImportHandler#Special_Commands. If you're using a RDBMS for source data, your query would have something like this in it: "select PAGE_RANK as '$docBoost', ... from ... etc" If yo

SolrJ dependencies

2012-06-12 Thread Thijs
Hi I just checked out and build solr&lucene from branches/lucene_4x I wanted to upgrade my custom client to this new version (using solrj). So I copied lucene/solr/dist/apache-solr-solrj-4.0-SNAPSHOT.jar & lucene/solr/dist/apache-solr-core-4.0-SNAPSHOT.jar to my project and I updated the other

solr nested multivalued fields

2012-06-12 Thread jerome
I would like to produce the following result in a Solr search result but not sure it is possible to do? (Using Solr 3.6) John Darby Sue Berger However, i cant seem to manage getting this Tree like structre in my results.

Re: Changing Index directory?

2012-06-12 Thread jamel essoussi
you can use the following configuration: ${solr.data.dir:/opt/data/solr/}/core_name in solrconfig.xml --> you should in this case specify the followinf JVM option : -Dsolr.data.dir=(your path here) --> /opt/data/solr/core_name : the default value 2012/6/12 Jack Krupansky > It is "dataDir" in s

Re: Changing Index directory?

2012-06-12 Thread Jack Krupansky
It is "dataDir" in solrconfig.xml: http://wiki.apache.org/solr/SolrConfigXml#dataDir_parameter -- Jack Krupansky -Original Message- From: Bruno Mannina Sent: Tuesday, June 12, 2012 2:54 AM To: solr-user@lucene.apache.org Subject: Re: Changing Index directory? Le 12/06/2012 08:49, Bru

Re: Exception when optimizing index

2012-06-12 Thread Jack Krupansky
It's good to know that the situation is reproducible. Maybe you could do a couple of smaller tests, such as running CheckIndex after loading only 10%, 25%, and 50% of the data to see if the problem occurs with less data or is dependent on a much higher document count. And also check for any e

Re: PageRanking with DIH

2012-06-12 Thread Gora Mohanty
On 12 June 2012 13:51, vineet yadav wrote: > Hi Gora, > Thanks for reply. > I have computed pagerank offline for document set dump.  I ideally > want to use pagerank and solr relevency score together in formula to > sort search solr result.  I have already looked at > http://wiki.apache.org/solr/S

Re: PageRanking with DIH

2012-06-12 Thread vineet yadav
Hi Gora, Thanks for reply. I have computed pagerank offline for document set dump. I ideally want to use pagerank and solr relevency score together in formula to sort search solr result. I have already looked at http://wiki.apache.org/solr/SolrRelevancyFAQ#How_can_I_increase_the_score_for_specifi

Re: what's better for in memory searching?

2012-06-12 Thread Mikhail Khludnev
If I get it right, it's kind of per process setting swappiness. On Tue, Jun 12, 2012 at 3:57 AM, Li Li wrote: > is this method equivalent to set vm.swappiness which is global? > or it can set the swappiness for jvm process? > > On Tue, Jun 12, 2012 at 5:11 AM, Mikhail Khludnev > wrote: > > Poin

Re: PageRanking with DIH

2012-06-12 Thread Gora Mohanty
On 12 June 2012 13:04, vineet yadav wrote: > Hi, > I have indexed documents and computed pagerank for documents. I want > to update pagerank for indexed document and sort solr search result > with pagerank. Your question is not entirely clear: What is "pagerank" in this case? A custom score that

Sharding in SolrCloud

2012-06-12 Thread Lenzner
Hello, we tested SolrCloud in a setup with one collection, two shards and one replica per shard and it works quite fine with some example data. Now, we plan to set up our own collection and determine in how many shards we should devide it. We can estimate quite exactly the size of the collecti

PageRanking with DIH

2012-06-12 Thread vineet yadav
Hi, I have indexed documents and computed pagerank for documents. I want to update pagerank for indexed document and sort solr search result with pagerank. I did some research and found that index time boost can be used, but I don't know how to use it. Can I boost document at index time with DIH

Indexing Data option for subdirectories?

2012-06-12 Thread Bruno Mannina
Dear All, Is exist a way to index data under sub-directories directly? I have several files under sub-directories like: /data/A/001/*.xml /data/A/002/*.xml /data/A/003/*.xml /data/A/004/*.xml ... /data/B/001/*.xml ... /data/Z/999/*.xml I would like to index directly with *i.e. java -jar post.