Re: [MASSMAIL]Weighting of prominent text in HTML

2015-01-26 Thread Dan Davis
Helps lots. Thanks, Jorge Luis. Good point about different fields - I'll just put the h1 and h2 (however deep I want to go) into fields, and we can sort out weighting and whether we want it later with edismax. The blogs on adding plugins for that sort of thing look straightforward. On Mon, J

Re: Solr Recovery process

2015-01-26 Thread Ramkumar R. Aiyengar
https://issues.apache.org/jira/browse/SOLR-6359 has a patch which allows this to be configured, it has not gone in as yet. Note that the current design of the UpdateLog causes it to be less efficient if the number is bumped up too much, but certainly worth experimenting with. On 22 Jan 2015 02:47,

SOS-help: How to store solr index data in hbase table???

2015-01-26 Thread zhangjianad
hi all, Now I store solr index data on local disk. I want store solr index data in hbase table, how to configure ? tips , any guys known about this??? thanks! Jan --- 免责声明(D

SuggestStopFilter not usable in Solr 4.10.x?

2015-01-26 Thread Clemens Wyss DEV
https://issues.apache.org/jira/browse/LUCENE-5820 Due to the missing factory the SuggestStopFilter is not "usable" before Solr/Lucene 5, right? Any plan on when Solr 5 will appear? How can I get hold of Solr/Lucene 5?

Re: SOS-help: How to store solr index data in hbase table???

2015-01-26 Thread Shawn Heisey
On 1/26/2015 2:56 AM, zhangjia...@dcits.com wrote: > Now I store solr index data on local disk. I want store solr index > data in hbase table, how to configure ? tips , any guys known about > this??? I have no idea how you would do that. You *can* store your indexes in HDFS storage, but tha

query time join (stored or indexed value field?)

2015-01-26 Thread Alvaro Cabrerizo
Hi, Is the time join query using stored data or indexed data from the fields set in "from" and "to"? (For example, the facet feature makes the count based on the indexed data) I've made an small example (using tokenizers, stopwords...) and it seems that the join

Re: query time join (stored or indexed value field?)

2015-01-26 Thread Mikhail Khludnev
indexed for sure, and/or docValues. not stored for sure. On Mon, Jan 26, 2015 at 3:44 PM, Alvaro Cabrerizo wrote: > Hi, > > Is the time join query using stored > data > or indexed data from the fields set in "from" and "to"? (For example, the > facet feature ma

Re: Sorting on a computed value

2015-01-26 Thread Shawn Heisey
On 1/25/2015 4:13 PM, tedsolr wrote: > I'll bet some super user has figured this out. How can I perform a sort on a > single computed field? I have a QParserPlugin that is collapsing docs based > on data from multiple fields. I am summing the values from one numerical > field 'X'. I was going to us

Re: SOS-help: How to store solr index data in hbase table???

2015-01-26 Thread Dmitry Kan
A bit of googling reveals this article of integrating HBase and Lucene, for example: http://www.infoq.com/articles/LuceneHbase The article references this code: https://github.com/akkumar/hbasene Does not look like it is under an active development, but might be worth exploring. Dmitry On Mon, J

Re: Indexed epoch time in Solr

2015-01-26 Thread Jim . Musil
If you are using the DataImportHandler, you can leverage on of the transformers, such as the DateFormatTransformer: http://wiki.apache.org/solr/DataImportHandler#DateFormatTransformer If you are updating documents directly you can define a regex transformation in your schema.xml: https://wiki.a

Re: Need Help with custom ZIPURLDataSource class

2015-01-26 Thread Dan Davis
I have seen such errors by looking under Logging in the Solr Admin UI. There is also the LogTransformer for Data Import Handler. However, it is a design choice in Data Import Handler to skip fields not in the schema. I would suggest you always use Debug and Verbose to do the first couple of docu

Re: Need help importing data

2015-01-26 Thread Dan Davis
Glad it worked out. On Fri, Jan 23, 2015 at 9:50 PM, Carl Roberts wrote: > NVM > > I figured this out. The problem was this: pk="link" in > rss-dat.config.xml but unique id not link in schema.xml - it is id. > > From rss-data-config.xml: > > *pk="link"* > url="https://nvd.nist.gov/feeds/xml/c

Re: Indexed epoch time in Solr

2015-01-26 Thread Dan Davis
I think copying to a new Solr date field is your best bet, because then you have the flexibility to do date range facets in the future. If you can re-index, and are using Data Import Handler, Jim Musil's suggestion is just right. If you can re-index, and are not using Data Import Handler: - T

Re: SuggestStopFilter not usable in Solr 4.10.x?

2015-01-26 Thread Alexandre Rafalovitch
RC1 of Solr 5 should be out very soon (days). But you can always download the latest source from svn_5_0 branch (remember to use shallow copy if using git) and build it yourself ('ant package' inside 'solr' directory). It's not terribly hard. Regards, Alex. Sign up for my Solr resources new

Re: Solr Recovery process

2015-01-26 Thread Nishanth S
Thank you Ram. On Mon, Jan 26, 2015 at 1:49 AM, Ramkumar R. Aiyengar < andyetitmo...@gmail.com> wrote: > https://issues.apache.org/jira/browse/SOLR-6359 has a patch which allows > this to be configured, it has not gone in as yet. > > Note that the current design of the UpdateLog causes it to be l

REMINDER: ApacheCon 2015 Call For Papers Ends This Week (February 1st)

2015-01-26 Thread Chris Hostetter
(cross posted, please confine replies to general@lucene) ApacheCon 2015 Will be in Austin Texas April 13-17. http://apachecon.com/ The Call For Papers is currently open, but it ends 2015-02-01 (11:55PM GMT-0600) https://events.linuxfoundation.org/events/apachecon-north-america/progra

solrcloud shard splitting with lock type native

2015-01-26 Thread calin.grecu
Hi there, Shard splitting seems to fail if the lock type is native. Here is my config setting: native 1000 Shard splitting works if i set the lock type to single or none. However, after splitting, i am not able to set the lock type back to native, which is the default. Here

Re: Sorting on a computed value

2015-01-26 Thread tedsolr
That's an interesting link Shawn. Especially since it mentions the possibility of sorting on pseudo-fields. My delegating collector computes the customs stats and stores them in the request context. I have a doc transformer that then grabs the stats for each doc and inserts the data in the output.

SimplePostTool with extracted Outlook messages

2015-01-26 Thread Mark
I'm looking to index some outlook extracted messages *.msg I notice by default msg isn't one of the defaults so I tried the following: java -classpath dist/solr-core-4.10.3.jar -Dtype=application/vnd.ms-outlook org.apache.solr.util.SimplePostTool C:/temp/samplemsg/*.msg That didn't work However

Re: replicas goes in recovery mode right after update

2015-01-26 Thread Vijay Sekhri
bytesUsed=4455794, segment=_5r5, aborting=false, numDocsInRAM=24222, deleteQueue=DWDQ: [ generation: 1 ]] 14:16:49,322 INFO [org.apache.solr.update.LoggingInfoStream] (recoveryExecutor-7-thread-1) [DWPT][recoveryExecutor-7-thread-1]: flush postings as segment _5r5 numDocs=24222 ==> gc.20150126

solr cloud replicas goes in recovery mode after update

2015-01-26 Thread Vijay Sekhri
Hi Erick, The older message seems to be deleted so I am sending a new one http://osdir.com/ml/solr-user.lucene.apache.org/2015-01/msg00773.html In solr.xml file I had zk timeout set to* ${zkClientTimeout:45}* One thing that made a it a bit better now is the zk tick time and syncLimit settin

Re: Sorting on a computed value

2015-01-26 Thread Mikhail Khludnev
I'm sorry for spoiling, but it's a fabulous FakeScorer pattern in Lucene. e.g. look at https://github.com/apache/lucene-solr/blob/trunk/lucene/grouping/src/java/org/apache/lucene/search/grouping/BlockGroupingCollector.java#L355 when your delegating collector is provided by scorer via setScorer(), i

Re: SimplePostTool with extracted Outlook messages

2015-01-26 Thread Alexandre Rafalovitch
Seems like apple to oranges comparison here. I would try giving an explicit end point (.../extract), a single message, and a literal id for the SimplePostTool and seeing whether that works. Not providing an ID could definitely be an issue. I would also specifically look on the server side in the

Re: SimplePostTool with extracted Outlook messages

2015-01-26 Thread Mark
A little further This fails java -classpath dist/solr-core-4.10.3.jar -Dtype=application/vnd.ms-outlook org.apache.solr.util.SimplePostTool C:/temp/samplemsg/*.msg With: SimplePostTool: WARNING: IOException while reading response: java.io.IOException: Server returned HTTP response code: 415 fo

How to implement Auto complete, suggestion client side

2015-01-26 Thread Olivier Austina
Hi All, I would say I am new to web technology. I would like to implement auto complete/suggestion in the user search box as the user type in the search box (like Google for example). I am using Solr as database. Basically I am familiar with Solr and I can formulate suggestion queries. But now

Re: SimplePostTool with extracted Outlook messages

2015-01-26 Thread Alexandre Rafalovitch
Well, you are NOT posting to the same URL. On 26 January 2015 at 17:00, Mark wrote: > http://localhost:8983/solr/update Sign up for my Solr resources newsletter at http://www.solr-start.com/

Showing distance in results

2015-01-26 Thread vit
I have Solr 4.2 I need to calculate the distance between a point (0, 0) and lat lng in each document. I do this http://:9081/solr/collection1/select?q={!func}dist(2, lat, lng, 0, 0)&wt=xml&indent=true It works fine but does not show the distance, Please help. -- View this message in context: h

Re: replicas goes in recovery mode right after update

2015-01-26 Thread Erick Erickson
nfoStream] > (recoveryExecutor-7-thread-1) [DW][recoveryExecutor-7-thread-1]: > startFullFlush > 14:16:49,284 INFO [org.apache.solr.update.LoggingInfoStream] > (recoveryExecutor-7-thread-1) [DW][recoveryExecutor-7-thread-1]: > anyChanges? numDocsInRam=24222 deletes=true h

Re: SimplePostTool with extracted Outlook messages

2015-01-26 Thread Mark
Fantastic - that explians it Adding -Durl=" http://localhost:8983/solr/update/extract?commit=true&overwrite=true"; Get's me a little further POSTing file 6252671B765A1748992DF1A6403BDF81A4A22E00.msg SimplePostTool: WARNING: Solr returned an error #400 (Bad Request) for url: http://localh

Re: Showing distance in results

2015-01-26 Thread Erick Erickson
A very small bit of Googling yeilds: https://wiki.apache.org/solr/SpatialSearch Best, Erick On Mon, Jan 26, 2015 at 2:18 PM, vit wrote: > I have Solr 4.2 > I need to calculate the distance between a point (0, 0) and lat lng in each > document. I do this > http://:9081/solr/collection1/select?q=

Re: solr cloud replicas goes in recovery mode after update

2015-01-26 Thread Mark Miller
bq. Is this the correct approach ? It works, but it might not be ideal. Recent versions of ZooKeeper have an alternate config for this max limit though, and it is preferable to use that. See maxSessionTimeout in http://zookeeper.apache.org/doc/r3.3.1/zookeeperAdmin.html - Mark On Mon Jan 26 201

Re: replicas goes in recovery mode right after update

2015-01-26 Thread Shawn Heisey
On 1/26/2015 2:26 PM, Vijay Sekhri wrote: > Hi Erick, > In solr.xml file I had zk timeout set to/ name="zkClientTimeout">${zkClientTimeout:45}/ > One thing that made a it a bit better now is the zk tick time and > syncLimit settings. I set it to a higher value as below. This may not > be advis

Re: How to implement Auto complete, suggestion client side

2015-01-26 Thread Alexandre Rafalovitch
You've got a lot of options depending on what you want. But since you seem to just want _an_ example, you can use mine from http://www.solr-start.com/javadoc/solr-lucene/index.html (gray search box there). You can see the source for the test screen (using Spring Boot and Spring Data Solr as a midd

An interesting approach to grouping

2015-01-26 Thread Ryan Josal
I have an index of products, and these products have a "category" which we can say for now is a good approximation of its location in the store. I'm investigating altering the ordering of the results so that the categories aren't interlaced as much... so that the results are a little bit more grou

Re: How to implement Auto complete, suggestion client side

2015-01-26 Thread Dan Davis
Cannot get any easier than jquery-ui's autocomplete widget - http://jqueryui.com/autocomplete/ Basically, you set some classes and implement a javascript that calls the server to get the autocomplete data. I never would expose Solr to browsers, so I would have the AJAX call go to a php script (o

Solr admin Url issues

2015-01-26 Thread Summer Shire
Hi All, Running solr (4.7.2) locally and hitting the admin page like this works just fine http://localhost:8983/solr/ # But on my deployment server my path is http://example.org/jetty/MyApp/1/solr/#

Re: Solr admin Url issues

2015-01-26 Thread Dan Davis
Is Jetty actually running on port 80?Do you have Apache2 reverse proxy in front? On Mon, Jan 26, 2015 at 11:02 PM, Summer Shire wrote: > Hi All, > > Running solr (4.7.2) locally and hitting the admin page like this works > just fine http://localhost:8983/solr/ #

Re: Solr admin Url issues

2015-01-26 Thread Summer Shire
jetty is not running on port 80 it is running on ports that I defined for my instances in sequence. and no I do not have apache2 reverse proxy in front :( > On Jan 26, 2015, at 8:18 PM, Dan Davis wrote: > > Is Jetty actually running on port 80?Do you have Apache2 reverse proxy > in front?

Re: replicas goes in recovery mode right after update

2015-01-26 Thread Vijay Sekhri
Hi Shawn, Erick So it turned out that once we increased our indexing rate to the original full indexing rate the replicas went back into recovery no matter what the zk timeout setting was. Initially we though that increasing the timeout is helping but apparently not . We just decreased indexing ra

Re: replicas goes in recovery mode right after update

2015-01-26 Thread Vijay Sekhri
Hi Shawn, Erick >From another replicas right after the same error it seems the leader initiates the recovery of the replicas. This one has a bit different log information than the other one that went into recovery. I am not sure if this helps in diagnosing Caused by: java.io.IOException: JBWEB0020

Re: Solr admin Url issues

2015-01-26 Thread Shawn Heisey
On 1/26/2015 9:02 PM, Summer Shire wrote: > Running solr (4.7.2) locally and hitting the admin page like this works just > fine http://localhost:8983/solr/ # > > > But on my deployment server my path is > http://example.org/jetty/MyA

Re: replicas goes in recovery mode right after update

2015-01-26 Thread Shawn Heisey
On 1/26/2015 9:34 PM, Vijay Sekhri wrote: > Hi Shawn, Erick > So it turned out that once we increased our indexing rate to the original > full indexing rate the replicas went back into recovery no matter what the > zk timeout setting was. Initially we though that increasing the timeout is > helpin

Want multiple df field on suggestion component.

2015-01-26 Thread Nitin Solanki
Hi, I have created 2 fields "ngram and count". ngram => Stores 1 to 5 grams of words or phrase. count => Stores frequency of each ngram. Applying df field on ngram in Suggestion component. When I get the suggestion of misspell word, word and freq are coming but now I need to use count f