Re: Minimum word length for stemming

2013-01-31 Thread Jan Høydahl
sis#Customizing_Stemming and the KeywordMarkerFilterFactory which lets you list a bunch of words you do not want the stemmers to touch. I guess you could easily implement your own TokenLengthMarkerFilterFactory which keeps words from being stemmed based on length. -- Jan Høydahl, search solution architect Cominv

Re: Anyone else see this error when running unit tests?

2013-02-01 Thread Jan Morlock
such a dependency inside the pom.xml, it should appear before lucene-core.jar: org.apache.lucene lucene-test-framework 4.1.0 test In my case, this fixed the problem. Best regards Jan -- View this message in context: http://lucene.472066.n3.nabble.com/Anyone-else-see-this-error-when-running-unit

Re: Solr 4 - Missing Field Collapsing Components

2013-02-04 Thread Jan Høydahl
Hi, Field collapsing is built-in and is also called Result Grouping: http://wiki.apache.org/solr/FieldCollapsing You simply enable it with ...&group=true&group.field=myfield If that does not work for you, please respond with detailed error messages so we can help you further. -- Jan

Re: Multi-threaded post.jar?

2013-02-04 Thread Jan Høydahl
from scratch is something to discuss. Please bring on your ideas of how you plan to extend it, perhaps even simplifying the code in the process? -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com 3. feb. 2013 kl. 17:19 skrev Upayavira

Re: Multi-threaded post.jar?

2013-02-05 Thread Jan Høydahl
pache/solr/util/SimplePostTool.java java -cp . org.apache.solr.util.SimplePostTool -h That's just how I've been thinking so far though. If other committers are happy with abandoning the simple-ness and instead create a best-practices based feature-rich tool with dependencies, then I&#x

Re: Multi-threaded post.jar?

2013-02-06 Thread Jan Høydahl
With dependencies I meant external jar dependencies. Perhaps extensions could have deps while leaving the "core" compilable without? -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com 5. feb. 2013 kl. 17:10 skrev Upayavi

Re: Multi-select faceting is not working when facet fields are configured in default request handler.

2013-02-07 Thread Jan Høydahl
If you want to override facet.field through the query, you have to override ALL facet.field's defined as default in reqeust handler, else those other facets are gone. You say "But it's not working." without specifying WHAT is not working. -- Jan Høydahl, search solution ar

Re: SolrCloud new zookeper node on different ip/ replicate between two clasters

2013-02-07 Thread Jan Høydahl
You should run replicated ZK: http://zookeeper.apache.org/doc/trunk/zookeeperStarted.html#sc_RunningReplicatedZooKeeper Give Solr the list of all ZK's and you're good to go -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.c

Re: Crawl Anywhere -

2013-02-11 Thread Jan Høydahl
Have a look at Nutch2, it is decoupled from HDFS and can store docs in e.g. HBase or other NoSql store. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com 11. feb. 2013 kl. 06:16 skrev SivaKarthik : > Dear Erick, > Thanks

Re: Advanced Search Option in Solr corresponding to DtSearch options

2013-02-24 Thread Jan Høydahl
Hi, Have you considered contributing the deep position info patch back to Apache? It could be useful to enhance highlighting code, external in-doc highlighters etc. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com 7. feb. 2013 kl

Re: [ANN] vifun: tool to help visually tweak Solr boosting

2013-02-25 Thread Jan Høydahl
m, so that you can point it to the "/browse" handler directly? -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com 23. feb. 2013 kl. 15:12 skrev jmlucjav : > Hi, > > I have built a small tool to help me tweak some p

Re: solr search integration

2013-02-25 Thread Jan Høydahl
Have you tried one of the extensions out there, such as https://code.google.com/p/magento-community-edition-solr/ ? -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com 25. feb. 2013 kl. 14:12 skrev Rohan Thakur : > hi all >

Max Score Query parser?

2013-02-25 Thread Jan Høydahl
max across multiple fields, not across multiple clauses. Perhaps something like this exists somewhere? If not, any pointers for where to start, since I'm not intimately familiar with the Scorer/Weight APIs? -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.co

Re: Introducing Solrstrap: A blazing fast tool for querying Solr in a Googleish fashion

2013-02-25 Thread Jan Høydahl
framework? Plans for layout in general? -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com 17. feb. 2013 kl. 17:13 skrev Fergus McDowall : > Jack > > That sounds entirely reasonable- yes, it probably would make sense. > > Fe

Re: [ANN] vifun: tool to help visually tweak Solr boosting

2013-02-25 Thread Jan Høydahl
Hi, I actually tried ../griffonw run-app but it says "griffon-app does not appear to be part of a Griffon application." I installed griffon and tried again "griffon run-app" inside of griffon-app, but same error. -- Jan Høydahl, search solution architect Cominvent AS - ww

Re: Max Score Query parser?

2013-02-26 Thread Jan Høydahl
Hi, Thanks, that seems to be the quickest way. But I did not get the part with building a DisjunctionMaxQuery from the clauses. I would need to keep it as a BooleanQuery, wouldn't I, and compare the weights from each clause and nullify all but the max weight clause? -- Jan Høydahl, s

Re: Max Score Query parser?

2013-02-26 Thread Jan Høydahl
Good suggestion. That would work for simple queries, but won't work in this case, because the score is deep down a complex query tree and should not always be applied on root level, only if other boolean conditions nearby match. -- Jan Høydahl, search solution architect Cominve

Re: [ANN] vifun: tool to help visually tweak Solr boosting

2013-02-26 Thread Jan Høydahl
Still problems running from source, see https://github.com/jmlucjav/vifun/issues/27 -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com 26. feb. 2013 kl. 09:42 skrev jmlucjav : > To anyone that has tested this and is having an er

Re: Get search results in the order of fields names searched

2013-02-26 Thread Jan Høydahl
Check out dismax (http://wiki.apache.org/solr/ExtendedDisMax) q="John Hopkins"&defType=edismax&qf=Author^1000 Editors^500 Raw_text^1 It's not strictly layered, but by playing with the numbers you can achieve that effect -- Jan Høydahl, search solution architect Cominvent

Re: Poll: SolrCloud vs. Master-Slave usage

2013-02-26 Thread Jan Høydahl
where master/slave replication is used (auto configured from ZK), and where search load balancer is aware of only querying the slaves, and DistributedUpdateProcessor is aware of not pushing updates to the slaves, since this will happen by pull. -- Jan Høydahl, search solution architect Cominve

Re: Unexpected '-' Results

2013-02-26 Thread Jan Høydahl
https://issues.apache.org/jira/browse/SOLR-2649 here. Compare the results of 3) with the result when you put -working as a filter instead: +(((text:medical) (text:retired))~2) -working -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com 26. feb. 20

Re: Unexpected '-' Results

2013-02-27 Thread Jan Høydahl
Well, as long as you know that introducing operators in your query will fallback to a pure boolean SHOULD query, you can do the workaround in your client query generating code, such as prefixing non-prefixed words with a "+". -- Jan Høydahl, search solution architect Co

Re: Unexpected '-' Results

2013-02-27 Thread Jan Høydahl
Or you can put that energy or money into contributing to solving SOLR-2649 :) -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com 27. feb. 2013 kl. 17:36 skrev z...@navigo.com: > Yes... we were trying to avoid getting between solr

Re: query builder for solr UI?

2013-02-27 Thread Jan Høydahl
Hi, Can you be more specific on what query you want to build an what you expect end users to enter into that/those boxes? Why are you not just using eDisMax? -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com 27. feb. 2013 kl. 22:05

Re: Problems with Solr 3.6 and Magento

2013-02-27 Thread Jan Høydahl
Please direct your questions to Magento Inc or whoever maintains the integration. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com 28. feb. 2013 kl. 00:22 skrev dboychuck : > I noticed that Magento is using the overwritePend

Re: query builder for solr UI?

2013-02-28 Thread Jan Høydahl
Hi, Have you tried edismax across your original (not text copyfield) fiels? If no, try it. If yes, which of your expectations did it not satisfy? Why would you want to "build" a query yourself, when Solr's queryParser is made to do just that for you from the input query string?

Re: query builder for solr UI?

2013-02-28 Thread Jan Høydahl
input for +foo is sent to Solr as q=%2Bfoo, since the plus otherwise would be a space :) So simply urlencode the whole user input when constructing your URL. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com 28. feb. 2013 kl. 15:46 skrev

Re: Solr 3.6 - Out Of Memory Exception

2013-02-28 Thread Jan Høydahl
memory you have. Also, you should read this blog to try optimize your HW resources http://blog.thetaphi.de/2012/07/use-lucenes-mmapdirectory-on-64bit.html My gut feel is that you still need to allocate more than 4G for Sold, until you get rid of all OOMs. -- Jan Høydahl, search solution architect

Re: Language Identification and Stemming

2013-03-01 Thread Jan Høydahl
topwords etc. Q2. You setup update.chain in your request handler and that's it. It is not possible to return to the client the detected language or any other response from the UpdateProcessors. You'll need to fetch the indexed document. -- Jan Høydahl, search solution arch

Re: Language Identification and Stemming

2013-03-02 Thread Jan Høydahl
be better off trying to guess the input language by means of IP detection, browser headers, statistical analysis or simply asking the user. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com 1. mars 2013 kl. 23:47 skrev vybe3142 : &

Re: solr cloud index size is too big

2013-03-04 Thread Jan Høydahl
Can you tell whether it's the "index" folder that is that large or is it including the "tlog" transaction log folder? If you have a huge transaction log, you need to start sending hard commits more often during indexing to flush the tlogs. -- Jan Høydahl, search soluti

Re: How should I configure Solr to support multi-word synonyms?

2013-03-04 Thread Jan Høydahl
Hi, I have been using this plugin with success: https://github.com/healthonnet/hon-lucene-synonyms While it gives you multi-word synonyms, you lose the ability to have different synonym dictionaries per field. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr

Re: Solr 4.1 on tomcat with log4j and multiple solr webapps

2013-03-05 Thread Jan Høydahl
could put a unique log4j.properties file in /lib/ and refer to it as a tag in solrconfig, but I can't guarantee that it will be visible to log4j from there? Let us know what worked. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtrain

Re: Dynamic schema design: feedback requested

2013-03-06 Thread Jan Høydahl
. Then SolrCloud can have a compat mode serializing this info as the old familiar files for those who need an export to plain singlenode or the opposite. Perhaps we can use ZK to keep N revisions too, so you could roll back a series of changes? -- Jan Høydahl, search solution architect Cominvent

Re: Search a folder with File name and retrieve all the files matched

2013-03-08 Thread Jan Høydahl
an achieve that through adding a new field "filename" with a copyField resourcename->filename and a custom fieldType for filename with a PatternReplaceFilterFactory to remove the path. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtra

Re: Search a folder with File name and retrieve all the files matched

2013-03-09 Thread Jan Høydahl
ve it in the ID already. See any downsides? Please just go ahead with whatever you think best :) -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com 9. mars 2013 kl. 04:35 skrev Erik Hatcher : > Thanks, Jan, for making the post tool do

Re: Bitwise operation

2013-03-20 Thread Jan Høydahl
hanges several times a day? If not, you should be fine! -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com 20. mars 2013 kl. 10:01 skrev Christopher ARZUR : > Hello and thank you for your answers. > I'll try to explain

Re: Bitwise operation

2013-03-20 Thread Jan Høydahl
index the documents, since they still belong to the group, but instead you change the search filter to match the new group ACL. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com 20. mars 2013 kl. 11:38 skrev Christopher ARZUR : >

Re: Keeping query index until a complete reindex is done

2013-03-20 Thread Jan Høydahl
://wiki.apache.org/solr/CoreAdmin Unload/delete the old index -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com 20. mars 2013 kl. 12:59 skrev Silvio Hermann : > Hi there > > > > is solr capable of keeping a query index u

Re: hot deploy of newer version of solr schema in production

2012-02-01 Thread Jan Høydahl
e and frontend changes to take place at the same time as a major search engine upgrade, so it's not always as easy as switching the backend, you need to synchronize things. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 1.

Re: StopFilterFactory and "qf" containing some fields that use it and some that do not

2012-02-01 Thread Jan Høydahl
perhaps it would be a nice way to circumvent this problem in a > future release... I created a JIRA issue to investigate if it is possible to implement this. See https://issues.apache.org/jira/browse/SOLR-3085 -- Jan Høydahl, search solution architect Cominvent AS - www.cominve

Re: Copy Index from Solr 3.2.0 to Solr 3.4.0

2012-02-04 Thread Jan Høydahl
grade, previous versions of Solr will no longer be able to read your indices. In a master/slave configuration, all searchers/slaves should be upgraded before the master. If the master were to be updated first, the older searchers would not be able to read the new index format. -- Jan Hø

Re: Help:Solr can't put all pdf files into index

2012-02-09 Thread Jan Høydahl
Hi, Are you 100% sure that the filename is globally unique, since you use it as the uniqueKey? -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 9. feb. 2012, at 08:30, 荣康 wrote: > Hey , > I am using solr as my search eng

Re: Range facet - Count in facet menu != Count in search results

2012-02-09 Thread Jan Høydahl
Hi, If you use trunk (4.0) version, you can say fq=price:[10 TO 20} and have the upper bound be exclusive. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 10. feb. 2012, at 00:58, Yuhao wrote: > I've cha

Re: Solr / Tika Integration

2012-02-10 Thread Jan Høydahl
I think you need to control the parameter "enableAutoSpace" in PDFBox. There's a JIRA for it, but it depends on some Tika1.1 stuff as far I can understand https://issues.apache.org/jira/browse/SOLR-2930 -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.co

Re: solr search speed is so slow.

2012-02-11 Thread Jan Høydahl
not binary) on average? Also, newer versions of Solr may have optimizations for faster highlighting.. Another thing from your reqHandler config. You use "maxAnalyzedChars=-1". The correct should be hl.maxAnalyzedChars and this will only work for the original highlighter, not fvh. --

Re: Setting up logging for a Solr project that isn't in tomcat/webapps/solr

2012-02-11 Thread Jan Høydahl
You can unpack your war (jar -xvf solr.war), change logging.properties and then pack it again (jar -cvf solr.war) You can also try to specify a new folder in a tag in solrconfig.xml and put your prop file there. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr

Re: Do we need reindexing from solr 1.4.1 to 3.5.0?

2012-02-13 Thread Jan Høydahl
Hi, If you keep the old schema and solrconfig, you should be able to simply upgrade your solr.war. Solr 3.5 will read the old index format and write the new one. Once you've upgraded the index will not be readable by 1.4.1. Test this on a test environment to verify. -- Jan Høydahl, s

Re: Solr binary response for C#?

2012-02-15 Thread Jan Høydahl
Hi, I just created a JIRA to investigate an Avro based serialization format for Solr: https://issues.apache.org/jira/browse/SOLR-3135 You're welcome to contribute. Guess we'll first need to define schemas, then create an AvroResponseWriter and then support in the C# Solr client. -- J

Re: Stemming and accents (HunspellStemFilterFactory)

2012-02-15 Thread Jan Høydahl
ss to apply when parsing the dictionary into memory. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 14. feb. 2012, at 16:27, Chantal Ackermann wrote: > Hi Bráulio, > > I don't know about HunspellStemFilterFac

Re: Semantic autocomplete with Solr

2012-02-15 Thread Jan Høydahl
Check out http://www.cominvent.com/2012/01/25/super-flexible-autocomplete-with-solr/ You can feed it anything, such as a log of previous searches, or a pre-computed dictionary of "item" + "color" combinations that exist in your DB etc. -- Jan Høydahl, search solution ar

Re: Solr edismax clarification

2012-02-17 Thread Jan Høydahl
Please provide your full query, including your "qf" parameter and all other request parameters, and also the relevant fields/field-types from schema. Do you use stopwords? Can you also add "debugQuery=true" and paste in the parsedQuery? -- Jan Høydahl, search solution ar

Re: Date search by specific month and day

2012-03-05 Thread Jan Høydahl
quot; it would individually store multiple versions in the index, perhaps using poly-field? It could work exactly like DateField for full date input, but also allow queries like myDate:2012, myDate:2012-03, myDate:2012-03-05, myDate:[1991 TO 2012] etc. -- Jan Høydahl, search solution archite

Re: How to define a multivalued string type "langid.langsField" in solrconfig.xml

2012-03-05 Thread Jan Høydahl
without also doing mapping. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 27. feb. 2012, at 05:09, bing wrote: > Hi, all, > > I am using tika language detection. It is said that, if "langid.langsField" >

Re: errata for solr tutorial

2012-03-05 Thread Jan Høydahl
Hi, Thanks for reporting. This is fixed now on the staging site, will be set live soon. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 1. mars 2012, at 16:50, Nicolai Scheer wrote: > Hi! > > Having just worked th

Re: How to Index Custom XML structure

2012-03-09 Thread Jan Høydahl
You could setup a ManifoldCF job to fetch the XMLs and then setup a new SolrOutputConnection for /solr/update/xslt?tr=myStyleSheet.xsl where myStyleSheet.xsl is the stylesheet to use for that kind of XML. See http://wiki.apache.org/solr/XsltUpdateRequestHandler -- Jan Høydahl, search solution

Re: Solr 4.0

2012-03-12 Thread Jan Høydahl
Hi Robert, See http://wiki.apache.org/solr/Solr4.0 The developer community is working towards a 4.0-Alpha release expected in a few months, however no dates are fixed. Many already use a snapshot version of TRUNK. You are free to do so, at your own risk. -- Jan Høydahl, search solution

Re: solr 3.5 and indexing performance

2012-03-13 Thread Jan Høydahl
the profiler to see what part of the code takes up the time http://docs.oracle.com/javase/6/docs/technotes/tools/share/jvisualvm.html -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 12. mars 2012, at 16:42, Agnieszka Kukałowicz

Re: Can solr-langid(Solr3.5.0) detect multiple languages in one text?

2012-03-13 Thread Jan Høydahl
text into chunks and then do langid on each chunk. If you'd like to open a JIRA for this, it will not be forgotten... -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 13. mars 2012, at 04:25, bing wrote: > Hi, all, &

Re: solr 3.5 and indexing performance

2012-03-13 Thread Jan Høydahl
the issue your polish hunspell dictionaries. Then we'll try to reproduce the error. I wonder if this performance decrease is also seen for English dictionaries? -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 13. mars 2012,

Re: Solr Monitoring / Stats

2012-03-13 Thread Jan Høydahl
And here is a page on how to wire Solr's JMX info into OpenNMS monitoring tool. Have not tried it, but as soon as a collector config is defined once I'd guess it could be re-used, maybe shipped with Solr. http://www.opennms.org/wiki/JMX_Collector -- Jan Høydahl, search solution

Re: Field Value Substitution

2012-03-16 Thread Jan Høydahl
e" at query time? -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 15. mars 2012, at 08:13, tosenthu wrote: > Hi > > I have a scenario, where I store a field which is an Id, > > ID field > -- > 1

Re: whethere solr 3.3 index file is compatable with solr 4.0

2012-03-22 Thread Jan Høydahl
That's correct. Solr4 will read your existing index and let you use it with the feature set it already has. But in order for you to use new fieldTypes, you need to re-index your data. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtrainin

Re: Is it possible to grouping solr results by their domain ?

2012-04-09 Thread Jan Høydahl
Sure! http://wiki.apache.org/solr/FieldCollapsing -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 9. apr. 2012, at 07:27, hadi wrote: > I have crawled many site with nutch and using solr 3.4 to browse the results > but

Re: Cloud-aware request processing?

2012-04-09 Thread Jan Høydahl
Hi, Instead of using Solr, you may want to have a look at Hadoop or another framework for distributed computation, see e.g. http://java.dzone.com/articles/comparison-gridcloud-computing -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training

Re: Solr 3.5 takes very long to commit gradually

2012-04-12 Thread Jan Høydahl
What operating system? Are you using spellchecker with buildOnCommit? Anything special in your Update Chain? -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 12. apr. 2012, at 06:45, Rohit wrote: > We recently migrated f

Re: Options for automagically Scaling Solr (without needing distributed index/replication) in a Hadoop environment

2012-04-13 Thread Jan Høydahl
the hadoop cluster, being responsible for the Reduce part building the indexes? -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 13. apr. 2012, at 04:23, Otis Gospodnetic wrote: > Hello Ali, > >> I'm trying to s

Re: Options for automagically Scaling Solr (without needing distributed index/replication) in a Hadoop environment

2012-04-14 Thread Jan Høydahl
Hi, This won't give you the performance you need, unless you have enough RAM on the Solr box to cache the whole index in memory. Have you tested this yourself? -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 12. apr. 201

Re: Location Based Searching – Long Lat?

2012-04-16 Thread Jan Høydahl
Hi, There is no geocoding API in Solr as per now so you could handle that in your PHP app. But check out https://issues.apache.org/jira/browse/SOLR-2833 for potential solution. I have a first version of the processor which I can upload to that JIRA if you're interested. -- Jan Høydahl, s

Re: Issue with Solr 3.5 while using TikaEntityProcessor on .docx files

2012-04-16 Thread Jan Høydahl
Hi, Solr3.6 is just out with Tika 1.0. Can you try that? Also, Solr TRUNK now has Tika 1.1... I recommend downloading Tika-App and testing your offending files directly with that http://tika.apache.org/1.1/gettingstarted.html -- Jan Høydahl, search solution architect Cominvent AS

Re: Can Solr solve this simple problem?

2012-04-16 Thread Jan Høydahl
he.org/solr/FunctionQuery for more functions But you could also to much of this through boost queries &sort=score desc &bq=language:FRA^50 %bq=age:[53 TO *]^20 > 3. Does Solr provide realtime index updating or updating every N minutes? Sure, there is Near Real-time indexing in TRUNK (coming 4.0) Jan

Re: Options for automagically Scaling Solr (without needing distributed index/replication) in a Hadoop environment

2012-04-17 Thread Jan Høydahl
will handle indexing the realtime shards as today, but indexing the static shards will be handled by Katta. If Katta adds a shard it will tell SolrCloud by updating the ZK tree, and SolrCloud will pick up the shard and start serving search for it.. -- Jan Høydahl, search solution ar

Re: Can I use Field Aliasing/Renaming on Solr3.3?

2012-04-17 Thread Jan Høydahl
You'll have to upgrade to 3.6. Upgrading is really easy and should be 100% back-compat. Just keep your old config and drop in the new solr.war, then you'll get the new features. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraini

Re: Can Solr solve this simple problem?

2012-04-17 Thread Jan Høydahl
here are better ways to do what you REALLY wanted instead of limiting yourself to the ORDER BY syntax etc. One of Solr's strengths is relevancy and FunctionQueries and it can do amazing things :) Further answers below.. -- Jan Høydahl, search solution architect Cominvent AS - www.cominve

Re: Jira 1540

2012-04-17 Thread Jan Høydahl
Simply try using Solr3.6 to read your old 1.3 indices. Chances are that it will work - without the exceptions :) -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 17. apr. 2012, at 11:08, Ramprakash Ramamoorthy wrote: > I

Re: Can Solr solve this simple problem?

2012-04-17 Thread Jan Høydahl
word. 3. On http://wiki.apache.org/solr/PublicServers the only one saying so explicitly is Jeeran - I'm sure others can fill in with more examples -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 17. apr. 2012, at 12:10, Al

Re: Difference between Search result from Admin console and solr/browse

2012-04-18 Thread Jan Høydahl
you'd like to output in the reults. A simple way to prototype transforms is to write an XSL and using the XSLTUpdateRequestHandler at solr/update/xslt instead of the XML handler. See http://wiki.apache.org/solr/XsltUpdateRequestHandler -- Jan Høydahl, search solution architect Cominv

Re: minimum match and not matched words / term frequency in query result

2012-04-18 Thread Jan Høydahl
instance. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 18. apr. 2012, at 17:34, giovanni.bricc...@banzai.it wrote: > Hi > > I have a dismax query with a mininimum match settings, this allows some terms > to be miss

Re: Solr 3.6 parsing and extraction files

2012-04-18 Thread Jan Høydahl
/solrconfig.xml which directives are included near the top of the file, this should give you a hint of how to configure your own solrconfig.xml depending on where you put those libs. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com

Re: Problems with edismax parser and solr3.6

2012-04-18 Thread Jan Høydahl
Hi, Thanks for reporting this. I've created a bug ticket for this at https://issues.apache.org/jira/browse/SOLR-3377 -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 18. apr. 2012, at 17:00, Bernd Fehling wrote: >

Re: Language Identification

2012-04-20 Thread Jan Høydahl
Hi, Solr just reuses Tika's language identifier. But you are of course free to do your language detection on the Nutch side if you choose and not invoke the one in Solr. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 20

Re: 'Error 404: missing core name in path' in Solr

2012-04-23 Thread Jan Høydahl
Hi, Perhaps your search server uses a multi core setup? In that case you need your core name as part of the URL http://wiki.apache.org/solr/CoreAdmin#Example -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 23. apr. 2012, at 01

Re: Language Identification

2012-04-23 Thread Jan Høydahl
I think nothing has "moved". We just offer Solr users to do language detection inside of Solr, using any of these two libs. If you choose to do language detection on client side instead, using any of these, what is stopping you? -- Jan Høydahl, search solution architect Co

Re: Using Customized sorting in Solr

2012-04-26 Thread Jan Høydahl
ext batch of ads, you go group.offset=1 etc etc. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 26. apr. 2012, at 08:10, solr user wrote: > Hi, > > We are planning to move the search of one of our listing based portal to

Re: Weird query results with edismax and boolean operator +

2012-04-28 Thread Jan Høydahl
Hi, What is your "qf" parameter? Can you run the three queries with debugQuery=true&echoParams=all and attach parsed query and all params? It will probably explain what is happening. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.so

Re: Weird query results with edismax and boolean operator +

2012-04-30 Thread Jan Høydahl
Hi, I see that you have already commented on SOLR-2649 "MM ignored in edismax queries with operators". So let's continue the way towards resolution there... -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 30. a

Re: >1MB file to Zookeeper

2012-05-05 Thread Jan Høydahl
the put(key,val) method chops up the file and stores it spanning multiple 1M ZK nodes, and the get(key) method assembles all parts and returns the object. It would be good enough for most, but if you require something better you can easily impl support for CouchDb, Voldemort or whatever. -- Jan

Re: Solritas in production

2012-05-06 Thread Jan Høydahl
never expose Solr directly to the world. So you should feel totally comfortable continuing to use Solr over HTTP from PHP! -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 6. mai 2012, at 14:02, András Bártházi wrote: >

Re: Partition Question

2012-05-06 Thread Jan Høydahl
arding caches etc. Can you get back with some more details? -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 6. mai 2012, at 17:07, Yuval Dotan wrote: > Hi All > We have an index of ~2,000,000,000 Documents and the query

Re: Solritas in production

2012-05-07 Thread Jan Høydahl
ction, and not intended to either. Even if you moved the Solritas code to another Tomcat instance to avoid direct Solr access, you would still need to put extensive development effort into the Solritas templates before you could call it a finished search front end. What is so bad with PHP after al

Re: Can one determine which results are "good enough" to alert users about?

2012-05-09 Thread Jan Høydahl
t the alerts themselves contain only top-N hits, with a link in the email to see all (of course only those which are new or updated since last alert). -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 9. mai 2012, at 10:50, Chris H

Re: Edismax lowercaseOperators=false does not ignore lowercase and

2012-05-09 Thread Jan Høydahl
uce a "global" stopwords list somewhere, would probably make sense at QP level, alternatively as a SearchComponent. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 9. mai 2012, at 11:49, jpascua wrote: > Hi, > > I

Re: Is it possible to limit the bandwidth of replication

2012-05-09 Thread Jan Høydahl
> I think we have to add this for java based rep. +1

Re: Can one determine which results are "good enough" to alert users about?

2012-05-10 Thread Jan Høydahl
query for "Software License" -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 9. mai 2012, at 22:55, Otis Gospodnetic wrote: > Hi Chris, > > I think there is some confusion here. > When people say things about

Re: SOLR Security

2012-05-11 Thread Jan Høydahl
Hi, There is nothing stopping you from pointing Ajax-SOLR to a URL on your app-server, which acts as a security insulation layer between the Solr backend and the world. In this (thin) layer you can analyze the input and choose carefully what to let through and not. -- Jan Høydahl, search

Re: Editing long Solr URLs - Chrome Extension

2012-05-11 Thread Jan Høydahl
uld be nice if the edit box was multi-line, or perhaps adjusts to the size of the content -- Jan Høydahl, search solution architect Cominvent AS - www.facebook.com/Cominvent Solr Training - www.solrtraining.com On 11. mai 2012, at 07:32, Amit Nithian wrote: > Hey all, > > I don't

Re: query with DATE FIELD AND RANGE query using dismax

2012-05-15 Thread Jan Høydahl
Hi, You can't. Try eDisMax instead: http://wiki.apache.org/solr/ExtendedDisMax -- Jan Høydahl, search solution architect Cominvent AS - www.facebook.com/Cominvent Solr Training - www.solrtraining.com On 15. mai 2012, at 11:05, ayyappan wrote: > Hi > > My queries ar

Re: Fault tolerant Solr replication architecture

2012-05-21 Thread Jan Høydahl
ction some places. [1] https://builds.apache.org/job/Solr-trunk/lastSuccessfulBuild/artifact/artifacts/ [2] http://wiki.apache.org/solr/SolrCloud -- Jan Høydahl, search solution architect Cominvent AS - www.facebook.com/Cominvent Solr Training - www.solrtraining.com On 21. mai 2012, at 09:

Re: Remote streaming - posting a URL which is password protected

2012-05-21 Thread Jan Høydahl
stream.url=http://username:passw...@somewebsite.com/docs/DOC2609 but I have not tested it. Why can't you download the file locally first? If you're looking for a production grade HTTP crawler you could look at ManifoldCF. -- Jan Høydahl, search solution architect Cominvent AS - www.fa

Re: How can i search site name

2012-05-22 Thread Jan Høydahl
UrlClassifyProcessor, see SOLR-2826. -- Jan Høydahl, search solution architect Cominvent AS - www.facebook.com/Cominvent Solr Training - www.solrtraining.com On 22. mai 2012, at 08:03, Shameema Umer wrote: > Sorry, > Please let me know how can I search site name using the solr query syntax

<    1   2   3   4   5   6   7   8   9   10   >