Re: Exact match works only for some of the strings

2017-03-16 Thread Alvaro Cabrerizo
Hello, I've tested on an old solr 4.3 instance and the schema and the field definition are fine. I've also checked that only the query nameExact:"Guardian EU-referendum" gives the result, the other one you have commented (nameExact:"Guardian US") gives 0 hits. Maybe, you forgot to re-index after s

Re: Hitting complex multilevel pivot queries in solr

2016-02-19 Thread Alvaro Cabrerizo
. I just need top 100 counts > from the leaf level of the pivot. > I'm also looking at facet.threads which could give responses to an extent. > But It does not solve my issue. > > Hovewer, the Endeca equivalent of this application seems to be working > well. > Example Endec

Re: Hitting complex multilevel pivot queries in solr

2016-02-18 Thread Alvaro Cabrerizo
Hi, The idea of copying fields into a new one (or various) during indexing and then facet the new field (or fields) looks promising. More information about data will be helpful (for example if the fields:country, state.. are single or multivalued). For example if all of the fields are single value

Re: Display entire string containing query string

2016-02-18 Thread Alvaro Cabrerizo
Hi, To understand Binoy's answer, please check the: The fl (Field List) Parameter . If you want to include: "*fragments of documents that match the user's query to be included

Re: Problem with queries that includes NOT

2015-02-25 Thread Alvaro Cabrerizo
Hi, The edismax parser should be able to manage the query you want to ask. I've made a test and the next both queries give me the right result (see the parenthesis): - {!edismax}(NOT id:7 AND NOT id:8 AND id:9) (gives 1 hit the id:9) - {!edismax}((NOT id:7 AND NOT

Re: Facet on TopDocs

2015-02-25 Thread Alvaro Cabrerizo
Hi, The facet component works with the whole result set, so you can't get the facets for your topN documents. A naive way you can fulfill your requirement is two implement it in two steps: - Request your data and recover the doc ids. - Create a new query with the selected ids (id:id1 OR id:

Re: Trending functionality in Solr

2015-02-09 Thread Alvaro Cabrerizo
Hi, In a project I'm working on, we adopted the same approach as the one commented by Jorge Luis (use a separate core and let the front-end send the query to solr to store it). Depending on the analysis you want to achieve it could be interesting to delete duplicates or not (have 1 solr document p

Re: Problem with faceting

2015-02-06 Thread Alvaro Cabrerizo
worked fine, and I think that the problem has nothing to do with the > definition. > Besides, I have seen that when the correct value in the facet field would > be 2, Solr shows 4, and when it would be 1 it shows 2. In conclusion, for > some reason values are being duplicated. Why? I hav

Re: Problem with faceting

2015-02-06 Thread Alvaro Cabrerizo
Hi David, Yes it sounds weird. Just for testing purpose, It would be nice to have the ID_bent fieldtype definition. Regards. On Fri, Feb 6, 2015 at 9:05 AM, wrote: > Hello, > > we have been using faceting for a long time, but now I have discovered a > problem that I can't understand: > > the

Re: Detect ongoing Solr Import and its Completion

2015-02-05 Thread Alvaro Cabrerizo
Maybe you are asking for the status command. Currently this is the url I invoke for checking whether the import process is running (or has failed) >From the cwiki: The URL is http://:/solr//dataimport?command=status. It returns statistics on the number of documents created, deleted, queries run,

Re: Reading data from another solr core

2015-01-28 Thread Alvaro Cabrerizo
Hi, I usually use the SolrEntityProcessor for moving/transform data between cores, it's a piece of cake! Regards. On Wed, Jan 28, 2015 at 8:13 AM, solrk wrote: > Hi Guys, > > I have multiple cores setup in my solr server. I would like read/import > data > from one core(source) into another co

query time join (stored or indexed value field?)

2015-01-26 Thread Alvaro Cabrerizo
Hi, Is the time join query using stored data or indexed data from the fields set in "from" and "to"? (For example, the facet feature makes the count based on the indexed data) I've made an small example (using tokenizers, stopwords...) and it seems that the join

Re: Ignore whitesapce, underscore using KeywordTokenizer... EdgeNGramFilter

2015-01-21 Thread Alvaro Cabrerizo
er@lucene.apache.org > Subject: Re: Ignore whitesapce, underscore using KeywordTokenizer... > EdgeNGramFilter > > I tried adding *PatternReplaceFilterFactory *in index section but it is > not working > > Example itemName data can be : > - "ABC E12" : if user types "A

Re: Ignore whitesapce, underscore using KeywordTokenizer... EdgeNGramFilter

2015-01-21 Thread Alvaro Cabrerizo
Hi, Not sure, but I think that the PatternReplaceFilterFactory or the PatternReplaceCharFilterFactory could help you deleting those characters. Regards. On Jan 21, 2015 7:59 PM, "Vishal Swaroop" wrote: > I am trying to implement type-ahead suggestion for single field which > should ignore white

Re: How to index data from multiple data source

2015-01-20 Thread Alvaro Cabrerizo
Hi, You can find several examples of configuring tika+dih to index pdf in internet (e.g. https://tuxdna.wordpress.com/2013/02/04/indexing-the-documents-stored-in-a-database-using-apache-solr-and-apache-tika/ ) Regards. On Jan 21, 2015 6:54 AM, "Yusniel Hidalgo Delgado" wrote: > > > Dear Solr co

Re: Filter Solr multivalued fields to be able to add pagination

2015-01-20 Thread Alvaro Cabrerizo
Hi, Currently, there is no way to sort by a multi-value field within solr (first the system should sort the content of the field, then sort documents...). Anyway, if you have a clear idea on how the sort should be done try to accomodate your data to your needs (in case it is posible). One option

Re: Newly observed Facets

2015-01-20 Thread Alvaro Cabrerizo
Hi, In case your data looks like: "id": "1", "userName": "one", "startTimeISO": "2015-01-20T17:24:32.888Z" "id": "2", "userName": "one", "startTimeISO": "2015-01-16T17:24:50.208Z" "id": "3", "userName": "two", "startTimeISO": "2015-01-20T17:25:06.109Z" You could use the next query combination

Re: Newly observed Facets

2015-01-20 Thread Alvaro Cabrerizo
ome usernames that were never seen in the last 30 days, > but they were observed today. > That is the main challenge I am having. > > How to identify which usernames from today were not seen in the last 30 > days. > > On Tue, Jan 20, 2015, 1:02 AM Alvaro Cabrerizo wrote: > &

Re: Newly observed Facets

2015-01-20 Thread Alvaro Cabrerizo
Ok, Thus as commented before, in case your starttimeISO is single-value you only need to add the range clause: startTimeISO:["2015-01-19T00: 00:00.000Z" TO "2015-01-20T00:00:00.000Z"]". There is no need to add both NOT A AND B as the documents that satisfy B will automatically satisfy A. If you q

Re: Newly observed Facets

2015-01-20 Thread Alvaro Cabrerizo
not present in the last 30 days data > and it was only observed today. So it is "new" > > Hope I could explain it to you well. Ask me for any more questions > > On Mon, Jan 19, 2015 at 11:45 PM, Alvaro Cabrerizo > wrote: > > > At first impression, everything

Re: Newly observed Facets

2015-01-19 Thread Alvaro Cabrerizo
At first impression, everything seems ok. Anyway, is the startTimeISO single-value or multivalued field? In case it is single-value the clause startTimeISO:["2015-01-19T00: 00:00.000Z" TO "2015-01-20T00:00:00.000Z"]" is sufficient to exclude other period of time. I also guess that the startTimeISO

Re: Boost documents having a field value

2014-06-02 Thread Alvaro Cabrerizo
Hi, One option (not tested by myself), could be the use of payloads ( http://wiki.apache.org/solr/Payloads). Regards. On Mon, Jun 2, 2014 at 7:58 PM, Hakim Benoudjit wrote: > Hi guys, > Is it possible in solr to boost documents having a field value (Ex. > :)? > I know that it's possible to bo

Re: Indexing DateField timezone problem

2014-05-16 Thread Alvaro Cabrerizo
I guess, you will need to modify your extraction select in order to fix it, using some date functions provided by the database manufacturer. For example, in some projects when using oracle as a data source i've been using the next recipe to modify the oracle TIMESTAMP(6) datatype to fit the solr da

Re: search multiple cores

2014-05-15 Thread Alvaro Cabrerizo
As far as I know (and how i have been using it), the join can't do what you want. The structure of the query you could try (among others) is : 1. http://SOLR_ADDRESS/coreA/select?q=A&fq={!join ... fromCore=coreB}B 2. http://SOLR_ADDRESS/coreA/select?q=A AND _query_:"{!join

commit persistence guarantee

2014-05-14 Thread Alvaro Cabrerizo
Hi, Is there any guarantee that every document is persisted on disk during a "commit avalanche" that produces the: "ERROR org.apache.solr.core.SolrCore – org.apache.solr.common.SolrException: Error opening new searcher. *exceeded limit of maxWarmingSearchers*=1, try again later". I've made some

Re: Join in solr to get data from two cores

2014-05-14 Thread Alvaro Cabrerizo
There are two previous threads in the list that i think can help you, http://mail-archives.apache.org/mod_mbox/lucene-solr-user/201405.mbox/%3c1398929537117-4134045.p...@n3.nabble.com%3E http://mail-archives.apache.org/mod_mbox/lucene-solr-user/201404.mbox/%3c20140403114242.horde.epx2xawezs3mvmt

Re: Join solr multi core select field mixed other core

2014-04-22 Thread Alvaro Cabrerizo
Hi, Please, check using ...q={!join+to=cat_id+from=id+fromIndex=category}name:*... Regards. On Tue, Apr 22, 2014 at 3:37 AM, hungctk33 wrote: > Hi . all. I has table > Product (id, name, cat_id) > Category (id, name); > > Select pro.id, pro.name, cat.name from Product as pro inner join Catego

Re: Solr join and lucene scoring

2014-04-09 Thread Alvaro Cabrerizo
Right, The examples posted before had a trick (both weren't pure join queries thats why the results had score). Having a pure join query (q={!join from=id to=id}printer) gives all the scores as 1 as Moritz said). Regards. On Wed, Apr 9, 2014 at 3:13 PM, Mikhail Khludnev wrote: > On Wed

Re: Does sorting skip everything having to do with relevancy?

2014-04-04 Thread Alvaro Cabrerizo
rt). The parameter boost will be overwritten if it is not defined as an invariant. Regards. On Fri, Apr 4, 2014 at 4:12 PM, Shawn Heisey wrote: > On 4/4/2014 12:48 AM, Alvaro Cabrerizo wrote: > > By default solr is using the sort parameter over the "score field". So if >

Re: Solr join and lucene scoring

2014-04-04 Thread Alvaro Cabrerizo
Hi, The defect you are referencing is closed with a resolution of *Invalid*, so it seems the scoring is working fine with the join. I've made the next two tests on my own data and seems it is working: *TestA* - fl=id,score - q=notebook - fq={!join from=product_list to=id fromIndex=prod

Re: Boosing Basic

2014-04-04 Thread Alvaro Cabrerizo
Hi, If I were you, I would start reading the edismax documentation. Apart from the wiki, you can find in every distribution a full example with the configuration of the edismax query parser (check the xml node reque

Re: Does sorting skip everything having to do with relevancy?

2014-04-03 Thread Alvaro Cabrerizo
Hi, By default solr is using the sort parameter over the "score field". So if you overwrite it using other sort field, yes solr will use the parameter you've provided. Remember, you can use multiple fields for sorting so you can make somethin

Re: Facetting by field then query

2014-03-27 Thread Alvaro Cabrerizo
I don't think you can do it, as pivot faceting doesn't let you use facet queries. The closer query I can imagine is: - q=sentence:bar OR sentence:foo - facet=true - facet.pivot=media_id,sentence At

Re: Expansion of fields in fl parameter

2013-11-28 Thread Alvaro Cabrerizo
ield names to text_01 , text01 and try again? Problem > still exists? > > Do you have dynamicField definitions? What happens when you request all > fields (fl=*)? > > > > > On Thursday, November 28, 2013 3:59 PM, Alvaro Cabrerizo < > topor...@gmail.com> wrote: > T

Expansion of fields in fl parameter

2013-11-28 Thread Alvaro Cabrerizo
The question: ¿Is there a way to avoid field expansion when adding flparameter to the request? The environment: Solr 4.4 using the next schema.xml The test has been done importing the next data: {"id": "1","text": "text","01text": "tex

Re: Thought exercise: features for Solr client

2013-11-14 Thread Alvaro Cabrerizo
Here goes my wishlist: - Transaction management - Access control at document level Regards. On Thu, Nov 14, 2013 at 10:35 AM, Alexandre Rafalovitch wrote: > Hello, > > I am trying to imagine what would a new, fresh, Solr client library look > like. There has been a number of features add

Re: Modify the querySearch to q=*:*

2013-11-13 Thread Alvaro Cabrerizo
Hi: First of all I have to say that I had never heard about *\* as the query to get all the documents in a index but *:* (maybe I'm wrong) . Re-reading "Apache Solr 4 cookbook", "Solr 1.4 Enterprise Search Server" and " Apache Solr 3 Enterprise Search Server" there is no trace for the query *\* a

Re: Why do people want to deploy to Tomcat?

2013-11-12 Thread Alvaro Cabrerizo
In my case, the selection of the servlet container has never been a hard requirement. I mean, some customers provide us a virtual machine configured with java/tomcat , others have a tomcat installed and want to share it with solr, others prefer jetty because their sysadmins are used to configure it

Re: Indexing a token to a different field in a custom filter

2013-11-12 Thread Alvaro Cabrerizo
Hi, Maybe the synonym filteris the mirror you can look in. You can start creating a new field type in your schema that is stanbol enhanced. Let's follow with the parallelism, in the case of synonym we could have

Re: SOLR keyword search with fq queries

2013-11-08 Thread Alvaro Cabrerizo
Please, check if "defaults", "appends" and "invariants" from http://wiki.apache.org/solr/SearchHandler can solve your problem. Regards. On Fri, Nov 8, 2013 at 6:05 AM, atuldj.jadhav wrote: > Hi All,I need your help to find a solution to one of the issue I am facing > with the keyword search.We

Re: Example of join using Solr/Lucene

2013-11-06 Thread Alvaro Cabrerizo
which companiId points to a company with a name that starts with "toyo" (AND _query_:{!join to=companyId from=id}name:toyo*) - *{!join to=id from=ownerId}* Use the former list of cars to make a join with the documents pointed by ownedId (i.e. people). Regards. On Wed, Nov 6,

Re: Example of join using Solr/Lucene

2013-11-06 Thread Alvaro Cabrerizo
s me all the documents and later I filter them saying just to return those which companyId has a company tha t satisfies id=23 or in the second example its name start by toyo and its id starts by 2. Regards. On Wed, Nov 6, 2013 at 7:55 PM, Alvaro Cabrerizo wrote: > You can check these link

Re: Example of join using Solr/Lucene

2013-11-06 Thread Alvaro Cabrerizo
uld be > good). > > Thanks > > > > On Tue, Nov 5, 2013 at 10:53 AM, Alvaro Cabrerizo >wrote: > > > In my case, everytime I've used joins, the FROM field was a multivalued > > string and the TO was an univalued string. > > > > Regards. &g

Re: Example of join using Solr/Lucene

2013-11-05 Thread Alvaro Cabrerizo
In my case, everytime I've used joins, the FROM field was a multivalued string and the TO was an univalued string. Regards. El 05/11/2013 18:37, "Tech Id" escribió: > I think Solr has the ability to do joins in the latest version as verified > on this issue: https://issues.apache.org/jira/browse

Re: measure result set quality

2013-10-21 Thread Alvaro Cabrerizo
> good (or better) results are in such a way that you can > > > program an evaluation?" Often the answer is "no"... > > > > > > One common technique is to have knowledgable users > > > do what's called A/B testing. You fire the query at two > &g

measure result set quality

2013-10-17 Thread Alvaro Cabrerizo
Hi, Imagine the next situation. You have a corpus of documents and a list of queries extracted from production environment. The corpus haven't been manually annotated with relvant/non relevant tags for every query. Then you configure various solr instances changing the schema (adding synonyms, sto

Re: synonyms and term position

2013-10-09 Thread Alvaro Cabrerizo
The synonyms.txt has defined the next associations defined. AIO=>All in one aio=>all-in-one Regards. On Wed, Oct 9, 2013 at 11:05 AM, Alvaro Cabrerizo wrote: > No, it has no synonyms. > > > On Wed, Oct 9, 2013 at 10:48 AM, Furkan KAMACI wrote: > >> Does "two

Re: synonyms and term position

2013-10-09 Thread Alvaro Cabrerizo
No, it has no synonyms. On Wed, Oct 9, 2013 at 10:48 AM, Furkan KAMACI wrote: > Does "two" has a synonym of "in" and "one"? > > > 2013/10/9 Furkan KAMACI > >> Does "two" has a synonym of "in" and "one"? >&g

Re: synonyms and term position

2013-10-09 Thread Alvaro Cabrerizo
> 2013/10/9 Alvaro Cabrerizo > > > Hi: > > > > I'm involved in a process o upgrade solr from 1.4 to 4.4 and I'm having a > > problem using SynonymFilterFactory within the process chain > > SynonymFilterFactory, StopFilterFactory . > > > >

synonyms and term position

2013-10-09 Thread Alvaro Cabrerizo
Hi: I'm involved in a process o upgrade solr from 1.4 to 4.4 and I'm having a problem using SynonymFilterFactory within the process chain SynonymFilterFactory, StopFilterFactory . I have configured synonyms.txt to expand the word AIO as: all-in-one. Well, when using solr 1.4 I get the following r

Re: Dynamic Field

2013-09-09 Thread Alvaro Cabrerizo
Hi: As you posted, a possibility could be, to define the fields "jobs" and "batch" as multivalued and use the partial updateto add new values to those fields. Hope it helps. On Sun, Sep 8, 2013 at 9:49 PM, anurag.jain wrote: >

Re: Solr substring search

2013-09-06 Thread Alvaro Cabrerizo
Hi: I would start looking: http://docs.lucidworks.com/display/solr/The+Standard+Query+Parser And the org.apache.lucene.queryparser.flexible.standard.StandardQueryParser.java Hope it helps. On Thu, Sep 5, 2013 at 11:30 PM, Scott Schneider < scott_schnei...@symantec.com> wrote: > Hello, > > I'm