This is the second time he has sent this shit. Kill his subscription. Is it
possible?
On Tue, Oct 26, 2010 at 10:38 PM, Yuchen Wang wrote:
> UNSUBSCRIBE
>
> On Tue, Oct 26, 2010 at 10:15 PM, Igor Chudov wrote:
>
> > UNSUBSCRIBE
> >
> > On Wed, Oct 27, 2010 at 12:14 AM, ST ST wrote:
> > > Looki
Creating a unique id for a schema is one of those design tasks:
http://wiki.apache.org/solr/UniqueKey
A marvelously lucid and well-written page, if I do say so. And I do.
On Tue, Oct 26, 2010 at 10:16 PM, Tharindu Mathew wrote:
> Really great to know you were able to fire up about 100 cores. Bu
UNSUBSCRIBE
On Tue, Oct 26, 2010 at 10:15 PM, Igor Chudov wrote:
> UNSUBSCRIBE
>
> On Wed, Oct 27, 2010 at 12:14 AM, ST ST wrote:
> > Looking for Developers Experienced in Solr/Lucene And/OR FAST Search
> Engines
> > from India (Pune)
> >
> > We are looking for off-shore India Based Developers
UNSUBSCRIBE
On Wed, Oct 27, 2010 at 12:14 AM, ST ST wrote:
> Looking for Developers Experienced in Solr/Lucene And/OR FAST Search Engines
> from India (Pune)
>
> We are looking for off-shore India Based Developers who are proficient in
> Solr/Lucene and/or FAST search engine .
> Developers in the
Really great to know you were able to fire up about 100 cores. But,
when it scales up to around 1000 or even more. I wonder how it would
perform.
I have a question regarding ids i.e. the unique key. Since there is a
potential use case that two users might add the same document, how
would we set th
Erick Erickson wrote:
> In general, the behavior when sorting is not predictable when
> sorting on a tokenized field, which "text" is. What would
> it mean to sort on a field with "erick" "Moazzam" as tokens
> in a single document? Should it be in the "e"s or the "m"s?
Might it be possible or reas
Thanks everybody for the inputs.
Looks like Steven's solution is the closest one but will lead to performance
issues when the query string has many terms.
I will try to implement the two filters suggested by Steven and see how the
performance matches up.
--
Thanks
Varun Gupta
On Wed, Oct 27, 2
Hi, I wanted to use a few fields from the dataase, but cannot use the DIH
because jdbc access to the database is not allowed. We can only go thru a
wrapper. As such, I would like to know how I can index the data obtained
through the db wrapper, using solrJ. I would have two fields to index - id
an
Do you want one number, or the sum for each group? For one number, the
stats component is fine.
For one number per group, grouping does not (yet) support the stats
component. This is the old SQL "Group By" command, right?
On Tue, Oct 26, 2010 at 6:42 AM, stockiii wrote:
>
> Hello.
>
> we want to
I think you have to write a "yet exact match" handler yourself (I mean yet
cause it's not quite exact match we normally know). Steve's answer is quite
near your request. You can do further work based on his solution.
At the last step, I'll suggest you eat up all blank within query string and
q
Facets are generated from indexed terms.
Depending on your need/use-case:
You can use a additional separate String field (which is not tokenized) for
facets, populate it via copyField. Search on tokenized field facet on
non-tokenized field.
Or
You can add solr.ShingleFilterFactory to your in
Use this field type -
On Tue, Oct 26, 2010 at 6:43 PM, Adam Estrada wrote:
> All,
> I am a new to Solr faceting and stuck on how to get multiple-word
> facets returned from a standard Solr query. See below for what is
> currently being returned.
>
>
>
>
>
> 89
> 87
> 87
> 87
> 84
You use maven-assembly-plugin's jar-with-dependencies to build a single
jar with all its dependencies
http://stackoverflow.com/questions/574594/how-can-i-create-an-executable-jar-with-dependencies-using-maven
@tommychheng
On 10/19/10 6:53 AM, Matt Mitchell wrote:
Hey thanks Tommy. To be more
All,
I am a new to Solr faceting and stuck on how to get multiple-word
facets returned from a standard Solr query. See below for what is
currently being returned.
89
87
87
87
84
60
32
22
19
15
15
14
12
11
10
9
7
7
7
6
6
6
6
...etc...
There are many terms in there that are 2 or 3 word phrases.
On Mon, Oct 25, 2010 at 2:45 AM, Alex Matviychuk wrote:
> Getting this when deploying to tomcat:
>
> [INFO][http-4443-exec-3][solr.schema.IndexSchema] readSchema():394
> Reading Solr Schema
> [INFO][http-4443-exec-3][solr.schema.IndexSchema] readSchema():408
> Schema name=tsadmin
> [ERROR][http-4
Hi Matt,
I think your concern about performance is spot-on, though.
The combinatorial explosion would be at query time, not at index time - my
solution has a single token indexed per document. My suggested query-time
filter would generate the following number of output terms, where C(n,k) is th
Try to do some changes, but it's not help:
In _http://localhost:8983/search/admin/schema.jsp I have, for example, term
"main" and frequency "7" for this term. But if I try to find this I don't
get any result. If I use wildcard, I have only 4 docs in response.
But if I try to find term "html" (fre
Hi everyone,
Do we need all lucene jars in the class path for this? Seems that the
solr-solrj and solr-core jars are not enough
(http://wiki.apache.org/solr/Solrj). It is asking for lucene jars in
the classpath. Could I know what jars are required to run this?
Thanks in advance.
--
Regards,
Th
Bah.. nope this would miss documents that only match a subset of the
given terms.
I'm going to have to go with Steven's approach as the right choice here.
Matt
On 10/26/2010 3:44 PM, Matthew Hall wrote:
Indeed, I'd missed the second part of his requirements, my and
solution is sadly insuffici
Indeed, I'd missed the second part of his requirements, my and solution
is sadly insufficient to this task.
The combinatorial part of you solution worries me a bit though Steven,
because his documents that are on the larger side of his corpus would
likely slow down query performance a bit whil
Thanks for the insight.
This is definitely a feasible solution because I only need to highlight when
the user open the document.
I guess the easiest way I can do this is to "reuse" the solr code (with some
modification) in my own application.
On Tue, Oct 26, 2010 at 2:35 PM, Pradeep Singh wrote:
Dennis,
I wasn't trying to force your admission of my rectitude - I was just getting
frustrated that the conversation was moving in spiral fashion, and was worried
that you might have intentionally engineered that.
I'm glad to hear that you weren't flame baiting.
Steve
> -Original Messag
I'm the LAST person anyone will ever need to worry about flame baiting. You did
notice that I retracted what I said and supported your point of view?
Sorry if my cryptic comment sounded critical. I was wrong, you were right :-)
Dennis Gearon
Signature Warning
It is always a good
Hi Dennis,
You wrote:
> If Solr is like Google, once documents matching only the ANDed items
> in the query ran out, then those that had only two of the terms, then
> only 1 of the terms, and then those close to it would start showing up.
[...]
> Plus, if he wants terms that contain ONLY those wor
I understand that the thread count is specified on root entities only. Does
it spawn multiple threads per root entity? Or multiple threads per
descendant entity? Can someone give an example of how you would make a
database query in an entity with 4 threads that would select 1 row per
thread?
Than
Plus, if he wants terms that contain ONLY those words, and no others, an ANDed
query would not do that, right? ANDed queries return results that must have ALL
the terms listed, and could have lots of other words, right?
Dennis Gearon
Signature Warning
It is always a good idea
If Solr is like Google, once documents matching only the ANDed items in the
query ran out, then those that had only two of the terms, then only 1 of the
terms, and then those close to it would start showing up.
Is this correct?
If so, it wouldn't match his requirements.
Dennis Gearon
Signatur
Dennis,
Do you mean to say that you read my earlier post, and disagree that it would
solve the problem? Or have you simply not read it?
Steve
> -Original Message-
> From: Dennis Gearon [mailto:gear...@sbcglobal.net]
> Sent: Tuesday, October 26, 2010 3:00 PM
> To: solr-user@lucene.apach
Good point. Since I might need such a query myself someday, how *IS* that done?
Dennis Gearon
Signature Warning
It is always a good idea to learn from your own mistakes. It is usually a
better idea to learn from others’ mistakes, so you do not have to make them
yourself. from
: [ERROR][http-4443-exec-3][util.plugin.AbstractPluginLoader] log():139
: java.lang.ClassCastException: org.apache.solr.schema.StrField cannot
: be cast to org.apache.solr.schema.FieldType
This almost certainly inidcates a classloader issue - i suspect you have
multiple solr related jars in vari
Um, maybe I'm way off base, but when Varun said:
> If I search with the text "samsung andriod GPS",
> search results should only conain "samsung", "GPS",
> "andriod" and "samsung andriod".
I interpreted that to mean that hit documents should contain terms from the
query, and nothing else. Makin
Another way you can do this is - after the search has completed, load the
field in your application, write separate code to reanalyze that
field/document, index it in RAM, and run it through highlighter classes. All
this as part of your web application outside of Solr. Considering the size
of your
Um.. you could change your default clause to AND rather than or.
That should do the trick.
Matt
On 10/26/2010 2:26 PM, Dennis Gearon wrote:
Overkill?
Dennis Gearon
I can't think of a way to do it without writing new
analysis filters.
But I think you could do what you want with two filters
(
"Son, don't touch that stove . . . .",
"OUCH! Hey Dad, I BURNED my hand on that stove, why didn't you tell me
that?!?#! You know I need to know WHY, not just DON'T!"
Dennis Gearon
> Very important: do not make a spelling or autosuggest index
> from a
> text field which some people can see and o
Overkill?
Dennis Gearon
>
> I can't think of a way to do it without writing new
> analysis filters.
>
> But I think you could do what you want with two filters
> (this is untested):
>
> 1. An index-time filter that outputs a single token
> consisting of all of the input tokens, sorted in a
> co
Can anyone tell my, why my search is so terrible? It's work realy strange.
Here my basic configs in schema.xml:
main filters:
and fields:
Hi,
we had the following problem. We added a field to schema.xml and fed our
master with the new data.
After that querying on the master is fine. But when we replicated
(solr1.4.0) to our slaves.
All slaves said they cannot find the new field (standard exception for
missing fields).
And that a
Hello esteemed Solr community --
I'm observing some inconsistent performance on our slave servers after
recently optimizing our master server.
Our configuration is as follows:
- all servers are hosted at Amazon EC2, running Ubuntu 8.04
- 1 master with heavy insert/update traffic, about 1
Hi,
I understand that I need to store the fields in order to use highlighting
"out of the box".
I'm looking for a way to highlighting using term offsets instead of the
actual text since the text is not stored. What am asking is is it possible
to modify the response (thru custom implementation) to
The Solr home is the -Dsolr.solr.home Java System property
Also make sure that -Dsolr.data.dir is define for your data directory, if it
is not already defined in the solrconfig.xml file
On Tue, Oct 26, 2010 at 10:46 AM, Upayavira wrote:
> You need to watch what you are setting your solr.home to
You need to watch what you are setting your solr.home to. That is where
your indexes are being written. Are they getting overwritten/lost
somehow. Watch the files in that dir while doing a restart.
That's a start at least.
Upayavira
On Tue, 26 Oct 2010 16:40 +0300, "Mackram Raydan"
wrote:
> Hey
Thanks Jonathan. FQ seems promising. I will give it a go.
Swapnonil Mukherjee
On 26-Oct-2010, at 7:29 PM, Jonathan Rochkind wrote:
> So, first of all, "exact" match is hard in Solr on tokenized fields.
> Tokenized fields don't really do that. So for exact match, you should
> probably use
mike anderson wrote:
I'm really curious if there is a clever solution to the obvious problem
with: "So your better off using a single index and with a user id and use
a query filter with the user id when fetching data.", i.e.. when you have
hundreds of thousands of user IDs tagged on each article
On Tue, Oct 26, 2010 at 9:15 AM, Savvas-Andreas Moysidis <
savvas.andreas.moysi...@googlemail.com> wrote:
> If I get your question right, you probably want to use the AND binary
> operator as in "samsung AND andriod AND GPS" or "+samsung +andriod +GPS"
>
>
N.b. For these queries you can also pass
So I fired up about 100 cores and used JMeter to fire off a few thousand
queries. It looks like the memory usage isn't much worse than running a
single shard. So thats good.
I'm really curious if there is a clever solution to the obvious problem
with: "So your better off using a single index and w
So, first of all, "exact" match is hard in Solr on tokenized fields.
Tokenized fields don't really do that. So for exact match, you should
probably use a non-tokenized field (string or text with keywordtokenizer
(which should really be called the non-tokenizer)). If there's only one
token in
Hi Javendra,
Thanks for the suggestion, I updated to Solr 1.4.1 and Solr Cell 1.4.1 and
tried sending a zip file that contained several html documents.
Unfortunately, that did not solve the problem.
Here's the curl command I used:
curl "
http://localhost:8983/solr/update/extract?literla.id=d...@u
I have a solr instance in my server, and I can make request with internet
explorer. However, with other browsers I can't.
Error given;
*XMLHttpRequest cannot load http://. Origin http://... is not allowed by
Access-Control-Allow-Origin.*
I changed my apache server conf file and added this lin
Andrzej Bialecki wrote:
> On 2010-10-25 11:22, Toke Eskildsen wrote:
>> On Thu, 2010-07-22 at 04:21 +0200, Li Li wrote:
>>> But itshows a problem of distrubted search without common idf.
>>> A doc will get different score in different shard.
>> Bingo.
>>
>> I really don't understand why this funda
Hey everyone,
I apologize if this question is rudimentary but it is getting to me and
I did not find anything reasonable about it online.
So basically I have a Solr 1.4.1 setup behind Tomcat 6. I used the
SolrTomcat wiki page to setup. The system works exactly the way I want
it (proper searc
Check out this link
http://wiki.apache.org/solr/FieldOptionsByUseCase
You need to store the field if you want to use the highlighting feature.
If you need to retrieve and display the highlighted snippets then the fields
definitely needs to be stored.
To use term offsets, it will be a good idea
Hi Varun,
I can't think of a way to do it without writing new analysis filters.
But I think you could do what you want with two filters (this is untested):
1. An index-time filter that outputs a single token consisting of all of the
input tokens, sorted in a consistent way, e.g.:
"mobile wi
If I get your question right, you probably want to use the AND binary
operator as in "samsung AND andriod AND GPS" or "+samsung +andriod +GPS"
On 26 October 2010 14:07, Varun Gupta wrote:
> Hi,
>
> I have lot of small documents (each containing 1 to 15 words) indexed in
> Solr. For the search qu
Hi,
I have lot of small documents (each containing 1 to 15 words) indexed in
Solr. For the search query, I want the search results to contain only those
documents that satisfy this criteria "All of the words of the search result
document are present in the search query"
For example:
If I have the
Am about to implement a custom query that is sort of mash-up of Facets,
Highlighting, and SpanQuery - but thought I'd see if anyone has done
anything similar.
In simple words, I need facet on the next word given a target word.
For example, if my index only had the following 5 documents (co
Try:
select IF(sub_cat_id=2002, DATE_FORMAT(ad_post_date,
'%Y-%m-%dT00:00:00Z/DAY'), null) as 'ad_sort_field' from
tcuser.ad_details where
Ephraim Ofir
-Original Message-
From: Pawan Darira [mailto:pawan.dar...@gmail.com]
Sent: Tuesday, October 26, 2010 1:29 PM
To: solr-user@lucene.apac
Hi Everybody,
Thanks Ephraim and Peter. I think I got my answer.
Swapnonil Mukherjee
On 26-Oct-2010, at 4:23 PM, Ephraim Ofir wrote:
> Note that usually when you change the schema.xml you have not only to
> restart solr, but also rebuild the index, so the issue of how to reload
> the file se
Hi Everybody,
Let me give you a brief idea of our Solr document. We have about 6 text type
fields, each containing IPTC data extracted from photos. Search is performed
mostly on these 6 fields.
We also have a mutlivalue field named group_id that contains a list of all the
group_ids that have a
My Sql is
select IF(sub_cat_id=2002, ad_post_date, null) as 'ad_sort_field' from
tcuser.ad_details where
+---+
| ad_sort_field |
+---+
| 2010-05-30|
| 2010-05-02|
| 2010-10-07|
| NULL|
| 2010-10-15|
| NULL|
++
T
On Tue, Oct 26, 2010 at 3:56 PM, Pawan Darira wrote:
> I am using mysql database, and, field type is "date"
[...]
Could you show us the exact SELECT statement, and some example
values returned by running the SELECT directly at a mysql console?
Regards,
Gora
Note that usually when you change the schema.xml you have not only to
restart solr, but also rebuild the index, so the issue of how to reload
the file seems like a small problem...
Ephraim Ofir
-Original Message-
From: Peter Karich [mailto:peat...@yahoo.de]
Sent: Tuesday, October 26, 201
Hi,
I've been looking thru the mailing archive for the past week and I haven't
found any useful info regarding this issue.
My requirement is to index a few terabytes worth of data to be searched.
Due to the size of the data, I would like to index without storing but I
would like to use the highli
This is probably just a date format problem, nothing to do with the IF()
statement. Try applying this on your date:
DATE_FORMAT(yourDate, '%Y-%m-%dT00:00:00Z')
Ephraim Ofir
-Original Message-
From: Pawan Darira [mailto:pawan.dar...@gmail.com]
Sent: Tuesday, October 26, 2010 12:26 PM
To:
Hi,
See this:
http://wiki.apache.org/solr/CoreAdmin#RELOAD
Solr will also load the new configuration (without restart the webapp)
on the slaves when using replication:
http://wiki.apache.org/solr/SolrReplication
Regards,
Peter.
Hi Everybody,
If I change my schema.xml to, do I have to rest
I am using mysql database, and, field type is "date"
On Tue, Oct 26, 2010 at 2:56 PM, Gora Mohanty wrote:
> On Tue, Oct 26, 2010 at 2:37 PM, Pawan Darira
> wrote:
> > Thanks Mr. Ephraim Ofir. I used the SELECT IF() for my requirement. The
> > query result is correct. But when i see it in my ind
Hi Xin,
from the wiki:
http://wiki.apache.org/solr/SolrConfigXml
The URL of the "ping" query is* /admin/ping
* You can also check (via wget) the number of documents. it might look
like a rusty hack but it works for me:
wget -T 1 -q "http://localhost:8080/solr/select?q=*:*"; -O - | tr '/>'
If you are using Solr Multicore http://wiki.apache.org/solr/CoreAdmin you can
issue a Reload command
http://localhost:8983/solr/admin/cores?action=RELOAD&core=core0
On 26 Oct 2010, at 11:09, Swapnonil Mukherjee wrote:
> Hi Everybody,
>
> If I change my schema.xml to, do I have to restart Solr.
Hi Everybody,
If I change my schema.xml to, do I have to restart Solr. Is there some way, I
can apply the changes to schema.xml without restarting Solr?
Swapnonil Mukherjee
On Tue, Oct 26, 2010 at 2:37 PM, Pawan Darira wrote:
> Thanks Mr. Ephraim Ofir. I used the SELECT IF() for my requirement. The
> query result is correct. But when i see it in my index, the value stored is
> something unusual bunch of characters e.g. "*...@6628ad5a"*
[...]
Which database are you i
Thanks Mr. Ephraim Ofir. I used the SELECT IF() for my requirement. The
query result is correct. But when i see it in my index, the value stored is
something unusual bunch of characters e.g. "*...@6628ad5a"*
Please suggest as to what went wrong.
- Pawan
On Mon, Oct 25, 2010 at 6:44 PM, Ephraim
Hi,
You need to share relevant parts of your schema for us to be able to see what's
going on.
Try using fieldType="text". Basically, you need a fieldType which has the
lowercaseFilter included.
--
Jan Høydahl, search solution architect
Cominvent AS - www.cominvent.com
On 25. okt. 2010, at 21.
Sounds like WordDelimiterFilter config issue, please refer to
http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.WordDelimiterFilterFactory
.
Also it will help if you could provide:
1) Tokenizers/Filters config in schema file
2) analysis.jsp output in admin page.
2010/10/26 wu liu
okay. how did you solve this ?
do you wrote an own importer ?
we have a "own" "importer" yet, but only for one instance of solr and one
index, we want to spit this in severeal cores and indexes and want to use
DIH because we think his indexing is better than a php skript ...
--
View this messa
The idea of ACL-based queries is: each document carries all of the
groups or roles that it is ok with. Each user search includes all of
the groups or roles the user has.
The roles are stored as multivalued string fields. Each ACL-based
query passes in "roles:A OR roles:B OR roles:C" and if any of
Filter queries are a set of bits which is ANDed against query results
at a very early stage of query processing. They are very useful. Note
that they are stored (I think) in parsed query order, so you have to
pass in the same filter query string each time.
On Mon, Oct 25, 2010 at 8:59 AM, Dennis
75 matches
Mail list logo