Solr + HDFS settings

2014-10-25 Thread Norgorn
I'm trying to run SOLR with HDFS
 in solrconfig.xml I've written 


  hdfs:///solr
  true
  1
  true
  16384
  true
  true
  true
  16
  192
 

But, when I'm trying to create collection, I get 
"Caused by: com.google.protobuf.InvalidProtocolBufferException: Protocol
message end-group tag did not match expected tag." 

tcpdump shows, that server answers with "Bad request", I've tried to set
path to datanode instead of namenode, but got the same result.
What am I doing wrong?

P.S. I've found, that the problem can be cause of different protobuf.jar,
I've changed that jar (and hadoop-*.jar for comparability) in my SOLR libs,
but the problem didn't change.




--
View this message in context: 
http://lucene.472066.n3.nabble.com/Solr-HDFS-settings-tp4165873.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Solr + HDFS settings

2014-10-25 Thread Norgorn
Ok, new problem, while collection or shard creating:

Caused by: no segments* file found in
NRTCachingDirectory(HdfsDirectory@3a19dc74
lockFactory=org.apache.solr.store.hdfs.HdfsLockFactory@43507d1b;
maxCacheMB=192.0 maxMergeSizeMB=16.0): files: [HdfsDirectory@3a19dc74
lockFactory=org.apache.solr.store.hdfs.hdfslockfact...@43507d1b-write.lock]

This problem is the same if I try to create with
"solr/admin/collections?action=CREATE&name=hdfs_test&numShards=1",
or with
"solr/admin/cores?action=CREATE&collection=hdfs_test&name=hdfs_test_sh1&shard=shard1".

Have no idea what to do with that.



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Solr-HDFS-settings-tp4165873p4165878.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: solr highlighting query

2014-10-25 Thread Erick Erickson
Well, the syntax is wrong. You probably want content_raw:"jobs
born"~15. The way slop works, it is the number of "moves" so this will
match "jobs was a man born somewhere" and "all persons born include
jobs". The difference is that the version that has "born" first and
"jobs" second will take an extra move, so if the search is "jobs born"
and the text was "born jobs", the second one would require more slop.

Best,
Erick

On Sat, Oct 25, 2014 at 2:48 AM, john eipe  wrote:
> Hi
>
> I'm trying to match keywords based on 2 fields and excluding order
> importance but with distance restriction.
>
> title:(Jobs) AND content_raw:(Jobs born)~15
>
> This throws error:
> org.apache.solr.search.SyntaxError: Cannot parse '(Jobs born)~15':
> Encountered "  "~15 "
>
> What's the correct way to frame this query?
>
> Regards,
> *John Eipe*
>
> “The Roots of Violence: Wealth without work, Pleasure without conscience,
> Knowledge without character, Commerce without morality, Science without
> humanity, Worship without sacrifice, Politics without principles”
> - Mahatma Gandhi


Re: AW: (auto)suggestions, but ony from a "filtered" set of documents

2014-10-25 Thread Jorge Luis Betancourt González
Maybe you could use copyField to populate one field with the values (fields) 
that you like to use for suggestions, then you could search only on this field 
and filter by any other field in your schema, and take advantage on the 
EdgeNGramFilter placed in the new search field.

Regards,

- Original Message -
From: "Clemens Wyss DEV" 
To: solr-user@lucene.apache.org
Sent: Saturday, October 25, 2014 2:43:46 AM
Subject: AW: (auto)suggestions, but ony from a "filtered" set of documents

Possibly making use of FilterAtomicReader?
http://lucene.apache.org/core/4_7_0/core/org/apache/lucene/index/FilterAtomicReader.html
 

-Ursprüngliche Nachricht-
Von: Clemens Wyss DEV [mailto:clemens...@mysign.ch] 
Gesendet: Freitag, 24. Oktober 2014 18:31
An: solr-user@lucene.apache.org
Betreff: AW: (auto)suggestions, but ony from a "filtered" set of documents

If possible I'd like to omit the sparate core. We have this approach in the 
current Lucene-only-solution. The main drawback IMHO is, that whenever the main 
index chnages you need to completely reindex the term-index.
Is it at all possible to write an own Suggester, which could do what I 
want/need? 

-Ursprüngliche Nachricht-
Von: Jorge Luis Betancourt Gonzalez [mailto:jlbetanco...@uci.cu] 
Gesendet: Freitag, 26. September 2014 19:15
An: solr-user@lucene.apache.org
Betreff: Re: (auto)suggestions, but ony from a "filtered" set of documents

Perhaps instead of the suggester component you could use the EdgeNGramFilter 
and provide partial matches so you will me able to configure a custom request 
handler that will "suggest" terms of phrases for you. I'm using this approach 
to provide queries suggestions, of course I'm indexing the queries into a 
separated core. 

Greetings,

On Sep 26, 2014, at 8:49 AM, Clemens Wyss DEV  wrote:

> Either my intention is dumb (pls let me know ;)), or there is no answer to 
> this problem. If so, I will have to index my sources into separate cores. 
> But then the questions arise:
> a) how do I get suggestions from more than one core? Multiple 
> suggest-requests, then merge?
> b) how doe I get (ranked) results from more than one core?
> In Lucene I was able to use a MultiIndexReader (one IndexReaders per index)
> 
> -Ursprüngliche Nachricht-
> Von: Clemens Wyss DEV [mailto:clemens...@mysign.ch] 
> Gesendet: Donnerstag, 25. September 2014 10:24
> An: solr-user@lucene.apache.org
> Betreff: (auto)suggestions, but ony from a "filtered" set of documents
> 
> What I'd like to do is
> http://localhost:8983/solr/solrpedia/suggest?q=atm&qf=source:
> 
> Through qf (or however the parameter shall be called) I'd like to restrict 
> the suggestions to documents which fit the given qf-query. 
> I need this filter if (as posted in a previous thread) I intend to put 
> "different kind of data" into one core/collection, cause suggestion shall be 
> restrictable to one or many source(s)

Concurso "Mi selfie por los 5". Detalles en 
http://justiciaparaloscinco.wordpress.com


Re: Remove indexes of XML file

2014-10-25 Thread Olivier Austina
Thank you Alex, I think I can use the file to delete corresponding indexes.

Regards
Olivier


2014-10-24 21:51 GMT+02:00 Alexandre Rafalovitch :

> You can delete individually, all (*:* query) or by specific query. So,
> if there is no common query pattern you may need to do a multi-id
> query - something like "id:(id1 id2 id3 id4)" which does require you
> knowing the IDs.
>
> Regards,
>Alex.
> Personal: http://www.outerthoughts.com/ and @arafalov
> Solr resources and newsletter: http://www.solr-start.com/ and @solrstart
> Solr popularizers community: https://www.linkedin.com/groups?gid=6713853
>
>
> On 24 October 2014 15:44, Olivier Austina 
> wrote:
> > Hi,
> >
> > This is newbie question. I have indexed some documents using some XML
> files
> > as indicating in the tutorial
> >  with the command :
> >
> > java -jar post.jar *.xml
> >
> > I have seen how to delete an index for one document but how to delete
> > all indexes
> > for documents within an XML file. For example if I have indexed some
> > files A, B, C, D etc.,
> > how to delete indexes of documents from file C. Is there a command
> > like above or other
> > solution without using individual ID? Thank you.
> >
> >
> > Regards
> > Olivier
>


How could I make more metadata indexed in Solr?

2014-10-25 Thread Mengying Wang
Hi everyone,

When I use the ./nutch parsechecker command to a pdf file, I see a number
of metadata, e.g., ETag="cbf961-5aafc-41e4319014b80"
meta:creation-date=2004-11-10T21:34:35Z
dcterms:modified=2004-11-10T21:34:35Z meta:save-date=2004-11-10T21:34:35Z
xmpTPg:NPages=10, etc. However, when I run the ./nutch indexchecker
command, only a few metadata appears, which will be indexed in the Solr. I
am wondering how could I make other metadata indexed in Solr too? Thank you!

Best,
Mengying (Angela) Wang