integrate solr and nutch index

2011-09-17 Thread hadi
I want to index my video and image files with solrj api's and index my sites
with nutch 1.3,but the problem is :
i should overwrite the nutch schema on solr schema to index the sites,if i
do this i lost the solr schema and i couldnt index files with solrj,how can
i solve this problem ?
should i define multiple core for each schema and data?

--
View this message in context: 
http://lucene.472066.n3.nabble.com/integrate-solr-and-nutch-index-tp3344176p3344176.html
Sent from the Solr - User mailing list archive at Nabble.com.


no result when searching sentences in solr

2011-09-20 Thread hadi
I index some pdf and docx with solrj and when i want to create query some
sentences like "We'd be glad to have you accompany" or anything else, the
result is empty. is it any configuration?
i mention that i create query in /solr/browse

--
View this message in context: 
http://lucene.472066.n3.nabble.com/no-result-when-searching-sentences-in-solr-tp3354659p3354659.html
Sent from the Solr - User mailing list archive at Nabble.com.


Search query doesn't work in solr/browse pnnel

2011-09-23 Thread hadi
When I create a query like "something&fl=content" in solr/browse the "&" and
"=" in URL converted to %26 and %3D and no result occurs. but it works in
solr/admin advanced search and also in URL bar directly, How can I solve
this problem?  Thanks

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Search-query-doesn-t-work-in-solr-browse-pnnel-tp3363032p3363032.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: two cores but have single result set in solr

2011-09-23 Thread hadi
I index my files with solrj and crawl my sites with nutch 1.3 ,as you
know, i have to overwrite the nutch schema on solr schema in order to
have view the result in solr/browse, in this case i should define two
cores,but i want have single result or the user can search into both
core indexes at the same time


--
View this message in context: 
http://lucene.472066.n3.nabble.com/two-cores-but-have-single-result-set-in-solr-tp3363043p3363133.html
Sent from the Solr - User mailing list archive at Nabble.com.

Re: two cores but have single result set in solr

2011-09-24 Thread hadi
I do not know how to search both cores and not define "shard"
parameter,could you show me some solutions for solve my issue?

On 9/24/11, Yury Kats [via Lucene]
 wrote:
>
>
> On 9/23/2011 6:00 PM, hadi wrote:
>> I index my files with solrj and crawl my sites with nutch 1.3 ,as you
>> know, i have to overwrite the nutch schema on solr schema in order to
>> have view the result in solr/browse, in this case i should define two
>> cores,but i want have single result or the user can search into both
>> core indexes at the same time
>
> Can you not use 'shard' parameter and specify both cores there?
>
>
>
> ___
> If you reply to this email, your message will be added to the discussion
> below:
> http://lucene.472066.n3.nabble.com/two-cores-but-have-single-result-set-in-solr-tp3363043p3363164.html
>
> To unsubscribe from two cores but have single result set in solr, visit
> http://lucene.472066.n3.nabble.com/template/NamlServlet.jtp?macro=unsubscribe_by_code&node=3363043&code=bWQuYW5iYXJpQGdtYWlsLmNvbXwzMzYzMDQzfC02NDQ5ODMwMjM=


--
View this message in context: 
http://lucene.472066.n3.nabble.com/two-cores-but-have-single-result-set-in-solr-tp3363043p3363901.html
Sent from the Solr - User mailing list archive at Nabble.com.

Re: two cores but have single result set in solr

2011-09-24 Thread hadi
I read the link but the
'http://localhost:8983/solr/select?shards=localhost:8983/solr,localhost:7574/solr&indent=true&q=ipod+solr'
have a XML response that is not useful for me, i want to create query
in solr/browse so this is need to change the template engine,do you
know how to change that to search both cores?  thanks

On 9/24/11, Yury Kats [via Lucene]
 wrote:
>
>
> On 9/24/2011 3:09 AM, hadi wrote:
>> I do not know how to search both cores and not define "shard"
>> parameter,could you show me some solutions for solve my issue?
>
> See this: http://wiki.apache.org/solr/DistributedSearch
>
>
> ___
> If you reply to this email, your message will be added to the discussion
> below:
> http://lucene.472066.n3.nabble.com/two-cores-but-have-single-result-set-in-solr-tp3363043p3364157.html
>
> To unsubscribe from two cores but have single result set in solr, visit
> http://lucene.472066.n3.nabble.com/template/NamlServlet.jtp?macro=unsubscribe_by_code&node=3363043&code=bWQuYW5iYXJpQGdtYWlsLmNvbXwzMzYzMDQzfC02NDQ5ODMwMjM=


--
View this message in context: 
http://lucene.472066.n3.nabble.com/two-cores-but-have-single-result-set-in-solr-tp3363043p3364459.html
Sent from the Solr - User mailing list archive at Nabble.com.

heap size problem when indexinf files with solrj

2011-09-30 Thread hadi
I write a simple program with solrj that index files but after a minute
passed it crashed and the 
*java.lang.OutOfmemoryError : java heap space* appear 

I used Eclipse and my memory storage is abou 2GB and i set the
-Xms1024M-Xmx2048M for both my VM arg of tomcat and my application in Debug
Configuration and uncomment the maxBufferedDocs in solrconfig and set it to
100 then run again my application but it crashed soon when it reach the
files greater than 500MB

is there any config to index large files with solrj?
the detail my solrj is as below:

String urlString = "http://localhost:8983/solr/file";;   
CommonsHttpSolrServer solr = new CommonsHttpSolrServer(urlString);

ContentStreamUpdateRequest req = new
ContentStreamUpdateRequest("/update/extract");

eq.addFile(file);
req.setParam("literal.id", file.getAbsolutePath());
req.setParam("literal.name", file.getName());
req.setAction(ACTION.COMMIT, true, true);

solr.request(req);  



--
View this message in context: 
http://lucene.472066.n3.nabble.com/heap-size-problem-when-indexinf-files-with-solrj-tp3382115p3382115.html
Sent from the Solr - User mailing list archive at Nabble.com.


How to change default operator in velocity?

2011-10-18 Thread hadi
in solr schema the defaultOperator value is OR but when i use
browse(http://localhost:8983/solr/browse)for searching AND is a
defaultOperator,and that config in solr is not affect on velocity how can i
change the velocity template engine default operators?


--
View this message in context: 
http://lucene.472066.n3.nabble.com/How-to-change-default-operator-in-velocity-tp3430871p3430871.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: How to change default operator in velocity?

2011-10-18 Thread hadi
thanks for your reply,i delete the dismax conf from solrconf.xml and
it works,is it any side effect?

On 10/18/11, Jan Høydahl / Cominvent [via Lucene]
 wrote:
>
>
> Hi,
>
> The reason why AND is default with /browse is that it uses the "dismax"
> query parser, which does not currently respect defaultOperator.
> If you want an OR like behaviour, try to add at the end of the url: &mm=0
> (which means minumum number of terms that should match=0), e.g.
> http://localhost:8983/solr/browse?q=samsung+maxtor&mm=0
>
> For more about mm, see
> http://wiki.apache.org/solr/DisMaxQParserPlugin#mm_.28Minimum_.27Should.27_Match.29
> NB: In trunk (4.0), even dismax will respect the defaultOperator from
> schema.
>
> --
> Jan Høydahl, search solution architect
> Cominvent AS - www.cominvent.com
> Solr Training - www.solrtraining.com
>
> On 18. okt. 2011, at 12:36, hadi wrote:
>
>> in solr schema the defaultOperator value is OR but when i use
>> browse(http://localhost:8983/solr/browse)for searching AND is a
>> defaultOperator,and that config in solr is not affect on velocity how can
>> i
>> change the velocity template engine default operators?
>>
>>
>> --
>> View this message in context:
>> http://lucene.472066.n3.nabble.com/How-to-change-default-operator-in-velocity-tp3430871p3430871.html
>> Sent from the Solr - User mailing list archive at Nabble.com.
>
>
>
> ___
> If you reply to this email, your message will be added to the discussion
> below:
> http://lucene.472066.n3.nabble.com/How-to-change-default-operator-in-velocity-tp3430871p3431189.html
>
> To unsubscribe from How to change default operator in velocity?, visit
> http://lucene.472066.n3.nabble.com/template/NamlServlet.jtp?macro=unsubscribe_by_code&node=3430871&code=bWQuYW5iYXJpQGdtYWlsLmNvbXwzNDMwODcxfC02NDQ5ODMwMjM=


--
View this message in context: 
http://lucene.472066.n3.nabble.com/How-to-change-default-operator-in-velocity-tp3430871p3431294.html
Sent from the Solr - User mailing list archive at Nabble.com.

Re: How to change default operator in velocity?

2011-10-18 Thread hadi
thanks a lot,your answer is great

On 10/18/11, Jan Høydahl / Cominvent [via Lucene]
 wrote:
>
>
> Rather than deleting the dismax config, I would recommend adding a new entry
> inside your "/browse" request handler config's  tag:
>
> 0
>
> This will go "OR" mode, and you will still benefit from all the advantages
> that DisMax gives you for weighted search across different fields. See
> http://wiki.apache.org/solr/DisMaxQParserPlugin to learn more about DisMax.
>
> --
> Jan Høydahl, search solution architect
> Cominvent AS - www.cominvent.com
> Solr Training - www.solrtraining.com
>
> On 18. okt. 2011, at 15:56, hadi wrote:
>
>> thanks for your reply,i delete the dismax conf from solrconf.xml and
>> it works,is it any side effect?
>>
>> On 10/18/11, Jan Høydahl / Cominvent [via Lucene]
>>  wrote:
>>>
>>>
>>> Hi,
>>>
>>> The reason why AND is default with /browse is that it uses the "dismax"
>>> query parser, which does not currently respect defaultOperator.
>>> If you want an OR like behaviour, try to add at the end of the url: &mm=0
>>> (which means minumum number of terms that should match=0), e.g.
>>> http://localhost:8983/solr/browse?q=samsung+maxtor&mm=0
>>>
>>> For more about mm, see
>>> http://wiki.apache.org/solr/DisMaxQParserPlugin#mm_.28Minimum_.27Should.27_Match.29
>>> NB: In trunk (4.0), even dismax will respect the defaultOperator from
>>> schema.
>>>
>>> --
>>> Jan Høydahl, search solution architect
>>> Cominvent AS - www.cominvent.com
>>> Solr Training - www.solrtraining.com
>>>
>>> On 18. okt. 2011, at 12:36, hadi wrote:
>>>
>>>> in solr schema the defaultOperator value is OR but when i use
>>>> browse(http://localhost:8983/solr/browse)for searching AND is a
>>>> defaultOperator,and that config in solr is not affect on velocity how
>>>> can
>>>> i
>>>> change the velocity template engine default operators?
>>>>
>>>>
>>>> --
>>>> View this message in context:
>>>> http://lucene.472066.n3.nabble.com/How-to-change-default-operator-in-velocity-tp3430871p3430871.html
>>>> Sent from the Solr - User mailing list archive at Nabble.com.
>>>
>>>
>>>
>>> ___
>>> If you reply to this email, your message will be added to the discussion
>>> below:
>>> http://lucene.472066.n3.nabble.com/How-to-change-default-operator-in-velocity-tp3430871p3431189.html
>>>
>>> To unsubscribe from How to change default operator in velocity?, visit
>>> http://lucene.472066.n3.nabble.com/template/NamlServlet.jtp?macro=unsubscribe_by_code&node=3430871&code=bWQuYW5iYXJpQGdtYWlsLmNvbXwzNDMwODcxfC02NDQ5ODMwMjM=
>>
>>
>> --
>> View this message in context:
>> http://lucene.472066.n3.nabble.com/How-to-change-default-operator-in-velocity-tp3430871p3431294.html
>> Sent from the Solr - User mailing list archive at Nabble.com.
>
>
>
> ___
> If you reply to this email, your message will be added to the discussion
> below:
> http://lucene.472066.n3.nabble.com/How-to-change-default-operator-in-velocity-tp3430871p3431940.html
>
> To unsubscribe from Solr, visit
> http://lucene.472066.n3.nabble.com/template/NamlServlet.jtp?macro=unsubscribe_by_code&node=472067&code=bWQuYW5iYXJpQGdtYWlsLmNvbXw0NzIwNjd8LTY0NDk4MzAyMw==


--
View this message in context: 
http://lucene.472066.n3.nabble.com/How-to-change-default-operator-in-velocity-tp3430871p3433415.html
Sent from the Solr - User mailing list archive at Nabble.com.

How to update document with solrj?

2011-10-18 Thread hadi
I have indexed some files that do not have any tag or description and i want
to add some field without deleting them,how can i update or add info to my
index files with solrj?
my idea for this issue is query on specific file and delete it and add some
info and re index it but i think it is not a good idea


--
View this message in context: 
http://lucene.472066.n3.nabble.com/How-to-update-document-with-solrj-tp3433434p3433434.html
Sent from the Solr - User mailing list archive at Nabble.com.


add thumnail image for search result

2011-10-18 Thread hadi
I want to know how can i add thumbnail image for my files when i am indexing
files with solrj?
thanks


--
View this message in context: 
http://lucene.472066.n3.nabble.com/add-thumnail-image-for-search-result-tp3433440p3433440.html
Sent from the Solr - User mailing list archive at Nabble.com.


Update document field with solrj

2011-10-23 Thread hadi
I want to edit document filed in solr,for example edit the author name,so i
use the following code in solrj:

params.set("literal.author","anaconda")

but the author multivalued="true" in schema and because of that "anaconde"
is not replace with it's previous name and add to the end of the author
name,
also if i omit the multivalued field or set it to false the bad request
exception happen in re-indexing file with new author field,how can i solve
this problem and delete or modify the previous document field in solrj? or
does it any config i miss in schema? thanks


--
View this message in context: 
http://lucene.472066.n3.nabble.com/Update-document-field-with-solrj-tp3445488p3445488.html
Sent from the Solr - User mailing list archive at Nabble.com.


FTP mount crash when crawling with solrj

2011-12-12 Thread hadi
I have a lots of files in my FTP account,and i use the curlftpfs to mount
them to folder and then start index them with solrj api, but after a minutes
pass something strange happen and the mounted folder is not accessible and
crash,also i can not unmount it and the message "device is in use" appear,
my solrj code is OK and i test it with my local files and the result is
great but indexing mounted folder is my terrible problem, i mention that i
use the curlftpfs with both centOS,fedora and Ubuntu but the result of
crashing is the same,how can i fix this problem? is the problem with the my
code? is sombody have ever face this problem when indexed of mounted folder?



--
View this message in context: 
http://lucene.472066.n3.nabble.com/FTP-mount-crash-when-crawling-with-solrj-tp3580982p3580982.html
Sent from the Solr - User mailing list archive at Nabble.com.


Is it possible to grouping solr results by their domain ?

2012-04-08 Thread hadi
I have crawled many site with nutch and using solr 3.4 to browse the results
but i want to group the result  by their domain.
for example if i search one site like "tabnak" the first result only contain
the http://tabnak.ir and do not show the other result from that domain.

for example i want to ignore the result such as(like google) :


--
View this message in context: 
http://lucene.472066.n3.nabble.com/Is-it-possible-to-grouping-solr-results-by-their-domain-tp3895995p3895995.html
Sent from the Solr - User mailing list archive at Nabble.com.


Category the result search

2012-04-13 Thread hadi
hi

I am new to solr, I crawled about 1000 news site with nutch and i use solr
to browse the result, but i want to categorize the sites to some categories
like(sport news,politic news,science and etc ..)
I know i have to use solr faceting but i do not know how can i do such
implementation for solr or at least how can i force solr to know my category
fields?

thanks

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Category-the-result-search-tp3909710p3909710.html
Sent from the Solr - User mailing list archive at Nabble.com.