Max physical size of document cache would be (size * avg byte size of a
document in the index). Is this correct ?
Thank you
-Rahul
Hi,
Is there any mechanism by which we can track and trend the incoming Solr
search requests ?
Some mechanisms like logging all incoming Solr requests to a different log
file than Tomcat's and have a tool to trend the patterns ?
--
Thanks and Regards
Rahul A. Warawdekar
dataSource attributes (autoCommit=true) ??
--
Thanks and Regards
Rahul A. Warawdekar
till don't see any highlighting. I've managed to get snippets of text
> but
> the actual word is not highlighted. I don't know where I am going wrong?
>
> --
> View this message in context:
> http://lucene.472066.n3.nabble.com/Not-able-to-use-the-highlighting-feature-Want-to-return-snippets-of-text-Urgent-tp3985012p3985174.html
> Sent from the Solr - User mailing list archive at Nabble.com.
>
--
Thanks and Regards
Rahul A. Warawdekar
e.472066.n3.nabble.com/Not-able-to-use-the-highlighting-feature-Want-to-return-snippets-of-text-Urgent-tp3985012p3985212.html
> Sent from the Solr - User mailing list archive at Nabble.com.
>
--
Thanks and Regards
Rahul A. Warawdekar
ents.
>
> failure:
> Indexing completed. Added/Updated: 603378 documents. Deleted 0
> documents.
>
> thank you,
>
>
> --
> View this message in context:
> http://lucene.472066.n3.nabble.com/possible-status-codes-from-solr-during-a-DIH-data-import-process-tp3987110.html
> Sent from the Solr - User mailing list archive at Nabble.com.
>
--
Thanks and Regards
Rahul A. Warawdekar
Hi,
Turn the Solr logging level to "FINE" for the DIH packages/classes and they
will show up in the log.
http://:/solr//admin/logging
On Fri, Jun 1, 2012 at 9:34 AM, wangjing wrote:
> how to show DIH query's sql in log file for troubleshooting?
>
> thanks.
>
--
>
>
>
> schema.xml
>
>
>
> omitNorms="true"/>
> sortMissingLast="true" omitNorms="true"/>
>
>
>
> required="true" />
>
>stored="true"/>
>
>
>
>
>
>
> iddocumento
>
>
> nrodocumento
>
>
>
>
>
>
> solrconfig.xml
>
>
> LUCENE_36
>
> class="${solr.directoryFactory:solr.StandardDirectoryFactory}"/>
>
>
>
>
> 3
>
>
>
>
> multipartUploadLimitInKB="2048" />
>
>
> default="true" />
>
> class="org.apache.solr.handler.admin.AdminHandlers" />
> class="org.apache.solr.handler.dataimport.DataImportHandler">
>
>db-data-config.xml
>
>
>
>
>solr
>
>
>
>
> Thanks for your help.
>
> --
> Rafael Taboada
>
> /*
> * Phone >> 992 741 026
> */
>
--
Thanks and Regards
Rahul A. Warawdekar
Hi All,
This is Rahul and am using Solr for one of my upcoming projects.
I had a query regarding search term count using Solr.
We have a requirement in one of our search based projects to search the
results based on search term counts per document.
For eg,
if a user searches for something like
Please refer
http://wiki.apache.org/solr/MultipleIndexes
On 3/29/11, Amel Fraisse wrote:
> Hello every body,
>
> Is it possible to create 2 index within the same Solr server ?
>
> Thank you.
>
> Amel.
>
--
Thanks and Regards
Rahul A. Warawdekar
next document to be indexed ?
--
Thanks and Regards
Rahul A. Warawdekar
is hung without throwing any
>> exception
>> which in tuen causes the indexing to be hung on the server.
>>
>> Has anyone faced a similar kind of issue in the past with
>> TikaEntityProcessor ?
>>
>> Also, does someone know of a way to just skip this type
seems to fail when indexing attachments, even after appying a few Solr
fix patches.
My question is, Is the current multithreading feature stable in Solr
3.1 or it needs further enhancements ?
--
Thanks and Regards
Rahul A. Warawdekar
STICS_NORECOMPUTE = OFF, IGNORE_DUP_KEY =
> OFF, ALLOW_ROW_LOCKS = ON, ALLOW_PAGE_LOCKS = ON) ON [PRIMARY]
> ) ON [PRIMARY]
>
> GO
>
>
>
> What am I doing wrong?
>
> --
> View this message in context:
> http://lucene.472066.n3.nabble.com/Delta-import-issue-tp3162581p3162581.html
> Sent from the Solr - User mailing list archive at Nabble.com.
>
--
Thanks and Regards
Rahul A. Warawdekar
On Tue, Jul 12, 2011 at 11:34 AM, PeterKerk wrote:
> Hi Rahul,
>
> Not sure how I would do this "Try adding the primary key attribute to the
> root entity 'ad'"?
>
> In my entity ad I already have these fields (I left those out earlier for
> readabil
ferent
> handlers and so on.
>
> What is a good way to deal with this?
>
>
> *Pranav Prakash*
>
> "temet nosce"
>
--
Thanks and Regards
Rahul A. Warawdekar
t;
> "temet nosce"
>
>
>
> On Wed, Jul 11, 2012 at 1:49 PM, Rahul Warawdekar <
> rahul.warawde...@gmail.com> wrote:
>
> > Hi Pranav,
> >
> > If you are using Tomcat to host Solr, you can define your data source in
> > context.xml file under tom
we can implement fuzzy searches using Solr.
> > Can someone provide any links to any relevant documentation ?
>
>
--
Thanks and Regards
Rahul A. Warawdekar
t; >> --
> >> Regards,
> >> Rafał Kuć
> >> Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch -
> ElasticSearch
> >>
> >> > Hi,
> >>
> >> > I need to know how we can implement fuzzy searches using Solr.
> >> > Can someone provide any links to any relevant documentation ?
> >>
> >>
>
>
>
--
Thanks and Regards
Rahul A. Warawdekar
; I need to know how we can implement fuzzy searches using Solr.
>> Can someone provide any links to any relevant documentation ?
>>
>
>
--
Thanks and Regards
Rahul A. Warawdekar
Indexing is not happening after 'x' documents.
I am using Bitnami and had upgraded Mysql server from Mysql 5.1.* to Mysql
5.5.* version. After up gradation when I ran indexing on solr, it not get
indexed.
I am using a procedure in which i am finding the parent of a child and
inserting it in a
Hi,
I am using mysql for solr indexing data in solr. I have two fields: "name"
and "college". How can I add auto suggest based on these two fields?
--
View this message in context:
http://lucene.472066.n3.nabble.com/Solr-Autocomplete-tp4013859.html
Sent from the Solr - User mailing list archive
?
- Will having a large number of properties defined as dynamic fields, reduce
performance ?
Thank you.
Regards
Rahul
VM.
I think I need to go back and check if I am not using all the fields in the
query. I understand that setting indexed=false alone will not ensure that
all fields don't participate in the query.
Thanks a lot for your response.
Regards
Rahul
On Fri, Jul 31, 2009 at 3:33 PM, Erik Hatcher wrot
to around 10 seconds.
This really helped. Thanks a lot !
Regards
Rahul
On Fri, Jul 31, 2009 at 6:34 PM, Erik Hatcher wrote:
>
> On Jul 31, 2009, at 7:17 AM, Rahul R wrote:
>
> Erik,
>> I understand that caching is going to improve performance. Infact we did a
>> PSR run wi
;Erik
>
>
> On Jul 31, 2009, at 10:00 AM, Rahul R wrote:
>
> In a production environment, having the caches enabled makes a lot of
>> sense.
>> And most definitely we will be enabling them. However, the primary idea of
>> this exercise is to verify if lim
eaks in my application ? Any thoughts, suggestions would be welcome.
Regards
Rahul
Philip,
I cannot answer your question, but I do have a question for you. Does
aggregation happen at the primary shard ? For eg : if I have three JVMs
JVM 1 : My application powered by Solr
JVM 2 : Shard 1
JVM 3 : Shard 2
I initialize my SolrServer like this
SolrServer _solrServer = *new* CommonsHt
the
request is first sent to the server (with which SolrServer is initialized)
and from there it is sent to all the other shards ?
Regards
Rahul
On Tue, Aug 4, 2009 at 2:29 PM, Shalin Shekhar Mangar <
shalinman...@gmail.com> wrote:
> On Tue, Aug 4, 2009 at 11:26 AM, Rahul R wrote:
&g
Shalin, thank you for the clarification.
Philip, I just realized that I have diverted the original topic of the
thread. My apologies.
Regards
Rahul
On Tue, Aug 4, 2009 at 3:35 PM, Shalin Shekhar Mangar <
shalinman...@gmail.com> wrote:
> On Tue, Aug 4, 2009 at 2:37 PM, Rahul
couldn't find anything of significance.
Regards
Rahul
On Tue, Aug 4, 2009 at 7:35 PM, Otis Gospodnetic wrote:
> Hi Rahul,
>
> A) There are no known (to me) memory leaks.
> I think there are too many variables for a person to tell you what exactly
> is happening, plus you are deali
s a try and see how it goes. Thanks for your input.
Rahul
On Wed, Aug 12, 2009 at 2:15 PM, Gunnar Wagenknecht
wrote:
> Rahul R schrieb:
> > I tried using a profiling tool - Yourkit. The trial version was free for
> 15
> > days. But I couldn't find anything of significanc
t least!)
>
>
> I didn't notice any memory leaks after I started to use 16Gb RAM for SOLR
> instance (almost a year without any restart!)
>
>
>
>
> -Original Message-
> From: Rahul R [mailto:rahul.s...@gmail.com]
> Sent: August-13-09 1:25 AM
> To: solr-u
Fuad,
We have around 5 million documents and around 3700 fields. All documents
will not have values for all the fields JRockit is not approved for use
within my organization. But thanks for the info anyway.
Regards
Rahul
On Tue, Aug 18, 2009 at 9:41 AM, Funtick wrote:
>
> BTW, you
All these 3700 fields are single valued non-boolean fields. Thanks
Regards
Rahul
On Wed, Aug 19, 2009 at 8:33 PM, Fuad Efendi wrote:
>
> Hi Rahul,
>
> JRockit could be used at least in a test environment to monitor JVM (and
> troubleshoot SOLR, licensed for-free for develope
.
Regards
Rahul
t is the case, then my
query is answered.
Thank you all.
Regards
Rahul
On Sun, Aug 23, 2009 at 7:16 AM, Lance Norskog wrote:
> Sorry, hit 'send' too soon. You can kill the servlet process, but it is
> much
> better to use the servlet container's shutdown protocol.
>
*"release any SOLR resources" - no need.*
My query is answered. Thank you.
Regards
Rahul
On Mon, Aug 24, 2009 at 12:32 AM, Fuad Efendi wrote:
> Truly correct:
>
> - SOLR does not create HttpSession for user access to Admin screens (do we
> have any other screens of UI?)
the cache)
- set intersection of above results (Will be done again even with filter
cache enabled)
Is there any flag/setting that I can enable to monitor how much time the
above operations take separately i.e. the querying and the set-intersection
?
Regards
Rahul
Thank you Martijn.
On Tue, Sep 1, 2009 at 8:07 PM, Martijn v Groningen <
martijn.is.h...@gmail.com> wrote:
> Hi Rahul,
>
> Yes you are understanding is correct, but it is not possible to
> monitor these actions separately with Solr.
>
> Martijn
>
> 2009/9/1 Rahul R
ike
The location however, I want to add it to the 'text' field programmatically.
I don't want to store the location as a separate field in the index. Can I
do this ?
Thank you.
Regards
Rahul
Would appreciate any help on this. Thanks
Rahul
On Mon, Sep 14, 2009 at 5:12 PM, Rahul R wrote:
> Hello,
> I have a few questions regarding the copyField directive in schema.xml
>
> 1. Does the destination field store a reference or the actual data ?
> If I have soemthing like
' from the program, will either one copy
(schema or program) not over-write the other ?
*Also note, that you don't really need to index/store the source field. You
can make the location field's type as ignored in the schema.*
Understood
Thank you for your response.
Regards
Rahul
Thank you Shalin.
Regards
Rahul
On Thu, Sep 17, 2009 at 11:49 AM, Shalin Shekhar Mangar <
shalinman...@gmail.com> wrote:
> On Thu, Sep 17, 2009 at 11:19 AM, Rahul R wrote:
>
> > Shalin,
> > Can you please elaborate a little more on the third response
> > *You
. For eg:
Now, will these dynamic fields have omitNorms=true for it ? I have read
about significant RAM usage when omitNorms is not set to true. Hence would
like to ensure that it is set to true for most of my fields.
Regards
Rahul
the above result was for a faceted query. I can't
understand why the FacetComponent would be zero. Any thoughts ?
Rahul
the QueryComponent
takes around 2.4s. I am doing a faceted and keyword query ie I have both 'q'
and 'fq' params in my query Thank you for your response.
Regards
Rahul
On Mon, Sep 28, 2009 at 1:20 AM, Yonik Seeley wrote:
> The response times in a Solr request don
878&facet.field=S9156&facet.field=S0369&facet.field=S9926&facet.field=S1421&facet.field=S8990&facet.field=S6881&facet.field=S3552&debugQuery=true
Other information
Solr 1.3, JDK 1.5.0_14
regards
Rahul
On Mon, Sep 28, 2009 at 6:48 PM, Yonik Seeley wrote:
> On Mon,
can look to improve accordingly. Thank you.
Regards
Rahul
On Tue, Sep 29, 2009 at 7:12 PM, Rahul R wrote:
> Sorry for the delayed response
> **
> *How big are your documents?*
> I have totally 1 million documents. I have totally 1950 fields in the
> index. Every document woul
Factory was
completely removed (both index and query), then the above problem was
resolved. But this seems too harsh a measure.
Is there a way by which I can prevent the WordDelimiterFilterFactory from
totally acting on numerical data ?
Regards
Rahul
Hello,
Would really appreciate any inputs/suggestions on this. Thank you.
On Tue, Nov 24, 2009 at 10:59 PM, Rahul R wrote:
> Hello,
> In our application we have a catch-all field (the 'text' field) which is
> cofigured as the default search field. Now this field will have
rds to "0", the way term texts are created
for ".355" does not change.
Thank you for your time.
Regards
Rahul
On Sun, Nov 29, 2009 at 1:07 AM, Steven A Rowe wrote:
> Hi Rahul,
>
> On 11/26/2009 at 12:53 AM, Rahul R wrote:
> > Is there a way by which I can
proper result
but when i create index table for large table...index is not propery
created..
and it does not return any result for search ...
what's the problem...
can any body help me out .
my data-config file looks like this..
--
Thanks and Regards
Rahul G.Brid
ImportHandler FAQ (
> http://wiki.apache.org/solr/DataImportHandlerFaq) the first issue mentions
> memory problems. Try adding the batchSize="-1" attribute to your datasource,
> it fixed the problem for me.
>
> Regards,
>
> gwk
>
--
Thanks and Regards
Rahul G.Brid
es not returns any result set..this
happens when database table has large number of rows
On Mon, Jan 12, 2009 at 9:17 AM, Rahul Brid wrote:
> Hi,thnx for the reply ...but an you tell me where to set this batchSize???
> in dataconfig.xml
>
>
> On Mon, Jan 12, 2009 at 8:48 AM, gwk w
you.
Regards
Rahul
the formula ?
Thank you.
Regards
Rahul
On Fri, May 21, 2010 at 3:03 PM, MitchK wrote:
>
> Rahul,
>
> the IndexSearcher of Solr gets shared with every request within two
> commits.
> That means one IndexSearcher + its caches got a lifetime of one commit.
> After every commit
solrServer = new EmbeddedSolrServer( coreContainer, coreName );
//Next two lines executed for every user
SolrIndexSearcher solrSearcher = solrCore.newSearcher("s1");
SolrRequestParsers solrRequestParsers = new SolrRequestParsers(solrConfig);
Many thanks for the response(s).
Regards
Thank you I found the API to get the existing SolrIndexSearcher to be
present in SolrCore:
SolrCore.getSearcher().get()
So if now the Index changes (a commit is done) in between, will I
automatically get the new SolrIndexSearcher from this call ?
Regards
Rahul
On Mon, May 24, 2010 at 11:25
red across multiple users (as statics) till the next update happens.
Thank you.
Regards
Rahul
On Tue, May 25, 2010 at 4:18 AM, Chris Hostetter
wrote:
>
> : Thank you I found the API to get the existing SolrIndexSearcher to be
> : present in SolrCore:
> : SolrCore.getSearcher().get()
&
I wrote something related to this topic a while ago.
https://www.google.com/amp/s/blog.anant.us/resources-for-monitoring-datastax-cassandra-spark-solr-performance/amp/
Rahul
On Aug 16, 2018, 3:35 PM -0700, Jan Høydahl , wrote:
> Check out the Reference Guide chapter on monitoring with o
” query.
Rahul Singh
Chief Executive Officer
m 202.905.2818
Anant Corporation
1010 Wisconsin Ave NW, Suite 250
Washington, D.C. 20007
We build and manage digital business technology platforms.
On Sep 3, 2018, 6:29 AM -0400, Emir Arnautović ,
wrote:
> Hi,
> The requirement is not 100% cl
waste of space.
Rahul Singh
Chief Executive Officer
m 202.905.2818
Anant Corporation
1010 Wisconsin Ave NW, Suite 250
Washington, D.C. 20007
We build and manage digital business technology platforms.
On Sep 11, 2018, 11:23 PM -0400, John Smith , wrote:
> On Tue, Sep 11, 2018 at 11:05 PM Wal
Depends on whether you are using Solr or solrcloud. Solrcloud distributes data
into shards so it increases overall capacity.
Rahul Singh
Chief Executive Officer
m 202.905.2818
Anant Corporation
1010 Wisconsin Ave NW, Suite 250
Washington, D.C. 20007
We build and manage digital business
https://github.com/bazaarvoice/jolt
On Thu, Sep 13, 2018 at 9:18 AM Joel Bernstein wrote:
> Solr Streaming Expressions allow you to do this with the cartesianProduct
> function:
>
>
> http://lucene.apache.org/solr/guide/7_4/stream-decorator-reference.html#cartesianproduct
>
> The structure of th
I am using SolrCloud 7.2.1. My understanding is that setting docvalues=true
would optimize faceting, grouping and sorting; but for a field to be
searchable it needs to be indexed=true. However I was dumbfounded today
when I executed a successful search on a field with below configuration:
However
What’s your update query?
You need to provide the unique id field of the document you are updating.
Rahul
On Mon, Nov 19, 2018 at 10:58 PM Rajeswari Kolluri <
rajeswari.koll...@oracle.com> wrote:
> Hi,
>
>
>
>
>
> Using Solr 7.5.0. While performing atomic upd
What is the Router name for your collection? Is it "implicit" (You can
know this from the "Overview" of you collection in the admin UI) ? If yes,
what is the router.field parameter the collection was created with?
Rahul
On Mon, Nov 19, 2018 at 11:19 PM Rajeswari Koll
particularly
functional for any industry size load anyway.
Thanks,
Rahul
On Tue, Nov 20, 2018 at 3:37 AM Toke Eskildsen wrote:
> On Mon, 2018-11-19 at 22:19 -0500, Rahul Goswami wrote:
> > I am using SolrCloud 7.2.1. My understanding is that setting
> > docvalues=true would optimize fac
Hello,
I am using SolrCloud on Solr 7.2.1.
I get the NullPointerException in the Solr logs (in ExportWriter.java) when
the /stream handler is invoked with a search() streaming expression with
qt="/export" containing fq="{!collapse field=id_field sort="time desc"}
(among other fq's. I tried elimina
Hello,
Following up on my query. I know this might be too specific an issue. But I
just want to know that it's a legitimate bug and the supported operation is
allowed with the /export handler. If someone has an idea about this and
could confirm, that would be great.
Thanks,
Rahul
On Thu, J
ve is coming from
documents not present in the same shard. I'll verify this tomorrow and
update the thread.
Thanks,
Rahul
On Mon, Jan 21, 2019 at 2:26 PM Joel Bernstein wrote:
> I haven't had time to look into the details of this issue but it's not
> clear that these two fea
mage.png]
Thanks,
Rahul
ink you need a
> screenshot here, what you describe is the default behaviour.
>
> --
> Jan Høydahl, search solution architect
> Cominvent AS - www.cominvent.com
>
> > 28. jan. 2019 kl. 09:05 skrev Rahul Goswami :
> >
> > Hello,
> > I am using Solr 7.2.1. I c
sc",fl="fileld1,field2,field3",qt="/export",q="*:*",fq="((field4:1)
OR (field4:2))",fq="{!collapse field=id_field sort='field3 desc'}")
The same query with "select" handler does return the collapse result fine.
Looks like this m
created post split?
Regards,
Rahul
On Wed, Jan 30, 2019 at 1:18 AM Rahul Goswami wrote:
> Thanks for the reply Jan. I have been referring to documentation for
> SPLISHARD on 7.2.1
> <https://lucene.apache.org/solr/guide/7_2/collections-api.html#splitshard>
> which
> see
47C-6673-4778-847D-2DE0FDE56C66_shard12_replica_n46]
org.apache.solr.update.PeerSync PeerSync:
core=DataIndex_1C6F947C-6673-4778-847D-2DE0FDE56C66_shard12_replica_n46 url=
http://indexnode1:2/solr too many updates received since start -
startingUpdates no longer overlaps with our currentUpdates
Thanks,
Rahul
our currentUpdates
Regards,
Rahul
On Thu, Feb 7, 2019 at 12:59 PM Erick Erickson
wrote:
> bq. We have a heavy indexing load of about 10,000 documents every 150
> seconds.
> Not so heavy query load.
>
> It's unlikely that changing numRecordsToKeep will help all that much if
> y
in Solr
to know whether a replica is falling behind from the leader ?
Thanks,
Rahul
On Mon, Feb 11, 2019 at 10:28 PM Erick Erickson
wrote:
> bq. To answer your question about index size on
> disk, it is 3 TB on every node. As mentioned it's a 32 GB machine and I
> allocated 24G
results, which in
turn has a cascading effect on other parts of the application. Is there a
setting in Solr which would prevent Solr from serving search requests
before log replay has finished?
Thanks,
Rahul
1
On Thu, Mar 7, 2019 at 11:36 PM Zheng Lin Edwin Yeo
wrote:
> Hi,
>
> Do you mean that when you startup Solr, it will automatically do the search
> request even before the Solr is fully started up?
>
> Regards,
> Edwin
>
>
> On Fri, 8 Mar 2019 at 10:13, Rahul Goswami
autoCommit interval (with openSearcher=false) is the number of
segments that would drastically increase, eventually causing merges,slower
searches etc.
Thanks,
Rahul
On Fri, Mar 8, 2019 at 12:08 PM Erick Erickson
wrote:
> Yes, you’ll get stale values. There’s no way I know of to change that,
>
;ll continue to monitor this for now.
Thanks,
Rahul
On Fri, Mar 8, 2019 at 2:14 PM Erick Erickson
wrote:
> (1) no, and Shawn’s comments are well taken.
>
> (2) bq. is the number of segments would drastically increase
>
> Not true. First of all, TieredMergePolicy will take care of m
on the data at
a certain fine granularity but don't want to index the data at such high
granularity for indexing performance reasons i.e. we need to keep the total
number of documents small.
Appreciate any help.
Regards,
Rahul Chhiber
ok at streaming expressions, looks interesting.
Regards,
Rahul Chhiber
-Original Message-
From: Atita Arora [mailto:atitaar...@gmail.com]
Sent: Tuesday, January 23, 2018 3:29 PM
To: solr-user@lucene.apache.org
Subject: Re: Using lucene to post-process Solr query results
Hi Rahul,
Looks
document
for each different value for that field in the same Solr document.
Regards,
Rahul
-Original Message-
From: Diego Ceccarelli (BLOOMBERG/ LONDON) [mailto:dceccarel...@bloomberg.net]
Sent: Tuesday, January 23, 2018 7:17 PM
To: solr-user@lucene.apache.org
Subject: Re: Using lucene to
Exactly. I want to validate each lucene document with the query and discard the
ones that don't match.
Regards,
Rahul
-Original Message-
From: Diego Ceccarelli (BLOOMBERG/ LONDON) [mailto:dceccarel...@bloomberg.net]
Sent: Tuesday, January 23, 2018 7:35 PM
To: solr
Hi,
Struggling to Import an XML containing an XSL transformation from
dataImport.
Do we need to run in Cloud mode for this ?
When I start solr in DIH mode, my other Cores are not visible.
1) My SolrConfig.XML has this:
rahul-data-config.xml
2) My rahul-data-config.xml
Hi Deepak,
Look at the score of your response results.
You can do this in Debug mode.
Rahul.
On Wed, Jan 31, 2018 at 4:18 AM, Deepak Udapudi wrote:
> Hi all,
>
> I have the below scenario in full name search that we are trying to
> implement.
>
> Solr configuration :-
may be more work but it’s more
scalable. Go big or go home. ;)
Hope it helps
--
Rahul Singh
rahul.si...@anant.us
Anant Corporation
On Mar 18, 2018, 11:14 AM -0400, Steven White , wrote:
> Hi everyone,
>
> I have a design problem that i"m not sure how to solve best so I figured I
&
Use a proxy server that only gives access to the update / select handlers
(URLs). Can do it with a numerous programming languages or with a simple proxy
in nginx.
The whole web server running SolR is not supposed to be out in the open. You
are opening yourself up to too many issues.
--
Rahul
Parallel processing in any way will help, including Spark w/ a DFS like S3 or
HDFS. Your three machines could end up being a bottleneck and you may need more
nodes.
On Mar 20, 2018, 2:36 AM -0500, LOPEZ-CORTES Mariano-ext
, wrote:
> CSV file is 5GB aprox. for 29 millions.
>
> As you say Christo
because the
updates / selects are fast.
Ultimately I think SolR is like a 18 wheel tractor trailer and Elastic is like
a uhaul trucks and you can chain a bunch of them up to do what SolR does.
--
Rahul Singh
rahul.si...@anant.us
Anant Corporation
On Mar 22, 2018, 9:04 AM -0500, Liu, Daphne
Maybe overthinking this. There is a “more like this” feature at basically does
this. Give that a try before digging deeper into the LTR methods. It may be
good enough for rock and roll.
--
Rahul Singh
rahul.si...@anant.us
Anant Corporation
On Mar 28, 2018, 12:25 PM -0400, Xavier Schepler
May need to extract outside SolR and index pure text with an external ingestion
process. You have much more control over the Tika attributes and behaviors.
--
Rahul Singh
rahul.si...@anant.us
Anant Corporation
On Apr 9, 2018, 10:23 PM -0400, Zheng Lin Edwin Yeo ,
wrote:
> Hi,
>
> Cu
How much data and what is the database source? Spark is probably the fastest
way.
--
Rahul Singh
rahul.si...@anant.us
Anant Corporation
On Apr 12, 2018, 7:28 AM -0400, Sujay Bawaskar , wrote:
> Hi,
>
> We are using DIH with SortedMapBackedCache but as data size increases we
> nee
If you want speed, Spark is the fastest easiest way. You can connect to
relational tables directly and import or export to CSV / JSON and import from a
distributed filesystem like S3 or HDFS.
Combining a dfs with spark and a highly available SolR - you are maximizing all
threads.
--
Rahul
CSV -> Spark -> SolR
https://github.com/lucidworks/spark-solr/blob/master/docs/examples/csv.adoc
If speed is not an issue there are other methods. Spring Batch / Spring Data
might have all the tools you need to get speed without Spark.
--
Rahul Singh
rahul.si...@anant.us
Anant Corpo
process can improve the overall stability of the SolR service.
--
Rahul Singh
rahul.si...@anant.us
Anant Corporation
On Apr 25, 2018, 12:49 PM -0400, Shawn Heisey , wrote:
> On 4/25/2018 4:02 AM, Lee Carroll wrote:
> > *We don't recommend using solr-cell for production indexing.*
>
pipeline.
Best,
--
Rahul Singh
rahul.si...@anant.us
Anant Corporation
On Apr 29, 2018, 6:27 AM -0700, Doug Turnbull
, wrote:
> Morphlines is a cloudera specific tool. I suspect moving Solr platforms
> will require you to rework your indexing somewhat. You may need to step
> back and think
101 - 200 of 267 matches
Mail list logo