.n3.nabble.com/Can-Apache-Solr-Handle-TeraByte-Large-Data-tp3656484p4221066.html
Sent from the Solr - User mailing list archive at Nabble.com.
@Mikhail Use of data import handler ,if i define my baseDir is
D:/work/folder. Will it work for sub-folder and sub-folder of sub-folder ...
etc also.?
--
View this message in context:
http://lucene.472066.n3.nabble.com/Can-Apache-Solr-Handle-TeraByte-Large-Data-tp3656484p4221063.html
Sent
n compare of HTTP or bin/post ?
>
>
>
> --
> View this message in context:
> http://lucene.472066.n3.nabble.com/Can-Apache-Solr-Handle-TeraByte-Large-Data-tp3656484p4221060.html
> Sent from the Solr - User mailing list archive at Nabble.com.
interaval.
Does it will take same time in compare of HTTP or bin/post ?
--
View this message in context:
http://lucene.472066.n3.nabble.com/Can-Apache-Solr-Handle-TeraByte-Large-Data-tp3656484p4221060.html
Sent from the Solr - User mailing list archive at Nabble.com.
o we ensure that the data is correctly stored in Solr ?
>
> Or XML is a correct way to parse it
>
>
>
> --
> View this message in context:
> http://lucene.472066.n3.nabble.com/Can-Apache-Solr-Handle-TeraByte-Large-Data-tp3656484p4221051.html
> Sent from the Solr - User mailing list archive at Nabble.com.
is message in context:
http://lucene.472066.n3.nabble.com/Can-Apache-Solr-Handle-TeraByte-Large-Data-tp3656484p4221051.html
Sent from the Solr - User mailing list archive at Nabble.com.
.griddynamics.com/2015/07/how-to-import-structured-data-into-solr.html
I have googled but not get such type of requirement.
provide my any of link for it or some suggestion to do it.
>
>
>
>
>
> --
> View this message in context:
> http://lucene.472066.n3.nabble.com/C
On Tue, Aug 4, 2015, at 06:13 PM, Mugeesh Husain wrote:
> @Upayavira if i uses Solrj for indexing. autocommit or Softautocommit
> will
> work in case of SolJ
There are two ways to get content into Solr:
* push it in via an HTTP post.
- this is what SolrJ uses, what bin/post uses, and every
@Upayavira if i uses Solrj for indexing. autocommit or Softautocommit will
work in case of SolJ
--
View this message in context:
http://lucene.472066.n3.nabble.com/Can-Apache-Solr-Handle-TeraByte-Large-Data-tp3656484p4220796.html
Sent from the Solr - User mailing list archive at Nabble.com.
.
--
View this message in context:
http://lucene.472066.n3.nabble.com/Can-Apache-Solr-Handle-TeraByte-Large-Data-tp3656484p4220793.html
Sent from the Solr - User mailing list archive at Nabble.com.
; understanding with DIH for such type operation which i needed on my
> requirement. i'd google but unable to find such type of DIH Example which i
> can implement on my problem.
>
>
>
>
>
> --
> View this message in context:
> http://lucene.472066.n3.nabble
ement. i'd google but unable to find such type of DIH Example which i
can implement on my problem.
--
View this message in context:
http://lucene.472066.n3.nabble.com/Can-Apache-Solr-Handle-TeraByte-Large-Data-tp3656484p4220673.html
Sent from the Solr - User mailing list archive at Nabble.com.
e. i am repeating my
> > requirement
> > > >
> > > > I have a 40 millions of files which is stored in a file systems,
> > > > the filename saved as ARIA_SSN10_0007_LOCATION_129.pdf
> > > >
> > > > I just split all Value fro
om a filename only,these values i have to
> index.
> > >
> > > I am interested to index value to solr not file contains.
> > >
> > > I have tested the DIH from a file system its work fine but i dont know
> how
> > > can i implement my code i
tem its work fine but i dont know how
> > can i implement my code in DIH
> > if my code get some value than how i can i index it using DIH.
> >
> > If i will use DIH then How i will make split operation and get value from
> > it.
> >
> >
> >
> >
> >
> > --
> > View this message in context:
> > http://lucene.472066.n3.nabble.com/Can-Apache-Solr-Handle-TeraByte-Large-Data-tp3656484p4220552.html
> > Sent from the Solr - User mailing list archive at Nabble.com.
n How i will make split operation and get value from
> it.
>
>
>
>
>
> --
> View this message in context:
> http://lucene.472066.n3.nabble.com/Can-Apache-Solr-Handle-TeraByte-Large-Data-tp3656484p4220552.html
> Sent from the Solr - User mailing list archive at Nabble.com.
.
--
View this message in context:
http://lucene.472066.n3.nabble.com/Can-Apache-Solr-Handle-TeraByte-Large-Data-tp3656484p4220552.html
Sent from the Solr - User mailing list archive at Nabble.com.
on which way i will start my
requirement.
Please told me you guys are told me yes(Is yes for Solrj ? or DIH ?)
--
View this message in context:
http://lucene.472066.n3.nabble.com/Can-Apache-Solr-Handle-TeraByte-Large-Data-tp3656484p4220550.html
Sent from the Solr - User mailing list archive at
these value have
> to be index to the solr.
> 2.)Do Not need file contains(Text) to index.
>
> You Told me "The answer is Yes" i didn't get in which way you said Yes.
>
> Thanks
>
>
>
>
> --
> View this message in context:
> http://lucene.47206
and these value have
>> to be index to the solr.
>> 2.)Do Not need file contains(Text) to index.
>>
>> You Told me "The answer is Yes" i didn't get in which way you said Yes.
>>
>> Thanks
>>
>>
>>
>>
>> --
>> View
>
> --
> View this message in context:
> http://lucene.472066.n3.nabble.com/Can-Apache-Solr-Handle-TeraByte-Large-Data-tp3656484p4220527.html
> Sent from the Solr - User mailing list archive at Nabble.com.
said Yes.
>
> Thanks
>
>
>
>
> --
> View this message in context:
> http://lucene.472066.n3.nabble.com/Can-Apache-Solr-Handle-TeraByte-Large-Data-tp3656484p4220527.html
> Sent from the Solr - User mailing list archive at Nabble.com.
"The answer is Yes" i didn't get in which way you said Yes.
Thanks
--
View this message in context:
http://lucene.472066.n3.nabble.com/Can-Apache-Solr-Handle-TeraByte-Large-Data-tp3656484p4220527.html
Sent from the Solr - User mailing list archive at Nabble.com.
>
> --
> View this message in context:
> http://lucene.472066.n3.nabble.com/Can-Apache-Solr-Handle-TeraByte-Large-Data-tp3656484p4220469.html
> Sent from the Solr - User mailing list archive at Nabble.com.
suggest me in which way i have to do it.
1.) Should i use Solrj
1.) Should i use DIH
1.) Should i use post method(in terminal)
or Is there any other way for indexing such amount of data.
--
View this message in context:
http://lucene.472066.n3.nabble.com/Can-Apache-Solr-Handle-TeraByte-Large
ent: Tuesday, January 17, 2012 12:15 AM
>Subject: Re: Can Apache Solr Handle TeraByte Large Data
>
>I've been toying with the idea of setting up an experiment to index a large
>document set 1+ TB -- any thoughts on an open data set that one could use
>for this purpose?
>
>Than
I've been toying with the idea of setting up an experiment to index a large
document set 1+ TB -- any thoughts on an open data set that one could use
for this purpose?
Thanks.
On Mon, Jan 16, 2012 at 5:00 PM, Burton-West, Tom wrote:
> Hello ,
>
> Searching real-time sounds difficult with that am
Hello ,
Searching real-time sounds difficult with that amount of data. With large
documents, 3 million documents, and 5TB of data the index will be very large.
With indexes that large your performance will probably be I/O bound.
Do you plan on allowing phrase or proximity searches? If so, you
Hello,
>
> From: mustafozbek
>
>All documents that we use are rich text documents and we parse them with
>tika. we need to search real time.
Because of real-time requirement, you'll need to use unreleased/dev version of
Solr.
>Robert Stewart wrote
>> Any idea
Hello,
Inline
- Original Message -
> From: mustafozbek
>
> I am an apache solr user about a year. I used solr for simple search tools
> but now I want to use solr with 5TB of data. I assume that 5TB data will be
> 7TB when solr index it according to filter that I use. And then I will a
>>> a- How many shards should I use
>>> b- Should I use solr cores
>>> c- What is the committing frequency you offered. (is 1 hour OK)
>>> 3- are there any test results for this kind of large data
>>>
>>> There is no available 5TB data, I
data
>>
>> There is no available 5TB data, I just want to estimate what will be
the
>> result.
>> Note: You can assume that hardware resourses are not a problem.
>>
>>
>> --
>> View this message in context:
>>
http://lucene.472066.n3.nabble.com/Can-Apache-Solr-Handle-TeraByte-Large-Data-tp3656484p3656484.html
>> Sent from the Solr - User mailing list archive at Nabble.com.
o estimate what will be the
> result.
> Note: You can assume that hardware resourses are not a problem.
>
>
> --
> View this message in context:
> http://lucene.472066.n3.nabble.com/Can-Apache-Solr-Handle-TeraByte-Large-Data-tp3656484p3656484.html
> Sent from the Solr - User mailing list archive at Nabble.com.
33 matches
Mail list logo