I am wondering that the performance of facet of DocValued SortableText
Field is slower than non Docvalued String Field.
Does anyone know why?
Thanks,
Jae
Is non CDCR replication in SolrCloud still working in Solr 9.0?
Jae
Does anyone have the experience to use HugePage and UseLargePage? How much
can we get the performance benefits from utilizing it?
The disk is NOT SSD and the sole node has 256 GB. The Heap is 31.99 G.
Thanks,
Jae
just try
> with a subquery
>
> https://lucene.apache.org/solr/guide/6_6/transforming-result-documents.html#TransformingResultDocuments-_subquery_
>
> On Fri, Dec 11, 2020 at 3:31 PM Jae Joo wrote:
>
> > I have the requirement to create field - xyz to be returned based on the
&
I have the requirement to create field - xyz to be returned based on the
matched result.
Here Is the code .
XYZ:concat(
if(exists(query({!v='field1:12345'})), '12345', ''),
if(exists(query({!v='field1:23456'})), '23456', ''),
if(exists(query({!v='field1:34567'})), '34567', ''),
But if you’re
> missing some info, please feel free to comment (here or there), I could add
> some more info.
>
> Best regards,
> Radu
> --
> Sematext Cloud - Full Stack Observability - https://sematext.com
> Solr and Elasticsearch Consulting, Training and Production Support
&
Is there any way to apply facet to the partial search result?
For ex, we have 10m return by "dog" and like to apply facet to first 10K.
Possible?
Jae
Is "smart" really smarter than one explicitly defined?
For "emun" type, would it be faster to define facet.method=enum than smart?
Jae
Does increasing of Position Gap make Search Slow?
Jae
I forgot how to enter multivalued in Analysis Page in Admin.
Can anyone help?
Jae
I have timeAllowed=2000 (2sec) and having mostly 0 hits coming out. Should
I have more than 0 results?
Jae
HI,
Is there any way to stop the job running in Async mode?
Thanks,
I found it the root cause. I have 3 collections assigned to a alias and one
of them are NOT synched.
By the alias.
Collection 1
Collection 2
Collection 3
On Wed, Aug 12, 2020 at 7:29 PM Jae Joo wrote:
> Good question. How can I validate if the repli
Good question. How can I validate if the replicas are all synched?
On Wed, Aug 12, 2020 at 7:28 PM Jae Joo wrote:
> numFound is same but different score.
>
>
>
>
>
>
>
>
> On Wed, Aug 12, 2020 at 6:01 PM Aroop Ganguly
> wrote:
>
>> Try a simpl
ou may think is sync’d may actually not be. How do
> you validate correct sync ?
>
> > On Aug 12, 2020, at 10:55 AM, Jae Joo wrote:
> >
> > The replications are all synched and there are no updates while I was
> > testing.
> >
> >
> > On Wed, Aug
out of sync.
>
> > On Aug 12, 2020, at 10:47 AM, Jae Joo wrote:
> >
> > I have 10 collections in single alias and having different result sets
> for
> > every time with the same query.
> >
> > Is it as designed or do I miss something?
> >
> >
I have 10 collections in single alias and having different result sets for
every time with the same query.
Is it as designed or do I miss something?
The configuration and schema for all 10 collections are identical.
Thanks,
Jae
I have a cluster with 8 nodes for 24 shards with replicaFactor=3 and have
only 4 nodes have the leader replicas.
Is there any way to redistribute the lead nodes to cross of all 8 nodes?
Thanks,
ould be supplied at
> QueryComponent.doFieldSortValues(ResponseBuilder, SolrIndexSearcher).
>
> On Tue, Jul 7, 2020 at 4:19 PM Jae Joo wrote:
>
> > 8.3.1
> >
> > > required="true" multiValued="false" docValues="true"/>
> &
8.3.1
the field "id" is for nested document.
On Mon, Jul 6, 2020 at 4:17 PM Mikhail Khludnev wrote:
> Hi,
> What's the version? What's uniqueKey? is it stored? what's fl param?
>
> On Mon, Jul 6, 2020 at 5:12 PM Jae Joo wrote:
>
> > I
I am seeing the nullPointerException in the list below and I am
looking for how to fix the exception.
Thanks,
NamedList sortFieldValues =
(NamedList)(srsp.getSolrResponse().getResponse().get("sort_values"));
if (sortFieldValues.size()==0 && // we bypass merging this response
only if it's partial
putting original term in the synonym list works.
On Fri, May 8, 2020 at 1:05 PM atin janki wrote:
> Hi Jae,
>
> Do try to explain your problem with an example. Also share how you are
> writing the synonyms file.
> Best Regards,
> Atin Janki
>
>
> On Fri, May 8, 202
In 8.3, There should be the way to preserve the original terms, but could
not find it.
Does anyone know?
Thanks,
Jae
I have multiple 100 M documents using Nested Document for joining. It is
the fastest way for joining in a single replica. By adding more replicas (2
or 3), the performance is slow down significantly. (about 100x times).
Does anyone have same experience?
Jae
Found the root cause. I disabled the transaction log.
Thanks,
On Mon, Oct 19, 2015 at 1:07 PM, Jae Joo wrote:
> Solr Version " 5.3
>
> I just built the SoleCloud with 5 shards and 3 replicationfactor in 15
> nodes. It means that I have shards and replicas running in it's
Solr Version " 5.3
I just built the SoleCloud with 5 shards and 3 replicationfactor in 15
nodes. It means that I have shards and replicas running in it's own servers.
When I see the Cloud page, I see that the status of replica is
"recovery-failed".
For testing, I downed the leader, but a replica
BM25 though. You should upgrade to 5.3.
>
> Markus
>
> -Original message-
> > From:Jae Joo
> > Sent: Wednesday 9th September 2015 21:23
> > To: solr-user@lucene.apache.org
> > Subject: statsCache issue
> >
> > Solr Version: 5
Solr Version: 5.2.1
Container: Tomcat (still).
in SolrConfig.xml:
However, I see the class is not plugged in.
in log file:
org.apache.solr.core.SolrCore; Using default statsCache cache:
org.apache.solr.search.stats.LocalStatsCache
Any reason why?
Thanks,
Jae
I am having some issue regarding "start" and "End" position of token.
Here is the CharFilterFactory.
Then the input data is
1
In the Analysis page,
textraw_bytesstartendpositionLengthtypeposition
1[31]21311word1
Should the "end" position "22"? It breaks the Highlighting...
HTMLStripCharFilte
ver, for the term (without tags) p-xylene,
p-xylene --> 1
p --> 1
xylene --> 2
pxylene --> 3
Only difference I can see is the start and end position because of xml tag.
Does any one know why?
Thanks,
Jae Joo
Any way to apply different edismax parameter to field by field?
For ex.
q=keywords:(lung cancer) AND title:chemotherapy
I would like to apply different qf for fields, keywords and title.
f.keywords.qf=keywords^40 subkeywords^20
f.title.qf=title^80 subtitle^20
I know it can be done by field aliasi
In the synonym file,
antigravity, anti gravity
In the analysis, I see the position of anti is 1 and gravity is 2.
Is there any way to keep postions of "anti" and "gravity" to 1?
And any ways to configure or define to have synonym "anti gravity" rather
than "anti" and "gravity" for "antigravity"
If I use a combined query - range query and others (term query), all terms
in field matched is highlighted. Any way to highlight only the term(s) in
term query?
Here is example.
+date:{20031231 TO *] +(title:red)
It highlight all terms except stopword.
using fq would not be an option because th
I have a synonyms.txt file which has
20th,twentieth
Once I apply the synonym, I see "20th", "twentieth" and "20" for "20th".
Does anyone know where "20" comes from? How can I have only "20th" and
"twentieth"?
Thanks,
Jae
Thank you.
However, fq is already in use.
In my opinion, it is to think that it might be slow data of 70 million
reviews is contained in the core of one, but do you have examples of
performance of a certain number or more may decrease maybe?
--
View this message in context:
http://lucene.472066
Search speed at which data is loaded is more than 7 ten millon current will
be reduced too.
About 50 seconds it will take, but the number is often just this, it is not
possible to know whether such.
Will there is a problem with the Query I use it to know the Query Optimizing
Solr and fall.
The Quer
Indexed properties in a constant field current to the field, I did not give
the search.
indexed attribute is set to true by default, does not turn you on?
--
View this message in context:
http://lucene.472066.n3.nabble.com/without-the-indexed-property-is-set-to-true-by-default-tp4060973.html
Se
Hi,
I am looking for the way to retrieve a ranking (or position) of the
document matched in the result set.
I can get the data, then parse it to find the position of the document
matched, but am looking for the way if there is a feature.
Thanks,
Jae
Hi,
Is NRT in Solr 4.0 from trunk? I have checkouted from Trunk, but could not
find the configuration for NRT.
Regards
Jae
ring from
> > distributed IDF issues.
> >
> > Upayavira
> >
> > On Wed, 02 Mar 2011 10:10 -0500, "Jae Joo" wrote:
> > > Is there still issue regarding distributed idf in sharding environment
> in
> > > Solr 1.4 or 4.0?
> > >
Is there still issue regarding distributed idf in sharding environment in
Solr 1.4 or 4.0?
If yes, any suggestions to resolve it?
Thanks,
Jae
Hi,
I am implementing spatial search and found some odd things. As I know that
the returning distance is still being implemented, so I have implement
algorithm to calculate the actual distance based on lat and long returned.
when I do it, I have found the sort is not working properly. Any thing I
All,
I have two indices - one has 23 M document and the other has less than 1000.
The small index is for real time update.
Does updating small index (with commit) hurt the overall performance?
(We can not update realtime for 23M big index because of heavy traffic and
size).
Thanks,
Jae Joo
Hi,
Here is the string to be indexed without duplication.
Kitchen Cabinet Utah Kitchen Remodeling Utah
Is RemoveDuplicatesTokenFilterFactory for this solution? or for something
else?
Jae
Hi,
I would like to make the synonym for internal medicine to physician or
doctor. but it is not working properly. Anyone help me?
synonym.index.txt
internal medicine => physician
synonyms.query.txt
physician, internal medicine => physician, doctor
In the Analysis tool, I can see clearly that
I have found that the stamming in solr 1.2 and 1.3 is different for
"communication". We have index built in Solr 1.2 and the index is being
queried by 1.3. Is there any way to adjust it?
Jae joo
Hi,
Is there any way to join multiple indexes in Solr?
Thanks,
Jae
Hi,
I would like to know how it can be implemented.
Index1 has fields id,1,2,3 and index2 has fields id,5,6,7.
The ID in both indexes are unique id.
Can I use "a kind of " distributed search and/or multicore to search, sort,
and facet through 2 indexes (index1 and index2)?
Thanks,
Jae joo
Hi,
We do have 16 millions of company name and would like to find the way for
"prefetching" by using Solr.
Does anyone have experience and/or suggestions?
Thanks,
Jae Joo
;spellcheck=true&spellcheck.collate=true
Can any one help me?
Thanks,
Jae Joo
I do have same experience.
What is the CPU in the Solaris box? it is not depending on the operating
system (linux or Solaris). It is depenong on the CPU (Intel ro SPARC).
Don't know why, but based on my performance test, SPARC machine requires
MORE memory for java application.
Jae
On Thu, Dec 4,
In the dataimport.properties file, there is the timespamp.
#Thu Dec 04 15:36:22 EST 2008
last_index_time=2008-12-04 15\:36\:20
I am using the Oracle (10g) and would like to know which format of timestamp
I have to use in Oracle.
Thanks,
Jae
I actually found the problem. Oracle returns the field name as "Capital".
On Tue, Dec 2, 2008 at 1:57 PM, Jae Joo <[EMAIL PROTECTED]> wrote:
> Hey,
>
> I am trying to connect the Oracle database and index the values into solr,
> but I ma getting the
> "Docume
Hey,
I am trying to connect the Oracle database and index the values into solr,
but I ma getting the
"Document [null] missing required field: id".
Here is the debug output.
1
2
0
2008-12-02 13:49:35
−
Indexing completed. Added/Updated: 0 documents. Deleted 0 documents.
schema.xml
id
>
> I am having some trouble to utilize the facet Query. As I know that the
> facet Query has better performance that simple query (q).
> Here is the example.
>
>
> http://localhost:8080/test_solr/select?q=*:*&facet=true&fq=state:CA&facet.mincount=1&facet.field=city&facet.field=sector&facet.limit=-
I am having some trouble to utilize the facet Query. As I know that the
facet Query has better performance that simple query (q).
Here is the example.
http://localhost:8080/test_solr/select?q=*:*&facet=true&fq=state:CA&facet.mincount=1&facet.field=city&facet.field=sector&facet.limit=-1&sort=score+
Here is what I am doing to check the memory statues.
1. Run the Servelt and Solr application.
2. On command prompt, jstat -gc 5s (5s means that getting data every 5
seconds.)
3. Watch it or pipe to the file.
4. Analyze the data gathered.
Jae
On Tue, Oct 21, 2008 at 9:48 PM, Willie Wong <[EMAIL P
If there any way to mix dismax and standard query type?
Here is the example..
(wendys columbus) AND state:Georgia
wendys columbus --> qt=dismax
Thanks,
Jae Joo
.com/ -- Lucene - Solr - Nutch
>
>
> ----- Original Message
> > From: Jae Joo <[EMAIL PROTECTED]>
> > To: solr-user@lucene.apache.org
> > Sent: Thursday, May 15, 2008 9:07:54 AM
> > Subject: Multi core weight
> >
> > Hi,
> >
> > I am lo
Hi,
I am looking for "the best or possible" way to add WEIGHT to each core in
multi core environment.
core 1 has about 10 millions articles from same publisher and core 2 and 3
have less than 10k.
I would like to have BALANCED Query result - ex. 10 from core 1, 10 from
core 2 and 10 from core 3..
I do have a same situation. Got 30 million indexed and deleted 3
millions.
DELETE can not be posted as same way as "ADD". We can add multiple
documents in the file, but not for DELETE.
If there is "RANGE" of ID, make the range first then delete the record
in index by
{URL} id:[xxx TO yyy].
T
You can adjust the performance of indexing by configuring of these parameters.
false
10
1000
2147483647
1
Jae
-Original Message-
From: Britske [mailto:[EMAIL PROTECTED]
Sent: Sat 4/5/2008 10:09 AM
To: solr-user@lucene.apache.org
Subject: indexing slow,
Finding the way how to sort by internal_docid desc.
Thanks,
Jae
On Tue, Mar 18, 2008 at 11:41 AM, Jae Joo <[EMAIL PROTECTED]> wrote:
> Is there any way to sort by index id - descending? (by order of indexed)
>
> Thanks,
> Jae
>
Is there any way to sort by index id - descending? (by order of indexed)
Thanks,
Jae
I have 30 millions document indexed and tried sort by "sequenceid" which is
unique over the document.
I am experiencing "very slow" than sort by pub_date.
sequenceid is not defined as "unique key" in the schema.xml and there is the
"unique key" defined in schema.xml - item_id.
Anyone knows why?
T
Hi,
I have many document to be deleted and the xml file is built shown as below.
delete.xml
id:0286-14582373
id:0286-14582372
id:0286-14582371
id:0286-14582415
id:0286-14582414
id:0286-14582413
id:0286-14582412
id:0286-14582411
id:0286-14582410
id:0286-14582409
id:0286-14582408
Once I post it
While the job is running, you can monitor the memory usage.
Use the following command - jstat (you can find in the java/bin directory)
jstat -gc 5s --> every 5 seconds.
Jae
-Original Message-
From: Reece [mailto:[EMAIL PROTECTED]
Sent: Mon 3/3/2008 8:20 PM
To: solr-user@lucene.apache.
As I know that tomcat is the server to support servlet and jsp and solr is just
one of application of tomcat.
So, theer is no meaning of port# of Solr.
Thanks
Jae
Hi All,
I have installed solr through tomcat(5.5.23). Its up and running on port
8080. Its like, if tomcat is running, solr i
In my environment, there is NO big difference between local disk and SAN based
file system.
A little slow down, but not a problem (1 or 2 %)
I do have 4 sets of solr indices each has more than 10G in 3 servers.
I think that it is not good way to share SINGLE Index. - disk is pretty cheap
and we
Or, if you have multiple files to be updated, please make sure "Index
Multiple Files" and commit "Once" at the end of Indexing..
Jae
-Original Message-
From: Jae Joo [mailto:[EMAIL PROTECTED]
Sent: Tuesday, February 12, 2008 10:50 AM
To: solr-user@lucene.apache.org
I have same experience.. I do have 6.5G Index and update it daily.
Have you ever check that the updated file does not have any document and
tried "commit"? I don't know why, but it takes so long - more than 10
minutes.
Jae Joo
-Original Message-
From: Ken Krugler [mailto:[
I have downloaded version 1.3 and built multiple indices.
I could not find any way for multiple indices search at Solr level, I
have written the Lucene application. It is working well.
Jae Joo
-Original Message-
From: Niveen Nagy [mailto:[EMAIL PROTECTED]
Sent: Monday, February 04
In the firstsearch listner, I need to use special character "&" in the q
string, but it complains "Error - filterStart"
company_desc:"Advertising & Marketing"
0
20
company_name, score
Thanks,
Jae Joo
.
Thanks,
Jae joo
I have a problem - memory and performance issues for more than 10 request
(solr Search and Facet) per second.
On tomcat, it requires 4 to 5 G Bytes, but still not enough.
Does anyone have any experience regarding high volume and performance issue
on Tomcat and JBOss and resolutions share with me?
I have built two cores - core0 and core1.
each core has different set of index.
I can access core0 and core 1 by
http://localhost:8983/solr/core[01]/admin/form.jsp.
Is there any way to access multiple indexes with single query?
Thanks,
Jae
I have set multicores - core0 and core1, core0 is default.
Once I update the index by http://localhost:8983/solr/update, it updates
core1 not core0.
Also, I tried to set the deault core using SETASDEFAULT, but it is "unknown
action command".
Can any one help me?
Thanks,
Jae
If the number of results > 2500 then sort by company_name
otherwise, sort by revenue;
Do I have to access 2 times? One is to get the number of results and the
other one is for sort.
The second query should be accessed by necessary.
Any efficient way?
Thanks,
Jae
Hi,
What happens if Solr application hit the max. memory of heap assigned?
Will be die or just slow down?
Jae
and document2?
Jae
On Jan 4, 2008 10:54 AM, Robert Young <[EMAIL PROTECTED]> wrote:
> I don't quite understand what you're getting at. What is the problem
> you're encountering or what are you trying to achieve?
>
> Cheers
> Rob
>
> On Jan 4, 2008 3:26 PM
Hi,
Is there any way to dedup the keyword cross the document?
Ex.
"china" keyword is in doc1 and doc2. Will Solr index have only 1 "china"
keyword for both document?
Thanks,
Jae Joo
try it.
/search/replication_test/0/index/solr/bin/snapshooter
.
true
Jae
On Dec 19, 2007 9:10 AM, Bill Au <[EMAIL PROTECTED]> wrote:
> Just changing the permission on the script is not enough. The id
> executing
> the script needs to have write permission to create the snapshot
Is there any parameter to set the max. number of error messages..
The Solr system was killed after a couple of error messages which caused by
WRONG QUERY
Thanks,
Jae
Hi,
I have about 20G bytes of index with 1 Million transactions per day.
I am considering the disk system between local disk and SAN based system
(not NFS).
Is there any performance difference to run solr instance with 20 G index on
local disk and on SAN based disk which is connected with fiber ch
Hi,
Can anyone help me how to facet and/or search for associated fields? -
1234
Baseball hall of Fame opens Jackie Robinson
exhibit
Description about the new JR hall of fame
exhibit.
20071114
200711
0
press
Sports
Baseball
Major League Baseball
Arts and Culture
C
Hi,
I am looking for the way to get the score - only hundredth - ex.
4.09something like that.
Currently, it has 7 decimal digits. 1.8032384
Thanks,
Jae
In the snappuller, the "solr" is hardcoded. Should it be
"${master_data_dir}?
# rsync over files that have changed
rsync -Wa${verbose}${compress} --delete ${sizeonly} \
${stats} rsync://${master_host}:${rsyncd_port}/solr/${name}/
${data_dir}/${name}-wip
Thanks,
Jae
est one and it works.
>
> -Original Message-
> From: Jae Joo [mailto:[EMAIL PROTECTED]
> Sent: Tuesday, November 13, 2007 9:06 AM
> To: solr-user
> Subject: two solr instances - index and commit
>
> Hi,
>
> I have two solr instance running under different tomcat envi
Hi,
I have two solr instance running under different tomcat environment.
One solr instance is for indexing and would like to commit to the other solr
instance.
This is what I tried, but failed.
using post.sh (without commit), the docs are indexed in solr-1 instance.
After indexed,
call commit co
all files same as index directory.
I have changed the snapshooter script because the bash in solaris do not
have cp -l option.
#cp -lr ${data_dir}/index ${temp} --> original
mkdir ${temp}
ln ${data_dir}/index/* ${temp}
Thanks,
Jae Joo
Hi,
Is there Solaris bash based script available? The couple of command is not
working, and wondering any available scripts I can use before I update it.
For ex. snapshooter, snappuller, snapinstaller
Thanks,
Jae
her. when indexing is done
> (or finishes a big chunk) send to the 'searching' one and it
> will see the new stuff.
>
> ryan
>
>
>
> Jae Joo wrote:
> > Here is my situation.
> >
> > I have 6 millions articles indexed and adding about 10k
one is for update. Once update is done, switch the index by
automatically and/or my application.
Thanks,
Jae joo
On Nov 12, 2007 8:48 AM, Ryan McKinley <[EMAIL PROTECTED]> wrote:
> The advantages of a multi-core setup are configuration flexibility and
> dynamically changing avail
In my system, the heap size (old generation) keeps growing up caused by
heavy traffic.
I have adjusted the size of young generation, but it does not work well.
Does anyone have any recommendation regarding this issue? - Solr
configuration and/or web.xml ...etc...
Thanks,
Jae
Hi,
I am looking for the way to utilize the multiple indexes for signle sole
instance.
I saw that there is the patch 215 available and would like to ask someone
who knows how to use multiple indexes.
Thanks,
Jae Joo
did something wrong and/or does Solr use multiple threading or
something else to get the good indexing performance?
Thanks
Jae Joo
Hi,
I am trying to delete the document remotly through curl command, but got the
internal server error - Permission Denied.
Anyone knows how to solve this problem?
Thanks,
Jae
..
and I am looking the way to update the specific field by
12345
Y
--> it may keep the unchanged field.
Thanks,
Jae Joo
Hi,
I have 9g index and try to delete a couple of document. The actual deletion
is working fine.
Here is my question.
Do I have to OPTIMIZE the index after deleting? or just COMMIT it? The
original index already optimized.
Thanks,
Jae Joo
Do I have to define the name/values as exactly same as acturl query
(order...)?
Here is actual query
indent=on&version=2.2&facet=true&facet.mincount=1
&facet.field=phys_state&facet.field=sic1&facet.limit=-1
&sort=sales_volume_us+desc&q=%28phys_country%3A%22United+States%22%29
&start=0&rows=20&fl
1 - 100 of 136 matches
Mail list logo