date:20191221

[jira] [Commented] (SOLR-14095) Remove serialization and/or support serialization filtering

2019-12-21 Thread Ishan Chattopadhyaya (Jira)



[ 
https://issues.apache.org/jira/browse/SOLR-14095?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17001635#comment-17001635
 ] 

Ishan Chattopadhyaya commented on SOLR-14095:
-

bq. Noble Paul, Ishan Chattopadhyaya, I took another look today. I still feel 
there is no way to use Json here unless we build a full serialization that 
retains types correctly. I feel Javabin here is less risky compared to Json, so 
I want to take that route and I want to do it soon to unblock the security 
manager work.

+1. Thanks Tomas.

> Remove serialization and/or support serialization filtering
> ---
>
> Key: SOLR-14095
> URL: https://issues.apache.org/jira/browse/SOLR-14095
> Project: Solr
>  Issue Type: Task
>  Security Level: Public(Default Security Level. Issues are Public) 
>Reporter: Robert Muir
>Priority: Major
> Attachments: SOLR-14095-json.patch, json-nl.patch
>
>  Time Spent: 1h 20m
>  Remaining Estimate: 0h
>
> Removing the use of serialization is greatly preferred.
> But if serialization over the wire must really happen, then we must use JDK's 
> serialization filtering capability to prevent havoc.
> https://docs.oracle.com/javase/10/core/serialization-filtering1.htm#JSCOR-GUID-3ECB288D-E5BD-4412-892F-E9BB11D4C98A



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[jira] [Commented] (SOLR-8876) Morphlines fails with "No command builder registered for ..." when using Java 9 due to morphline "importCommands" config option attempting to resolve classname globs

2019-12-21 Thread sdhalex (Jira)



[ 
https://issues.apache.org/jira/browse/SOLR-8876?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17001654#comment-17001654
 ] 

sdhalex commented on SOLR-8876:
---

Excuse me,  has this bug been fixed in the newest version?  Or need we still do 
the same workaround to avoid this bug now ? Thx. [~hossman][~uschindler]






> Morphlines fails with "No command builder registered for ..." when using Java 
> 9 due to morphline "importCommands" config option attempting to resolve 
> classname globs
> -
>
> Key: SOLR-8876
> URL: https://issues.apache.org/jira/browse/SOLR-8876
> Project: Solr
>  Issue Type: Bug
>  Components: contrib - MapReduce, contrib - morphlines-cell, contrib 
> - morphlines-core
>Reporter: Uwe Schindler
>Assignee: Chris M. Hostetter
>Priority: Major
>  Labels: Java9
> Fix For: 6.5, 7.0
>
> Attachments: SOLR-8876.patch
>
>
> When running Solr in java9, and using the morphlines contrib(s) users may 
> encounter vague errors such as...
> {noformat}
> No command builder registered for COMMAND_NAME
> {noformat}
> This error comes directly from the morphlines code, and relates to the use of 
> wildcards in the {{importCommands}} declaration of of morphlines {{\*.conf}} 
> files used -- for example...
> {noformat}
> importCommands : ["org.kitesdk.**", "org.apache.solr.**"]
> {noformat}
> Using wildcards like {{\*}} and {{\*\*}} in morphline's {{importCommands}} 
> config options do not work in java9 due to changes in the underlying JVM 
> classloader.
> This issue is tracked up stream in: 
> https://github.com/kite-sdk/kite/issues/469
> 
> *WORK AROUND*
> The workaround is to only use fully qualified command class names in 
> {{importCommands}} declaration, one for each distinct command used in that 
> {{conf}} file.
> Example:
> {noformat}
> # Old config, does not work in java9
> # importCommands : ["org.kitesdk.**", "org.apache.solr.**"]
> # replaced with...
> # using globs (foo.bar.* or foo.bar.**) will not work in Java9 due to 
> classpath scanning limitations
> # so we enumarate every command (builder) we know this config uses below. 
> (see SOLR-8876)
> importCommands : ["org.kitesdk.morphline.stdlib.LogDebugBuilder",
>   
> "org.apache.solr.morphlines.solr.SanitizeUnknownSolrFieldsBuilder",
>   "org.apache.solr.morphlines.solr.LoadSolrBuilder"]
> {noformat}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[jira] [Commented] (SOLR-8876) Morphlines fails with "No command builder registered for ..." when using Java 9 due to morphline "importCommands" config option attempting to resolve classname globs

2019-12-21 Thread sdhalex (Jira)



[ 
https://issues.apache.org/jira/browse/SOLR-8876?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17001655#comment-17001655
 ] 

sdhalex commented on SOLR-8876:
---

[~sdhalex] gogogo

> Morphlines fails with "No command builder registered for ..." when using Java 
> 9 due to morphline "importCommands" config option attempting to resolve 
> classname globs
> -
>
> Key: SOLR-8876
> URL: https://issues.apache.org/jira/browse/SOLR-8876
> Project: Solr
>  Issue Type: Bug
>  Components: contrib - MapReduce, contrib - morphlines-cell, contrib 
> - morphlines-core
>Reporter: Uwe Schindler
>Assignee: Chris M. Hostetter
>Priority: Major
>  Labels: Java9
> Fix For: 6.5, 7.0
>
> Attachments: SOLR-8876.patch
>
>
> When running Solr in java9, and using the morphlines contrib(s) users may 
> encounter vague errors such as...
> {noformat}
> No command builder registered for COMMAND_NAME
> {noformat}
> This error comes directly from the morphlines code, and relates to the use of 
> wildcards in the {{importCommands}} declaration of of morphlines {{\*.conf}} 
> files used -- for example...
> {noformat}
> importCommands : ["org.kitesdk.**", "org.apache.solr.**"]
> {noformat}
> Using wildcards like {{\*}} and {{\*\*}} in morphline's {{importCommands}} 
> config options do not work in java9 due to changes in the underlying JVM 
> classloader.
> This issue is tracked up stream in: 
> https://github.com/kite-sdk/kite/issues/469
> 
> *WORK AROUND*
> The workaround is to only use fully qualified command class names in 
> {{importCommands}} declaration, one for each distinct command used in that 
> {{conf}} file.
> Example:
> {noformat}
> # Old config, does not work in java9
> # importCommands : ["org.kitesdk.**", "org.apache.solr.**"]
> # replaced with...
> # using globs (foo.bar.* or foo.bar.**) will not work in Java9 due to 
> classpath scanning limitations
> # so we enumarate every command (builder) we know this config uses below. 
> (see SOLR-8876)
> importCommands : ["org.kitesdk.morphline.stdlib.LogDebugBuilder",
>   
> "org.apache.solr.morphlines.solr.SanitizeUnknownSolrFieldsBuilder",
>   "org.apache.solr.morphlines.solr.LoadSolrBuilder"]
> {noformat}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[jira] [Issue Comment Deleted] (SOLR-8876) Morphlines fails with "No command builder registered for ..." when using Java 9 due to morphline "importCommands" config option attempting to resolve classna

2019-12-21 Thread sdhalex (Jira)



 [ 
https://issues.apache.org/jira/browse/SOLR-8876?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

sdhalex updated SOLR-8876:
--
Comment: was deleted

(was: [~sdhalex] gogogo)

> Morphlines fails with "No command builder registered for ..." when using Java 
> 9 due to morphline "importCommands" config option attempting to resolve 
> classname globs
> -
>
> Key: SOLR-8876
> URL: https://issues.apache.org/jira/browse/SOLR-8876
> Project: Solr
>  Issue Type: Bug
>  Components: contrib - MapReduce, contrib - morphlines-cell, contrib 
> - morphlines-core
>Reporter: Uwe Schindler
>Assignee: Chris M. Hostetter
>Priority: Major
>  Labels: Java9
> Fix For: 6.5, 7.0
>
> Attachments: SOLR-8876.patch
>
>
> When running Solr in java9, and using the morphlines contrib(s) users may 
> encounter vague errors such as...
> {noformat}
> No command builder registered for COMMAND_NAME
> {noformat}
> This error comes directly from the morphlines code, and relates to the use of 
> wildcards in the {{importCommands}} declaration of of morphlines {{\*.conf}} 
> files used -- for example...
> {noformat}
> importCommands : ["org.kitesdk.**", "org.apache.solr.**"]
> {noformat}
> Using wildcards like {{\*}} and {{\*\*}} in morphline's {{importCommands}} 
> config options do not work in java9 due to changes in the underlying JVM 
> classloader.
> This issue is tracked up stream in: 
> https://github.com/kite-sdk/kite/issues/469
> 
> *WORK AROUND*
> The workaround is to only use fully qualified command class names in 
> {{importCommands}} declaration, one for each distinct command used in that 
> {{conf}} file.
> Example:
> {noformat}
> # Old config, does not work in java9
> # importCommands : ["org.kitesdk.**", "org.apache.solr.**"]
> # replaced with...
> # using globs (foo.bar.* or foo.bar.**) will not work in Java9 due to 
> classpath scanning limitations
> # so we enumarate every command (builder) we know this config uses below. 
> (see SOLR-8876)
> importCommands : ["org.kitesdk.morphline.stdlib.LogDebugBuilder",
>   
> "org.apache.solr.morphlines.solr.SanitizeUnknownSolrFieldsBuilder",
>   "org.apache.solr.morphlines.solr.LoadSolrBuilder"]
> {noformat}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[jira] [Commented] (SOLR-6209) Database connections lost during data import

2019-12-21 Thread sdhalex (Jira)



[ 
https://issues.apache.org/jira/browse/SOLR-6209?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17001662#comment-17001662
 ] 

sdhalex commented on SOLR-6209:
---

 has this bug been fixed in the newest version?  Or need we still do the same 
workaround to avoid this bug now ? Thx. [~sanket.thakkar007]  [~arafalov]
   






> Database connections lost during data import
> 
>
> Key: SOLR-6209
> URL: https://issues.apache.org/jira/browse/SOLR-6209
> Project: Solr
>  Issue Type: Bug
> Environment: OS: Windows 7
> RAM: 8 GB
>Reporter: SANKET
>Priority: Major
>  Labels: solr
>   Original Estimate: 30h
>  Remaining Estimate: 30h
>
> Follow the steps to generate the error:
> 1. Configure the large amount of data (around 4  GB  or more than 50 millions 
> of records)
> 2. Give proper data-config.xml file for indexing the data from remote 
> database server.
> 3. During indexing the data into solr from SQL SERVER 2010, at the half way 
> unplug the network cable and see the status in solr.
> e.g.
> localhost:8083/solr/core1/dataimport?command=status
> or
> localhost:8083/solr/core1/dataimport
> 4. Pass few seconds then again plug back the cable.
> 5. You can clearly see that there is just only "Time Elapsed" parameter 
> increase. "Total Rows Fetched" & "Total Documents Processed" remains same for 
> infinite time.
> 6. You can regenerate this for small data also.
> 7. Work around is you need to restart the solr. (But this is not good 
> solution)
> Note:
> This is very important issue because, so many organizations not using this 
> valuable products just because of the this database infinite connection issue.
> Solution can be: Forcefully abort the data indexing or provide mechanism for 
> forcefully abort the indexing.
> Hope you guys knows that abort command is also not working.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[GitHub] [lucene-solr] bruno-roustant commented on a change in pull request #1103: LUCENE-9102: add maxQueryLength option to DirectSpellChecker

2019-12-21 Thread GitBox

bruno-roustant commented on a change in pull request #1103: LUCENE-9102: add 
maxQueryLength option to DirectSpellChecker
URL: https://github.com/apache/lucene-solr/pull/1103#discussion_r360642971
 
 

 ##
 File path: 
lucene/suggest/src/java/org/apache/lucene/search/spell/DirectSpellChecker.java
 ##
 @@ -195,9 +197,27 @@ public int getMinQueryLength() {
* metric.
*/
   public void setMinQueryLength(int minQueryLength) {
+if (minQueryLength > this.maxQueryLength)
 
 Review comment:
   Actually I think we don't need these checks because whatever the value for 
minQueryLength and maxQueryLength, the logic is always the same, no surprise.
   So personally I would remove these checks, but I'm ok to keep them, your 
choice.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[jira] [Commented] (SOLR-13618) The core admin API should fail if the instanceDir is not under SOLR_HOME or a child or ancestor of instanceDir contains core.properties

2019-12-21 Thread Vincenzo D'Amore (Jira)



[ 
https://issues.apache.org/jira/browse/SOLR-13618?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17001675#comment-17001675
 ] 

Vincenzo D'Amore commented on SOLR-13618:
-

Hi, I have some experience with Solr and would like to contribute to the 
community. I think being able to fix this. What is not clear, is exactly what 
to do. I think to double check the instanceDir is within SOLR_HOME, when 
someone calls the core admin API, in case is outside raise an informative error 
message. On the other hand I didn't get what to do in the other case when a 
child or ancestor of instanceDir contains core.properties.

> The core admin API should fail if the instanceDir is not under SOLR_HOME or a 
> child or ancestor of instanceDir contains core.properties
> ---
>
> Key: SOLR-13618
> URL: https://issues.apache.org/jira/browse/SOLR-13618
> Project: Solr
>  Issue Type: Bug
>Reporter: Talvinder Matharu
>Priority: Major
>
> When new 'instanceDir' is set outside the SOLR_HOME directory then core will 
> fail to reload after restart as the server only discovers cores within 
> SOLR_HOME, looking for a 'core.properties'.  
> So what we ideally want is to check for all the “core.properties” defined 
> within all 'instanceDir' paths, which may exist outside the SOLR_HOME dir.
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[jira] [Comment Edited] (SOLR-13618) The core admin API should fail if the instanceDir is not under SOLR_HOME or a child or ancestor of instanceDir contains core.properties

2019-12-21 Thread Vincenzo D'Amore (Jira)



[ 
https://issues.apache.org/jira/browse/SOLR-13618?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17001675#comment-17001675
 ] 

Vincenzo D'Amore edited comment on SOLR-13618 at 12/21/19 11:32 AM:


Hi, I have some experience with Solr and would like to contribute to the 
community. I think being able to fix this. What is not clear, is exactly what 
to do. I think to double check the instanceDir is within SOLR_HOME when someone 
calls the core admin API, in case is outside raise an informative error 
message. On the other hand I didn't get what to do in the other case when a 
child or ancestor of instanceDir contains core.properties.


was (Author: v.dam...@gmail.com):
Hi, I have some experience with Solr and would like to contribute to the 
community. I think being able to fix this. What is not clear, is exactly what 
to do. I think to double check the instanceDir is within SOLR_HOME, when 
someone calls the core admin API, in case is outside raise an informative error 
message. On the other hand I didn't get what to do in the other case when a 
child or ancestor of instanceDir contains core.properties.

> The core admin API should fail if the instanceDir is not under SOLR_HOME or a 
> child or ancestor of instanceDir contains core.properties
> ---
>
> Key: SOLR-13618
> URL: https://issues.apache.org/jira/browse/SOLR-13618
> Project: Solr
>  Issue Type: Bug
>Reporter: Talvinder Matharu
>Priority: Major
>
> When new 'instanceDir' is set outside the SOLR_HOME directory then core will 
> fail to reload after restart as the server only discovers cores within 
> SOLR_HOME, looking for a 'core.properties'.  
> So what we ideally want is to check for all the “core.properties” defined 
> within all 'instanceDir' paths, which may exist outside the SOLR_HOME dir.
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[jira] [Commented] (SOLR-14120) Solr Admin UI breaks when using IE 11

2019-12-21 Thread Lucene/Solr QA (Jira)



[ 
https://issues.apache.org/jira/browse/SOLR-14120?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17001686#comment-17001686
 ] 

Lucene/Solr QA commented on SOLR-14120:
---

| (/) *{color:green}+1 overall{color}* |
\\
\\
|| Vote || Subsystem || Runtime || Comment ||
|| || || || {color:brown} Prechecks {color} ||
|| || || || {color:brown} master Compile Tests {color} ||
|| || || || {color:brown} Patch Compile Tests {color} ||
| {color:green}+1{color} | {color:green} Validate source patterns {color} | 
{color:green}  0m  2s{color} | {color:green} the patch passed {color} |
|| || || || {color:brown} Other Tests {color} ||
| {color:black}{color} | {color:black} {color} | {color:black}  0m 30s{color} | 
{color:black} {color} |
\\
\\
|| Subsystem || Report/Notes ||
| JIRA Issue | SOLR-14120 |
| JIRA Patch URL | 
https://issues.apache.org/jira/secure/attachment/12989279/SOLR-14120.patch |
| Optional Tests |  validatesourcepatterns  |
| uname | Linux lucene1-us-west 4.15.0-54-generic #58-Ubuntu SMP Mon Jun 24 
10:55:24 UTC 2019 x86_64 x86_64 x86_64 GNU/Linux |
| Build tool | ant |
| Personality | 
/home/jenkins/jenkins-slave/workspace/PreCommit-SOLR-Build/sourcedir/dev-tools/test-patch/lucene-solr-yetus-personality.sh
 |
| git revision | master / c4f68bdab91 |
| ant | version: Apache Ant(TM) version 1.10.5 compiled on March 28 2019 |
| modules | C: solr solr/webapp U: solr |
| Console output | 
https://builds.apache.org/job/PreCommit-SOLR-Build/637/console |
| Powered by | Apache Yetus 0.7.0   http://yetus.apache.org |


This message was automatically generated.



> Solr Admin UI breaks when using IE 11
> -
>
> Key: SOLR-14120
> URL: https://issues.apache.org/jira/browse/SOLR-14120
> Project: Solr
>  Issue Type: Bug
>  Security Level: Public(Default Security Level. Issues are Public) 
>  Components: Admin UI
>Affects Versions: master (9.0), 8.3.1
> Environment: Operation System: Microsoft Windows 10 Enterprise 
> (German)
> Version: 10.0.17763 Build 17763
> Browser: Internet Explorer 11
> Version: 11.864.17763
> Update: 11.0.160
>Reporter: Jakob Furrer
>Priority: Minor
> Attachments: SOLR-14120.patch, SOLR-14120.patch, SOLR-14120.patch, 
> Solr-8.3.1_ IE11_AdminUI_Error.png
>
>
> A customer of mine is restricted to use no other browser than Internet 
> Explorer to access the Solr Administration User Interface.
> He reported that the website does not display any content in the main area 
> when any button of the menu on the left hand side is clicked.
> I can replicate that behavior on my machine.
> Symptoms
> -
> When the Admin UI is loaded, the menu on the left hand side is visible but no 
> content is ever displayed in the main area.
> Any menu item (e.g. "Dashboard", "logging", "Java Properties" etc.) can be 
> clicked but no content appears.
> An error with a stacktrace is shown in the JavaScript console.
> History
> -
> I can confirm, that some month ago the Admin UI worked well under IE.
> However, I have not researched with what update of IE the problems exactly 
> started.
> Analysis
> -
> The displayed stacktrace indicates that Internet Explorer does not support 
> the method "startsWith" (and subsequently, the same is true for the method 
> "includes").
> This behavior is documented:
> https://developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Global_Objects/String/startsWith#Browser_compatibility
> https://developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Global_Objects/Array/includes#Browser_compatibility
> https://developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Global_Objects/String/includes#Browser_compatibility



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[GitHub] [lucene-solr] shalinmangar commented on a change in pull request #1082: SOLR-13984: add (experimental, disabled by default) security manager support

2019-12-21 Thread GitBox

shalinmangar commented on a change in pull request #1082: SOLR-13984: add 
(experimental, disabled by default) security manager support
URL: https://github.com/apache/lucene-solr/pull/1082#discussion_r360646424
 
 

 ##
 File path: solr/bin/solr
 ##
 @@ -2071,6 +2071,17 @@ else
   REMOTE_JMX_OPTS=()
 fi
 
+# Enable java security manager (limiting filesystem access and other things)
+# Don't cache DNS lookups forever, set the value back to the original JDK 
default.
+if [ "$SOLR_SECURITY_MANAGER_ENABLED" == "true" ]; then
+  SECURITY_MANAGER_OPTS=('-Djava.security.manager' \
+"-Djava.security.policy=${SOLR_SERVER_DIR}/etc/security.policy" \
+'-Dsun.net.inetaddr.ttl=30' \
 
 Review comment:
   Shouldn't this be configurable by the user? Perhaps we keep this default but 
make it configurable with a `SOLR_NETWORKADDRESS_CACHE_TTL` variable.
   
   Although most (all?) JVMs still honor the `sun.net.inetaddr.ttl` but it is 
officially deprecated so we should use `-Dnetworkaddress.cache.ttl`


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[jira] [Updated] (SOLR-13618) The core admin API should fail to create a core if the instanceDir is not under SOLR_HOME or a child or ancestor of instanceDir contains core.properties

2019-12-21 Thread Erick Erickson (Jira)



 [ 
https://issues.apache.org/jira/browse/SOLR-13618?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Erick Erickson updated SOLR-13618:
--
Summary: The core admin API should fail to create a core if the instanceDir 
is not under SOLR_HOME or a child or ancestor of instanceDir contains 
core.properties  (was: The core admin API should fail if the instanceDir is not 
under SOLR_HOME or a child or ancestor of instanceDir contains core.properties)

> The core admin API should fail to create a core if the instanceDir is not 
> under SOLR_HOME or a child or ancestor of instanceDir contains core.properties
> 
>
> Key: SOLR-13618
> URL: https://issues.apache.org/jira/browse/SOLR-13618
> Project: Solr
>  Issue Type: Bug
>Reporter: Talvinder Matharu
>Priority: Major
>
> When new 'instanceDir' is set outside the SOLR_HOME directory then core will 
> fail to reload after restart as the server only discovers cores within 
> SOLR_HOME, looking for a 'core.properties'.  
> So what we ideally want is to check for all the “core.properties” defined 
> within all 'instanceDir' paths, which may exist outside the SOLR_HOME dir.
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[jira] [Commented] (SOLR-13618) The core admin API should fail to create a core if the instanceDir is not under SOLR_HOME or a child or ancestor of instanceDir contains core.properties

2019-12-21 Thread Erick Erickson (Jira)



[ 
https://issues.apache.org/jira/browse/SOLR-13618?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17001691#comment-17001691
 ] 

Erick Erickson commented on SOLR-13618:
---

Vincenzo:

Sure, give a whirl. Short form:

when the core admin API instanceDir method is called (note, core admin API, 
_not_ collection API) a couple of things should happen if the instanceDir 
property is specified:

1> make sure instanceDir is a descendant of SOLR_HOME

2> Insure that no _other_ core.properties files exist above or below it, i.e. 
in any directory between instanceDir and SOLR_HOME, or any directory below 
SOLR_HOME.

You're probably looking at CoreContainer.create...

> The core admin API should fail to create a core if the instanceDir is not 
> under SOLR_HOME or a child or ancestor of instanceDir contains core.properties
> 
>
> Key: SOLR-13618
> URL: https://issues.apache.org/jira/browse/SOLR-13618
> Project: Solr
>  Issue Type: Bug
>Reporter: Talvinder Matharu
>Priority: Major
>
> When new 'instanceDir' is set outside the SOLR_HOME directory then core will 
> fail to reload after restart as the server only discovers cores within 
> SOLR_HOME, looking for a 'core.properties'.  
> So what we ideally want is to check for all the “core.properties” defined 
> within all 'instanceDir' paths, which may exist outside the SOLR_HOME dir.
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[jira] [Commented] (SOLR-8876) Morphlines fails with "No command builder registered for ..." when using Java 9 due to morphline "importCommands" config option attempting to resolve classname globs

2019-12-21 Thread Erick Erickson (Jira)



[ 
https://issues.apache.org/jira/browse/SOLR-8876?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17001692#comment-17001692
 ] 

Erick Erickson commented on SOLR-8876:
--

See SOLR-9221. This is no longer part of Solr.

> Morphlines fails with "No command builder registered for ..." when using Java 
> 9 due to morphline "importCommands" config option attempting to resolve 
> classname globs
> -
>
> Key: SOLR-8876
> URL: https://issues.apache.org/jira/browse/SOLR-8876
> Project: Solr
>  Issue Type: Bug
>  Components: contrib - MapReduce, contrib - morphlines-cell, contrib 
> - morphlines-core
>Reporter: Uwe Schindler
>Assignee: Chris M. Hostetter
>Priority: Major
>  Labels: Java9
> Fix For: 6.5, 7.0
>
> Attachments: SOLR-8876.patch
>
>
> When running Solr in java9, and using the morphlines contrib(s) users may 
> encounter vague errors such as...
> {noformat}
> No command builder registered for COMMAND_NAME
> {noformat}
> This error comes directly from the morphlines code, and relates to the use of 
> wildcards in the {{importCommands}} declaration of of morphlines {{\*.conf}} 
> files used -- for example...
> {noformat}
> importCommands : ["org.kitesdk.**", "org.apache.solr.**"]
> {noformat}
> Using wildcards like {{\*}} and {{\*\*}} in morphline's {{importCommands}} 
> config options do not work in java9 due to changes in the underlying JVM 
> classloader.
> This issue is tracked up stream in: 
> https://github.com/kite-sdk/kite/issues/469
> 
> *WORK AROUND*
> The workaround is to only use fully qualified command class names in 
> {{importCommands}} declaration, one for each distinct command used in that 
> {{conf}} file.
> Example:
> {noformat}
> # Old config, does not work in java9
> # importCommands : ["org.kitesdk.**", "org.apache.solr.**"]
> # replaced with...
> # using globs (foo.bar.* or foo.bar.**) will not work in Java9 due to 
> classpath scanning limitations
> # so we enumarate every command (builder) we know this config uses below. 
> (see SOLR-8876)
> importCommands : ["org.kitesdk.morphline.stdlib.LogDebugBuilder",
>   
> "org.apache.solr.morphlines.solr.SanitizeUnknownSolrFieldsBuilder",
>   "org.apache.solr.morphlines.solr.LoadSolrBuilder"]
> {noformat}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[jira] [Commented] (SOLR-14115) Deprecate zkcli and bin/solr zk support

2019-12-21 Thread Erick Erickson (Jira)



[ 
https://issues.apache.org/jira/browse/SOLR-14115?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17001696#comment-17001696
 ] 

Erick Erickson commented on SOLR-14115:
---

Ah, ok. bin/solr doesn't use the HTTP APIs, it uses SolrZkClient. Sounds like 
we should keep it that way.

Oh, actually I did restrict one thing. You can't recursively remove everything, 
this fails: "bin/solr zk rm -r /".

> Deprecate zkcli and bin/solr zk support
> ---
>
> Key: SOLR-14115
> URL: https://issues.apache.org/jira/browse/SOLR-14115
> Project: Solr
>  Issue Type: Improvement
>  Security Level: Public(Default Security Level. Issues are Public) 
>  Components: scripts and tools
>Reporter: Erick Erickson
>Priority: Major
>
> I think it's a valid argument that these have outlived their usefulness and 
> we should remove them and have APIs to do what Solr requires. Especially if 
> we can find and point to a third-party visual ZK tool for _changing_ 
> arbitrary data in ZK. Zookeeper 3.5.5 has the admin server which has a UI 
> (although I don't see how to change data in ZK with it. Haven't looked very 
> much).
> While we're ripping stuff out of Solr, are these candidates? It would break 
> my heart to rip ZK support out from bin/solr, but all good things must come 
> to an end. Why do we maintain three (zkcli, bin/solr and the APIs) ways of 
> doing the same thing?
> Mark put the zkcli stuff in before we had APIs to do what Solr needs to do 
> with ZK, mainly uploading configsets at the time. I put the zk support in 
> bin/solr also before the APIs existed because I thought having to learn our 
> custom wrapper for ZK was yet another orphan bit of code laying around. All 
> before we had things like the configsets API.
> Personally, I'd prefer removing zkcli rather than bin/solr, but that's 
> because I originated the bin/solr code ;)
> This occurred when reading SOLR-14109, I'm not entirely sure what I _really_ 
> think about it.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[GitHub] [lucene-solr] rmuir commented on a change in pull request #1082: SOLR-13984: add (experimental, disabled by default) security manager support

2019-12-21 Thread GitBox

rmuir commented on a change in pull request #1082: SOLR-13984: add 
(experimental, disabled by default) security manager support
URL: https://github.com/apache/lucene-solr/pull/1082#discussion_r360649127
 
 

 ##
 File path: solr/bin/solr
 ##
 @@ -2071,6 +2071,17 @@ else
   REMOTE_JMX_OPTS=()
 fi
 
+# Enable java security manager (limiting filesystem access and other things)
+# Don't cache DNS lookups forever, set the value back to the original JDK 
default.
+if [ "$SOLR_SECURITY_MANAGER_ENABLED" == "true" ]; then
+  SECURITY_MANAGER_OPTS=('-Djava.security.manager' \
+"-Djava.security.policy=${SOLR_SERVER_DIR}/etc/security.policy" \
+'-Dsun.net.inetaddr.ttl=30' \
 
 Review comment:
   {quote}
   Shouldn't this be configurable by the user? Perhaps we keep this default but 
make it configurable with a SOLR_NETWORKADDRESS_CACHE_TTL variable.
   {quote}
   
   It is not configurable today: I'm not trying to add that feature here. I am 
just preventing dns from cached indefinitely.
   
   {quote}
   Although most (all?) JVMs still honor the sun.net.inetaddr.ttl but it is 
officially deprecated so we should use -Dnetworkaddress.cache.ttl
   {quote}
   
   nope: `networkaddress.cache.ttl` is not a system property at all.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[GitHub] [lucene-solr] rmuir commented on a change in pull request #1082: SOLR-13984: add (experimental, disabled by default) security manager support

2019-12-21 Thread GitBox

rmuir commented on a change in pull request #1082: SOLR-13984: add 
(experimental, disabled by default) security manager support
URL: https://github.com/apache/lucene-solr/pull/1082#discussion_r360649127
 
 

 ##
 File path: solr/bin/solr
 ##
 @@ -2071,6 +2071,17 @@ else
   REMOTE_JMX_OPTS=()
 fi
 
+# Enable java security manager (limiting filesystem access and other things)
+# Don't cache DNS lookups forever, set the value back to the original JDK 
default.
+if [ "$SOLR_SECURITY_MANAGER_ENABLED" == "true" ]; then
+  SECURITY_MANAGER_OPTS=('-Djava.security.manager' \
+"-Djava.security.policy=${SOLR_SERVER_DIR}/etc/security.policy" \
+'-Dsun.net.inetaddr.ttl=30' \
 
 Review comment:
   > Shouldn't this be configurable by the user? Perhaps we keep this default 
but make it configurable with a SOLR_NETWORKADDRESS_CACHE_TTL variable.
   
   It is not configurable today: I'm not trying to add that feature here. I am 
just preventing dns from cached indefinitely.
   
   > Although most (all?) JVMs still honor the sun.net.inetaddr.ttl but it is 
officially deprecated so we should use -Dnetworkaddress.cache.ttl
   
   nope: `networkaddress.cache.ttl` is not a system property at all.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[jira] [Updated] (SOLR-14130) Add postlogs command line tool for indexing Solr logs

2019-12-21 Thread Joel Bernstein (Jira)



 [ 
https://issues.apache.org/jira/browse/SOLR-14130?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Joel Bernstein updated SOLR-14130:
--
Attachment: Screen Shot 2019-12-21 at 8.46.51 AM.png

> Add postlogs command line tool for indexing Solr logs
> -
>
> Key: SOLR-14130
> URL: https://issues.apache.org/jira/browse/SOLR-14130
> Project: Solr
>  Issue Type: Task
>  Security Level: Public(Default Security Level. Issues are Public) 
>Reporter: Joel Bernstein
>Assignee: Joel Bernstein
>Priority: Major
> Attachments: SOLR-14130.patch, Screen Shot 2019-12-19 at 2.04.41 
> PM.png, Screen Shot 2019-12-19 at 2.16.01 PM.png, Screen Shot 2019-12-19 at 
> 2.35.41 PM.png, Screen Shot 2019-12-19 at 8.48.08 PM.png, Screen Shot 
> 2019-12-21 at 8.46.51 AM.png
>
>
> This ticket adds a simple command line tool for posting Solr logs to a solr 
> index. The tool works with the out of the box Solr log format. Still a work 
> in progress but currently indexes:
>  * queries
>  * updates
>  * commits
>  * new searchers
>  * errors - including stack traces
> Attached are some sample visualizations using Solr Streaming Expressions and 
> Math Expressions after the data has been loaded. The visualizations show: 
> time series, scatter plots, histograms and quantile plots, but really this is 
> just scratching the surface of the visualizations that can be done with the 
> Solr logs.
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[jira] [Updated] (SOLR-14130) Add postlogs command line tool for indexing Solr logs

2019-12-21 Thread Joel Bernstein (Jira)



 [ 
https://issues.apache.org/jira/browse/SOLR-14130?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Joel Bernstein updated SOLR-14130:
--
Attachment: (was: Screen Shot 2019-12-19 at 8.48.08 PM.png)

> Add postlogs command line tool for indexing Solr logs
> -
>
> Key: SOLR-14130
> URL: https://issues.apache.org/jira/browse/SOLR-14130
> Project: Solr
>  Issue Type: Task
>  Security Level: Public(Default Security Level. Issues are Public) 
>Reporter: Joel Bernstein
>Assignee: Joel Bernstein
>Priority: Major
> Attachments: SOLR-14130.patch, Screen Shot 2019-12-19 at 2.04.41 
> PM.png, Screen Shot 2019-12-19 at 2.16.01 PM.png, Screen Shot 2019-12-19 at 
> 2.35.41 PM.png, Screen Shot 2019-12-21 at 8.46.51 AM.png
>
>
> This ticket adds a simple command line tool for posting Solr logs to a solr 
> index. The tool works with the out of the box Solr log format. Still a work 
> in progress but currently indexes:
>  * queries
>  * updates
>  * commits
>  * new searchers
>  * errors - including stack traces
> Attached are some sample visualizations using Solr Streaming Expressions and 
> Math Expressions after the data has been loaded. The visualizations show: 
> time series, scatter plots, histograms and quantile plots, but really this is 
> just scratching the surface of the visualizations that can be done with the 
> Solr logs.
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[jira] [Updated] (SOLR-14056) Solr admin api's are not authenticated after enabling blockUnknown parameter in jwt authentication.

2019-12-21 Thread Lakhan Gupta (Jira)



 [ 
https://issues.apache.org/jira/browse/SOLR-14056?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Lakhan Gupta updated SOLR-14056:

Description: 
Hi, 

I am enabling jwt authentication using solr 8.1.1 in standalone mode. below is 
my security.json file. 
{code:java}
{{  "authentication":{ "blockUnknown": true,    "class":"solr.JWTAuthPlugin", 
"jwk":{      "kty":"oct",      "use":"sig",      "kid":"k1",      
"k":"8347834radajklajdlkajskdjaskldjkasljdklasjdklaskdjaslkdjklasjdlkasjkldaskldjlkasdasjdlkasjdlkdsl",
      "alg":"HS256"},    "aud":"solr"},  "authorization":{      
"class":"solr.RuleBasedAuthorizationPlugin",      "permissions":[  {            
"name":"all", "path":"/*",            "role":"admin"         }      ],      
"user-role":{         "solr":"admin"      }   }}
{code}
as you can see in above code, blockunknown parameter is enabled because of that 
my solr admin/info/system api is not authenticated. 

I've read in 8.1.1 documentation as blockunknown parameter block unknown 
request. so that, my admin/info/system api causing problem. I need an urgent 
help! 

 

Really appreciate if someone can give me a quick solution.

 

Thanks 

Laksh Gupta 

 

  was:
Hi, 

I am enabling jwt authentication using solr 8.1.1 in standalone mode. below is 
my security.json file. 
{code:java}
{{  "authentication":{ "blockUnknown": true,    "class":"solr.JWTAuthPlugin", 
"jwk":{      "kty":"oct",      "use":"sig",      "kid":"k1",      
"k":"7A02618BE6943C22FD81CAB9F6FCF063B6E1732C3614BC3ACA6032B6B3215CAF0D28A34FD423423CA3AC34BEA27D3F79",
      "alg":"HS256"},    "aud":"solr"},  "authorization":{      
"class":"solr.RuleBasedAuthorizationPlugin",      "permissions":[  {            
"name":"all", "path":"/*",            "role":"admin"         }      ],      
"user-role":{         "solr":"admin"      }   }}
{code}
as you can see in above code, blockunknown parameter is enabled because of that 
my solr admin/info/system api is not authenticated. 

I've read in 8.1.1 documentation as blockunknown parameter block unknown 
request. so that, my admin/info/system api causing problem. I need an urgent 
help! 

 

Really appreciate if someone can give me a quick solution.

 

Thanks 

Laksh Gupta 

 


> Solr admin api's are not authenticated after enabling blockUnknown parameter 
> in jwt authentication.
> ---
>
> Key: SOLR-14056
> URL: https://issues.apache.org/jira/browse/SOLR-14056
> Project: Solr
>  Issue Type: Bug
>  Security Level: Public(Default Security Level. Issues are Public) 
>  Components: Authentication, Authorization
>Affects Versions: 8.1.1
> Environment: solr stand
>Reporter: Lakhan Gupta
>Priority: Critical
>  Labels: beginner
>
> Hi, 
> I am enabling jwt authentication using solr 8.1.1 in standalone mode. below 
> is my security.json file. 
> {code:java}
> {{  "authentication":{ "blockUnknown": true,    "class":"solr.JWTAuthPlugin", 
> "jwk":{      "kty":"oct",      "use":"sig",      "kid":"k1",      
> "k":"8347834radajklajdlkajskdjaskldjkasljdklasjdklaskdjaslkdjklasjdlkasjkldaskldjlkasdasjdlkasjdlkdsl",
>       "alg":"HS256"},    "aud":"solr"},  "authorization":{      
> "class":"solr.RuleBasedAuthorizationPlugin",      "permissions":[  {          
>   "name":"all", "path":"/*",            "role":"admin"         }      ],      
> "user-role":{         "solr":"admin"      }   }}
> {code}
> as you can see in above code, blockunknown parameter is enabled because of 
> that my solr admin/info/system api is not authenticated. 
> I've read in 8.1.1 documentation as blockunknown parameter block unknown 
> request. so that, my admin/info/system api causing problem. I need an urgent 
> help! 
>  
> Really appreciate if someone can give me a quick solution.
>  
> Thanks 
> Laksh Gupta 
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[GitHub] [lucene-solr] yonik merged pull request #1081: SOLR-13101: Concurrency tests for SHARED collection.

2019-12-21 Thread GitBox

yonik merged pull request #1081: SOLR-13101: Concurrency tests for SHARED 
collection.
URL: https://github.com/apache/lucene-solr/pull/1081
 
 
   


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[jira] [Commented] (SOLR-13101) Shared storage support in SolrCloud

2019-12-21 Thread ASF subversion and git services (Jira)



[ 
https://issues.apache.org/jira/browse/SOLR-13101?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17001716#comment-17001716
 ] 

ASF subversion and git services commented on SOLR-13101:


Commit 036ce539525aa9c0eb0270d94e4105cd1a0313e9 in lucene-solr's branch 
refs/heads/jira/SOLR-13101 from Bilal Waheed
[ https://gitbox.apache.org/repos/asf?p=lucene-solr.git;h=036ce53 ]

SOLR-13101: Concurrency tests for SHARED collection. (#1081)

* SOLR-13101: Concurrency tests for SHARED collection.

-Concurrency tests for SHARED collection.
-On core container shutdown, shutdown the background blob delete manager too.
-Use correct casing for SharedCoreStage enum.
-Added INDEXING_BATCH_FINISHED to indicate the end of a batch 
(BLOB_PUSH_FINISHED does not help if push itself runs into error)

* -CR feedback.


> Shared storage support in SolrCloud
> ---
>
> Key: SOLR-13101
> URL: https://issues.apache.org/jira/browse/SOLR-13101
> Project: Solr
>  Issue Type: New Feature
>  Components: SolrCloud
>Reporter: Yonik Seeley
>Priority: Major
>  Time Spent: 7h 50m
>  Remaining Estimate: 0h
>
> Solr should have first-class support for shared storage (blob/object stores 
> like S3, google cloud storage, etc. and shared filesystems like HDFS, NFS, 
> etc).
> The key component will likely be a new replica type for shared storage.  It 
> would have many of the benefits of the current "pull" replicas (not indexing 
> on all replicas, all shards identical with no shards getting out-of-sync, 
> etc), but would have additional benefits:
>  - Any shard could become leader (the blob store always has the index)
>  - Better elasticity scaling down
>- durability not linked to number of replcias.. a single replica could be 
> common for write workloads
>- could drop to 0 replicas for a shard when not needed (blob store always 
> has index)
>  - Allow for higher performance write workloads by skipping the transaction 
> log
>- don't pay for what you don't need
>- a commit will be necessary to flush to stable storage (blob store)
>  - A lot of the complexity and failure modes go away
> An additional component a Directory implementation that will work well with 
> blob stores.  We probably want one that treats local disk as a cache since 
> the latency to remote storage is so large.  I think there are still some 
> "locking" issues to be solved here (ensuring that more than one writer to the 
> same index won't corrupt it).  This should probably be pulled out into a 
> different JIRA issue.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[jira] [Commented] (SOLR-13101) Shared storage support in SolrCloud

2019-12-21 Thread ASF subversion and git services (Jira)



[ 
https://issues.apache.org/jira/browse/SOLR-13101?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17001717#comment-17001717
 ] 

ASF subversion and git services commented on SOLR-13101:


Commit 036ce539525aa9c0eb0270d94e4105cd1a0313e9 in lucene-solr's branch 
refs/heads/jira/SOLR-13101 from Bilal Waheed
[ https://gitbox.apache.org/repos/asf?p=lucene-solr.git;h=036ce53 ]

SOLR-13101: Concurrency tests for SHARED collection. (#1081)

* SOLR-13101: Concurrency tests for SHARED collection.

-Concurrency tests for SHARED collection.
-On core container shutdown, shutdown the background blob delete manager too.
-Use correct casing for SharedCoreStage enum.
-Added INDEXING_BATCH_FINISHED to indicate the end of a batch 
(BLOB_PUSH_FINISHED does not help if push itself runs into error)

* -CR feedback.


> Shared storage support in SolrCloud
> ---
>
> Key: SOLR-13101
> URL: https://issues.apache.org/jira/browse/SOLR-13101
> Project: Solr
>  Issue Type: New Feature
>  Components: SolrCloud
>Reporter: Yonik Seeley
>Priority: Major
>  Time Spent: 7h 50m
>  Remaining Estimate: 0h
>
> Solr should have first-class support for shared storage (blob/object stores 
> like S3, google cloud storage, etc. and shared filesystems like HDFS, NFS, 
> etc).
> The key component will likely be a new replica type for shared storage.  It 
> would have many of the benefits of the current "pull" replicas (not indexing 
> on all replicas, all shards identical with no shards getting out-of-sync, 
> etc), but would have additional benefits:
>  - Any shard could become leader (the blob store always has the index)
>  - Better elasticity scaling down
>- durability not linked to number of replcias.. a single replica could be 
> common for write workloads
>- could drop to 0 replicas for a shard when not needed (blob store always 
> has index)
>  - Allow for higher performance write workloads by skipping the transaction 
> log
>- don't pay for what you don't need
>- a commit will be necessary to flush to stable storage (blob store)
>  - A lot of the complexity and failure modes go away
> An additional component a Directory implementation that will work well with 
> blob stores.  We probably want one that treats local disk as a cache since 
> the latency to remote storage is so large.  I think there are still some 
> "locking" issues to be solved here (ensuring that more than one writer to the 
> same index won't corrupt it).  This should probably be pulled out into a 
> different JIRA issue.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[jira] [Commented] (SOLR-13984) Solr should run inside a SecurityManager

2019-12-21 Thread ASF subversion and git services (Jira)



[ 
https://issues.apache.org/jira/browse/SOLR-13984?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17001727#comment-17001727
 ] 

ASF subversion and git services commented on SOLR-13984:


Commit 420400063479a27bb24878abd2f86d9a50415535 in lucene-solr's branch 
refs/heads/jira/SOLR-13984 from Robert Muir
[ https://gitbox.apache.org/repos/asf?p=lucene-solr.git;h=4204000 ]

SOLR-13984: use a custom security properties file so nobody freaks out about 
sun.net.inetaddr.ttl


> Solr should run inside a SecurityManager
> 
>
> Key: SOLR-13984
> URL: https://issues.apache.org/jira/browse/SOLR-13984
> Project: Solr
>  Issue Type: Improvement
>  Security Level: Public(Default Security Level. Issues are Public) 
>Reporter: Ishan Chattopadhyaya
>Priority: Major
>  Time Spent: 3h 10m
>  Remaining Estimate: 0h
>
> To reduce the effect of attacks, esp. RCE, Solr should run inside a 
> SecurityManager.
> Quoting Uwe here:
> {quote}
> The correct way to fix all issues we have seen the last time is very simple: 
> LET'S RUN SOLR INSIDE A SECURITY MANAGER IN PRODUCTION (like in tests). 
> Elasticsearch is doing this, so please please let's do this instead. But this 
> requires to finally get rid of the webapplication and start.jar and add our 
> own bootstrapping (like in tests) that configure Jetty and Security Manager 
> from our own org.apache.solr.bootstrap.Main.java (or similar).
> {quote}
> https://jira.apache.org/jira/browse/SOLR-12316?focusedCommentId=16465038&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-16465038



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[GitHub] [lucene-solr] rmuir commented on a change in pull request #1082: SOLR-13984: add (experimental, disabled by default) security manager support

2019-12-21 Thread GitBox

rmuir commented on a change in pull request #1082: SOLR-13984: add 
(experimental, disabled by default) security manager support
URL: https://github.com/apache/lucene-solr/pull/1082#discussion_r360654988
 
 

 ##
 File path: solr/bin/solr
 ##
 @@ -2071,6 +2071,17 @@ else
   REMOTE_JMX_OPTS=()
 fi
 
+# Enable java security manager (limiting filesystem access and other things)
+# Don't cache DNS lookups forever, set the value back to the original JDK 
default.
+if [ "$SOLR_SECURITY_MANAGER_ENABLED" == "true" ]; then
+  SECURITY_MANAGER_OPTS=('-Djava.security.manager' \
+"-Djava.security.policy=${SOLR_SERVER_DIR}/etc/security.policy" \
+'-Dsun.net.inetaddr.ttl=30' \
 
 Review comment:
   @shalinmangar i added a commit with a custom `java.security.properties` 
file, to override this via the `networkaddress.cache.ttl`, setting it back to 
the default of 30 (same as when no security manager is active). It does the 
same thing as before, just in a more roundabout way...
   
   If users want to mess with java security properties, they must edit java 
security properties files either way: no variables can be supported since they 
are not system properties and can't be set with `-D`.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[jira] [Created] (SOLR-14137) Boosting by date (and perhaps others) shows a steady decline 6.6->8.3

2019-12-21 Thread Erick Erickson (Jira)

Erick Erickson created SOLR-14137:
-

 Summary: Boosting by date (and perhaps others) shows a steady 
decline 6.6->8.3
 Key: SOLR-14137
 URL: https://issues.apache.org/jira/browse/SOLR-14137
 Project: Solr
  Issue Type: Improvement
  Security Level: Public (Default Security Level. Issues are Public)
Reporter: Erick Erickson


Moving a user's list discussion over here.

{color:#00}The very short form is that from Solr 6.6.1 to Solr 8.3.1, the 
throughput for date boosting in my tests dropped by 40+%{color}

{color:#00}I’ve been hearing about slowdowns in successive Solr releases 
with boost functions, so I dug into it a bit. The test setup is just a 
boost-by-date with an additional big OR clause of 100 random words so I’d be 
sure to hit a bunch of docs. I figured that if there were few hits, the signal 
would be lost in the noise, but I didn’t look at the actual hit counts.{color}

{color:#00}I saw several Solr JIRAs about this subject, but they were 
slightly different, although quite possibly the same underlying issue. So I 
tried to get this down to a very specific form of a query.{color}

{color:#00}I’ve also seen some cases in the wild where the response was 
proportional to the number of segments, thus my optimize experiments.{color}

{color:#00}Here are the results, explanation below. O stands for optimized 
to one segment. I spot checked pdate against 7x and 8x and they weren’t 
significantly different performance wise from tdate. All have docValues 
enabled. I ran these against a multiValued=“false” field. All the tests pegged 
all my CPUs. Jmeter is being run on a different machine than Solr. Only one 
Solr was running for any test.{color}

{color:#00}Solr version   queries/min   {color}
{color:#00}6.6.1  3,400  {color}
{color:#00}6.6.1 O   4,800  {color}

{color:#00}7.1 2,800   {color}
{color:#00}7.1 O 4,200   {color}

{color:#00}7.7.1  2,400   {color}
{color:#00}7.7.1 O  3,500    {color}

{color:#00}8.3.1 2,000    {color}
{color:#00}8.3.1 O  2,600    {color}


{color:#00}The tests I’ve been running just index 20M docs into a single 
core, then run the exact same 10,000 queries against them from jmeter with 24 
threads. Spot checks showed no hits on the queryResultCache.{color}

{color:#00}A query looks like this: {color}
{color:#00}rows=0&\{!boost b=recip(ms(NOW, 
INSERT_FIELD_HERE),3.16e-11,1,1)}text_txt:(campaigners OR adjourned OR 
anyplace…97 more random words){color}

{color:#00}There is no faceting. No grouping. No sorting.{color}

{color:#00}I fill in INSERT_FIELD_HERE through jmeter magic. I’m running 
the exact same queries for every test.{color}

{color:#00}One wildcard is that I did regenerate the index for each major 
revision, and the chose random words from the same list of words, as well as 
random times (bounded in the same range though) so the docs are not completely 
identical. The index was in the native format for that major version even if 
slightly different between versions. I ran the test once, then ran it again 
after optimizing the index.{color}

{color:#00}I haven’t dug any farther, if anyone’s interested I can throw a 
profiler at, say, 8.3 and see what I can see, although I’m not going to have 
time to dive into this any time soon. I’d be glad to run some tests though. I 
saved the queries and the indexes so running a test would  only take a few 
minutes.{color}

{color:#00}While I concentrated on date fields, the docs have date, int, 
and long fields, both docValues=true and docValues=false, each variant with 
multiValued=true and multiValued=false and both Trie and Point (where possible) 
variants as well as a pretty simple text field.{color}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[jira] [Commented] (SOLR-14137) Boosting by date (and perhaps others) shows a steady decline 6.6->8.3

2019-12-21 Thread Erick Erickson (Jira)



[ 
https://issues.apache.org/jira/browse/SOLR-14137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17001738#comment-17001738
 ] 

Erick Erickson commented on SOLR-14137:
---

>From David Smiley:

{color:#00}This unfolding story shows us why we need nightly benchmarks of 
Solr -- {color}SOLR-10317

> Boosting by date (and perhaps others) shows a steady decline 6.6->8.3
> -
>
> Key: SOLR-14137
> URL: https://issues.apache.org/jira/browse/SOLR-14137
> Project: Solr
>  Issue Type: Improvement
>  Security Level: Public(Default Security Level. Issues are Public) 
>Reporter: Erick Erickson
>Priority: Major
>
> Moving a user's list discussion over here.
> {color:#00}The very short form is that from Solr 6.6.1 to Solr 8.3.1, the 
> throughput for date boosting in my tests dropped by 40+%{color}
> {color:#00}I’ve been hearing about slowdowns in successive Solr releases 
> with boost functions, so I dug into it a bit. The test setup is just a 
> boost-by-date with an additional big OR clause of 100 random words so I’d be 
> sure to hit a bunch of docs. I figured that if there were few hits, the 
> signal would be lost in the noise, but I didn’t look at the actual hit 
> counts.{color}
> {color:#00}I saw several Solr JIRAs about this subject, but they were 
> slightly different, although quite possibly the same underlying issue. So I 
> tried to get this down to a very specific form of a query.{color}
> {color:#00}I’ve also seen some cases in the wild where the response was 
> proportional to the number of segments, thus my optimize experiments.{color}
> {color:#00}Here are the results, explanation below. O stands for 
> optimized to one segment. I spot checked pdate against 7x and 8x and they 
> weren’t significantly different performance wise from tdate. All have 
> docValues enabled. I ran these against a multiValued=“false” field. All the 
> tests pegged all my CPUs. Jmeter is being run on a different machine than 
> Solr. Only one Solr was running for any test.{color}
> {color:#00}Solr version   queries/min   {color}
> {color:#00}6.6.1  3,400  {color}
> {color:#00}6.6.1 O   4,800  {color}
> {color:#00}7.1 2,800   {color}
> {color:#00}7.1 O 4,200   {color}
> {color:#00}7.7.1  2,400   {color}
> {color:#00}7.7.1 O  3,500    {color}
> {color:#00}8.3.1 2,000    {color}
> {color:#00}8.3.1 O  2,600    {color}
> {color:#00}The tests I’ve been running just index 20M docs into a single 
> core, then run the exact same 10,000 queries against them from jmeter with 24 
> threads. Spot checks showed no hits on the queryResultCache.{color}
> {color:#00}A query looks like this: {color}
> {color:#00}rows=0&\{!boost b=recip(ms(NOW, 
> INSERT_FIELD_HERE),3.16e-11,1,1)}text_txt:(campaigners OR adjourned OR 
> anyplace…97 more random words){color}
> {color:#00}There is no faceting. No grouping. No sorting.{color}
> {color:#00}I fill in INSERT_FIELD_HERE through jmeter magic. I’m running 
> the exact same queries for every test.{color}
> {color:#00}One wildcard is that I did regenerate the index for each major 
> revision, and the chose random words from the same list of words, as well as 
> random times (bounded in the same range though) so the docs are not 
> completely identical. The index was in the native format for that major 
> version even if slightly different between versions. I ran the test once, 
> then ran it again after optimizing the index.{color}
> {color:#00}I haven’t dug any farther, if anyone’s interested I can throw 
> a profiler at, say, 8.3 and see what I can see, although I’m not going to 
> have time to dive into this any time soon. I’d be glad to run some tests 
> though. I saved the queries and the indexes so running a test would  only 
> take a few minutes.{color}
> {color:#00}While I concentrated on date fields, the docs have date, int, 
> and long fields, both docValues=true and docValues=false, each variant with 
> multiValued=true and multiValued=false and both Trie and Point (where 
> possible) variants as well as a pretty simple text field.{color}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[jira] [Commented] (SOLR-14137) Boosting by date (and perhaps others) shows a steady decline 6.6->8.3

2019-12-21 Thread Erick Erickson (Jira)



[ 
https://issues.apache.org/jira/browse/SOLR-14137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17001739#comment-17001739
 ] 

Erick Erickson commented on SOLR-14137:
---

>From Joel Bernstein:

Here is a quantile plot that shows the performance slowdown of the boost query, 
from Ericks test runs. 
 
{color:#00}The x-axis is the percentiles and the y-axis is the QTime for 
the different Solr versions at each percentile. The median QTime's are shown in 
the tooltip.{color}
{color:#00}(I'll attach){color}

> Boosting by date (and perhaps others) shows a steady decline 6.6->8.3
> -
>
> Key: SOLR-14137
> URL: https://issues.apache.org/jira/browse/SOLR-14137
> Project: Solr
>  Issue Type: Improvement
>  Security Level: Public(Default Security Level. Issues are Public) 
>Reporter: Erick Erickson
>Priority: Major
> Attachments: Screen Shot 2019-12-19 at 2.35.41 PM.png
>
>
> Moving a user's list discussion over here.
> {color:#00}The very short form is that from Solr 6.6.1 to Solr 8.3.1, the 
> throughput for date boosting in my tests dropped by 40+%{color}
> {color:#00}I’ve been hearing about slowdowns in successive Solr releases 
> with boost functions, so I dug into it a bit. The test setup is just a 
> boost-by-date with an additional big OR clause of 100 random words so I’d be 
> sure to hit a bunch of docs. I figured that if there were few hits, the 
> signal would be lost in the noise, but I didn’t look at the actual hit 
> counts.{color}
> {color:#00}I saw several Solr JIRAs about this subject, but they were 
> slightly different, although quite possibly the same underlying issue. So I 
> tried to get this down to a very specific form of a query.{color}
> {color:#00}I’ve also seen some cases in the wild where the response was 
> proportional to the number of segments, thus my optimize experiments.{color}
> {color:#00}Here are the results, explanation below. O stands for 
> optimized to one segment. I spot checked pdate against 7x and 8x and they 
> weren’t significantly different performance wise from tdate. All have 
> docValues enabled. I ran these against a multiValued=“false” field. All the 
> tests pegged all my CPUs. Jmeter is being run on a different machine than 
> Solr. Only one Solr was running for any test.{color}
> {color:#00}Solr version   queries/min   {color}
> {color:#00}6.6.1  3,400  {color}
> {color:#00}6.6.1 O   4,800  {color}
> {color:#00}7.1 2,800   {color}
> {color:#00}7.1 O 4,200   {color}
> {color:#00}7.7.1  2,400   {color}
> {color:#00}7.7.1 O  3,500    {color}
> {color:#00}8.3.1 2,000    {color}
> {color:#00}8.3.1 O  2,600    {color}
> {color:#00}The tests I’ve been running just index 20M docs into a single 
> core, then run the exact same 10,000 queries against them from jmeter with 24 
> threads. Spot checks showed no hits on the queryResultCache.{color}
> {color:#00}A query looks like this: {color}
> {color:#00}rows=0&\{!boost b=recip(ms(NOW, 
> INSERT_FIELD_HERE),3.16e-11,1,1)}text_txt:(campaigners OR adjourned OR 
> anyplace…97 more random words){color}
> {color:#00}There is no faceting. No grouping. No sorting.{color}
> {color:#00}I fill in INSERT_FIELD_HERE through jmeter magic. I’m running 
> the exact same queries for every test.{color}
> {color:#00}One wildcard is that I did regenerate the index for each major 
> revision, and the chose random words from the same list of words, as well as 
> random times (bounded in the same range though) so the docs are not 
> completely identical. The index was in the native format for that major 
> version even if slightly different between versions. I ran the test once, 
> then ran it again after optimizing the index.{color}
> {color:#00}I haven’t dug any farther, if anyone’s interested I can throw 
> a profiler at, say, 8.3 and see what I can see, although I’m not going to 
> have time to dive into this any time soon. I’d be glad to run some tests 
> though. I saved the queries and the indexes so running a test would  only 
> take a few minutes.{color}
> {color:#00}While I concentrated on date fields, the docs have date, int, 
> and long fields, both docValues=true and docValues=false, each variant with 
> multiValued=true and multiValued=false and both Trie and Point (where 
> possible) variants as well as a pretty simple text field.{color}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional comman

[jira] [Updated] (SOLR-14137) Boosting by date (and perhaps others) shows a steady decline 6.6->8.3

2019-12-21 Thread Erick Erickson (Jira)



 [ 
https://issues.apache.org/jira/browse/SOLR-14137?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Erick Erickson updated SOLR-14137:
--
Attachment: Screen Shot 2019-12-19 at 2.35.41 PM.png
Status: Open  (was: Open)

Joel's screenshot

> Boosting by date (and perhaps others) shows a steady decline 6.6->8.3
> -
>
> Key: SOLR-14137
> URL: https://issues.apache.org/jira/browse/SOLR-14137
> Project: Solr
>  Issue Type: Improvement
>  Security Level: Public(Default Security Level. Issues are Public) 
>Reporter: Erick Erickson
>Priority: Major
> Attachments: Screen Shot 2019-12-19 at 2.35.41 PM.png
>
>
> Moving a user's list discussion over here.
> {color:#00}The very short form is that from Solr 6.6.1 to Solr 8.3.1, the 
> throughput for date boosting in my tests dropped by 40+%{color}
> {color:#00}I’ve been hearing about slowdowns in successive Solr releases 
> with boost functions, so I dug into it a bit. The test setup is just a 
> boost-by-date with an additional big OR clause of 100 random words so I’d be 
> sure to hit a bunch of docs. I figured that if there were few hits, the 
> signal would be lost in the noise, but I didn’t look at the actual hit 
> counts.{color}
> {color:#00}I saw several Solr JIRAs about this subject, but they were 
> slightly different, although quite possibly the same underlying issue. So I 
> tried to get this down to a very specific form of a query.{color}
> {color:#00}I’ve also seen some cases in the wild where the response was 
> proportional to the number of segments, thus my optimize experiments.{color}
> {color:#00}Here are the results, explanation below. O stands for 
> optimized to one segment. I spot checked pdate against 7x and 8x and they 
> weren’t significantly different performance wise from tdate. All have 
> docValues enabled. I ran these against a multiValued=“false” field. All the 
> tests pegged all my CPUs. Jmeter is being run on a different machine than 
> Solr. Only one Solr was running for any test.{color}
> {color:#00}Solr version   queries/min   {color}
> {color:#00}6.6.1  3,400  {color}
> {color:#00}6.6.1 O   4,800  {color}
> {color:#00}7.1 2,800   {color}
> {color:#00}7.1 O 4,200   {color}
> {color:#00}7.7.1  2,400   {color}
> {color:#00}7.7.1 O  3,500    {color}
> {color:#00}8.3.1 2,000    {color}
> {color:#00}8.3.1 O  2,600    {color}
> {color:#00}The tests I’ve been running just index 20M docs into a single 
> core, then run the exact same 10,000 queries against them from jmeter with 24 
> threads. Spot checks showed no hits on the queryResultCache.{color}
> {color:#00}A query looks like this: {color}
> {color:#00}rows=0&\{!boost b=recip(ms(NOW, 
> INSERT_FIELD_HERE),3.16e-11,1,1)}text_txt:(campaigners OR adjourned OR 
> anyplace…97 more random words){color}
> {color:#00}There is no faceting. No grouping. No sorting.{color}
> {color:#00}I fill in INSERT_FIELD_HERE through jmeter magic. I’m running 
> the exact same queries for every test.{color}
> {color:#00}One wildcard is that I did regenerate the index for each major 
> revision, and the chose random words from the same list of words, as well as 
> random times (bounded in the same range though) so the docs are not 
> completely identical. The index was in the native format for that major 
> version even if slightly different between versions. I ran the test once, 
> then ran it again after optimizing the index.{color}
> {color:#00}I haven’t dug any farther, if anyone’s interested I can throw 
> a profiler at, say, 8.3 and see what I can see, although I’m not going to 
> have time to dive into this any time soon. I’d be glad to run some tests 
> though. I saved the queries and the indexes so running a test would  only 
> take a few minutes.{color}
> {color:#00}While I concentrated on date fields, the docs have date, int, 
> and long fields, both docValues=true and docValues=false, each variant with 
> multiValued=true and multiValued=false and both Trie and Point (where 
> possible) variants as well as a pretty simple text field.{color}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[jira] [Commented] (SOLR-14137) Boosting by date (and perhaps others) shows a steady decline 6.6->8.3

2019-12-21 Thread Erick Erickson (Jira)



[ 
https://issues.apache.org/jira/browse/SOLR-14137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17001742#comment-17001742
 ] 

Erick Erickson commented on SOLR-14137:
---

Likely the same underlying cause?

> Boosting by date (and perhaps others) shows a steady decline 6.6->8.3
> -
>
> Key: SOLR-14137
> URL: https://issues.apache.org/jira/browse/SOLR-14137
> Project: Solr
>  Issue Type: Improvement
>  Security Level: Public(Default Security Level. Issues are Public) 
>Reporter: Erick Erickson
>Priority: Major
> Attachments: Screen Shot 2019-12-19 at 2.35.41 PM.png
>
>
> Moving a user's list discussion over here.
> {color:#00}The very short form is that from Solr 6.6.1 to Solr 8.3.1, the 
> throughput for date boosting in my tests dropped by 40+%{color}
> {color:#00}I’ve been hearing about slowdowns in successive Solr releases 
> with boost functions, so I dug into it a bit. The test setup is just a 
> boost-by-date with an additional big OR clause of 100 random words so I’d be 
> sure to hit a bunch of docs. I figured that if there were few hits, the 
> signal would be lost in the noise, but I didn’t look at the actual hit 
> counts.{color}
> {color:#00}I saw several Solr JIRAs about this subject, but they were 
> slightly different, although quite possibly the same underlying issue. So I 
> tried to get this down to a very specific form of a query.{color}
> {color:#00}I’ve also seen some cases in the wild where the response was 
> proportional to the number of segments, thus my optimize experiments.{color}
> {color:#00}Here are the results, explanation below. O stands for 
> optimized to one segment. I spot checked pdate against 7x and 8x and they 
> weren’t significantly different performance wise from tdate. All have 
> docValues enabled. I ran these against a multiValued=“false” field. All the 
> tests pegged all my CPUs. Jmeter is being run on a different machine than 
> Solr. Only one Solr was running for any test.{color}
> {color:#00}Solr version   queries/min   {color}
> {color:#00}6.6.1  3,400  {color}
> {color:#00}6.6.1 O   4,800  {color}
> {color:#00}7.1 2,800   {color}
> {color:#00}7.1 O 4,200   {color}
> {color:#00}7.7.1  2,400   {color}
> {color:#00}7.7.1 O  3,500    {color}
> {color:#00}8.3.1 2,000    {color}
> {color:#00}8.3.1 O  2,600    {color}
> {color:#00}The tests I’ve been running just index 20M docs into a single 
> core, then run the exact same 10,000 queries against them from jmeter with 24 
> threads. Spot checks showed no hits on the queryResultCache.{color}
> {color:#00}A query looks like this: {color}
> {color:#00}rows=0&\{!boost b=recip(ms(NOW, 
> INSERT_FIELD_HERE),3.16e-11,1,1)}text_txt:(campaigners OR adjourned OR 
> anyplace…97 more random words){color}
> {color:#00}There is no faceting. No grouping. No sorting.{color}
> {color:#00}I fill in INSERT_FIELD_HERE through jmeter magic. I’m running 
> the exact same queries for every test.{color}
> {color:#00}One wildcard is that I did regenerate the index for each major 
> revision, and the chose random words from the same list of words, as well as 
> random times (bounded in the same range though) so the docs are not 
> completely identical. The index was in the native format for that major 
> version even if slightly different between versions. I ran the test once, 
> then ran it again after optimizing the index.{color}
> {color:#00}I haven’t dug any farther, if anyone’s interested I can throw 
> a profiler at, say, 8.3 and see what I can see, although I’m not going to 
> have time to dive into this any time soon. I’d be glad to run some tests 
> though. I saved the queries and the indexes so running a test would  only 
> take a few minutes.{color}
> {color:#00}While I concentrated on date fields, the docs have date, int, 
> and long fields, both docValues=true and docValues=false, each variant with 
> multiValued=true and multiValued=false and both Trie and Point (where 
> possible) variants as well as a pretty simple text field.{color}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[jira] [Updated] (SOLR-14137) Boosting by date (and perhaps others) shows a steady decline 6.6->8.3

2019-12-21 Thread Erick Erickson (Jira)



 [ 
https://issues.apache.org/jira/browse/SOLR-14137?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Erick Erickson updated SOLR-14137:
--
Attachment: Screen Shot 2019-12-19 at 3.09.37 PM.png
Screen Shot 2019-12-19 at 3.31.16 PM.png
Status: Open  (was: Open)

Here's a couple of screenshots of a bit of profiling I did from 6.6 and 8.3. 
The problem here is that they claim the time is spent in DisiPriorityQueue in 
both, which hasn't changed. So the difference must be higher up in the 
process

I'll dig a bit more sometime over the holidays.

> Boosting by date (and perhaps others) shows a steady decline 6.6->8.3
> -
>
> Key: SOLR-14137
> URL: https://issues.apache.org/jira/browse/SOLR-14137
> Project: Solr
>  Issue Type: Improvement
>  Security Level: Public(Default Security Level. Issues are Public) 
>Reporter: Erick Erickson
>Priority: Major
> Attachments: Screen Shot 2019-12-19 at 2.35.41 PM.png, Screen Shot 
> 2019-12-19 at 3.09.37 PM.png, Screen Shot 2019-12-19 at 3.31.16 PM.png
>
>
> Moving a user's list discussion over here.
> {color:#00}The very short form is that from Solr 6.6.1 to Solr 8.3.1, the 
> throughput for date boosting in my tests dropped by 40+%{color}
> {color:#00}I’ve been hearing about slowdowns in successive Solr releases 
> with boost functions, so I dug into it a bit. The test setup is just a 
> boost-by-date with an additional big OR clause of 100 random words so I’d be 
> sure to hit a bunch of docs. I figured that if there were few hits, the 
> signal would be lost in the noise, but I didn’t look at the actual hit 
> counts.{color}
> {color:#00}I saw several Solr JIRAs about this subject, but they were 
> slightly different, although quite possibly the same underlying issue. So I 
> tried to get this down to a very specific form of a query.{color}
> {color:#00}I’ve also seen some cases in the wild where the response was 
> proportional to the number of segments, thus my optimize experiments.{color}
> {color:#00}Here are the results, explanation below. O stands for 
> optimized to one segment. I spot checked pdate against 7x and 8x and they 
> weren’t significantly different performance wise from tdate. All have 
> docValues enabled. I ran these against a multiValued=“false” field. All the 
> tests pegged all my CPUs. Jmeter is being run on a different machine than 
> Solr. Only one Solr was running for any test.{color}
> {color:#00}Solr version   queries/min   {color}
> {color:#00}6.6.1  3,400  {color}
> {color:#00}6.6.1 O   4,800  {color}
> {color:#00}7.1 2,800   {color}
> {color:#00}7.1 O 4,200   {color}
> {color:#00}7.7.1  2,400   {color}
> {color:#00}7.7.1 O  3,500    {color}
> {color:#00}8.3.1 2,000    {color}
> {color:#00}8.3.1 O  2,600    {color}
> {color:#00}The tests I’ve been running just index 20M docs into a single 
> core, then run the exact same 10,000 queries against them from jmeter with 24 
> threads. Spot checks showed no hits on the queryResultCache.{color}
> {color:#00}A query looks like this: {color}
> {color:#00}rows=0&\{!boost b=recip(ms(NOW, 
> INSERT_FIELD_HERE),3.16e-11,1,1)}text_txt:(campaigners OR adjourned OR 
> anyplace…97 more random words){color}
> {color:#00}There is no faceting. No grouping. No sorting.{color}
> {color:#00}I fill in INSERT_FIELD_HERE through jmeter magic. I’m running 
> the exact same queries for every test.{color}
> {color:#00}One wildcard is that I did regenerate the index for each major 
> revision, and the chose random words from the same list of words, as well as 
> random times (bounded in the same range though) so the docs are not 
> completely identical. The index was in the native format for that major 
> version even if slightly different between versions. I ran the test once, 
> then ran it again after optimizing the index.{color}
> {color:#00}I haven’t dug any farther, if anyone’s interested I can throw 
> a profiler at, say, 8.3 and see what I can see, although I’m not going to 
> have time to dive into this any time soon. I’d be glad to run some tests 
> though. I saved the queries and the indexes so running a test would  only 
> take a few minutes.{color}
> {color:#00}While I concentrated on date fields, the docs have date, int, 
> and long fields, both docValues=true and docValues=false, each variant with 
> multiValued=true and multiValued=false and both Trie and Point (where 
> possible) variants as well as a pretty simple text field.{color}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-

[jira] [Commented] (SOLR-13709) Race condition on core reload while core is still loading?

2019-12-21 Thread Erick Erickson (Jira)



[ 
https://issues.apache.org/jira/browse/SOLR-13709?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17001746#comment-17001746
 ] 

Erick Erickson commented on SOLR-13709:
---

[~hossman] any thoughts?

I've finally gotten back to this and think I'm getting close. The short form is 
that I think the test calls delete collection before prior operations are 
complete.

During a successful run, I see:
 * a bunch of reloads
 * a bunch of unloads

- The reloads seem OK, they're being run from 
{code}SolrCore.getConfListener{code}. 

- The unload is coming when the collection is being deleted near the end of the 
test.

AFAIK, the only way the CoreDescriptor could be disappearing form the various 
lists is through unload and that's only being called by deleteCollection in 
this test.

So what my next best guess is is that the reloads aren't complete by the time 
the test gets to the delete collection call and that's where the race condition 
is coming from.

I'm putting some more debugging, I want to see all thread dumps when the NPE 
occurs. I'm wondering if the sequencing in from the Overseer is part of the 
problem, but that's a guess.

> Race condition on core reload while core is still loading?
> --
>
> Key: SOLR-13709
> URL: https://issues.apache.org/jira/browse/SOLR-13709
> Project: Solr
>  Issue Type: Bug
>  Security Level: Public(Default Security Level. Issues are Public) 
>Reporter: Chris M. Hostetter
>Assignee: Erick Erickson
>Priority: Major
> Attachments: apache_Lucene-Solr-Tests-8.x_449.log.txt
>
>
> A recent jenkins failure from {{TestSolrCLIRunExample}} seems to suggest that 
> there may be a race condition when attempting to re-load a SolrCore while the 
> core is currently in the process of (re)loading that can leave the SolrCore 
> in an unusable state.
> Details to follow...



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[jira] [Updated] (SOLR-13985) bind to localhost by default

2019-12-21 Thread Jason Gerlowski (Jira)



 [ 
https://issues.apache.org/jira/browse/SOLR-13985?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jason Gerlowski updated SOLR-13985:
---
Attachment: SOLR-13985.patch
Status: Open  (was: Open)

Uploading a patch to address some of the feedback given so far.  This mostly 
fixes some duplication in the docs and some bugs in the Windows batch script.

bq. [re: docs duplication] Alternatively it should be possible to tag the 
paragraphs in securing-solr in some way so that you can include them with a 
reference in taking-solr-to-production

This is the path I ended up going down.  Worked out pretty nicely.

bq. should it be instead IF DEFINED

Yep, fixed.

*On Naming* 

Jan pointed out that we could come up with a better name than 
{{SOLR_JETTY_HOST}}.  David pointed out that we have a smattering of other 
properties with at least some overlap ("host" in solr.xml, for one), and that 
we should aim for consistency in how these are set, and better documentation 
around what each does.

These are both good points, and confusing pieces of config that could benefit 
from straightening out.  I straightened out the "jetty.host" vs 
"solr.jetty.host" http/https inconsistency.  But in the interest of not letting 
the perfect get in the way of the good, I'd rather move that 
investigation/untangling for a separate jira.  I'll file a follow-up jira to 
re-examine the names and documentation around these host/port config values, 
and try to make progress on it over the holidays.  I'm not sure I have enough 
time to fully untangle these additional settings and put together fixes, but 
I'll try to put together at least a good writeup so someone else can tackle it 
if I'm not able to.

*Moving Forward*
I just finished testing this.  It looks good on both Windows and Linux.  Any 
last qualms or votes against merging this?  It's a pretty high impact change 
for users, but it addresses a real security need and the documentation makes it 
pretty clear how to change the default when necessary.  I'll aim to commit 
after the holidays if there's no additional concerns/feedback.  I might send 
out a dev mail to let others know this is happening as well.

> bind to localhost by default
> 
>
> Key: SOLR-13985
> URL: https://issues.apache.org/jira/browse/SOLR-13985
> Project: Solr
>  Issue Type: Improvement
>  Security Level: Public(Default Security Level. Issues are Public) 
>Reporter: Robert Muir
>Assignee: Jason Gerlowski
>Priority: Major
> Attachments: SOLR-13985.patch, SOLR-13985.patch, SOLR-13985.patch, 
> SOLR-13985.patch
>
>
> Currently solr binds to all interfaces by default. 
> The default should be safer, so that e.g. the user is not exposed to the 
> internet until they make an explicit step to do so.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[jira] [Created] (SOLR-14138) Fix commented-out RequestLog in jetty.xml to use non-deprecated class

2019-12-21 Thread Robert Muir (Jira)

Robert Muir created SOLR-14138:
--

 Summary: Fix commented-out RequestLog in jetty.xml to use 
non-deprecated class
 Key: SOLR-14138
 URL: https://issues.apache.org/jira/browse/SOLR-14138
 Project: Solr
  Issue Type: Improvement
  Security Level: Public (Default Security Level. Issues are Public)
Reporter: Robert Muir


Currently the jetty request logging is disabled (commented out). 

But it can be useful, e.g. since it uses a standard logging format and there 
are tools to analyze it by default. Also it can be used to detect some attacks 
not otherwise logged anywhere else, since they don't make it to solr servlet: 
requests blocked at the jetty level (invalid/malformed requests, ones filtered 
by jetty IP filtering, etc).

We should switch it from the deprecated NCSARequestLog class, instead to use 
the CustomRequestLog with either NCSA_FORMAT or EXTENDED_NCSA_FORMAT.

{quote}
Deprecated.
use CustomRequestLog given format string CustomRequestLog.EXTENDED_NCSA_FORMAT 
with a RequestLogWriter
{quote}




--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[jira] [Resolved] (LUCENE-7635) Kuromoji fails if user dictionary contains #

2019-12-21 Thread Michael Sokolov (Jira)



 [ 
https://issues.apache.org/jira/browse/LUCENE-7635?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Michael Sokolov resolved LUCENE-7635.
-
Resolution: Duplicate

This same issue came up again recently in LUCENE-8596; we'll address it there 
since this patch can no longer be applied, but I'll mention the OP in the commit

> Kuromoji fails if user dictionary contains #
> 
>
> Key: LUCENE-7635
> URL: https://issues.apache.org/jira/browse/LUCENE-7635
> Project: Lucene - Core
>  Issue Type: Bug
>Reporter: Masaru Hasegawa
>Priority: Major
> Attachments: LUCENE-7635.patch
>
>
> If user dictionary contains entries like:
> {code}
> withsharp#,withsharp#,withsharp#,カスタム名詞
> {code}
> It fails to create dictionary throwing 
> java.lang.ArrayIndexOutOfBoundsException.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[jira] [Commented] (LUCENE-8596) The replacement of comments is a bug, in "UserDictionary.java"

2019-12-21 Thread Michael Sokolov (Jira)



[ 
https://issues.apache.org/jira/browse/LUCENE-8596?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17001766#comment-17001766
 ] 

Michael Sokolov commented on LUCENE-8596:
-

See also LUCENE-7635

> The replacement of comments is a bug, in "UserDictionary.java"
> --
>
> Key: LUCENE-8596
> URL: https://issues.apache.org/jira/browse/LUCENE-8596
> Project: Lucene - Core
>  Issue Type: Bug
>  Components: modules/analysis
>Reporter: miyaharas
>Priority: Major
>  Time Spent: 10m
>  Remaining Estimate: 0h
>
> [https://github.com/apache/lucene-solr/blob/1d85cd783863f75cea133fb9c452302214165a4d/lucene/analysis/kuromoji/src/java/org/apache/lucene/analysis/ja/dict/UserDictionary.java#L68]
>  
> hi
> I think that this is bug.
> I think the following is correct
> {code:java}
> line = line.replaceAll ("^ #. * $", "");  
> {code}
>  
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[GitHub] [lucene-solr] mkhludnev commented on a change in pull request #1034: SOLR-13863: payload query function now handles string encoded payload field (delimited_payloads_string)

2019-12-21 Thread GitBox

mkhludnev commented on a change in pull request #1034: SOLR-13863: payload 
query function now handles string encoded payload field 
(delimited_payloads_string)
URL: https://github.com/apache/lucene-solr/pull/1034#discussion_r360661332
 
 

 ##
 File path: lucene/core/src/java/org/apache/lucene/index/PostingsEnum.java
 ##
 @@ -67,6 +67,56 @@ public static boolean featureRequested(int flags, short 
feature) {
   protected PostingsEnum() {
   }
 
+  /**
+   * Dummy PostingsEnum object.
+   */
+  public static PostingsEnum DUMMY_INSTANCE = new PostingsEnum() {
 
 Review comment:
   Please look at org.apache.lucene.search.DocIdSetIterator.empty() 
   There's subtle exhausting logic which needs to be supported and you can 
follow naming as well.  


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[GitHub] [lucene-solr] mkhludnev commented on a change in pull request #1034: SOLR-13863: payload query function now handles string encoded payload field (delimited_payloads_string)

2019-12-21 Thread GitBox

mkhludnev commented on a change in pull request #1034: SOLR-13863: payload 
query function now handles string encoded payload field 
(delimited_payloads_string)
URL: https://github.com/apache/lucene-solr/pull/1034#discussion_r360661472
 
 

 ##
 File path: solr/core/src/java/org/apache/solr/search/PayloadValueSource.java
 ##
 @@ -0,0 +1,66 @@
+/*
+ * Licensed to the Apache Software Foundation (ASF) under one or more
+ * contributor license agreements.  See the NOTICE file distributed with
+ * this work for additional information regarding copyright ownership.
+ * The ASF licenses this file to You under the Apache License, Version 2.0
+ * (the "License"); you may not use this file except in compliance with
+ * the License.  You may obtain a copy of the License at
+ *
+ * http://www.apache.org/licenses/LICENSE-2.0
+ *
+ * Unless required by applicable law or agreed to in writing, software
+ * distributed under the License is distributed on an "AS IS" BASIS,
+ * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ * See the License for the specific language governing permissions and
+ * limitations under the License.
+ */
+
+package org.apache.solr.search;
+
+import org.apache.lucene.queries.function.ValueSource;
+import org.apache.lucene.util.BytesRef;
+
+public abstract class PayloadValueSource extends ValueSource {
+  protected final String field;
+  protected final String val;
+  protected final String indexedField;
+  protected final BytesRef indexedBytes;
+  protected final ValueSource defaultValueSource;
+
+  public PayloadValueSource(String field, String val, String indexedField, 
BytesRef indexedBytes, ValueSource defaultValueSource) {
+this.field = field;
+this.val = val;
+this.indexedField = indexedField;
+this.indexedBytes = indexedBytes;
+this.defaultValueSource = defaultValueSource;
+  }
+
+//  public abstract SortField getSortField(boolean reverse);
+
+  // TODO: should this be formalized at the ValueSource level?  Seems to be 
the convention
+  public abstract String name();
+
+  @Override
+  public abstract String description();
+
+  @Override
+  public boolean equals(Object o) {
+if (this == o) return true;
+if (o == null || getClass() != o.getClass()) return false;
+
+PayloadValueSource that = (PayloadValueSource) o;
+
+if (!indexedField.equals(that.indexedField)) return false;
+if (indexedBytes != null ? !indexedBytes.equals(that.indexedBytes) : 
that.indexedBytes != null) return false;
+return defaultValueSource.equals(that.defaultValueSource);
+
+  }
+
+  @Override
+  public int hashCode() {
 
 Review comment:
   How it will be supported by descendants? 


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[GitHub] [lucene-solr] mkhludnev commented on a change in pull request #1034: SOLR-13863: payload query function now handles string encoded payload field (delimited_payloads_string)

2019-12-21 Thread GitBox

mkhludnev commented on a change in pull request #1034: SOLR-13863: payload 
query function now handles string encoded payload field 
(delimited_payloads_string)
URL: https://github.com/apache/lucene-solr/pull/1034#discussion_r360662836
 
 

 ##
 File path: solr/core/src/java/org/apache/solr/search/ValueSourceParser.java
 ##
 @@ -722,13 +722,29 @@ public ValueSource parse(FunctionQParser fp) throws 
SyntaxError {
 
 TInfo tinfo = parseTerm(fp); // would have made this parser a new 
separate class and registered it, but this handy method is private :/
 
-ValueSource defaultValueSource;
-if (fp.hasMoreArguments()) {
-  defaultValueSource = fp.parseValueSource();
-} else {
-  defaultValueSource = new ConstValueSource(0.0f);
+IndexSchema schema = fp.getReq().getCore().getLatestSchema();
+final FieldType fieldType = schema.getFieldType(tinfo.field);
+
+final String payloadEncoder = 
PayloadUtils.getPayloadEncoder(fieldType);
+if (payloadEncoder.equals("identity")) {
+
+  ValueSource defaultValueSource = (fp.hasMoreArguments()) ? 
fp.parseValueSource() : new LiteralValueSource("");
+
+  if (fp.hasMoreArguments()) {
+  // functions are not supported with strings
+  throw new SolrException(SolrException.ErrorCode.BAD_REQUEST, 
"Invalid payload function: " + fp.parseArg());
 
 Review comment:
   Please explain why it;s invalid in exception message. It seems like in/mas 
isn't supported so far.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[GitHub] [lucene-solr] mkhludnev commented on a change in pull request #1034: SOLR-13863: payload query function now handles string encoded payload field (delimited_payloads_string)

2019-12-21 Thread GitBox

mkhludnev commented on a change in pull request #1034: SOLR-13863: payload 
query function now handles string encoded payload field 
(delimited_payloads_string)
URL: https://github.com/apache/lucene-solr/pull/1034#discussion_r360661577
 
 

 ##
 File path: 
solr/core/src/java/org/apache/solr/search/StringPayloadValueSource.java
 ##
 @@ -0,0 +1,271 @@
+/*
+ * Licensed to the Apache Software Foundation (ASF) under one or more
+ * contributor license agreements.  See the NOTICE file distributed with
+ * this work for additional information regarding copyright ownership.
+ * The ASF licenses this file to You under the Apache License, Version 2.0
+ * (the "License"); you may not use this file except in compliance with
+ * the License.  You may obtain a copy of the License at
+ *
+ * http://www.apache.org/licenses/LICENSE-2.0
+ *
+ * Unless required by applicable law or agreed to in writing, software
+ * distributed under the License is distributed on an "AS IS" BASIS,
+ * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ * See the License for the specific language governing permissions and
+ * limitations under the License.
+ */
+
+package org.apache.solr.search;
+
+import java.io.IOException;
+import java.util.Map;
+
+import org.apache.lucene.index.LeafReaderContext;
+import org.apache.lucene.index.PostingsEnum;
+import org.apache.lucene.index.Terms;
+import org.apache.lucene.index.TermsEnum;
+import org.apache.lucene.queries.function.FunctionValues;
+import org.apache.lucene.queries.function.ValueSource;
+import org.apache.lucene.queries.function.docvalues.StrDocValues;
+import org.apache.lucene.search.DocIdSetIterator;
+import org.apache.lucene.search.FieldComparator;
+import org.apache.lucene.search.FieldComparatorSource;
+import org.apache.lucene.search.IndexSearcher;
+import org.apache.lucene.search.SimpleFieldComparator;
+import org.apache.lucene.search.SortField;
+import org.apache.lucene.util.BytesRef;
+
+public class StringPayloadValueSource extends PayloadValueSource {
+
+  public StringPayloadValueSource(String field, String val, String 
indexedField, BytesRef indexedBytes, ValueSource defaultValueSource) {
+super(field, val, indexedField, indexedBytes, defaultValueSource);
+  }
+
+  public SortField getSortField(boolean reverse) {
+return new StringPayloadValueSourceSortField(reverse);
+  }
+
+  @Override
+  public FunctionValues getValues(Map context, LeafReaderContext 
readerContext) throws IOException {
+
+final Terms terms = readerContext.reader().terms(indexedField);
+
+FunctionValues defaultValues = defaultValueSource.getValues(context, 
readerContext);
+
+// copied the bulk of this from TFValueSource - TODO: this is a very 
repeated pattern - base-class this advance logic stuff?
+return new StrDocValues(this) {
+  PostingsEnum docs;
+  int atDoc;
+  int lastDocRequested = -1;
+  String docValue = null;
+
+  {
 
 Review comment:
   @eribeiro , to invoke reset() on instance construction.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[GitHub] [lucene-solr] mkhludnev commented on a change in pull request #1034: SOLR-13863: payload query function now handles string encoded payload field (delimited_payloads_string)

2019-12-21 Thread GitBox

mkhludnev commented on a change in pull request #1034: SOLR-13863: payload 
query function now handles string encoded payload field 
(delimited_payloads_string)
URL: https://github.com/apache/lucene-solr/pull/1034#discussion_r360662950
 
 

 ##
 File path: 
solr/core/src/test/org/apache/solr/search/function/TestFunctionQuery.java
 ##
 @@ -569,6 +570,57 @@ public void testPayloadFunction() {
 assertQ(req("fl","*,score","q", "{!func}payload(vals_dpf,A)", 
CommonParams.DEBUG, "true"), "//float[@name='score']='1.0'");
   }
 
+  @Test
+  public void testStringPayloadFunction() {
+clearIndex();
+
+SolrInputDocument doc = new SolrInputDocument();
+doc.setField("id", "1");
+doc.setField("vals_dpss", new String[]{"A|USD", "C|EUR", "SORT|A"});
+assertU(adoc(doc));
+
+doc = new SolrInputDocument();
+doc.setField("id", "2");
+doc.setField("vals_dpss", new String[]{"A|usd", "C|eur", "SORT|B"});
+assertU(adoc(doc));
+
+doc = new SolrInputDocument();
+doc.setField("id", "3");
+doc.setField("vals_dpss", new String[]{"A|Usd", "C|Eur", "SORT|C"});
+assertU(adoc(doc));
+
+assertU(commit());
+
+assertQ(req("q", "id:1", "fl", "*,score,field_test:payload(vals_dpss,A)"), 
"//str[@name='field_test']='USD'");
+assertQ(req("q", "id:2", "fl", "*,score,field_test:payload(vals_dpss,C)"), 
"//str[@name='field_test']='eur'");
+assertQ(req("q", "*:*",
+   "rows", "1",
+   "sort", "payload(vals_dpss,C) asc",
+   "fl", "*,score,field_test:payload(vals_dpss,C)"), 
"//str[@name='field_test']='EUR'");
 
 Review comment:
   I don't see how sort order is asserted.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[jira] [Commented] (SOLR-14138) Fix commented-out RequestLog in jetty.xml to use non-deprecated class

2019-12-21 Thread Jira



[ 
https://issues.apache.org/jira/browse/SOLR-14138?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17001774#comment-17001774
 ] 

Jan Høydahl commented on SOLR-14138:


+1
I recently missed some Jetty logging when debugging SSL handshake problems and 
having jetty logs around would help.

Is there a way we can control enabling jetty request log through an env var 
instead of having to edit xml?

> Fix commented-out RequestLog in jetty.xml to use non-deprecated class
> -
>
> Key: SOLR-14138
> URL: https://issues.apache.org/jira/browse/SOLR-14138
> Project: Solr
>  Issue Type: Improvement
>  Security Level: Public(Default Security Level. Issues are Public) 
>Reporter: Robert Muir
>Priority: Major
>
> Currently the jetty request logging is disabled (commented out). 
> But it can be useful, e.g. since it uses a standard logging format and there 
> are tools to analyze it by default. Also it can be used to detect some 
> attacks not otherwise logged anywhere else, since they don't make it to solr 
> servlet: requests blocked at the jetty level (invalid/malformed requests, 
> ones filtered by jetty IP filtering, etc).
> We should switch it from the deprecated NCSARequestLog class, instead to use 
> the CustomRequestLog with either NCSA_FORMAT or EXTENDED_NCSA_FORMAT.
> {quote}
> Deprecated.
> use CustomRequestLog given format string 
> CustomRequestLog.EXTENDED_NCSA_FORMAT with a RequestLogWriter
> {quote}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[GitHub] [lucene-solr] asfgit merged pull request #520: LUCENE-8596: The replacement of comments is a bug, in "UserDictionary.java"

2019-12-21 Thread GitBox

asfgit merged pull request #520: LUCENE-8596: The replacement of comments is a 
bug, in "UserDictionary.java"
URL: https://github.com/apache/lucene-solr/pull/520
 
 
   


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[jira] [Commented] (LUCENE-8596) The replacement of comments is a bug, in "UserDictionary.java"

2019-12-21 Thread ASF subversion and git services (Jira)



[ 
https://issues.apache.org/jira/browse/LUCENE-8596?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17001775#comment-17001775
 ] 

ASF subversion and git services commented on LUCENE-8596:
-

Commit 93309e9728210b6db413361ae4d83382408b3627 in lucene-solr's branch 
refs/heads/master from Michael Sokolov
[ https://gitbox.apache.org/repos/asf?p=lucene-solr.git;h=93309e9 ]

LUCENE-8596: Treat hash mark as comment only at beginning of line in kuromoji
  user dictionary. Via Masaru Hasegawa and Satoshi Kato


> The replacement of comments is a bug, in "UserDictionary.java"
> --
>
> Key: LUCENE-8596
> URL: https://issues.apache.org/jira/browse/LUCENE-8596
> Project: Lucene - Core
>  Issue Type: Bug
>  Components: modules/analysis
>Reporter: miyaharas
>Priority: Major
>  Time Spent: 10m
>  Remaining Estimate: 0h
>
> [https://github.com/apache/lucene-solr/blob/1d85cd783863f75cea133fb9c452302214165a4d/lucene/analysis/kuromoji/src/java/org/apache/lucene/analysis/ja/dict/UserDictionary.java#L68]
>  
> hi
> I think that this is bug.
> I think the following is correct
> {code:java}
> line = line.replaceAll ("^ #. * $", "");  
> {code}
>  
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[jira] [Commented] (SOLR-14138) Fix commented-out RequestLog in jetty.xml to use non-deprecated class

2019-12-21 Thread Robert Muir (Jira)



[ 
https://issues.apache.org/jira/browse/SOLR-14138?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17001776#comment-17001776
 ] 

Robert Muir commented on SOLR-14138:


i can investigate it. i would like to make it easier since these logs contain 
ip address and are easy to process. Maybe its not easy to conditionalize with 
the current start.jar approach, but maybe there is a simple way. At the least 
lets get off the deprecated class.

> Fix commented-out RequestLog in jetty.xml to use non-deprecated class
> -
>
> Key: SOLR-14138
> URL: https://issues.apache.org/jira/browse/SOLR-14138
> Project: Solr
>  Issue Type: Improvement
>  Security Level: Public(Default Security Level. Issues are Public) 
>Reporter: Robert Muir
>Priority: Major
>
> Currently the jetty request logging is disabled (commented out). 
> But it can be useful, e.g. since it uses a standard logging format and there 
> are tools to analyze it by default. Also it can be used to detect some 
> attacks not otherwise logged anywhere else, since they don't make it to solr 
> servlet: requests blocked at the jetty level (invalid/malformed requests, 
> ones filtered by jetty IP filtering, etc).
> We should switch it from the deprecated NCSARequestLog class, instead to use 
> the CustomRequestLog with either NCSA_FORMAT or EXTENDED_NCSA_FORMAT.
> {quote}
> Deprecated.
> use CustomRequestLog given format string 
> CustomRequestLog.EXTENDED_NCSA_FORMAT with a RequestLogWriter
> {quote}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[jira] [Commented] (SOLR-13808) Query DSL should let to cache filter

2019-12-21 Thread Mikhail Khludnev (Jira)



[ 
https://issues.apache.org/jira/browse/SOLR-13808?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17001777#comment-17001777
 ] 

Mikhail Khludnev commented on SOLR-13808:
-

ok. I think I'll push it next week.

> Query DSL should let to cache filter
> 
>
> Key: SOLR-13808
> URL: https://issues.apache.org/jira/browse/SOLR-13808
> Project: Solr
>  Issue Type: Improvement
>  Security Level: Public(Default Security Level. Issues are Public) 
>Reporter: Mikhail Khludnev
>Priority: Major
> Attachments: SOLR-13808.patch
>
>
> Query DSL let to express Lucene BQ's filter
>  
> {code:java}
> { query: {bool: { filter: {term: {f:name,query:"foo bar"}}} }}{code}
> However, it might easily catch the need in caching it in filter cache. This 
> might rely on ExtensibleQuery and QParser: 
> {code:java}
> { query: {bool: { filter: {term: {f:name,query:"foo bar", cache:true}}} }}
> {code}
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[jira] [Assigned] (SOLR-13808) Query DSL should let to cache filter

2019-12-21 Thread Mikhail Khludnev (Jira)



 [ 
https://issues.apache.org/jira/browse/SOLR-13808?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Mikhail Khludnev reassigned SOLR-13808:
---

Assignee: Mikhail Khludnev

> Query DSL should let to cache filter
> 
>
> Key: SOLR-13808
> URL: https://issues.apache.org/jira/browse/SOLR-13808
> Project: Solr
>  Issue Type: Improvement
>  Security Level: Public(Default Security Level. Issues are Public) 
>Reporter: Mikhail Khludnev
>Assignee: Mikhail Khludnev
>Priority: Major
> Attachments: SOLR-13808.patch
>
>
> Query DSL let to express Lucene BQ's filter
>  
> {code:java}
> { query: {bool: { filter: {term: {f:name,query:"foo bar"}}} }}{code}
> However, it might easily catch the need in caching it in filter cache. This 
> might rely on ExtensibleQuery and QParser: 
> {code:java}
> { query: {bool: { filter: {term: {f:name,query:"foo bar", cache:true}}} }}
> {code}
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[jira] [Commented] (SOLR-14125) plugins must be hot loadable

2019-12-21 Thread Noble Paul (Jira)



[ 
https://issues.apache.org/jira/browse/SOLR-14125?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17001788#comment-17001788
 ] 

Noble Paul commented on SOLR-14125:
---

[~jbernste] your comments are welcome

 

>  plugins must be hot loadable
> --
>
> Key: SOLR-14125
> URL: https://issues.apache.org/jira/browse/SOLR-14125
> Project: Solr
>  Issue Type: Improvement
>  Security Level: Public(Default Security Level. Issues are Public) 
>Reporter: Noble Paul
>Priority: Major
>  Labels: packagemanager
>  Time Spent: 10m
>  Remaining Estimate: 0h
>
> Today, the class is loaded at the core load time an a stong reference is 
> held. Package loading need it to be lazily loaded



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[jira] [Commented] (SOLR-14138) Fix commented-out RequestLog in jetty.xml to use non-deprecated class

2019-12-21 Thread ASF subversion and git services (Jira)



[ 
https://issues.apache.org/jira/browse/SOLR-14138?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17001792#comment-17001792
 ] 

ASF subversion and git services commented on SOLR-14138:


Commit b2db7a282b1bf0f9e3c1a8ad5440d98032d8111f in lucene-solr's branch 
refs/heads/jira/SOLR-14138 from Robert Muir
[ https://gitbox.apache.org/repos/asf?p=lucene-solr.git;h=b2db7a2 ]

SOLR-14138: enable request log via environ var


> Fix commented-out RequestLog in jetty.xml to use non-deprecated class
> -
>
> Key: SOLR-14138
> URL: https://issues.apache.org/jira/browse/SOLR-14138
> Project: Solr
>  Issue Type: Improvement
>  Security Level: Public(Default Security Level. Issues are Public) 
>Reporter: Robert Muir
>Priority: Major
>
> Currently the jetty request logging is disabled (commented out). 
> But it can be useful, e.g. since it uses a standard logging format and there 
> are tools to analyze it by default. Also it can be used to detect some 
> attacks not otherwise logged anywhere else, since they don't make it to solr 
> servlet: requests blocked at the jetty level (invalid/malformed requests, 
> ones filtered by jetty IP filtering, etc).
> We should switch it from the deprecated NCSARequestLog class, instead to use 
> the CustomRequestLog with either NCSA_FORMAT or EXTENDED_NCSA_FORMAT.
> {quote}
> Deprecated.
> use CustomRequestLog given format string 
> CustomRequestLog.EXTENDED_NCSA_FORMAT with a RequestLogWriter
> {quote}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[GitHub] [lucene-solr] rmuir opened a new pull request #1110: SOLR-14138: enable request log via environ var

2019-12-21 Thread GitBox

rmuir opened a new pull request #1110: SOLR-14138: enable request log via 
environ var
URL: https://github.com/apache/lucene-solr/pull/1110
 
 
   Today you have to edit XML to enable the jetty request log. This makes it 
work via environment variable.
   
   TODO:
   * get this thing off the non-deprecated classes.
   * plumb SOLR_LOGS_DIR to the config so it respects that (today it uses 
hardcoded `logs/`


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[jira] [Commented] (SOLR-14138) Fix commented-out RequestLog in jetty.xml to use non-deprecated class

2019-12-21 Thread ASF subversion and git services (Jira)



[ 
https://issues.apache.org/jira/browse/SOLR-14138?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17001797#comment-17001797
 ] 

ASF subversion and git services commented on SOLR-14138:


Commit c3731a7c085172d87109df6a3866419d17d47eac in lucene-solr's branch 
refs/heads/jira/SOLR-14138 from Robert Muir
[ https://gitbox.apache.org/repos/asf?p=lucene-solr.git;h=c3731a7 ]

SOLR-14138: remove deprecated class and respect SOLR_LOGS_DIR


> Fix commented-out RequestLog in jetty.xml to use non-deprecated class
> -
>
> Key: SOLR-14138
> URL: https://issues.apache.org/jira/browse/SOLR-14138
> Project: Solr
>  Issue Type: Improvement
>  Security Level: Public(Default Security Level. Issues are Public) 
>Reporter: Robert Muir
>Priority: Major
>  Time Spent: 10m
>  Remaining Estimate: 0h
>
> Currently the jetty request logging is disabled (commented out). 
> But it can be useful, e.g. since it uses a standard logging format and there 
> are tools to analyze it by default. Also it can be used to detect some 
> attacks not otherwise logged anywhere else, since they don't make it to solr 
> servlet: requests blocked at the jetty level (invalid/malformed requests, 
> ones filtered by jetty IP filtering, etc).
> We should switch it from the deprecated NCSARequestLog class, instead to use 
> the CustomRequestLog with either NCSA_FORMAT or EXTENDED_NCSA_FORMAT.
> {quote}
> Deprecated.
> use CustomRequestLog given format string 
> CustomRequestLog.EXTENDED_NCSA_FORMAT with a RequestLogWriter
> {quote}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[GitHub] [lucene-solr] rmuir commented on issue #1110: SOLR-14138: enable request log via environ var

2019-12-21 Thread GitBox

rmuir commented on issue #1110: SOLR-14138: enable request log via environ var
URL: https://github.com/apache/lucene-solr/pull/1110#issuecomment-568220195
 
 
   ok, i fixed it to respect SOLR_LOGS_DIR as well, and moved it to the new 
non-deprecated way. I think its in much better shape.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[jira] [Resolved] (LUCENE-8596) The replacement of comments is a bug, in "UserDictionary.java"

2019-12-21 Thread Michael Sokolov (Jira)



 [ 
https://issues.apache.org/jira/browse/LUCENE-8596?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Michael Sokolov resolved LUCENE-8596.
-
Fix Version/s: trunk
   Resolution: Fixed

This fix won't be backported to 8.x branch since it changes comment processing 
in kuromoji user dictionaries and could cause breakage

> The replacement of comments is a bug, in "UserDictionary.java"
> --
>
> Key: LUCENE-8596
> URL: https://issues.apache.org/jira/browse/LUCENE-8596
> Project: Lucene - Core
>  Issue Type: Bug
>  Components: modules/analysis
>Reporter: miyaharas
>Priority: Major
> Fix For: trunk
>
>  Time Spent: 20m
>  Remaining Estimate: 0h
>
> [https://github.com/apache/lucene-solr/blob/1d85cd783863f75cea133fb9c452302214165a4d/lucene/analysis/kuromoji/src/java/org/apache/lucene/analysis/ja/dict/UserDictionary.java#L68]
>  
> hi
> I think that this is bug.
> I think the following is correct
> {code:java}
> line = line.replaceAll ("^ #. * $", "");  
> {code}
>  
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[GitHub] [lucene-solr] noblepaul commented on issue #1108: SOLR-14125 : Streaming expressions to be loadable from packages

2019-12-21 Thread GitBox

noblepaul commented on issue #1108: SOLR-14125 : Streaming expressions to be 
loadable from packages
URL: https://github.com/apache/lucene-solr/pull/1108#issuecomment-568223643
 
 
   It's in a good shape. reviews are welcome


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[jira] [Commented] (SOLR-14125) plugins must be hot loadable

2019-12-21 Thread Noble Paul (Jira)



[ 
https://issues.apache.org/jira/browse/SOLR-14125?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17001805#comment-17001805
 ] 

Noble Paul commented on SOLR-14125:
---

I've added testcases. I shall merge this soon

>  plugins must be hot loadable
> --
>
> Key: SOLR-14125
> URL: https://issues.apache.org/jira/browse/SOLR-14125
> Project: Solr
>  Issue Type: Improvement
>  Security Level: Public(Default Security Level. Issues are Public) 
>Reporter: Noble Paul
>Priority: Major
>  Labels: packagemanager
>  Time Spent: 20m
>  Remaining Estimate: 0h
>
> Today, the class is loaded at the core load time an a stong reference is 
> held. Package loading need it to be lazily loaded



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[jira] [Commented] (SOLR-13101) Shared storage support in SolrCloud

2019-12-21 Thread Noble Paul (Jira)



[ 
https://issues.apache.org/jira/browse/SOLR-13101?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17001807#comment-17001807
 ] 

Noble Paul commented on SOLR-13101:
---

Is there  any reason why wee can't release this as a third party plugin using 
the new package manager instead of pushing this to Solr codebase?

 

> Shared storage support in SolrCloud
> ---
>
> Key: SOLR-13101
> URL: https://issues.apache.org/jira/browse/SOLR-13101
> Project: Solr
>  Issue Type: New Feature
>  Components: SolrCloud
>Reporter: Yonik Seeley
>Priority: Major
>  Time Spent: 7h 50m
>  Remaining Estimate: 0h
>
> Solr should have first-class support for shared storage (blob/object stores 
> like S3, google cloud storage, etc. and shared filesystems like HDFS, NFS, 
> etc).
> The key component will likely be a new replica type for shared storage.  It 
> would have many of the benefits of the current "pull" replicas (not indexing 
> on all replicas, all shards identical with no shards getting out-of-sync, 
> etc), but would have additional benefits:
>  - Any shard could become leader (the blob store always has the index)
>  - Better elasticity scaling down
>- durability not linked to number of replcias.. a single replica could be 
> common for write workloads
>- could drop to 0 replicas for a shard when not needed (blob store always 
> has index)
>  - Allow for higher performance write workloads by skipping the transaction 
> log
>- don't pay for what you don't need
>- a commit will be necessary to flush to stable storage (blob store)
>  - A lot of the complexity and failure modes go away
> An additional component a Directory implementation that will work well with 
> blob stores.  We probably want one that treats local disk as a cache since 
> the latency to remote storage is so large.  I think there are still some 
> "locking" issues to be solved here (ensuring that more than one writer to the 
> same index won't corrupt it).  This should probably be pulled out into a 
> different JIRA issue.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[jira] [Commented] (SOLR-7879) Null pointer exception when

2019-12-21 Thread Munendra S N (Jira)



[ 
https://issues.apache.org/jira/browse/SOLR-7879?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17001828#comment-17001828
 ] 

Munendra S N commented on SOLR-7879:


The syntax is wrong here. Correct syntax is shared below. custom sort should be 
computed as part of subfacet and then used for sorting. NPEs are fixes in 
SOLR-13022

{code:java}
{
  "uniq_viewerNum": "unique(viewer_ID)",
  "powerPointName": {
"type": "terms",
"field": "powerPointName",
"limit": 1,
"sort": {
  "lastUpdatedTime": "desc"
},
"facet": {
  "lastUpdatedTime": "max(powerPoint_lastUpdatedTime)"
}
  }
}
{code}
Closing this

> Null pointer exception when 
> 
>
> Key: SOLR-7879
> URL: https://issues.apache.org/jira/browse/SOLR-7879
> Project: Solr
>  Issue Type: Bug
>  Components: Facet Module, search
>Affects Versions: 5.2.1
> Environment: Oracle jdk1.8
>Reporter: Gary Yang
>Priority: Major
>  Labels: faceting, jsonapi, subfacet
>
> This can be reproduce by certain inputs, but I can't find a pattern in the 
> inputs that cause this error:
> Caused by: 
> org.apache.solr.client.solrj.impl.HttpSolrClient$RemoteSolrException: Error 
> from server at http://192.168.1.47:8983/solr/core1: 
> java.lang.NullPointerException
>   at 
> org.apache.solr.search.facet.FacetFieldProcessorFCBase.findTopSlots(FacetField.java:301)
>   at 
> org.apache.solr.search.facet.FacetFieldProcessorFCBase.getFieldCacheCounts(FacetField.java:254)
>   at 
> org.apache.solr.search.facet.FacetFieldProcessorFCBase.process(FacetField.java:218)
>   at 
> org.apache.solr.search.facet.FacetProcessor.processSubs(FacetRequest.java:267)
>   at 
> org.apache.solr.search.facet.FacetFieldProcessorNumeric.calcFacets(FacetFieldProcessorNumeric.java:472)
>   at 
> org.apache.solr.search.facet.FacetFieldProcessorNumeric.process(FacetFieldProcessorNumeric.java:151)
>   at 
> org.apache.solr.search.facet.FacetProcessor.processSubs(FacetRequest.java:267)
>   at 
> org.apache.solr.search.facet.FacetProcessor.fillBucket(FacetRequest.java:354)
>   at 
> org.apache.solr.search.facet.FacetQueryProcessor.process(FacetQuery.java:57)
>   at org.apache.solr.search.facet.FacetModule.process(FacetModule.java:87)
>   at 
> org.apache.solr.handler.component.SearchHandler.handleRequestBody(SearchHandler.java:255)
>   at 
> org.apache.solr.handler.RequestHandlerBase.handleRequest(RequestHandlerBase.java:143)
>   at org.apache.solr.core.SolrCore.execute(SolrCore.java:2064)
>   at org.apache.solr.servlet.HttpSolrCall.execute(HttpSolrCall.java:654)
>   at org.apache.solr.servlet.HttpSolrCall.call(HttpSolrCall.java:450)
>   at 
> org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:227)
>   at 
> org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:196)
>   at 
> org.eclipse.jetty.servlet.ServletHandler$CachedChain.doFilter(ServletHandler.java:1652)
>   at 
> org.eclipse.jetty.servlet.ServletHandler.doHandle(ServletHandler.java:585)
>   at 
> org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.java:143)
>   at 
> org.eclipse.jetty.security.SecurityHandler.handle(SecurityHandler.java:577)
>   at 
> org.eclipse.jetty.server.session.SessionHandler.doHandle(SessionHandler.java:223)
>   at 
> org.eclipse.jetty.server.handler.ContextHandler.doHandle(ContextHandler.java:1127)
>   at 
> org.eclipse.jetty.servlet.ServletHandler.doScope(ServletHandler.java:515)
>   at 
> org.eclipse.jetty.server.session.SessionHandler.doScope(SessionHandler.java:185)
>   at 
> org.eclipse.jetty.server.handler.ContextHandler.doScope(ContextHandler.java:1061)
>   at 
> org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.java:141)
>   at 
> org.eclipse.jetty.server.handler.ContextHandlerCollection.handle(ContextHandlerCollection.java:215)
>   at 
> org.eclipse.jetty.server.handler.HandlerCollection.handle(HandlerCollection.java:110)
>   at 
> org.eclipse.jetty.server.handler.HandlerWrapper.handle(HandlerWrapper.java:97)
>   at org.eclipse.jetty.server.Server.handle(Server.java:497)
>   at org.eclipse.jetty.server.HttpChannel.handle(HttpChannel.java:310)
>   at 
> org.eclipse.jetty.server.HttpConnection.onFillable(HttpConnection.java:257)
>   at 
> org.eclipse.jetty.io.AbstractConnection$2.run(AbstractConnection.java:540)
>   at 
> org.eclipse.jetty.util.thread.QueuedThreadPool.runJob(QueuedThreadPool.java:635)
>   at 
> org.eclipse.jetty.util.thread.QueuedThreadPool$3.run(QueuedThreadPool.java:555)
>   at java.lang.Thread.run(Thread.java:745)



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-

[jira] [Resolved] (SOLR-7879) Null pointer exception when

2019-12-21 Thread Munendra S N (Jira)



 [ 
https://issues.apache.org/jira/browse/SOLR-7879?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Munendra S N resolved SOLR-7879.

Resolution: Fixed

> Null pointer exception when 
> 
>
> Key: SOLR-7879
> URL: https://issues.apache.org/jira/browse/SOLR-7879
> Project: Solr
>  Issue Type: Bug
>  Components: Facet Module, search
>Affects Versions: 5.2.1
> Environment: Oracle jdk1.8
>Reporter: Gary Yang
>Priority: Major
>  Labels: faceting, jsonapi, subfacet
>
> This can be reproduce by certain inputs, but I can't find a pattern in the 
> inputs that cause this error:
> Caused by: 
> org.apache.solr.client.solrj.impl.HttpSolrClient$RemoteSolrException: Error 
> from server at http://192.168.1.47:8983/solr/core1: 
> java.lang.NullPointerException
>   at 
> org.apache.solr.search.facet.FacetFieldProcessorFCBase.findTopSlots(FacetField.java:301)
>   at 
> org.apache.solr.search.facet.FacetFieldProcessorFCBase.getFieldCacheCounts(FacetField.java:254)
>   at 
> org.apache.solr.search.facet.FacetFieldProcessorFCBase.process(FacetField.java:218)
>   at 
> org.apache.solr.search.facet.FacetProcessor.processSubs(FacetRequest.java:267)
>   at 
> org.apache.solr.search.facet.FacetFieldProcessorNumeric.calcFacets(FacetFieldProcessorNumeric.java:472)
>   at 
> org.apache.solr.search.facet.FacetFieldProcessorNumeric.process(FacetFieldProcessorNumeric.java:151)
>   at 
> org.apache.solr.search.facet.FacetProcessor.processSubs(FacetRequest.java:267)
>   at 
> org.apache.solr.search.facet.FacetProcessor.fillBucket(FacetRequest.java:354)
>   at 
> org.apache.solr.search.facet.FacetQueryProcessor.process(FacetQuery.java:57)
>   at org.apache.solr.search.facet.FacetModule.process(FacetModule.java:87)
>   at 
> org.apache.solr.handler.component.SearchHandler.handleRequestBody(SearchHandler.java:255)
>   at 
> org.apache.solr.handler.RequestHandlerBase.handleRequest(RequestHandlerBase.java:143)
>   at org.apache.solr.core.SolrCore.execute(SolrCore.java:2064)
>   at org.apache.solr.servlet.HttpSolrCall.execute(HttpSolrCall.java:654)
>   at org.apache.solr.servlet.HttpSolrCall.call(HttpSolrCall.java:450)
>   at 
> org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:227)
>   at 
> org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:196)
>   at 
> org.eclipse.jetty.servlet.ServletHandler$CachedChain.doFilter(ServletHandler.java:1652)
>   at 
> org.eclipse.jetty.servlet.ServletHandler.doHandle(ServletHandler.java:585)
>   at 
> org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.java:143)
>   at 
> org.eclipse.jetty.security.SecurityHandler.handle(SecurityHandler.java:577)
>   at 
> org.eclipse.jetty.server.session.SessionHandler.doHandle(SessionHandler.java:223)
>   at 
> org.eclipse.jetty.server.handler.ContextHandler.doHandle(ContextHandler.java:1127)
>   at 
> org.eclipse.jetty.servlet.ServletHandler.doScope(ServletHandler.java:515)
>   at 
> org.eclipse.jetty.server.session.SessionHandler.doScope(SessionHandler.java:185)
>   at 
> org.eclipse.jetty.server.handler.ContextHandler.doScope(ContextHandler.java:1061)
>   at 
> org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.java:141)
>   at 
> org.eclipse.jetty.server.handler.ContextHandlerCollection.handle(ContextHandlerCollection.java:215)
>   at 
> org.eclipse.jetty.server.handler.HandlerCollection.handle(HandlerCollection.java:110)
>   at 
> org.eclipse.jetty.server.handler.HandlerWrapper.handle(HandlerWrapper.java:97)
>   at org.eclipse.jetty.server.Server.handle(Server.java:497)
>   at org.eclipse.jetty.server.HttpChannel.handle(HttpChannel.java:310)
>   at 
> org.eclipse.jetty.server.HttpConnection.onFillable(HttpConnection.java:257)
>   at 
> org.eclipse.jetty.io.AbstractConnection$2.run(AbstractConnection.java:540)
>   at 
> org.eclipse.jetty.util.thread.QueuedThreadPool.runJob(QueuedThreadPool.java:635)
>   at 
> org.eclipse.jetty.util.thread.QueuedThreadPool$3.run(QueuedThreadPool.java:555)
>   at java.lang.Thread.run(Thread.java:745)



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[jira] [Updated] (SOLR-13195) NPE caused by distributed queries when no shards are configured

2019-12-21 Thread Munendra S N (Jira)



 [ 
https://issues.apache.org/jira/browse/SOLR-13195?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Munendra S N updated SOLR-13195:

Component/s: (was: Facet Module)
 FacetComponent

> NPE caused by distributed queries when no shards are configured
> ---
>
> Key: SOLR-13195
> URL: https://issues.apache.org/jira/browse/SOLR-13195
> Project: Solr
>  Issue Type: Bug
>  Components: FacetComponent
>Affects Versions: master (9.0)
> Environment: h1. Steps to reproduce
> * Use a Linux machine.
> *  Build commit {{ea2c8ba}} of Solr as described in the section below.
> * Build the films collection as described below.
> * Start the server using the command {{./bin/solr start -f -p 8983 -s 
> /tmp/home}}
> * Request the URL given in the bug description.
> h1. Compiling the server
> {noformat}
> git clone https://github.com/apache/lucene-solr
> cd lucene-solr
> git checkout ea2c8ba
> ant compile
> cd solr
> ant server
> {noformat}
> h1. Building the collection
> We followed [Exercise 
> 2|http://lucene.apache.org/solr/guide/7_5/solr-tutorial.html#exercise-2] from 
> the [Solr 
> Tutorial|http://lucene.apache.org/solr/guide/7_5/solr-tutorial.html]. The 
> attached file ({{home.zip}}) gives the contents of folder {{/tmp/home}} that 
> you will obtain by following the steps below:
> {noformat}
> mkdir -p /tmp/home
> echo '' > 
> /tmp/home/solr.xml
> {noformat}
> In one terminal start a Solr instance in foreground:
> {noformat}
> ./bin/solr start -f -p 8983 -s /tmp/home
> {noformat}
> In another terminal, create a collection of movies, with no shards and no 
> replication, and initialize it:
> {noformat}
> bin/solr create -c films
> curl -X POST -H 'Content-type:application/json' --data-binary '{"add-field": 
> {"name":"name", "type":"text_general", "multiValued":false, "stored":true}}' 
> http://localhost:8983/solr/films/schema
> curl -X POST -H 'Content-type:application/json' --data-binary 
> '{"add-copy-field" : {"source":"*","dest":"_text_"}}' 
> http://localhost:8983/solr/films/schema
> ./bin/post -c films example/films/films.json
> {noformat}
>Reporter: Johannes Kloos
>Priority: Major
>  Labels: diffblue, newdev
>
> Requesting the following URL causes Solr to return an HTTP 500 error response:
> {noformat}
> http://localhost:8983/solr/films/select?distrib=true&facet=on&facet.field=genre
> {noformat}
> The error response seems to be caused by the following uncaught exception:
> {noformat}
> null:java.lang.NullPointerException
> at 
> org.apache.solr.handler.component.FacetComponent$DistribFieldFacet.(FacetComponent.java:1432)
> at 
> org.apache.solr.handler.component.FacetComponent$FacetInfo.parse(FacetComponent.java:1266)
> at 
> org.apache.solr.handler.component.FacetComponent.modifyRequest(FacetComponent.java:528)
> at 
> org.apache.solr.handler.component.ResponseBuilder.addRequest(ResponseBuilder.java:162)
> at 
> org.apache.solr.handler.component.QueryComponent.createMainQuery(QueryComponent.java:755)
> at 
> org.apache.solr.handler.component.QueryComponent.regularDistributedProcess(QueryComponent.java:546)
> at 
> org.apache.solr.handler.component.QueryComponent.distributedProcess(QueryComponent.java:499)
> at 
> org.apache.solr.handler.component.SearchHandler.handleRequestBody(SearchHandler.java:348)
> {noformat}
> Since the query contains “distrib=true”, it is handled by calling the 
> distributedProcess methods of the search pipeline. At no point in the call 
> chain that leads to the above exception is there any check if the 
> ResponseBuilder rb used to handle the request actually has its shared 
> property configured. Nevertheless, the DistribFieldFact constructor tries to 
> access rb.shards.length, but rb.shared is null in this example.
> We found this bug using [Diffblue Microservices 
> Testing|https://www.diffblue.com/labs/?utm_source=solr-br]. Find more 
> information on this [fuzz testing 
> campaign|https://www.diffblue.com/blog/2018/12/19/diffblue-microservice-testing-a-sneak-peek-at-our-early-product-and-results?utm_source=solr-br].



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

58 matches

Mail list logo