[ 
https://issues.apache.org/jira/browse/SOLR-13973?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17190728#comment-17190728
 ] 

Alexandre Rafalovitch commented on SOLR-13973:
----------------------------------------------

Sanity check on the links and numbers (I use both Drupal 8/9 and Solr, though 
not currently together):

1) [https://www.drupal.org/project/apachesolr] is for Drupal 7 and before. 
Those people will also continue using Solr 4 or whatever was the configuration 
last updated to (2 years ago at the latest)

2) [https://www.drupal.org/project/search_api_solr] starting from v4 release 
does support Drupal 8.8+/9 (only) and Solr versions more recent than Solr 6.4. 
It was released in May 2020 and updated since. The current adoption is probably 
still fairly low, but will accelerate next year, once previous release version 
is no longer supported (notice said December). 

Drupal was never known for chasing latest Solr version as they have their own 
configuration that is designed for field definitions with wildcards and maybe 
only recently (if at all) with managed-schema API manipulation. They can also 
can keep using Solr 8 for another 5-6 years with Tika built in.

If Tika is removed from Solr (in version 9 the earliest), this will only affect 
the choices of those setting up new Drupal installation and wanting new 
features of Solr 9. At that point (say in 4 years), we can figure something out 
for Solr 11. Most likely a variation on preconfigured Solr and Tika colocated 
in a Docker container.

 

On the other hand, I honestly don't know much about solarium library directly. 
Perhaps it is a serious issue there, though we have to look again at number of 
active installations*percentage of those using /extract handler*percentage of 
people able to run _latest_ Solr process but not a second (also Java) process.

So, to me, this sounds less like a -1, then as an awareness for a bit of an 
extra education around that edge case. And, yes, awareness of the greater 
community; something we really need to pay more attention to in general.

 

Of course, any improvement of workflow we can do between Solr and Tika, both 
standalone, would be very good regardless of this particular use case.

 

> Deprecate Tika
> --------------
>
>                 Key: SOLR-13973
>                 URL: https://issues.apache.org/jira/browse/SOLR-13973
>             Project: Solr
>          Issue Type: Improvement
>            Reporter: Ishan Chattopadhyaya
>            Assignee: Ishan Chattopadhyaya
>            Priority: Blocker
>             Fix For: 8.7
>
>          Time Spent: 10m
>  Remaining Estimate: 0h
>
> Solr's primary responsibility should be to focus on search and scalability. 
> Having to deal with the problems (CVEs) of Velocity, Tika etc. can slow us 
> down. I propose that we deprecate it going forward.
> Tika can be run outside Solr. Going forward, if someone wants to use these, 
> it should be possible to bring them into third party packages and installed 
> via package manager.
> Plan is to just to throw warnings in logs and add deprecation notes in 
> reference guide for now. Removal can be done in 9.0.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

Reply via email to