[
https://issues.apache.org/jira/browse/TINKERPOP-1217?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15191021#comment-15191021
]
ASF GitHub Bot commented on TINKERPOP-1217:
-------------------------------------------
GitHub user okram opened a pull request:
https://github.com/apache/incubator-tinkerpop/pull/258
TINKERPOP-1217: Repeated Logging of "The HadoopPools has not been
initialized, using the default pool"
https://issues.apache.org/jira/browse/TINKERPOP-1217
This is a simple fix. However, the more troubling thing for me (and what
could be another ticket) is why are you (@rspitzer) getting that? I don't know
where in Spark the pool isn't being initialized and I haven't seen that in my
experiments with Friendster/etc.
Perhaps we can discuss on the mailing list about where in Spark this WARN
is popping up for you.
For this issue VOTE +1.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/apache/incubator-tinkerpop TINKERPOP-1217
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/incubator-tinkerpop/pull/258.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #258
----
commit 035dc36f4c0f077963949b5222650ee7b4497533
Author: Marko A. Rodriguez <[email protected]>
Date: 2016-03-11T14:43:43Z
If the HadoopPool hasn't been initlized, then initialize it with a default
configuration. This ensures that this WARN happens once and only once. The more
troubling issue, is why is the HadoopPool isn't getting initlized for @rspitzer
in Spark, but that is another ticket.
----
> Repeated Logging of "The HadoopPools has not been initialized, using the
> default pool"
> ---------------------------------------------------------------------------------------
>
> Key: TINKERPOP-1217
> URL: https://issues.apache.org/jira/browse/TINKERPOP-1217
> Project: TinkerPop
> Issue Type: Bug
> Components: hadoop
> Affects Versions: 3.1.1-incubating
> Reporter: Russell Alexander Spitzer
>
> When running a Spark Job against a rather large database my spark log fills
> with the following log line repeatedly
> {code}WARN 2016-03-10 15:58:20,123 HadoopPools.java:55 -
> org.apache.tinkerpop.gremlin.hadoop.structure.HadoopGraph: The HadoopPools
> has not been initialized, using the default pool{code}
> This amounted to about 5GB of logging per Spark Executor over the course of
> 90minutes.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)