mattcasters opened a new issue, #5468: URL: https://github.com/apache/hop/issues/5468
### Apache Hop version? 2.14.0 ### Java version? openjdk 17.0.15 2025-04-15 LTS ### Operating system Linux ### What happened? The data samplers (First, Last, Random, Data Profile) in the Execution Data Profile plugins are not cloning rows. This can lead to rare cases where sample rows are being collected to be sent off somewhere to disk or database and subsequent transforms already changed the data type or content. The simplest case is one where you have an Epoch date as an Integer (ms since 1970/01/01) and convert it to a Date. If you enable first or last rows data profiling in an execution information you get an error like: ``` 025/07/01 14:34:59 - select-values-ei-error - Error writing cache entry to file '/tmp/executions/ebb22d6d-e5fd-4d1e-86c6-252d3993e098.json' 2025/07/01 14:34:59 - select-values-ei-error - Stream Closed 2025/07/01 14:34:59 - select-values-ei-error - 2025/07/01 14:34:59 - select-values-ei-error - 2025/07/01 14:34:59 - select-values-ei-error - 2025/07/01 14:34:59 - select-values-ei-error - at org.apache.hop.execution.caching.BaseCachingExecutionInfoLocation.close(BaseCachingExecutionInfoLocation.java:193) 2025/07/01 14:34:59 - select-values-ei-error - at org.apache.hop.pipeline.engines.local.LocalPipelineEngine.stopTransformExecutionInfoTimer(LocalPipelineEngine.java:512) 2025/07/01 14:34:59 - select-values-ei-error - at org.apache.hop.pipeline.engines.local.LocalPipelineEngine.pipelineCompleted(LocalPipelineEngine.java:460) 2025/07/01 14:34:59 - select-values-ei-error - at org.apache.hop.pipeline.Pipeline.fireExecutionFinishedListeners(Pipeline.java:1344) 2025/07/01 14:34:59 - select-values-ei-error - at org.apache.hop.pipeline.Pipeline.lambda$startThreads$0(Pipeline.java:1141) 2025/07/01 14:34:59 - select-values-ei-error - at org.apache.hop.pipeline.transform.BaseTransform.fireTransformFinishedListeners(BaseTransform.java:2840) 2025/07/01 14:34:59 - select-values-ei-error - at org.apache.hop.pipeline.transform.BaseTransform.markStop(BaseTransform.java:2833) 2025/07/01 14:34:59 - select-values-ei-error - at org.apache.hop.pipeline.transform.RunThread.run(RunThread.java:141) 2025/07/01 14:34:59 - select-values-ei-error - at java.base/java.lang.Thread.run(Thread.java:840) 2025/07/01 14:34:59 - select-values-ei-error - Caused by: org.apache.hop.core.exception.HopException: 2025/07/01 14:34:59 - select-values-ei-error - Error writing caching file entry to disk in folder /tmp/executions 2025/07/01 14:34:59 - select-values-ei-error - 2025/07/01 14:34:59 - select-values-ei-error - Error writing cache entry to file '/tmp/executions/ebb22d6d-e5fd-4d1e-86c6-252d3993e098.json' 2025/07/01 14:34:59 - select-values-ei-error - Stream Closed 2025/07/01 14:34:59 - select-values-ei-error - 2025/07/01 14:34:59 - select-values-ei-error - 2025/07/01 14:34:59 - select-values-ei-error - at org.apache.hop.execution.caching.CachingFileExecutionInfoLocation.persistCacheEntry(CachingFileExecutionInfoLocation.java:99) 2025/07/01 14:34:59 - select-values-ei-error - at org.apache.hop.execution.caching.BaseCachingExecutionInfoLocation.close(BaseCachingExecutionInfoLocation.java:189) 2025/07/01 14:34:59 - select-values-ei-error - ... 8 more 2025/07/01 14:34:59 - select-values-ei-error - Caused by: org.apache.hop.core.exception.HopException: 2025/07/01 14:34:59 - select-values-ei-error - Error writing cache entry to file '/tmp/executions/ebb22d6d-e5fd-4d1e-86c6-252d3993e098.json' 2025/07/01 14:34:59 - select-values-ei-error - Stream Closed 2025/07/01 14:34:59 - select-values-ei-error - 2025/07/01 14:34:59 - select-values-ei-error - at org.apache.hop.execution.caching.CacheEntry.writeToDisk(CacheEntry.java:100) 2025/07/01 14:34:59 - select-values-ei-error - at org.apache.hop.execution.caching.CachingFileExecutionInfoLocation.persistCacheEntry(CachingFileExecutionInfoLocation.java:95) 2025/07/01 14:34:59 - select-values-ei-error - ... 9 more 2025/07/01 14:34:59 - select-values-ei-error - Caused by: java.io.IOException: Stream Closed 2025/07/01 14:34:59 - select-values-ei-error - at java.base/java.io.FileOutputStream.writeBytes(Native Method) 2025/07/01 14:34:59 - select-values-ei-error - at java.base/java.io.FileOutputStream.write(FileOutputStream.java:349) 2025/07/01 14:34:59 - select-values-ei-error - at com.fasterxml.jackson.core.json.UTF8JsonGenerator._flushBuffer(UTF8JsonGenerator.java:2203) 2025/07/01 14:34:59 - select-values-ei-error - at com.fasterxml.jackson.core.json.UTF8JsonGenerator._writeStringSegment2(UTF8JsonGenerator.java:1515) 2025/07/01 14:34:59 - select-values-ei-error - at com.fasterxml.jackson.core.json.UTF8JsonGenerator._writeStringSegment(UTF8JsonGenerator.java:1462) 2025/07/01 14:34:59 - select-values-ei-error - at com.fasterxml.jackson.core.json.UTF8JsonGenerator._writeStringSegments(UTF8JsonGenerator.java:1345) 2025/07/01 14:34:59 - select-values-ei-error - at com.fasterxml.jackson.core.json.UTF8JsonGenerator.writeString(UTF8JsonGenerator.java:517) 2025/07/01 14:34:59 - select-values-ei-error - at com.fasterxml.jackson.databind.ser.std.StringSerializer.serialize(StringSerializer.java:41) 2025/07/01 14:34:59 - select-values-ei-error - at com.fasterxml.jackson.databind.ser.BeanPropertyWriter.serializeAsField(BeanPropertyWriter.java:732) 2025/07/01 14:34:59 - select-values-ei-error - at com.fasterxml.jackson.databind.ser.std.BeanSerializerBase.serializeFields(BeanSerializerBase.java:772) 2025/07/01 14:34:59 - select-values-ei-error - at com.fasterxml.jackson.databind.ser.BeanSerializer.serialize(BeanSerializer.java:178) 2025/07/01 14:34:59 - select-values-ei-error - at com.fasterxml.jackson.databind.ser.BeanPropertyWriter.serializeAsField(BeanPropertyWriter.java:732) 2025/07/01 14:34:59 - select-values-ei-error - at com.fasterxml.jackson.databind.ser.std.BeanSerializerBase.serializeFields(BeanSerializerBase.java:772) 2025/07/01 14:34:59 - select-values-ei-error - at com.fasterxml.jackson.databind.ser.BeanSerializer.serialize(BeanSerializer.java:178) 2025/07/01 14:34:59 - select-values-ei-error - at com.fasterxml.jackson.databind.ser.DefaultSerializerProvider._serialize(DefaultSerializerProvider.java:479) 2025/07/01 14:34:59 - select-values-ei-error - at com.fasterxml.jackson.databind.ser.DefaultSerializerProvider.serializeValue(DefaultSerializerProvider.java:318) 2025/07/01 14:34:59 - select-values-ei-error - at com.fasterxml.jackson.databind.ObjectMapper._writeValueAndClose(ObjectMapper.java:4719) 2025/07/01 14:34:59 - select-values-ei-error - at com.fasterxml.jackson.databind.ObjectMapper.writeValue(ObjectMapper.java:3923) 2025/07/01 14:34:59 - select-values-ei-error - at org.apache.hop.execution.caching.CacheEntry.writeToDisk(CacheEntry.java:97) 2025/07/01 14:34:59 - select-values-ei-error - ... 10 more ``` ### Issue Priority Priority: 3 ### Issue Component Component: API -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
