liyunzhang_intel created PIG-5192:
-------------------------------------
Summary: Remove schema tuple reference overhead for replicate join
hashmap in POFRJoinSpark
Key: PIG-5192
URL: https://issues.apache.org/jira/browse/PIG-5192
Project: Pig
Issue Type: Sub-task
Reporter: liyunzhang_intel
Currently even if pig.schematuple is set to false which is the default, the
usage of TupleToMapKey and TuplesToSchemaTupleList instead of plain
HashMap<Object, ArrayList<Tuple>> costs a lot of memory. Also key is currently
converted to a tuple which is unnecessary. Detail see PIG-4874
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)