liyunzhang_intel created PIG-5192:
-------------------------------------

             Summary: Remove schema tuple reference overhead for replicate join 
hashmap in POFRJoinSpark
                 Key: PIG-5192
                 URL: https://issues.apache.org/jira/browse/PIG-5192
             Project: Pig
          Issue Type: Sub-task
            Reporter: liyunzhang_intel


Currently even if pig.schematuple is set to false which is the default, the 
usage of TupleToMapKey and TuplesToSchemaTupleList instead of plain 
HashMap<Object, ArrayList<Tuple>> costs a lot of memory. Also key is currently 
converted to a tuple which is unnecessary. Detail see PIG-4874




--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to