I think Srikant's other reply addressed this?
Matt
________________________________
From: David Fong <[email protected]>
Sent: Monday, March 7, 2022 11:12 AM
To: Poremba, Matthew <[email protected]>; David Fong via gem5-users
<[email protected]>; Bharadwaj, Srikant <[email protected]>
Cc: Bobby Bruce <[email protected]>; Matt Sinclair <[email protected]>
Subject: gem5 + APU latency numbers
Hi Matt P.,
I notice these stat numbers in the overall number for cpu3 (APU).
For 40, overall cpu3 (APU) latency numbers are reduced but shaderActiveTicks
increased.
Do these numbers make sense?
David
Modified:
gem5/build/GCN3_X86/gpu-compute/GPU.py
mem_req_latency = Param.Int(40, "Latency for request from the cu to ruby. "\
"Represents the pipeline to reach the TCP "\
"and specified in GPU clock cycles")
mem_resp_latency = Param.Int(40, "Latency for responses from ruby to the "\
"cu. Represents the pipeline between the "\
"TCP and cu as well as TCP data array "\
"access. Specified in GPU clock cycles")
m5out/stats.txt
40 (mem_req_latency, mem_resp_latency) (smaller is better)
system.cpu3.allLatencyDist::mean 458572.656250 #
delay distribution for all (Unspecified)
system.cpu3.allLatencyDist::stdev 429452.145064 #
delay distribution for all (Unspecified)
50 (mem_req_latency, mem_resp_latency)
system.cpu3.allLatencyDist::mean 491744.531250 #
delay distribution for all (Unspecified)
system.cpu3.allLatencyDist::stdev 439992.936927 #
delay distribution for all (Unspecified)
Latency is reduced for mean and stdev.
40 (mem_req_latency, mem_resp_latency) (smaller is better)
system.cpu3.allLatencyDist::overflows 97 1.52% 100.00% #
delay distribution for all (Unspecified)
system.cpu3.allLatencyDist::min_value 84000 #
delay distribution for all (Unspecified)
system.cpu3.allLatencyDist::max_value 3796000 #
delay distribution for all (Unspecified)
50 (mem_req_latency, mem_resp_latency)
system.cpu3.allLatencyDist::overflows 125 1.95% 100.00% #
delay distribution for all (Unspecified)
system.cpu3.allLatencyDist::min_value 104000 #
delay distribution for all (Unspecified)
system.cpu3.allLatencyDist::max_value 2651000 #
delay distribution for all (Unspecified)
40 (mem_req_latency, mem_resp_latency) (larger is better ??????)
system.cpu3.shaderActiveTicks 172369999 #
Total ticks that any CU attached to this shader is active (Unspecified)
50 (mem_req_latency, mem_resp_latency)
system.cpu3.shaderActiveTicks 171038999 #
Total ticks that any CU attached to this shader is active (Unspecified)
_______________________________________________
gem5-users mailing list -- [email protected]
To unsubscribe send an email to [email protected]
%(web_page_url)slistinfo%(cgiext)s/%(_internal_name)s