Re: [Mesa-dev] [PATCH 6/9] i965/fs: Fetch one cacheline of pull constants at a time.

2016-12-14 Thread Francisco Jerez
Kenneth Graunke writes: > On Wednesday, December 14, 2016 2:18:16 PM PST Francisco Jerez wrote: >> Francisco Jerez writes: >> >> > Kenneth Graunke writes: >> > >> >> On Friday, December 9, 2016 11:03:29 AM PST Francisco Jerez wrote: >> >>> Asking the DC for less than one cacheline (4 owords) o

Re: [Mesa-dev] [PATCH 6/9] i965/fs: Fetch one cacheline of pull constants at a time.

2016-12-14 Thread Kenneth Graunke
On Wednesday, December 14, 2016 2:18:16 PM PST Francisco Jerez wrote: > Francisco Jerez writes: > > > Kenneth Graunke writes: > > > >> On Friday, December 9, 2016 11:03:29 AM PST Francisco Jerez wrote: > >>> Asking the DC for less than one cacheline (4 owords) of data for > >>> uniform pull cons

Re: [Mesa-dev] [PATCH 6/9] i965/fs: Fetch one cacheline of pull constants at a time.

2016-12-14 Thread Francisco Jerez
Francisco Jerez writes: > Kenneth Graunke writes: > >> On Friday, December 9, 2016 11:03:29 AM PST Francisco Jerez wrote: >>> Asking the DC for less than one cacheline (4 owords) of data for >>> uniform pull constants is suboptimal because the DC cannot request >>> less than that from L3, result

Re: [Mesa-dev] [PATCH 6/9] i965/fs: Fetch one cacheline of pull constants at a time.

2016-12-13 Thread Francisco Jerez
Kenneth Graunke writes: > On Friday, December 9, 2016 11:03:29 AM PST Francisco Jerez wrote: >> Asking the DC for less than one cacheline (4 owords) of data for >> uniform pull constants is suboptimal because the DC cannot request >> less than that from L3, resulting in wasted bandwidth and unnec

Re: [Mesa-dev] [PATCH 6/9] i965/fs: Fetch one cacheline of pull constants at a time.

2016-12-13 Thread Kenneth Graunke
On Friday, December 9, 2016 11:03:29 AM PST Francisco Jerez wrote: > Asking the DC for less than one cacheline (4 owords) of data for > uniform pull constants is suboptimal because the DC cannot request > less than that from L3, resulting in wasted bandwidth and unnecessary > message dispatch overh

[Mesa-dev] [PATCH 6/9] i965/fs: Fetch one cacheline of pull constants at a time.

2016-12-09 Thread Francisco Jerez
Asking the DC for less than one cacheline (4 owords) of data for uniform pull constants is suboptimal because the DC cannot request less than that from L3, resulting in wasted bandwidth and unnecessary message dispatch overhead, and exacerbating the IVB L3 serialization bug. The following table su