On Mon, Sep 5, 2022 at 3:04 AM Yury Zhuravlev <[email protected]> wrote:
>
> Hello,
>
> During the setup, the fan manager https://github.com/markusressel/fan2go I
> found that my Vega56 was not working correctly. This fan manager expects what
> read PWM value should be the same as you wrote before, but it's not the case.
> PWM value was volatile, and what is more critical, if I wrote 200, after
> reading I saw ~70-100, which is very confusing.
> After that, I started reading the amdgpu driver, and how fan speed works, and
> I found what PWM value was calculated from RPM speed and not correct for my
> case (different BIOS or fan configuration?).
> Because it looked wrong, I started looking into different implementations and
> found that Vega20 used mmCG_FDO_CTRL1 and mmCG_THERMAL_STATUS registers to
> calculate the PWM value.
> I also checked how we set PWM for Vega10 and found the same registers. After
> that, I copy-pasted the function from Vega20 to Vega10, and it started
> working much better. It still has some fluctuation, but as I understand, this
> behavior is expected.
>
> I have no in-depth information about amdgpu, and the original function may
> have been for some reason (maybe for some broken BIOS?), but I suppose
> somebody forgot to backport this code after prototype implementation.
>
> It would be my first patch here. Sorry if I skipped some procedures, will be
> appreciated it if you help me.
Please send this as a proper patch with your Signed-off-by using git-send-email.
Alex
>
> Regards,
>
> ---
> diff --git a/drivers/gpu/drm/amd/pm/powerplay/hwmgr/vega10_thermal.c
> b/drivers/gpu/drm/amd/pm/powerplay/hwmgr/vega10_thermal.c
> index dad3e3741a4e..190af79f3236 100644
> --- a/drivers/gpu/drm/amd/pm/powerplay/hwmgr/vega10_thermal.c
> +++ b/drivers/gpu/drm/amd/pm/powerplay/hwmgr/vega10_thermal.c
> @@ -67,22 +67,21 @@ int vega10_fan_ctrl_get_fan_speed_info(struct pp_hwmgr
> *hwmgr,
> int vega10_fan_ctrl_get_fan_speed_pwm(struct pp_hwmgr *hwmgr,
> uint32_t *speed)
> {
> - uint32_t current_rpm;
> - uint32_t percent = 0;
> -
> - if (hwmgr->thermal_controller.fanInfo.bNoFan)
> - return 0;
> + struct amdgpu_device *adev = hwmgr->adev;
> + uint32_t duty100, duty;
> + uint64_t tmp64;
>
> - if (vega10_get_current_rpm(hwmgr, ¤t_rpm))
> - return -1;
> + duty100 = REG_GET_FIELD(RREG32_SOC15(THM, 0, mmCG_FDO_CTRL1),
> + CG_FDO_CTRL1, FMAX_DUTY100);
> + duty = REG_GET_FIELD(RREG32_SOC15(THM, 0, mmCG_THERMAL_STATUS),
> + CG_THERMAL_STATUS, FDO_PWM_DUTY);
>
> - if (hwmgr->thermal_controller.
> - advanceFanControlParameters.usMaxFanRPM != 0)
> - percent = current_rpm * 255 /
> - hwmgr->thermal_controller.
> - advanceFanControlParameters.usMaxFanRPM;
> + if (!duty100)
> + return -EINVAL;
>
> - *speed = MIN(percent, 255);
> + tmp64 = (uint64_t)duty * 255;
> + do_div(tmp64, duty100);
> + *speed = MIN((uint32_t)tmp64, 255);
>
> return 0;
> }
> --