> On 20.11.2018, at 17:43, Kyrill Tkachov <kyrylo.tkac...@foss.arm.com> wrote: > > On 20/11/18 16:20, Christoph Müllner wrote: >>> On 20.11.2018, at 17:14, Kyrill Tkachov <kyrylo.tkac...@foss.arm.com> wrote: >>> >>> Hi Christoph, >>> >>> On 20/11/18 15:22, Christoph Muellner wrote: >>>> Tested with "make check" and no regressions found. >>>> >>>> *** gcc/ChangeLog *** >>>> >>>> 2018-xx-xx Christoph Muellner <christoph.muell...@theobroma-system.com> >>>> >>>> * config/aarch64/aarch64-cores.def: Define emag. >>>> * config/aarch64/aarch64-tune.md: Regenerated with emag. >>>> * config/aarch64/aarch64.c (emag_tunings): New struct. >>>> * doc/invoke.texi: Document mtune value. >>>> >>>> Signed-off-by: Christoph Muellner >>>> <christoph.muell...@theobroma-systems.com> >>>> --- >>>> gcc/config/aarch64/aarch64-cores.def | 3 +++ >>>> gcc/config/aarch64/aarch64-tune.md | 2 +- >>>> gcc/config/aarch64/aarch64.c | 25 +++++++++++++++++++++++++ >>>> gcc/doc/invoke.texi | 2 +- >>>> 4 files changed, 30 insertions(+), 2 deletions(-) >>>> >>>> diff --git a/gcc/config/aarch64/aarch64-cores.def >>>> b/gcc/config/aarch64/aarch64-cores.def >>>> index 1f3ac56..8eee97f 100644 >>>> --- a/gcc/config/aarch64/aarch64-cores.def >>>> +++ b/gcc/config/aarch64/aarch64-cores.def >>>> @@ -61,6 +61,9 @@ AARCH64_CORE("thunderxt88", thunderxt88, thunderx, >>>> 8A, AARCH64_FL_FOR_ARCH >>>> AARCH64_CORE("thunderxt81", thunderxt81, thunderx, 8A, >>>> AARCH64_FL_FOR_ARCH8 | AARCH64_FL_CRC | AARCH64_FL_CRYPTO, thunderx, >>>> 0x43, 0x0a2, -1) >>>> AARCH64_CORE("thunderxt83", thunderxt83, thunderx, 8A, >>>> AARCH64_FL_FOR_ARCH8 | AARCH64_FL_CRC | AARCH64_FL_CRYPTO, thunderx, >>>> 0x43, 0x0a3, -1) >>>> +/* Ampere Computing cores. */ >>>> +AARCH64_CORE("emag", emag, xgene1, 8A, >>>> AARCH64_FL_FOR_ARCH8, emag, 0x50, 0x000, 3) >>>> + >>> According to your previous reply, the 0x3 should be in the "PART" field, >>> that is >>> ..., 0x50, 0x3, -1) >> Should have been "variant field" in the email. >> The v2 patch is correct and tested: > > I see, that does look correct then. > >> processor : 0 >> BogoMIPS : 100.00 >> Features : fp asimd evtstrm aes pmull sha1 sha2 crc32 cpuid > > This line says that the CPU supports the "crc" and "crypto" option extensions > by default so > the 5th field should include AARCH64_FL_CRC | AARCH64_FL_CRYPTO so that the > user gets them > by default when they use -mcpu=emag.
Good hint! Thanks, Christoph > > Thanks, this is really helpful. > Kyrill > >> CPU implementer : 0x50 >> CPU architecture: 8 >> CPU variant : 0x3 >> CPU part : 0x000 >> CPU revision : 1 >> >> gcc -mcpu=native -Q --help=target >> -mcpu= emag+crypto+crc+aes+sha2+profile >> >> Thanks, >> Christoph >> >> >> >>> Thanks, >>> Kyrill >>> >>>> /* APM ('P') cores. */ >>>> AARCH64_CORE("xgene1", xgene1, xgene1, 8A, >>>> AARCH64_FL_FOR_ARCH8, xgene1, 0x50, 0x000, -1) >>>> diff --git a/gcc/config/aarch64/aarch64-tune.md >>>> b/gcc/config/aarch64/aarch64-tune.md >>>> index fade1d4..2fc7f03 100644 >>>> --- a/gcc/config/aarch64/aarch64-tune.md >>>> +++ b/gcc/config/aarch64/aarch64-tune.md >>>> @@ -1,5 +1,5 @@ >>>> ;; -*- buffer-read-only: t -*- >>>> ;; Generated automatically by gentune.sh from aarch64-cores.def >>>> (define_attr "tune" >>>> - >>>> "cortexa35,cortexa53,cortexa57,cortexa72,cortexa73,thunderx,thunderxt88p1,thunderxt88,thunderxt81,thunderxt83,xgene1,falkor,qdf24xx,exynosm1,phecda,thunderx2t99p1,vulcan,thunderx2t99,cortexa55,cortexa75,cortexa76,ares,tsv110,saphira,cortexa57cortexa53,cortexa72cortexa53,cortexa73cortexa35,cortexa73cortexa53,cortexa75cortexa55,cortexa76cortexa55" >>>> + >>>> "cortexa35,cortexa53,cortexa57,cortexa72,cortexa73,thunderx,thunderxt88p1,thunderxt88,thunderxt81,thunderxt83,emag,xgene1,falkor,qdf24xx,exynosm1,phecda,thunderx2t99p1,vulcan,thunderx2t99,cortexa55,cortexa75,cortexa76,ares,tsv110,saphira,cortexa57cortexa53,cortexa72cortexa53,cortexa73cortexa35,cortexa73cortexa53,cortexa75cortexa55,cortexa76cortexa55" >>>> (const (symbol_ref "((enum attr_tune) aarch64_tune)"))) >>>> diff --git a/gcc/config/aarch64/aarch64.c b/gcc/config/aarch64/aarch64.c >>>> index f7f88a9..995aafe 100644 >>>> --- a/gcc/config/aarch64/aarch64.c >>>> +++ b/gcc/config/aarch64/aarch64.c >>>> @@ -957,6 +957,31 @@ static const struct tune_params xgene1_tunings = >>>> &xgene1_prefetch_tune >>>> }; >>>> +static const struct tune_params emag_tunings = >>>> +{ >>>> + &xgene1_extra_costs, >>>> + &xgene1_addrcost_table, >>>> + &xgene1_regmove_cost, >>>> + &xgene1_vector_cost, >>>> + &generic_branch_cost, >>>> + &xgene1_approx_modes, >>>> + 6, /* memmov_cost */ >>>> + 4, /* issue_rate */ >>>> + AARCH64_FUSE_NOTHING, /* fusible_ops */ >>>> + "16", /* function_align. */ >>>> + "16", /* jump_align. */ >>>> + "16", /* loop_align. */ >>>> + 2, /* int_reassoc_width. */ >>>> + 4, /* fp_reassoc_width. */ >>>> + 1, /* vec_reassoc_width. */ >>>> + 2, /* min_div_recip_mul_sf. */ >>>> + 2, /* min_div_recip_mul_df. */ >>>> + 17, /* max_case_values. */ >>>> + tune_params::AUTOPREFETCHER_OFF, /* autoprefetcher_model. */ >>>> + (AARCH64_EXTRA_TUNE_NO_LDP_STP_QREGS), /* tune_flags. */ >>>> + &xgene1_prefetch_tune >>>> +}; >>>> + >>>> static const struct tune_params qdf24xx_tunings = >>>> { >>>> &qdf24xx_extra_costs, >>>> diff --git a/gcc/doc/invoke.texi b/gcc/doc/invoke.texi >>>> index e016dce..ac81fb2 100644 >>>> --- a/gcc/doc/invoke.texi >>>> +++ b/gcc/doc/invoke.texi >>>> @@ -15288,7 +15288,7 @@ Specify the name of the target processor for which >>>> GCC should tune the >>>> performance of the code. Permissible values for this option are: >>>> @samp{generic}, @samp{cortex-a35}, @samp{cortex-a53}, @samp{cortex-a55}, >>>> @samp{cortex-a57}, @samp{cortex-a72}, @samp{cortex-a73}, >>>> @samp{cortex-a75}, >>>> -@samp{cortex-a76}, @samp{ares}, @samp{exynos-m1}, @samp{falkor}, >>>> +@samp{cortex-a76}, @samp{ares}, @samp{exynos-m1}, @samp{emag}, >>>> @samp{falkor}, >>>> @samp{qdf24xx}, @samp{saphira}, @samp{phecda}, @samp{xgene1}, >>>> @samp{vulcan}, >>>> @samp{thunderx}, @samp{thunderxt88}, @samp{thunderxt88p1}, >>>> @samp{thunderxt81}, >>>> @samp{tsv110}, @samp{thunderxt83}, @samp{thunderx2t99},