Request to improve docs: Fortran integer ranges vs. machine representation
Hi, I stumbled upon a Fortran quirk regarding the ranges for integers. The standard stipulates that the ranges be symmetric even though as we know two's complement signed integers have asymmetrical ranges, e.g. int8 kind can vary between -128 and 127. This is sure to cause confusion as for example the following program does not compile: ``` program main use iso_fortran_env implicit none print *, -128_int8 end program ``` GFortran 13.2.0 provides the error message Error: Integer too big for its kind at (1). This check can be disabled with the option '-fno-range-check' While a workaround to disable the range check is provided, the error message is a bit misleading. For one, the number is too small for the int8 number model. For the second, it is not obvious to the lay why removing the kind literal leads to a clean compile. Codewise the topic has been discussed in PRs 13490 and 17912, and I think the current implementation is correct. What could be improved are the pages https://gcc.gnu.org/bugs/ and the above error message. I would like to propose that the error message should hint more towards the root cause. Something along the lines of: 'Out of range integer for its selected kind at (1). This check can be disabled with the option '-fno-range-check'. Secondly, I think this quirk could be a good candidate to the known bugs section of your website. I tried to file a bug / craft a PR but access to your repos is a bit limited at this point, so I decided to write this message as my 2 cents. Anyway, thanks for your good work! I can also volunteer to help you with this or some other tasks. Haven't done it before but would be interesting to try it out. -- Best regards / Ystävällisesti Elias Toivanen elias.a.toiva...@gmail.com
Re: Request to improve docs: Fortran integer ranges vs. machine representation
On Fri, Jan 26, 2024 at 03:44:26PM +0200, Elias Toivanen wrote: > > I stumbled upon a Fortran quirk regarding the ranges for integers. > The standard stipulates that the ranges be symmetric even though > as we know two's complement signed integers have asymmetrical > ranges, e.g. int8 kind can vary between -128 and 127. This is not quite true. The model number for integer is symmetric, but the machine representable does not need to be symmetric. > > This is sure to cause confusion as for example the following > program does not compile: > > ``` > program main > use iso_fortran_env > implicit none > print *, -128_int8 This is a unary minus operator with an operand of 128. gfortran does not have negative integers. > end program > ``` > > GFortran 13.2.0 provides the error message > > Error: Integer too big for its kind at (1). This check can > be disabled with the option '-fno-range-check' which gives you this error because 128 is greater 127. It is a little known fact, but you can force a symmetric range with the -pedantic option. -- Steve
Re: Request to improve docs: Fortran integer ranges vs. machine representation
>> print *, -128_int8 > > This is a unary minus operator with an operand of 128. > gfortran does not have negative integers. To be clear, it’s not gfortran, it’s the standard(s). It’s not exactly true either, because there is a "signed-int-literal-constant” which is allowed in some contexts: - as real part or imaginary part of a complex literal constant - as a data-stmt-constant - in some edit descriptors (DT and P control-edit-desc) This is, for example, legal code (and handled by gfortran as expected): use iso_fortran_env implicit none integer(int8) :: j data j /-128_int8/ print *, j end On the other hand, in your code, -128_int8 cannot be a literal constant, because integer literal constants cannot have a sign in Fortran. As Steve said, “-128_int8” in this context in Fortran is the minus unary operator applied to the integer literal constant 128_int8. We current output this: 4 | j = -128_int8 | 1 Error: Integer too big for its kind at (1). This check can be disabled with the option '-fno-range-check’ I think the only better output we could give is to make it clear what the integer value is, either by underlining it (but we don’t have a framework to do that right now) or by outputting the value in the message (?). Best, FX
Re: Request to improve docs: Fortran integer ranges vs. machine representation
On Fri, Jan 26, 2024 at 09:23:56PM +0100, FX Coudert wrote: > > This is, for example, legal code (and handled by gfortran as expected): > > use iso_fortran_env > implicit none > integer(int8) :: j > data j /-128_int8/ > print *, j > end > Interesting example. % gfcx -o z a.f90 && ./z -128 % gfcx -o z -pedantic a.f90 && ./z a.f90:5:20: 5 | data j /-128_int8/ |1 Error: Integer too big for its kind at (1). This check can be disabled with the option ‘-fno-range-check’ % gfcx -o z -pedantic -fno-range-check a.f90 && ./z a.f90:5:20: 5 | data j /-128_int8/ |1 Error: Integer too big for its kind at (1). This check can be disabled with the option ‘-fno-range-check’ :-) -- Steve
Re: Request to improve docs: Fortran integer ranges vs. machine representation
> Interesting example. > > % gfcx -o z a.f90 && ./z > -128 > % gfcx -o z -pedantic a.f90 && ./z > a.f90:5:20: > >5 | data j /-128_int8/ > |1 > Error: Integer too big for its kind at (1). This check can be disabled with > the option ‘-fno-range-check’ That qualifies as a compiler bug, I think. Our documentation for -pedantic states: “Issue warnings for uses of extensions to Fortran.” and "Valid Fortran programs should compile properly with or without this option.” The same is true of the following, which is also valid Fortran since 95 : use iso_fortran_env implicit none complex, parameter :: z = (-128_int8, -128_int8) print *, z end Right now it fails to compile with -pedantic. Or are they illegal because of how the range should be be symmetric? I can’t quite find the language in the standard for that, actually. To me, they’re valid signed-int-literal-constant. FX PS: I’m going to ignore the cases of the P and DT edit descriptors, because they’re not allowed to have a kind value, and therefore the corner cases occur for values too big to be actually relevant to anything.
Re: Request to improve docs: Fortran integer ranges vs. machine representation
On Fri, Jan 26, 2024 at 11:38:01PM +0100, FX Coudert wrote: > > Interesting example. > > > > % gfcx -o z a.f90 && ./z > > -128 > > % gfcx -o z -pedantic a.f90 && ./z > > a.f90:5:20: > > > >5 | data j /-128_int8/ > > |1 > > Error: Integer too big for its kind at (1). This check can be disabled with > > the option ‘-fno-range-check’ > > That qualifies as a compiler bug, I think. Our documentation for -pedantic > states: “Issue warnings for uses of extensions to Fortran.” and "Valid > Fortran programs should compile properly with or without this option.” > > The same is true of the following, which is also valid Fortran since 95 : > > use iso_fortran_env > implicit none > complex, parameter :: z = (-128_int8, -128_int8) > print *, z > end > > Right now it fails to compile with -pedantic. > > Or are they illegal because of how the range should be be symmetric? I can’t > quite find the language in the standard for that, actually. To me, they’re > valid signed-int-literal-constant. > Symmetry comes from the definition of a model integer number in 16.3.3 "Bit sequences as arguments to INT and REAL". If 'i' and 'j' are INTEGER(1), then this leads to something like 'j = abs(i)' being comforming except if 'i = -128'. As far as literal constants, one has from Fortran 2023 p. 66 R707 signed-int-literal-constant is [ sign ] int-literal-constant R708 int-literal-constant is digit-string [ _ kind-param ] R709 kind-param is digit-string R710 signed-digit-string is [ sign ] digit-string R711 digit-string is digit [ digit ] ... R712 sign is + or - I cannot find in the Standard any statement about the conversion of a digit-string to an internal representation. I suspect it depends on the path through the compiler. In assignments, the signed-digit-string is likely handled as a unary operator and operand. -- Steve