Package: mawk
Version: 1.3.3-17+b3
Severity: normal

Dear Maintainer,

NOTE: this affects both 'mawk' and 'gawk' equally, so i'm not sure if this is 
some utterly esoteric behavior i'm just not "getting", but my expectations are 
definitely not being met.
(i also am unaware if a bug can be co-assigned to different packages)

Attempting to do a search for filesystems > 90% inode % or size % was giving 
anomolous results. (hair-pulling time)

I was using 'sub("%","",$1) to strip the % symbol off the results from:
    df --output=ipcent,pcent,target /var /var/log
e.g.:
    $ df --output=ipcent,pcent,target -t ext4  | mawk 
'NR>1{sub("%","",$1);sub("%","",$2); if (($1>30)||($2>30)) print}'
    40 79 /
    6 30 /var
    1 7 /opt
    20 78 /home
    1 89 /d1
(clearly /opt doesn't meet the reporting criteria)

I narrowed this down to what appears to be a field-size truncation comparison 
issue with the result from the sub() against the comparison numeric on the 
right-side of the relop:

# The following all work as expected: (no output)
$ for i in $(seq 20 29); do echo "$i%" | mawk '{if (int($1)>100) print}'; done
$ for i in $(seq 20 29); do echo "$i%" | gawk '{if (int($1)>100) print}'; done
$ for i in $(seq 20 29); do echo "$i%" | mawk '{if (($1+0)>100) print}'; done
$ for i in $(seq 20 29); do echo "$i%" | gawk '{if (($1+0)>100) print}'; done

This does NOT work, though it should: (none of these values should print, they 
are all < 100, which is the comparison being made)
$ for i in $(seq 20 29); do echo "$i%" | mawk '{sub("%","",$1);if ($1>100) 
printf("[%s][%d]\n",$1,$1)}'; done
[20][20]
[21][21]
[22][22]
[23][23]
[24][24]
[25][25]
[26][26]
[27][27]
[28][28]
[29][29]

The man pages clearly state that a dual-type (numeric&string) variable SHOULD 
be implicitly cast to numeric if a comparison is against a numeric.

I also did this against a single digit input and a two-digit RHS value, and it 
shows the same issue, where only the same number of digits of the RHS that the 
input value contain are being compared.
$ for i in $(seq 0 20); do echo "$i%" | gawk '{sub("%","",$1);if ($1>40) 
printf("[%s][%d]\n",$1,$1)}'; done
[5][5]
[6][6]
[7][7]
[8][8]
[9][9]

Using an RHS of a float also fails:
$ for i in $(seq 0 20); do echo "$i%" | gawk '{sub("%","",$1);if ($1>40.0) 
printf("[%s][%d]\n",$1,$1)}'; done
[5][5]
[6][6]
[7][7]
[8][8]
[9][9]

I don't know how if there's a way to evaluate the internal data-type 
representation for $1 here, so i printed it with printf both as string and 
integer, and they report identical.

Thanks,
--stephen


-- System Information:
Debian Release: 10.2
  APT prefers stable-updates
  APT policy: (500, 'stable-updates'), (500, 'proposed-updates'), (500, 
'stable')
Architecture: amd64 (x86_64)
Foreign Architectures: i386

Kernel: Linux 4.19.0-7-amd64 (SMP w/16 CPU cores)
Kernel taint flags: TAINT_PROPRIETARY_MODULE, TAINT_OOT_MODULE, 
TAINT_UNSIGNED_MODULE
Locale: LANG=en_US.UTF-8, LC_CTYPE=en_US.UTF-8 (charmap=UTF-8), LANGUAGE= 
(charmap=UTF-8)
Shell: /bin/sh linked to /usr/bin/dash
Init: systemd (via /run/systemd/system)

Versions of packages mawk depends on:
ii  libc6  2.28-10

mawk recommends no packages.

mawk suggests no packages.

-- no debconf information

Reply via email to