On 26.08.2025 17:23, Miquel Raynal wrote: > Hello Mikhail, > > On 26/08/2025 at 02:48:29 +03, Mikhail Kshevetskiy > <[email protected]> wrote: > >> The shown speed inverse linearly depends on size of data. >> See the output: >> >> spi-nand: spi_nand nand@0: Micron SPI NAND was found. >> spi-nand: spi_nand nand@0: 256 MiB, block size: 128 KiB, page size: 2048, >> OOB size: 128 >> ... >> => mtd read.benchmark spi-nand0 $loadaddr 0 0x40000 >> Reading 262144 byte(s) (128 page(s)) at offset 0x00000000 >> Read speed: 63kiB/s >> => mtd read.benchmark spi-nand0 $loadaddr 0 0x20000 >> Reading 131072 byte(s) (64 page(s)) at offset 0x00000000 >> Read speed: 127kiB/s >> => mtd read.benchmark spi-nand0 $loadaddr 0 0x10000 >> Reading 65536 byte(s) (32 page(s)) at offset 0x00000000 >> Read speed: 254kiB/s >> >> In the spi-nand case 'io_op.len' is not the same as 'len', >> thus we divide a size of the single block on total time. >> This is wrong, we should divide on the time for a single >> block. >> >> Signed-off-by: Mikhail Kshevetskiy <[email protected]> > Happy to see this is useful :-) But you're totally right, it didn't use > the correct length. Maybe I would rephrase a bit the last two sentences > to make the commit clearer: > > "In the spi-nand case 'io_op.len' is not always the same as 'len', thus > we are using the wrong amount of data to derive the speed." > > However, regarding the diff, > >> @@ -594,9 +594,10 @@ static int do_mtd_io(struct cmd_tbl *cmdtp, int flag, >> int argc, >> >> if (benchmark && bench_start) { >> bench_end = timer_get_us(); >> + block_time = (bench_end - bench_start) / (len / io_op.len); >> printf("%s speed: %lukiB/s\n", >> read ? "Read" : "Write", >> - ((io_op.len * 1000000) / (bench_end - bench_start)) / >> 1024); >> + ((io_op.len * 1000000) / block_time) / 1024); > Why not just dividing the length by the benchmark time instead of > reducing and rounding the denominator in the first place, which I > believe makes the final result less precise?
Do we use 64 bit math? If not we may easily get an overflow. Actually for 32-bit math it's better use a less precise formula: (io_op.len * (1000000/1024)) / block_time; thus we will have about 22 bit for length. > > Thanks, > Miquèl

