Hi Stephen, On Wed, Feb 13, 2019 at 01:55:34PM -0800, Stephen Hemminger wrote: > On Wed, 13 Feb 2019 22:17:16 +0100 > Stefano Brivio <sbri...@redhat.com> wrote: > > > On Wed, 13 Feb 2019 09:31:03 -0800 > > Eric Dumazet <eric.duma...@gmail.com> wrote: > > > > > On 02/13/2019 12:37 AM, Stefano Brivio wrote: > > > > On Tue, 12 Feb 2019 16:42:04 -0800 > > > > Eric Dumazet <eric.duma...@gmail.com> wrote: > > > > > > > >> I do not get it. > > > >> > > > >> "ss -emoi " uses almost 1KB per socket. > > > >> > > > >> 10,000,000 sockets -> we need about 10GB of memory ??? > > > >> > > > >> This is a serious regression. > > > > > > > > I guess this is rather subjective: the worst case I considered back then > > > > was the output of 'ss -tei0' (less than 500 bytes) for one million > > > > sockets, which gives 500M of memory, which should in turn be fine on a > > > > machine handling one million sockets. > > > > > > > > Now, if 'ss -emoi' on 10 million sockets is an actual use case (out of > > > > curiosity: how are you going to process that output? Would JSON help?), > > > > I see two easy options to solve this: > > > > > > > > > ss -temoi | parser (written in shell or awk or whatever...) > > > > > > This is a use case, I just got bitten because using ss command > > > actually OOM my container, while trying to debug a busy GFE. > > > > > > The host itself can have 10,000,000 TCP sockets, but usually sysadmin > > > shells > > > run in a container with no more than 500 MB available. > > > > > > Otherwise, it would be too easy for a buggy program to OOM the whole > > > machine > > > and have angry customers. > > > > > > > > > > > 1. flush the output every time we reach a given buffer size (1M > > > > perhaps). This might make the resulting blocks slightly unaligned, > > > > with occasional loss of readability on lines occurring every 1k to > > > > 10k sockets approximately, even though after 1k sockets column sizes > > > > won't change much (it looks anyway better than the original), and I > > > > don't expect anybody to actually scroll that output > > > > > > > > 2. add a switch for unbuffered output, but then you need to remember to > > > > pass it manually, and the whole output would be as bad as the > > > > original in case you need the switch. > > > > > > > > I'd rather go with 1., it's easy to implement (we already have partial > > > > flushing with '--events') and it looks like a good compromise on > > > > usability. Thoughts? > > > > > > > > > > 1 seems fine, but a switch for 'please do not try to format' would be > > > fine. > > > > > > I wonder why we try to 'format' when stdout is a pipe or a regular file . > > > > > > > On a second thought: what about | less, or | grep [ports], > > or > readable.log? I guess those might also be rather common use cases, > > what do you think? > > > > I'm tempted to skip this for the moment and just go with option 1. > > > > What I would favor: > * use big enough columns that for the common case everything lines up > fine > * if column is to wide just print that element wider (which is what > print %Ns does)
This is pretty much the situation Stefano attempted to improve, minus scaling the columns to max terminal width. ss output formatting being quirky and unreadable with either small or large terminals was the number one reason I heard so far why people prefer netstat. > and > * add json output for programs that want to parse > * use print_uint etc for that For Eric's use-case, skipping any buffering and tabular output if stdout is not a TTY suffices. In fact, iproute2 does this already for colored output (see check_enable_color() for reference). Adding JSON output support everywhere is a nice feature when it comes to scripting, but it won't help console users. Unless you expect CLI frontends to come turning that JSON into human-readable output. IMHO, JSON output wouldn't even help in this case - unless Eric indeed prefers to write/use a JSON parser for his analysis instead of something along 'ss | grep'. > The buffering patch (in iproute2-next) can/will be reverted. It's not fair to claim that despite Stefano's commitment to fix the reported issues. His ss output rewrite is there since v4.15.0 and according to git history it needed only two fixes so far. I've had one-liners which required more follow-ups than that! Also, we're still discovering issues introduced by all the jsonify patches. Allowing for people to get things right not the first time but after a few tries is important. If you want to revert something, start with features which have a fundamental design issue in the exact situation they tried to improve, like the MSG_PEEK | MSG_TRUNC thing Hangbin and me wrote. Thanks, Phil