Hi
I have big problem with soft bounces. Take a look for following transaction
log (I have change dest mail to fake one):
Dec 17 12:03:39 www postfix/smtp[17276]: vstream_buf_get_ready: fd 12 got 87
12:03:39 www postfix/smtp[17276]: < mx.poczta.onet.pl[213.180.130.8 220
mx.poczta.onet.pl ESMTP (8) our local time is now Wed, 17 Dec 2008 12:03:39
Dec 17 12:03:39 www postfix/smtp[17276]: >
mx.poczta.onet.pl[213.180.130.86]:25:www.dentonline.pl
Dec 17 12:03:39 www postfix/smtp[17276]: vstream_fflush_some: fd 12 flush 24
12:03:39 www postfix/smtp[17276]: vstream_buf_get_ready: fd 12 got Dec 17
12:03:39 www postfix/smtp[17276]: < mx.poczta.onet.pl[213.180.130.8
250-mx.poczta.onet.pl Hello www.dentonline.pl
12:03:39 www postfix/smtp[17276]: < mx.poczta.onet.pl[213.180.130.86]:25SIZE
52428800 Dec 17
12:03:39 www postfix/smtp[17276]: <
mx.poczta.onet.pl[213.180.130.86]:258BITMIME
Dec 17 12:03:39 www postfix/smtp[17276]: <
mx.poczta.onet.pl[213.180.130.86]:25PIPELINING
Dec 17 12:03:39 www postfix/smtp[17276]: <
mx.poczta.onet.pl[213.180.130.86]:25CHUNKING
Dec 17 12:03:39 www postfix/smtp[17276]: <
mx.poczta.onet.pl[213.180.130.86]:25ENHANCEDSTATUSCODES
Dec 17 12:03:39 www postfix/smtp[17276]: < mx.poczta.onet.pl[213.180.130.8
250-AUTH=LOGIN Dec
17 12:03:39 www postfix/smtp[17276]: < mx.poczta.onet.pl[213.180.130.8
250-AUTH LOGIN Dec
17 12:03:39 www postfix/smtp[17276]: < mx.poczta.onet.pl[213.180.130.8
250-STARTTLS Dec
17 12:03:39 www postfix/smtp[17276]: < mx.poczta.onet.pl[213.180.130.8 250
HELP
Dec 17 12:03:39 www postfix/smtp[17276]: server features: 0x100f size 5242Dec
17 12:03:39 www postfix/smtp[17276]: Using ESMTP PIPELINING, TCP send size
is 4096 Dec 17
12:03:39 www postfix/smtp[17276]: > mx.poczta.onet.pl[213.180.130.8 MAIL
FROM:<[email protected]> SIZE=3449
12:03:39 www postfix/smtp[17276]: > mx.poczta.onet.pl[213.180.130.8 RCPT
TO:<[email protected]>
12:03:39 www postfix/smtp[17276]: > mx.poczta.onet.pl[213.180.130.8 DATA
Dec 17 12:03:39 www postfix/smtp[17276]: vstream_fflush_some: fd 12 flush Dec
17 12:03:39 www postfix/smtp[17276]: vstream_buf_get_ready: fd 12 got Dec 17
12:03:39 www postfix/smtp[17276]: < mx.poczta.onet.pl[213.180.130.8 250 2.1.0
Sender syntax Ok;
12:03:39 www postfix/smtp[17276]: < mx.poczta.onet.pl[213.180.130.8 450 4.7.1
W chwili obecnej nie mozesz wyslac listu do: <[email protected]>, j za chwile
[0400.-1] Dec 17 12:03:39
www postfix/smtp[17276]: connect to subsystem private/defeDec 17 12:03:39 www
postfix/smtp[17276]: send attr nrequest = 0 Dec 17 12:03:39 www
postfix/smtp[17276]: send attr flags = 0 Dec 17 12:03:39 www
postfix/smtp[17276]: send attr queue_id = AC7D8637C3 Dec 17 12:03:39 www
postfix/smtp[17276]: send attr original_recipient = [email protected]
Dec 17 12:03:39 www postfix/smtp[17276]: send attr recipient = x...@opdec 17
12:03:39 www postfix/smtp[17276]: send attr offset = 3644 Dec 17
12:03:39 www postfix/smtp[17276]: send attr dsn_orig_rcpt = Dec 17
12:03:39 www postfix/smtp[17276]: send attr notify_flags = 0 Dec 17
12:03:39 www postfix/smtp[17276]: send attr status = 4.7.1 Dec 17
12:03:39 www postfix/smtp[17276]: send attr diag_type = smtp Dec 17
12:03:39 www postfix/smtp[17276]: send attr diag_text = 450 4.7.1 Wi obecnej
nie mozesz wyslac listu do: <[email protected]>, sprobuj za chwile -1] / At the
moment you cannot sen a message to <[email protected]>, try agair [0400.-1]
Dec 17 12:03:39 www postfix/smtp[17276]: send attr mta_type = dns Dec
17 12:03:39 www postfix/smtp[17276]: send attr mta_mname = mx.poczta.oDec 17
12:03:39 www postfix/smtp[17276]: send attr action = delayed Dec 17
12:03:39 www postfix/smtp[17276]: send attr reason = host
mx.pocztapl[213.180.130.86] said: 450 4.7.1 W chwili obecnej nie mozesz
wyslac list<[email protected]>, sprobuj za chwile [0400.-1] / At the moment you
cannot sessage to <[email protected]>, try again later [0400.-1] (in reply to RCPT
Tand)
Dec 17 12:03:39 www postfix/smtp[17276]: vstream_fflush_some: fd 13 flush
After that the smpt proces hangs on poll syscall. Because I have a lot of mail
to send, eventually all available smtp processes will hang. This
cause "active" queue to stall - almost no mail are delivered. Only way to
make some delivery mail is server restarting. Because eventually
all "deliverable" mail are gone, rest only those greylisted (and other 4xx).
In this case postifix restart give only some delivery - but I have thousands
mails wait in queue!
I have trace problem by debug build and gdb. This is a call backtrace:
#0 0xa7baf9d8 in poll () from /lib/tls/libc.so.6
#1 0x0806d01f in read_wait (fd=-4, timeout=3600) at read_wait.c:120
#2 0x08072241 in timed_read (fd=13, buf=0x8092d60, len=4096, timeout=3600,
unused_context=0x0) at timed_read.c:73
#3 0x0806f20f in vstream_buf_get_ready (bp=0x80918f0) at vstream.c:731
#4 0x0806eaf7 in vbuf_get (bp=0x80918f0) at vbuf.c:157
#5 0x08063bcb in attr_vscan0 (fp=0x80918f0, flags=3, ap=0xafb16988 "\001") at
attr_scan0.c:272
#6 0x08063f9b in attr_scan0 (fp=0x80918f0, flags=3) at attr_scan0.c:427
#7 0x08056e67 in mail_command_client (class=0x807462e "private",
name=0x8085810 "defer") at mail_command_client.c:76
#8 0x080548e9 in defer_append (flags=0, id=0x808cb60 "ABA9B63E79",
stats=0x809019c, rcpt=0x808ad78,
relay=0x808eef0 "mx.poczta.onet.pl[213.180.130.86]:25", dsn=0x808d084) at
defer.c:204
#9 0x08050385 in smtp_rcpt_fail (state=0x80823a0, rcpt=0x808ad78,
mta_name=0x808eca0 "mx.poczta.onet.pl",
resp=0x80809bc, format=0x8074de4 "host %s said: %s (in reply to %s)") at
smtp_trouble.c:396
#10 0x0804de43 in smtp_loop (state=0x80823a0, send_state=5, recv_state=3) at
smtp_proto.c:1570
#11 0x0804e87c in smtp_xfer (state=0x80823a0) at smtp_proto.c:1961
#12 0x0804c0c4 in smtp_connect_remote (state=0x80823a0,
nexthop=0x808ec28 "\200ë\b\bhÔ\b\b ě\b\bŔě\b\bŕě\b\b`\022\t\b",
def_service=0x8074868 "smtp") at smtp_connect.c:913
#13 0x0804c356 in smtp_connect (state=0x80823a0) at smtp_connect.c:1038
#14 0x0804ac85 in deliver_message (service=0xfffffffc <Address 0xfffffffc out
of bounds>, request=0x8090178) at smtp.c:829
#15 0x0804acd5 in smtp_service (client_stream=0x808d498,
service=0xafb17f8f "smtp", argv=0xfffffffc) at smtp.c:861
#16 0x08051513 in single_server_wakeup (fd=134807704) at single_server.c:257
#17 0x08051658 in single_server_accept_local (unused_event=1, context=0x6
<Address 0x6 out of bounds>)
at single_server.c:299
#18 0x08069619 in event_loop (delay=-1347326752) at events.c:1079
#19 0x0805204d in single_server_main (argc=134788304, argv=0xafb17314,
service=0x804ac98 <smtp_service>)
at single_server.c:722
#20 0x0804b07b in main (argc=4, argv=0xafb17314) at smtp.c:1066
Please note that read_wait call has file descriptor equal to -4 - it looks
very _strange_ to me, especially that source code indicates that this value
should by equal to timed_read parameter ('13').
lsof tell that '13' is unix socket:
smtp 25061 postfix 13u unix 0xe0cb3b00 313337814 socket
Can you tell me what's going wrong?
--
Konrad Rzepecki - Wydawnictwo Bestom DENTOnet.pl Sp.z o.o.