Hi Apollo!, El vie., 16 de nov. de 2018 a la(s) 21:18, Apollon Oikonomopoulos ( apoi...@debian.org) escribió:
> Control: tags -1 moreinfo > > Hi again, > > On 09:06 Wed 14 Nov , Maximiliano Boscovich wrote: > > root@lisa:~# gnt-backup export -n lisa acme.sf-az2-fe > > Wed Nov 14 08:13:51 2018 Shutting down instance acme.sf-az2-fe > > Wed Nov 14 08:15:53 2018 Creating a snapshot of disk/0 on node lisa > > Wed Nov 14 08:15:53 2018 Starting instance acme.sf-az2-fe > > Wed Nov 14 08:15:54 2018 Exporting snapshot/0 from lisa to lisa > > Wed Nov 14 08:15:57 2018 snapshot/0 is now listening, starting export > > Wed Nov 14 08:16:05 2018 snapshot/0 sent 0M, 0.0 MiB/s > > Wed Nov 14 08:16:55 2018 - WARNING: import > > 'import-disk0-2018-11-14_08_15_54-a5_3EV' on lisa failed: Exited due to > > signal 15 > > Wed Nov 14 08:16:55 2018 snapshot/0 failed to receive data: Exited due to > > signal 15 (recent output: Child process didn't establish connection in > time > > (60s), sending SIGTERM\nsocat: W exiting on signal 15) > > Wed Nov 14 08:16:55 2018 - WARNING: Aborting export > > 'export-disk0-2018-11-14_08_15_59-6wxfk8' on > > b2e076b8-499a-45a0-8a65-1a4d2005708f > > Wed Nov 14 08:16:57 2018 - WARNING: export > > 'export-disk0-2018-11-14_08_15_59-6wxfk8' on lisa failed: Exited due to > > signal 15 > > Wed Nov 14 08:16:57 2018 snapshot/0 failed to send data: Exited due to > > signal 15 (recent output: DUMP: Date of this level 0 dump: Wed Nov 14 > > 08:15:59 2018\n DUMP: Dumping > > > /dev/mapper/vg--ganeti-76049ec2--8232--413d--8151--d263c3efa80e.disk0.snap-1 > > (an unlisted file system) to standard output\n DUMP: Label: none\n > DUMP: > > Writing 10 Kilobyte records\n DUMP: mapping (Pass I) [regular files]\n > > DUMP: mapping (Pass II) [directories]\n DUMP: estimated 247771 blocks.\n > > DUMP: Volume 1 started with block 1 at: Wed Nov 14 08:15:59 2018\n DUMP: > > dumping (Pass III) [directories]\n DUMP: DUMP: The ENTIRE dump is > > aborted.\nSignal on pipe: cannot recover\n DUMP: The ENTIRE dump is > > aborted.\nsocat: W exiting on signal 15) > > Wed Nov 14 08:16:57 2018 Removing snapshot of disk/0 on node lisa > > Wed Nov 14 08:16:57 2018 - WARNING: Some disk exports have failed; there > > may be leftover data for instance acme.sf-az2-fe on node lisa > > Failure: command execution error: > > Export failed, errors in export finalization, disk export: disk(s) 0 > > I can reproduce this when blocking the import/export connections between > nodes. Are you running a firewall on the machine? If so, does it filter > traffic on the loopback interface? Keep in mind that import/export uses > two completely random, high ports for the transfer. > I don't have any firewall on that server, and the export was doing in the same node root@lisa:~# iptables -S -P INPUT ACCEPT -P FORWARD ACCEPT -P OUTPUT ACCEPT This is my loopback interface: root@lisa:~# ip a|grep lo 1: lo: <LOOPBACK,UP,LOWER_UP> mtu 65536 qdisc noqueue state UNKNOWN group default qlen 1 link/loopback 00:00:00:00:00:00 brd 00:00:00:00:00:00 inet 127.0.0.1/8 scope host lo inet 172.21.0.111/24 brd 172.21.0.255 scope global administracion And this is de lisa IP (cluster IP) root@lisa:~# ip a|grep admini 8: vlan300@bond0: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc noqueue master administracion state UP group default qlen 1000 12: administracion: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc noqueue state UP group default qlen 1000 inet 172.21.0.111/24 brd 172.21.0.255 scope global administracion And this is de bridge config root@lisa:~# brctl show bridge name bridge id STP enabled interfaces administracion 8000.6cae8b616faa no vlan300 sf-az1-be 8000.6cae8b616faa no vlan502 sf-az1-fe 8000.6cae8b616faa no vlan500 sf-az1-mw 8000.6cae8b616faa no vlan501 sf-az2-be 8000.6cae8b616faa no vlan505 sf-az2-fe 8000.6cae8b616faa no tap0 vlan503 sf-az2-mw 8000.6cae8b616faa no vlan504 I do a test with socat and its work great (i think so). I ran a server: root@lisa:~# socat -T 1 -d -d tcp-l:10081,reuseaddr,fork,crlf system:"echo -e \"\\\"HTTP/1.0 200 OK\\\nDocumentType: text/html\\\n\\\n<html>date: \$\(date\)<br>server:\$SOCAT_SOCKADDR:\$SOCAT_SOCKPORT<br>client: \$SOCAT_PEERADDR:\$SOCAT_PEERPORT\\\n<pre>\\\"\"; cat; echo -e \"\\\"\\\n</pre></html>\\\"\""& [1] 20135 root@lisa:~# 2018/11/17 11:33:17 socat[20135] N listening on AF=2 0.0.0.0:10081 Connected to it with localhost IP: root@lisa:~# socat - TCP:localhost:10081 2018/11/17 11:33:24 socat[20135] N accepting connection from AF=2 127.0.0.1:49816 on AF=2 127.0.0.1:10081 2018/11/17 11:33:24 socat[20135] N forked off child process 20137 2018/11/17 11:33:24 socat[20135] N listening on AF=2 0.0.0.0:10081 2018/11/17 11:33:24 socat[20137] N forking off child, using socket for reading and writing 2018/11/17 11:33:24 socat[20137] N forked off child process 20138 2018/11/17 11:33:24 socat[20137] N forked off child process 20138 2018/11/17 11:33:24 socat[20137] N starting data transfer loop with FDs [6,6] and [5,5] -e HTTP/1.0 200 OK DocumentType: text/html <html>date: sáb nov 17 11:33:24 -03 2018<br>server:127.0.0.1:10081<br>client: 127.0.0.1:49816 <pre> 2018/11/17 11:33:25 socat[20137] N inactivity timeout triggered 2018/11/17 11:33:25 socat[20137] N exiting with status 0 2018/11/17 11:33:25 socat[20135] W exiting on signal 15 2018/11/17 11:33:25 socat[20138] N socat_signal(): handling signal 15 2018/11/17 11:33:25 socat[20135] N socat_signal(): finishing signal 15 sh: echo: I/O error 2018/11/17 11:33:25 socat[20135] N childdied(): handling signal 17 Then connected to it with Cluster IP (lisa.intranet): root@lisa:~# socat - TCP:lisa.intranet:10081 2018/11/17 11:33:39 socat[20135] N accepting connection from AF=2 172.21.0.111:40834 on AF=2 172.21.0.111:10081 2018/11/17 11:33:39 socat[20135] N forked off child process 20143 2018/11/17 11:33:39 socat[20135] N listening on AF=2 0.0.0.0:10081 2018/11/17 11:33:39 socat[20143] N forking off child, using socket for reading and writing 2018/11/17 11:33:39 socat[20143] N forked off child process 20144 2018/11/17 11:33:39 socat[20143] N forked off child process 20144 2018/11/17 11:33:39 socat[20143] N starting data transfer loop with FDs [6,6] and [5,5] -e HTTP/1.0 200 OK DocumentType: text/html <html>date: sáb nov 17 11:33:39 -03 2018<br>server:172.21.0.111:10081<br>client: 172.21.0.111:40834 <pre> 2018/11/17 11:33:40 socat[20143] N inactivity timeout triggered 2018/11/17 11:33:40 socat[20143] N exiting with status 0 2018/11/17 11:33:40 socat[20135] N socat_signal(): handling signal 15 2018/11/17 11:33:40 socat[20135] N socat_signal(): finishing signal 15 2018/11/17 11:33:40 socat[20144] W exiting on signal 15 sh: echo: I/O error 2018/11/17 11:33:40 socat[20135] N childdied(): handling signal 17 If you need extra information, please let me know. Regards, Maximiliano