Hi!
When upgrading from OpenLDAP 2.4 (SLES12) to OpenLDAP 2.5 (SLES15) I gave
delta-syncrepl a try.
That was a hard way in several aspects. Meanwhile I think I understand most of
the details (docs could be much better IMHO).
Where delta-syncrepl has big problems is when sync has been set up, but one
database is reloaded and some UUIDs are newly created for entries that exist on
the other server(s).
Somehow slapd detects that problem and claims that a “content sync” is
required, but after some time it seems to start a refresh anyway.
When I did the content load on the other server, slapd quit with a core dump.
Unfortunately I had quite a lot of core dumps during my testing.
So I feel delta-syncrepl is not as solid as it should be (in the version
provided with SLES15 SP6).
Some of the logs:
May 27 13:37:05 v06 slapd[27194]: do_syncrep1: rid=105 starting refresh
(sending
cookie=rid=105,sid=006,csn=20130708113956.028116Z#000000#000#000000;20240708090039.074592Z#000000#001#0>
May 27 13:37:05 v06 slapd[27194]: do_syncrep2: rid=105 LDAP_RES_SEARCH_RESULT
May 27 13:37:05 v06 slapd[27194]: do_syncrep2: rid=105 delta-sync lost sync,
switching to REFRESH
May 27 13:37:05 v06 slapd[27194]: do_syncrep2: rid=105 (4096) Content Sync
Refresh Required
May 27 13:41:36 v06 slapd[27194]: do_syncrep1: rid=105 starting refresh
(sending
cookie=rid=105,sid=006,csn=20130708113956.028116Z#000000#000#000000;20240708090039.074592Z#000000#001#0>
May 27 13:41:36 v06 slapd[27194]: do_syncrep2: rid=105 LDAP_RES_INTERMEDIATE -
SYNC_ID_SET
May 27 13:41:36 v06 slapd[27194]: syncrepl_message_to_entry: rid=105 DN:
cn=config, UUID: 85353d82-cf3a-103f-9d14-755ae58f92b6
May 27 13:41:36 v06 slapd[27194]: syncrepl_entry: rid=105
LDAP_RES_SEARCH_ENTRY(LDAP_SYNC_ADD) csn=(none) tid 0x7f6b1b7fe6c0
…
May 27 13:43:33 v06 slapd[27194]: do_syncrepl: rid=105 rc -1 retrying (4
retries left)
May 27 13:43:34 v06 slapd[27194]: conn=1008 op=2 syncprov_op_search: got a
persistent search with a
cookie=rid=106,sid=005,csn=20130708113956.028116Z#000000#000#000000;20240708090039.0>
May 27 13:43:34 v06 slapd[27194]: conn=1008 op=2 syncprov_findbase: searching
May 27 13:43:34 v06 slapd[27194]: conn=1008 op=2 syncprov_op_search: registered
persistent search
May 27 13:43:34 v06 slapd[27194]: conn=1008 op=2 syncprov_sendinfo:
refreshDelete cookie=
May 27 13:43:34 v06 slapd[27194]: conn=1008 op=2 syncprov_search_response:
detaching op
May 27 13:43:34 v06 slapd[27194]: conn=1009 op=2 syncprov_op_search: got a
persistent search with a
cookie=rid=116,sid=005,csn=20130719093756.074776Z#000000#000#000000;20250217105250.3>
May 27 13:43:34 v06 slapd[27194]: conn=1009 op=2 syncprov_findbase: searching
May 27 13:43:34 v06 slapd[27194]: conn=1009 op=2 syncprov_op_search: registered
persistent search
May 27 13:43:34 v06 slapd[27194]: conn=1009 op=2 syncprov_op_search: no change,
skipping log replay
May 27 13:43:34 v06 slapd[27194]: conn=1009 op=2 syncprov_op_search: nothing
changed, finishing up initial search early
May 27 13:43:34 v06 slapd[27194]: conn=1009 op=2 syncprov_sendinfo:
refreshDelete cookie=
May 27 13:43:34 v06 slapd[27194]: conn=1009 op=2 syncprov_search_response:
detaching op
May 27 13:43:34 v06 slapd[27194]: conn=1011 op=2 syncprov_op_search: got a
persistent search with a
cookie=rid=106,sid=005,csn=20130708113956.028116Z#000000#000#000000;20240708090039.0>
May 27 13:43:34 v06 slapd[27194]: conn=1011 op=2 syncprov_findbase: searching
May 27 13:43:34 v06 slapd[27194]: conn=1011 op=2 syncprov_op_search: registered
persistent search
May 27 13:43:34 v06 slapd[27194]: conn=1011 op=2 syncprov_sendinfo:
refreshDelete cookie=
May 27 13:43:34 v06 slapd[27194]: conn=1011 op=2 syncprov_search_response:
detaching op
May 27 13:43:34 v06 slapd[27194]: conn=1010 op=2 syncprov_op_search: got a
persistent search with a
cookie=rid=116,sid=005,csn=20130719093756.074776Z#000000#000#000000;20250217105250.3>
May 27 13:43:34 v06 slapd[27194]: conn=1010 op=2 syncprov_findbase: searching
May 27 13:43:34 v06 slapd[27194]: conn=1010 op=2 syncprov_op_search: registered
persistent search
May 27 13:43:34 v06 slapd[27194]: conn=1010 op=2 syncprov_op_search: no change,
skipping log replay
May 27 13:43:34 v06 slapd[27194]: conn=1010 op=2 syncprov_op_search: nothing
changed, finishing up initial search early
May 27 13:43:34 v06 slapd[27194]: conn=1010 op=2 syncprov_sendinfo:
refreshDelete cookie=
May 27 13:43:34 v06 slapd[27194]: conn=1010 op=2 syncprov_search_response:
detaching op
May 27 13:43:34 v06 slapd[27194]: conn=1012 op=2 syncprov_op_search: got a
persistent search with a
cookie=rid=116,sid=005,csn=20130719093756.074776Z#000000#000#000000;20250217105250.3>
May 27 13:43:34 v06 slapd[27194]: conn=1012 op=2 syncprov_findbase: searching
May 27 13:43:34 v06 slapd[27194]: conn=1012 op=2 syncprov_op_search: registered
persistent search
May 27 13:43:34 v06 slapd[27194]: conn=1012 op=2 syncprov_op_search: no change,
skipping log replay
May 27 13:43:34 v06 slapd[27194]: conn=1012 op=2 syncprov_op_search: nothing
changed, finishing up initial search early
May 27 13:43:34 v06 slapd[27194]: conn=1012 op=2 syncprov_sendinfo:
refreshDelete cookie=
May 27 13:43:34 v06 slapd[27194]: conn=1012 op=2 syncprov_search_response:
detaching op
May 27 13:43:34 v06 slapd[27194]: conn=1013 op=2 syncprov_op_search: got a
persistent search with a
cookie=rid=106,sid=005,csn=20130708113956.028116Z#000000#000#000000;20240708090039.0>
May 27 13:43:34 v06 slapd[27194]: conn=1013 op=2 syncprov_findbase: searching
May 27 13:43:34 v06 slapd[27194]: conn=1013 op=2 syncprov_op_search: registered
persistent search
May 27 13:43:34 v06 slapd[27194]: conn=1013 op=2 syncprov_sendinfo:
refreshDelete cookie=
May 27 13:43:34 v06 slapd[27194]: conn=1013 op=2 syncprov_search_response:
detaching op
May 27 13:43:34 v06 slapd[27194]: conn=1014 op=2 syncprov_op_search: got a
persistent search with a
cookie=rid=106,sid=005,csn=20130708113956.028116Z#000000#000#000000;20240708090039.0>
May 27 13:43:34 v06 slapd[27194]: conn=1014 op=2 syncprov_findbase: searching
May 27 13:43:34 v06 slapd[27194]: conn=1014 op=2 syncprov_op_search: registered
persistent search
May 27 13:43:34 v06 systemd[1]: Started Process Core Dump (PID 27241/UID 0).
May 27 13:43:35 v06 systemd-coredump[27242]: [🡕] Process 27194 (slapd) of user
76 dumped core.
Stack trace of thread 27199:
#0 0x00007f6b34ca8dfc
__pthread_kill_implementation (libc.so.6 + 0xa8dfc)
#1 0x00007f6b34c57842 raise
(libc.so.6 + 0x57842)
#2 0x00007f6b34c3f5cf abort
(libc.so.6 + 0x3f5cf)
#3 0x00007f6b34c3f4e7
__assert_fail_base.cold (libc.so.6 + 0x3f4e7)
#4 0x00007f6b34c4fb32
__assert_fail (libc.so.6 + 0x4fb32)
#5 0x00007f6b34787258 n/a
(syncprov.so + 0xc258)
#6 0x000055765d7e04f3
overlay_op_walk (slapd + 0xb74f3)
#7 0x000055765d7e06be n/a
(slapd + 0xb76be)
#8 0x000055765d76ee54
fe_op_search (slapd + 0x45e54)
#9 0x000055765d76e726
do_search (slapd + 0x45726)
#10 0x000055765d76c18f n/a
(slapd + 0x4318f)
#11 0x000055765d76c98d n/a
(slapd + 0x4398d)
#12 0x00007f6b34ff7da0 n/a
(libldap-2.5.releng.so.0 + 0x48da0)
#13 0x00007f6b34ca6f6c
start_thread (libc.so.6 + 0xa6f6c)
#14 0x00007f6b34d2e338
__clone3 (libc.so.6 + 0x12e338)
Stack trace of thread 27194:
#0 0x00007f6b34ca3c1e
__futex_abstimed_wait_common (libc.so.6 + 0xa3c1e)
#1 0x00007f6b34ca8b83
__pthread_clockjoin_ex (libc.so.6 + 0xa8b83)
#2 0x000055765d7691ca
slapd_daemon (slapd + 0x401ca)
#3 0x000055765d74c88e main
(slapd + 0x2388e)
#4 0x00007f6b34c40e6c
__libc_start_call_main (libc.so.6 + 0x40e6c)
#5 0x00007f6b34c40f35
__libc_start_main@@GLIBC_2.34 (libc.so.6 + 0x40f35)
#6 0x000055765d74c90a _start
(slapd + 0x2390a)
Stack trace of thread 27204:
#0 0x00007f6b34ca3c1e
__futex_abstimed_wait_common (libc.so.6 + 0xa3c1e)
#1 0x00007f6b34ca6548
pthread_cond_wait@@GLIBC_2.3.2 (libc.so.6 + 0xa6548)
#2 0x00007f6b34ff7e40 n/a
(libldap-2.5.releng.so.0 + 0x48e40)
#3 0x00007f6b34ca6f6c
start_thread (libc.so.6 + 0xa6f6c)
#4 0x00007f6b34d2e338
__clone3 (libc.so.6 + 0x12e338)
Stack trace of thread 27198:
#0 0x00007f6b34ca3c1e
__futex_abstimed_wait_common (libc.so.6 + 0xa3c1e)
#1 0x00007f6b34ca6548
pthread_cond_wait@@GLIBC_2.3.2 (libc.so.6 + 0xa6548)
#2 0x00007f6b34ff7e40 n/a
(libldap-2.5.releng.so.0 + 0x48e40)
#3 0x00007f6b34ca6f6c
start_thread (libc.so.6 + 0xa6f6c)
#4 0x00007f6b34d2e338
__clone3 (libc.so.6 + 0x12e338)
Stack trace of thread 27197:
#0 0x00007f6b34ca3c1e
__futex_abstimed_wait_common (libc.so.6 + 0xa3c1e)
#1 0x00007f6b34ca6548
pthread_cond_wait@@GLIBC_2.3.2 (libc.so.6 + 0xa6548)
#2 0x00007f6b34ff7e40 n/a
(libldap-2.5.releng.so.0 + 0x48e40)
#3 0x00007f6b34ca6f6c
start_thread (libc.so.6 + 0xa6f6c)
#4 0x00007f6b34d2e338
__clone3 (libc.so.6 + 0x12e338)
Stack trace of thread 27202:
#0 0x00007f6b34ca3c1e
__futex_abstimed_wait_common (libc.so.6 + 0xa3c1e)
#1 0x00007f6b34ca6548
pthread_cond_wait@@GLIBC_2.3.2 (libc.so.6 + 0xa6548)
#2 0x00007f6b34ff7e40 n/a
(libldap-2.5.releng.so.0 + 0x48e40)
#3 0x00007f6b34ca6f6c
start_thread (libc.so.6 + 0xa6f6c)
#4 0x00007f6b34d2e338
__clone3 (libc.so.6 + 0x12e338)
Stack trace of thread 27200:
#0 0x00007f6b34ca3c1e
__futex_abstimed_wait_common (libc.so.6 + 0xa3c1e)
#1 0x00007f6b34ca6548
pthread_cond_wait@@GLIBC_2.3.2 (libc.so.6 + 0xa6548)
#2 0x00007f6b34ff7e40 n/a
(libldap-2.5.releng.so.0 + 0x48e40)
#3 0x00007f6b34ca6f6c
start_thread (libc.so.6 + 0xa6f6c)
#4 0x00007f6b34d2e338
__clone3 (libc.so.6 + 0x12e338)
Stack trace of thread 27203:
#0 0x00007f6b34ca3c1e
__futex_abstimed_wait_common (libc.so.6 + 0xa3c1e)
#1 0x00007f6b34ca6548
pthread_cond_wait@@GLIBC_2.3.2 (libc.so.6 + 0xa6548)
#2 0x00007f6b34ff7e40 n/a
(libldap-2.5.releng.so.0 + 0x48e40)
#3 0x00007f6b34ca6f6c
start_thread (libc.so.6 + 0xa6f6c)
#4 0x00007f6b34d2e338
__clone3 (libc.so.6 + 0x12e338)
Stack trace of thread 27196:
#0 0x00007f6b34d2e796
epoll_wait (libc.so.6 + 0x12e796)
#1 0x000055765d76607b n/a
(slapd + 0x3d07b)
#2 0x00007f6b34ca6f6c
start_thread (libc.so.6 + 0xa6f6c)
#3 0x00007f6b34d2e338
__clone3 (libc.so.6 + 0x12e338)
ELF object binary
architecture: AMD x86-64
May 27 13:43:35 v06 systemd[1]: [email protected]: Deactivated
successfully.
Kind regards,
Ulrich Windl