All -
This is going to be a long email. My apologies for that! I have a question regarding performance of OpenNTPD vs generic/DaveMills NTPD.

Setup:
1. We have 3 machines in our DMZ which act as our primary NTP servers.
2. We have close to 8 machines in our Intranet which act as secondary NTP servers, which serve all our network and they all get their time(rather try to) from our Primary NTP servers.

All the 3 primary NTP servers run OpenBSD 3.9 stable (i.e from the CD) and GENERIC kernel. and are in DMZ. All the them have identical hardware, in fact Dell PowerEdge 650 and all have same setup,
* same release install i.e 3.9 install from OpenBSD CD.
* GENERIC kernel and NO modification using ukc or recompile

DMZ NTP servers.
dmz-ntp1
uname: OpenBSD dmz-ntp1.XXX.YYY 3.9 GENERIC#617 i386
Snippet of dmesg:
OpenBSD 3.9 (GENERIC) #617: Thu Mar  2 02:26:48 MST 2006
    [EMAIL PROTECTED]:/usr/src/sys/arch/i386/compile/GENERIC

dmz-ntp2
uname: OpenBSD dmz-ntp2.XXX.YYY 3.9 GENERIC#617 i386
OpenBSD 3.9 (GENERIC) #617: Thu Mar  2 02:26:48 MST 2006
    [EMAIL PROTECTED]:/usr/src/sys/arch/i386/compile/GENERIC

dmz-ntp3
uname: OpenBSD dmz-ntp3 XXX.YYY 3.9 GENERIC#617 i386
OpenBSD 3.9 (GENERIC) #617: Thu Mar  2 02:26:48 MST 2006
    [EMAIL PROTECTED]:/usr/src/sys/arch/i386/compile/GENERIC

ntpd configuration is the same for all the DMZ boxes.
Shown below are with comments removed!
[Begin Conf]
server 0.us.pool.ntp.org
server 1.us.pool.ntp.org
server 2.us.pool.ntp.org

server dmz-ntp1.XXX.YYY
server dmz-ntp2.XXX.YYY
server dmz-ntp3.XXX.YYY

listen on *
[End Conf]

All our internal boxes run RHEL4u2
Linux arrowhead.XXX.YYY 2.6.9-22.ELsmp #1 SMP Mon Sep 19 18:32:14 EDT 2005 i686 i686 i386 GNU/Linux

ntpd configuration is the same for all internal boxes.
Show below are with comments removed.

[Begin Conf]
server dmz-ntp1.XXX.YYY
server dmz-ntp2.XXX.YYY
server dmz-ntp3.XXX.YYY


peer ntp01.XXX.YYY
peer ntp02.XXX.YYY
peer ntp03.XXX.YYY
peer ntp04.XXX.YYY
peer ntp05.XXX.YYY
peer ntp06.XXX.YYY
peer ntp07.XXX.YYY
peer ntp08.XXX.YYY
peer ntp09.XXX.YYY
peer ntp10.XXX.YYY

server 127.127.1.0
fudge 127.127.1.0 stratum 10

driftfile /var/lib/ntp/drift
[End Conf]

All NTP machines are reachable on port 123.

Problem:
There seems to be a lot of jitter on OpenNTPD based machines (i.e DMZ machines). This prevents the Intranet machines to sync up to the OpenBSD machines. So we installed NTPD (not OpenNTPD) from ports and restared NTP on the dmz-ntp3 and lo and behold all Linux boxes starting syncing up with this box, but not to any other OpenNTPD based machines (which are 2 right now).

NTP stats from the Intranet boxes.
Please ignore

NTP stats from our Intranet box called arrowhead running RHEL4u2
--== ntpq -p arrowhead ==--
remote    refid      st t when poll reach   delay   offset  jitter
==============================================================================
dmz-ntp1  93.5.230.181     4 u   22 1024  377    0.734  -270.18  87.034
-dmz-ntp3 24.123.214.97    3 u  903 1024  377    1.373  -10.758   3.727
xdmz-ntp2 220.249.119.159  4 u  960 1024  377    0.932  -213.16 246.533
arrowhead .STEP.          16 u    - 1024    0    0.000    0.000 4000.00
+aspen    10.200.2.65      5 u  559 1024  376    4.154   -0.673   0.142
+baldy    10.200.1.243     5 u  270 1024  377    3.364    0.145   1.597
buttermilk 10.200.2.87      5 u  576 1024  376    2.387    0.718   0.203
-copper   10.200.1.222     4 u  971 1024  376    3.130    1.433   0.584
 cypress  10.200.2.87      5 u  426 1024  376    4.032   -0.456   0.111
*heavenly 10.200.1.222     4 u  772 1024  376    3.314   -0.136   0.630
-kirkwood 10.200.2.68      3 u  610 1024  377    4.017   -2.248   3.995
-shasta   66.92.68.11      2 u  747 1024  376   10.674  -12.836   3.089
 LOCAL(0)        LOCAL(0)        10 l   63   64

NTP stats from our Intranet box called aspen running RHEL4u2
--== ntpq -p aspen ==--
remote     refid      st t when poll reach   delay   offset  jitter
==============================================================================
xdmz-ntp1  93.5.230.181     4 u  292 1024  377    0.867  -240.39  86.761
-dmz-ntp3 24.123.214.97    3 u  797 1024  377    1.143  -12.227   3.712
xdmz-ntp2 220.249.119.159  4 u  908 1024  377    2.641  -222.98 247.827
-arrowhead 10.200.1.245     5 u  552 1024  377    3.845    0.828   0.409
 aspen     .STEP.          16 u    - 1024    0    0.000    0.000 4000.00
-baldy     10.200.1.222     4 u 1015 1024  376    4.710   -0.608   0.941
+buttermilk 10.200.1.245     5 u  888 1024  376    3.913    0.013   0.210
*copper    10.200.1.222     4 u  979 1024  376    6.086    0.130   0.248
-cypress   10.200.2.87      5 u  989 1024  376    3.789    0.254   0.437
-heavenly 10.200.2.87      5 u  299 1024  377    5.326    1.349   0.759
+kirkwood 199.184.165.135  3 u  284 1024  377    9.448   -0.471   0.523
-shasta 66.33.216.11     3 u   26 1024  376    5.536  -16.241   4.331
 LOCAL(0)  LOCAL(0)        10 l   52   64  377    0.000    0.000   0.001

NTP stats from our Intranet box called baldy running RHEL4u2
--== ntpq -p baldy ==--
remote      refid      st t when poll reach   delay   offset  jitter
==============================================================================
LOCAL(0) LOCAL(0) 10 l 73 64 377 0.000 0.000 0.004
xdmz-ntp1  93.5.230.181     4 u   20 1024  377    0.712  -269.32  86.366
-dmz-ntp3  24.123.214.97    3 u  427 1024  377    0.741  -11.506   4.435
xdmz-ntp2 220.249.119.159  4 u  918 1024  377    0.704  -219.70 247.097
-arrowhead 10.200.1.222     4 u  620 1024  376    3.329   -0.162   1.737
+aspen     10.200.2.65      5 u  335 1024  377    3.949    0.988   0.622
 baldy     0.0.0.0         16 u    - 1024    0    0.000    0.000 4000.00
+buttermilk 10.200.1.231     5 u   80 1024  377    1.331    0.930   0.041
-copper    10.200.1.222     4 u  969 1024  376    3.394    2.795   0.740
-cypress   10.200.2.87      5 u  572 1024  376    2.765    0.420   2.524
*heavenly  10.200.1.222     4 u  684 1024  376    1.964    0.571   0.174
-kirkwood 199.184.165.135  3 u  321 1024  376    2.006   -1.500   1.571
-shasta    66.33.216.11     3 u  132 1024  377    2.625  -13.549   3.691


As you can see from the jitter column, the jitter for the dmz-ntp servers are high as compared to other one (except for dmz-ntp3, which is currently running DaveMills NTPD)

Question:
Why is OpenNTPD causing more jitter? Is there anybody experiencing this? Is there a workaround? or does anybody need any more logs apart from what I have provided?

To be needed?
I still have Intranet logs for all our Intranet ntp servers. I dont think dmesg will be useful here, but anyways I will append a dmesg from one of the dmz-ntp machines and dmesg is identical on all three dmz-ntp machines.

dmesg from dmz-ntp1:

OpenBSD 3.9 (GENERIC) #617: Thu Mar  2 02:26:48 MST 2006
    [EMAIL PROTECTED]:/usr/src/sys/arch/i386/compile/GENERIC
cpu0: Intel(R) Pentium(R) 4 CPU 3.06GHz ("GenuineIntel" 686-class) 3.07 GHz
cpu0: FPU,V86,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CFLUSH,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,SBF,CNXT-ID
real mem  = 1073262592 (1048108K)
avail mem = 972619776 (949824K)
using 4278 buffers containing 53764096 bytes (52504K) of memory
mainbus0 (root)
bios0 at mainbus0: AT/286+(00) BIOS, date 06/02/03, BIOS32 rev. 0 @ 0xffe90
pcibios0 at bios0: rev 2.1 @ 0xf0000/0x10000
pcibios0: PCI IRQ Routing Table rev 1.0 @ 0xfc500/96 (4 entries)
pcibios0: no compatible PCI ICU found: ICU vendor 0x1166 product 0x0203
pcibios0: Warning, unable to fix up PCI interrupt routing
pcibios0: PCI bus #0 is the last bus
bios0: ROM list: 0xc0000/0x8000 0xc8000/0x4800 0xcc800/0x1000 0xec000/0x4000!
cpu0 at mainbus0
pci0 at mainbus0 bus 0: configuration mode 1 (no bios)
pchb0 at pci0 dev 0 function 0 "ServerWorks GCNB-LE Host" rev 0x32
pchb1 at pci0 dev 0 function 1 "ServerWorks GCNB-LE Host" rev 0x00
pci1 at pchb1 bus 1
em0 at pci1 dev 3 function 0 "Intel PRO/1000MT (82546EB)" rev 0x01: irq 10, address 00:04:23:a6:47:16 em1 at pci1 dev 3 function 1 "Intel PRO/1000MT (82546EB)" rev 0x01: irq 7, address 00:04:23:a6:47:17
vga1 at pci0 dev 4 function 0 "ATI Rage XL" rev 0x27
wsdisplay0 at vga1 mux 1: console (80x25, vt100 emulation)
wsdisplay0: screen 1-5 added (80x25, vt100 emulation)
pciide0 at pci0 dev 5 function 0 "CMD Technology PCI0680" rev 0x02
pciide0: bus-master DMA support present
pciide0: channel 0 wired to native-PCI mode
pciide0: using irq 15 for native-PCI interrupt
wd0 at pciide0 channel 0 drive 0: <ST3120026A>
wd0: 16-sector PIO, LBA48, 114473MB, 234441648 sectors
wd0(pciide0:0:0): using PIO mode 4, Ultra-DMA mode 5
pciide0: channel 1 wired to native-PCI mode
piixpm0 at pci0 dev 15 function 0 "ServerWorks CSB6" rev 0xa0: SMBus disabled
pciide1 at pci0 dev 15 function 1 "ServerWorks CSB6 RAID/IDE" rev 0xa0: DMA
atapiscsi0 at pciide1 channel 0 drive 0
scsibus0 at atapiscsi0: 2 targets
cd0 at scsibus0 targ 0 lun 0: <TEAC, CD-224E, K.9A> SCSI0 5/cdrom removable
cd0(pciide1:0:0): using PIO mode 4, DMA mode 2, Ultra-DMA mode 2
pciide1: no compatibility interrupt for use by channel 1
ohci0 at pci0 dev 15 function 2 "ServerWorks CSB6 USB" rev 0x05: irq 11, version 1.0, legacy support
usb0 at ohci0: USB revision 1.0
uhub0 at usb0
uhub0: ServerWorks OHCI root hub, rev 1.00/1.00, addr 1
uhub0: 4 ports with 4 removable, self powered
pcib0 at pci0 dev 15 function 3 "ServerWorks GCLE-2 Host" rev 0x00
pchb2 at pci0 dev 16 function 0 "ServerWorks CIOB-E" rev 0x12
pchb3 at pci0 dev 16 function 2 "ServerWorks CIOB-E" rev 0x12
pci2 at pchb3 bus 2
isa0 at pcib0
isadma0 at isa0
pckbc0 at isa0 port 0x60/5
pckbd0 at pckbc0 (kbd slot)
pckbc0: using irq 1 for kbd slot
wskbd0 at pckbd0: console keyboard, using wsdisplay0
pmsi0 at pckbc0 (aux slot)
pckbc0: using irq 12 for aux slot
wsmouse0 at pmsi0 mux 0
pcppi0 at isa0 port 0x61
midi0 at pcppi0: <PC speaker>
spkr0 at pcppi0
npx0 at isa0 port 0xf0/16: using exception 16
pccom0 at isa0 port 0x3f8/8 irq 4: ns16550a, 16 byte fifo
fdc0 at isa0 port 0x3f0/6 irq 6 drq 2
fd0 at fdc0 drive 0: 1.44MB 80 cyl, 2 head, 18 sec
biomask eb6d netmask efed ttymask ffef
pctr: user-level cycle counter enabled
dkcsum: wd0 matches BIOS drive 0x80
root on wd0a
rootdev=0x0 rrootdev=0x300 rawdev=0x302

Reply via email to