Viewing Delta Synchronization

Heavily loaded clusters and clusters with geographically separated members pose special challenges.

High connection rates, and large distances between the members can lead to delays that affect the operation of the cluster.

Monitor the operation of the State Synchronization mechanism in highly loaded and distributed clusters.

Perform these troubleshooting steps:

  1. Examine the Delta Sync statistics counters:

    Shell

    Command

    Gaia Clish

    show cluster statistics sync

    Expert mode

    cphaprob syncstat

  2. Change the values of the applicable synchronization global configuration parameters.

  3. Reset the Delta Sync statistics counters:

    Shell

    Command

    Gaia Clish

    show cluster statistics sync reset

    Expert mode

    cphaprob -reset syncstat

  4. Examine the Delta Sync statistics to see if the problem is solved.

  5. Solve any identified problem.

Example output of the "show cluster statistics sync" and "cphaprob syncstat" commands from a Cluster Member:

Delta Sync Statistics
 
Sync status: OK
 
Drops:
Lost updates.................................  0
Lost bulk update events......................  0
Oversized updates not sent...................  0
 
Sync at risk:
Sent reject notifications....................  0
Received reject notifications................  0
 
Sent messages:
Total generated sync messages................  26079
Sent retransmission requests.................  0
Sent retransmission updates..................  0
Peak fragments per update....................  1
 
Received messages:
Total received updates.......................  3710
Received retransmission requests.............  0
 
Sync Interface:
Name.........................................  eth1
Link speed...................................  1000Mb/s
Rate.........................................  46000 [Bps]
Peak rate....................................  46000 [Bps]
Link usage...................................   0%
Total........................................  376827[KB]
 
Queue sizes (num of updates):
Sending queue size...........................  512
Receiving queue size.........................  256
Fragments queue size.........................  50
 
Timers:
Delta Sync interval (ms).....................  100
 
Reset on Sun Sep  8 16:09:15 2019 (triggered by fullsync).

Each section of the output is described below.