How to solve in.mpathd[-]: [ID –daemon.error] NIC failure detected on vnet1 of group bond0 ?
Today we will see one of a common issue and its solution how to proceed with this.
I. Error
Oct 7 21:52:14 ngeaix01 in.mpathd[281]: [ID 585766 daemon.error] Cannot meet requested failure detection time of 5000 ms on (inet vnet0) new failure detection time for group "bond0" is 43120 ms Oct 7 21:52:15 ngeaix01 in.mpathd[281]: [ID 594170 daemon.error] NIC failure detected on vnet1 of group bond0 Oct 7 21:53:14 ngeaix01 in.mpathd[281]: [ID 302819 daemon.error] Improved failure detection time 21560 ms on (inet vnet0) for group "bond0" Oct 7 21:53:15 ngeaix01 in.mpathd[281]: [ID 302819 daemon.error] Improved failure detection time 10780 ms on (inet vnet0) for group "bond0" Oct 7 21:53:16 ngeaix01 in.mpathd[281]: [ID 302819 daemon.error] Improved failure detection time 5390 ms on (inet vnet0) for group "bond0" Oct 7 21:53:17 ngeaix01 in.mpathd[281]: [ID 302819 daemon.error] Improved failure detection time 5000 ms on (inet vnet0) for group "bond0" Oct 7 21:53:18 ngeaix01 in.mpathd[281]: [ID 299542 daemon.error] NIC repair detected on vnet1 of group bond0
II. Solution
Everything is fine with the server except few occasional packet loss for a long-running ping and is making the application unusable. Check system logs if no hardware or cable fault is there. If no issue at system end found, it is most probable that the issue was down to a core network switch that was cpu-bound. It should be dealt by the networks team to check and reload reloading SMTP monitoring if required.