~/netref / Monitoring & Management
##

Monitoring & Management

Keeping the network observable: collecting metrics, logs and flows — and measuring against an SLA.

ToolPortWhat it does
SNMP161 / 162Poll metrics & receive traps (MIB / OID)
Syslog514Centralized event logging (severity 0–7)
NetFlow / IPFIXFlow records: who talked to whom
sFlowSampled flow export
NTP123Time sync — vital for logs & certs
SPAN / mirrorCopy port traffic to an analyzer
// syslog severity
0Emergency
1Alert
2Critical
3Error
4Warning
5Notice
6Informational
7Debug
// SLA → downtime / yr
99%3.65 days / yr
99.9%8.77 h / yr
99.99%52.6 min / yr
99.999%5.26 min / yr

// facts

  • Baseline normal behaviour to spot anomalies
  • MTBF = reliability · MTTR = repair speed
  • Observability = metrics + logs + traces
  • SLA defines the uptime you promise/consume
// SNMP versions
v1Cleartext community string; no integrity
v2cAdds bulk gets; still a cleartext community
v3Auth + privacy (encryption) + integrity — use this
// flow & capture
NetFlowCisco flow records (v5/v9) — who talked to whom
IPFIXIETF standard based on NetFlow v9
sFlowPacket sampling in switch ASICs — scalable
SPAN / mirrorCopy a port / VLAN to an analyzer
snmpv3 + syslog + netflow · cisco ios
! SNMPv3 (auth + privacy)
snmp-server group NETADM v3 priv
snmp-server user mon NETADM v3 auth sha <key> priv aes 128 <key>
snmp-server host 10.0.0.50 version 3 priv mon
!
! Syslog + NTP
logging host 10.0.0.50
logging trap informational
service timestamps log datetime msec
ntp server 10.0.0.1
!
! NetFlow export
ip flow-export version 9
ip flow-export destination 10.0.0.50 2055
interface Gi0/0
 ip flow ingress
verify
show snmp
show logging
show ntp status
show ip cache flow