...
Overview
BGP is THE protocol if of Internet, it is used to exchange routing information between other BGP systems between Internet domains. It comes in two flavours:
External BGP(eBGP): Network Layer Reachability Information (NLRI) is exchanged between network domain called Autonomous system usually administratively independant. We are speaking about BGP inter-domain routing. As an example, let's us aassume assume a BGP speaker from AS2200 (RENATER) advertising NLRI information to AS20965 (GÉANT R&E). From that point AS20965 has the knowledge of how to reach any network advertised by AS2200 based on the NLRI information.
...
Expand |
---|
title | BGP Route Reflector implementation |
---|
|
Info |
---|
title | BGP RR main requirements |
---|
|
SR655 1 x EPYC 7302P, 64GB RAM, 2G CONTROLLER CACHE FLASH, 4x10G ports + SFP+ and 4x1G ports, 3 SSD 480GB MAINSTREAM, XCLARITY ENTERPRISE. SR655 AMD EPYC 7302P (16C 2.8GHz 128MB Cache/155W) 32GB (2x32GB, 2Rx4 3200MHz RDIMM), No Backplane, SATA, 1x750W, Tooless Rails ThinkSystem 2x32GB TruDDR4 3200MHz (2Rx4 1.2V) RDIMM-A ThinkSystem SR655 2.5 SATA/SAS 8-Bay Backplane Kit ThinkSystem RAID 930-8i 2GB Flash PCIe 12Gb Adapter ThinkSystem 2.5 5300 480GB Mainstream SATA 6Gb Hot Swap SSD ThinkSystem SR655 x16/x8/x8 PCIe Riser1 FH Kit ThinkSystem SR635/SR655 x8 PCIe Internal Riser Kit ThinkSystem Broadcom 57454 10/25GbE SFP28 4-port OCP Ethernet Adapter ThinkSystem Broadcom 5720 1GbE RJ45 2-Port PCIe Ethernet Adapter SFP+ SR Transceiver ThinkSystem 750W(230/115V) Platinum Hot-Swap Power Supply 2.8m, 10A/100-250V, C13 to IEC 320-C14 Rack Power Cable ThinkSystem Toolless Slide Rail Kit with 2U CMA ThinkSystem SR655 Fan Option Kit ThinkSystem SR635/SR655 Supercap Installation Kit |
Info |
---|
title | BGP RR main requirements |
---|
| RR is a specific component inside a service provider environment: - The BGP RR is not in the data path inside the backbone, this can be adjusted by setting hight IGP metrics inside the code backbone.
- BGP traffic does not require a tremendous throughput so no need to have hardware NIC assisted forwarding mechanism such as dpdk.
- A NREN route reflector with 2xIPv4 and 2xIPv6 full views coming from 2 upstream provider requires steady ~ 10 Mbps traffic rates, so we can assume that 10GE connection will be sufficient for the next decades all address-family included.
- As of 2020/07/13, the Internet IPv4 routing table size is 839945 entries
- As of 2020/07/13, the Internet IPv6 routing table size is 91062 entries
both cumulated with BGP other address families needs a constant usage of ~ 4GB of memory: # show watchdog memory
- So in the config above 64 Gbytes of RAM is sufficient in order to cache all the IPv4 and IPv6 routing table in memory (and also other BGP address family BGP tables). It will be also largely enough in case of network instability, events that involves more CPU/memory usage related convergence computation.
|
Warning |
---|
| - We have no incentive in proposing a server with the above brand. It just happen that this server was already bought and its configuration is matching perfectly the use case requirement but again, this is pure coincidence
- 10GE port connection might be overkill, but in a Service Provider context this is the norm. It will avoid adjacent core routers to implement 1GE connectivity
- PCIe GEN4 is available, and thus provide a tremendous amount of bandwidth for disk R/W operation. Though useful for the OS application, BGP RR setup won't take a direct advantage from PCIE GEN4.
- Indeed in this configuration considering the amount of RAM we have we will disable SWAP operations.
|
|
...
Expand |
---|
|
Code Block |
---|
language | bash |
---|
theme | Midnight |
---|
title | freeRouter launch with supplied rr1-hw.txt and rr1-sw.txt with a console prompt |
---|
| ╭─[6:06:13]floui@debian ~/freeRouter
╰─➤ java -jar lib/rtr.jar routersc etc/rr1-hw.txt etc/rr1-sw.txt 3 ↵
info cfg.cfgInit.doInit:cfgInit.java:556 booting
info cfg.cfgInit.doInit:cfgInit.java:680 initializing hardware
info cfg.cfgInit.doInit:cfgInit.java:687 applying defaults
info cfg.cfgInit.doInit:cfgInit.java:695 applying configuration
info cfg.cfgInit.doInit:cfgInit.java:721 done
welcome
line ready
rr1# |
Code Block |
---|
language | bash |
---|
theme | Midnight |
---|
title | Launch pcapInt in order to bind socket for both interface enp0s9 |
---|
| ╭─[6:06:13]floui@debian[1] ~/freeRouter/bin
╰─➤ sudo ./pcapInt.bin enp0s9 10012 127.0.0.1 10011 127.0.0.1 1 ↵
binded to local port 127.0.0.1 10012.
will send to 127.0.0.1 10011.
pcap version: libpcap version 1.8.1
opening interface enp0s9 with pcap1.x api
serving others
> |
Code Block |
---|
language | bash |
---|
theme | Midnight |
---|
title | Launch pcapInt in order to bind socket for both interface enp0s10 |
---|
| ╭─[6:06:13]floui@debian[1] ~/freeRouter/bin
╰─➤ sudo ./pcapInt.bin enp0s10 10022 127.0.0.1 10021 127.0.0.1 1 ↵
binded to local port 127.0.0.1 10022.
will send to 127.0.0.1 10021.
pcap version: libpcap version 1.8.1
opening interface enp0s10 with pcap1.x api
serving others
> |
|
...
Expand |
---|
title | Check telnet access for rr1@10010 |
---|
|
Code Block |
---|
language | bash |
---|
theme | Midnight |
---|
title | R1 rr1 telnet access from Virtualbox VM guest via port 112310010 |
---|
| ╭─[1:09:28]floui@debian ~
╰─➤ telnet localhost 232310010
Trying ::1...
Trying 127.0.0.1...
Connected to localhost.
Escape character is '^]'.
welcome
line ready
rr1# |
|
...
Expand |
---|
title | Connectivity test between rr1 and other BGP speakers |
---|
|
Code Block |
---|
language | bash |
---|
theme | Midnight |
---|
title | Check reachability from one RR client (c5 for example) |
---|
| c5#sh ipv4 route v1
typ prefix metric iface hop time
O 1.1.1.1/32 110/1 ethernet1 10.1.5.1 00:07:22
O 2.2.2.2/32 110/2 ethernet1 10.1.5.1 00:07:07
O 3.3.3.3/32 110/2 ethernet2 10.4.5.4 00:06:49
O 4.4.4.4/32 110/1 ethernet2 10.4.5.4 00:06:35
C 5.5.5.5/32 0/0 loopback1 null 00:25:07
O 6.6.6.6/32 110/2 ethernet1 10.1.5.1 00:06:00
O 7.7.7.7/32 110/3 ethernet1 10.1.5.1 00:05:46
O 8.8.8.8/32 110/2 ethernet2 10.4.5.4 00:05:17
O 10.1.2.0/24 110/1 ethernet1 10.1.5.1 00:25:06
O 10.1.4.0/24 110/1 ethernet2 10.4.5.4 00:25:05
C 10.1.5.0/24 0/0 ethernet1 null 00:25:07
LOC 10.1.5.5/32 0/1 ethernet1 null 00:25:07
O 10.1.6.0/24 110/1 ethernet1 10.1.5.1 00:25:06
O 10.1.10.0/24 110/1 ethernet1 10.1.5.1 00:25:06
O 10.2.3.0/24 110/2 ethernet2 10.4.5.4 00:24:53
O 10.2.6.0/24 110/2 ethernet1 10.1.5.1 00:25:05
O 10.2.7.0/24 110/2 ethernet1 10.1.5.1 00:24:56
O 10.2.11.0/24 110/2 ethernet1 10.1.5.1 00:24:56
O 10.3.4.0/24 110/1 ethernet2 10.4.5.4 00:25:05
O 10.3.7.0/24 110/2 ethernet2 10.4.5.4 00:24:53
O 10.3.8.0/24 110/2 ethernet2 10.4.5.4 00:24:50
O 10.3.11.0/24 110/2 ethernet2 10.4.5.4 00:24:53
C 10.4.5.0/24 0/0 ethernet2 null 00:25:07
LOC 10.4.5.5/32 0/1 ethernet2 null 00:25:07
O 10.4.8.0/24 110/1 ethernet2 10.4.5.4 00:25:05
O 10.4.10.0/24 110/1 ethernet2 10.4.5.4 00:25:05
O 10.10.10.10/32 110/4445 ethernet1 10.1.5.1 00:11:05
O 11.11.11.11/32 110/4446 ethernet1 10.1.5.1 00:09:01
c5#sh ipv4 ospf 1 topo 0
node reach via ifc met hop conn sr br neighbors
4.4.4.1 true 10.1.5.1 ethernet1 1 1 5 0 0 4.4.4.2=1=10.1.2.1 4.4.4.4=1=10.1.4.1 4.4.4.5=1=10.1.5.1 4.4.4.6=1=10.1.6.1 4.4.4.10=4444=10.1.10.1
4.4.4.2 true 10.1.5.1 ethernet1 2 2 5 0 0 4.4.4.1=1=10.1.2.2 4.4.4.3=1=10.2.3.2 4.4.4.7=1=10.2.7.2 4.4.4.6=1=10.2.6.2 4.4.4.11=4444=10.2.11.2
4.4.4.3 true 10.4.5.4 ethernet2 2 2 4 0 0 4.4.4.2=1=10.2.3.3 4.4.4.4=1=10.3.4.3 4.4.4.8=1=10.3.8.3 4.4.4.7=1=10.3.7.3
4.4.4.4 true 10.4.5.4 ethernet2 1 1 5 0 0 4.4.4.3=1=10.3.4.4 4.4.4.8=1=10.4.8.4 4.4.4.5=1=10.4.5.4 4.4.4.1=1=10.1.4.4 4.4.4.10=4444=10.4.10.4
4.4.4.5 true null null 0 0 2 0 0 4.4.4.1=1=10.1.5.5 4.4.4.4=1=10.4.5.5
4.4.4.6 true 10.1.5.1 ethernet1 2 2 2 0 0 4.4.4.1=1=10.1.6.6 4.4.4.2=1=10.2.6.6
4.4.4.7 true 10.1.5.1 ethernet1 3 3 2 0 0 4.4.4.2=1=10.2.7.7 4.4.4.3=1=10.3.7.7
4.4.4.8 true 10.4.5.4 ethernet2 2 2 2 0 0 4.4.4.3=1=10.3.8.8 4.4.4.4=1=10.4.8.8
4.4.4.10 true 10.1.5.1 ethernet1 4445 2 2 0 0 4.4.4.1=4444=10.1.10.10 4.4.4.4=4444=10.4.10.10
4.4.4.11 true 10.1.5.1 ethernet1 4446 3 1 0 0 4.4.4.2=4444=10.2.11.11
c5#sh ipv6 route v1
typ prefix metric iface hop time
O fd00::1/128 110/1 ethernet1 fd00:cafe::1:5:1 00:08:06
O fd00::2/128 110/2 ethernet1 fd00:cafe::1:5:1 00:07:51
O fd00::3/128 110/2 ethernet2 fd00:cafe::4:5:4 00:07:33
O fd00::4/128 110/1 ethernet2 fd00:cafe::4:5:4 00:07:19
C fd00::5/128 0/0 loopback1 null 00:25:51
O fd00::6/128 110/2 ethernet1 fd00:cafe::1:5:1 00:06:43
O fd00::7/128 110/3 ethernet1 fd00:cafe::1:5:1 00:06:29
O fd00::8/128 110/2 ethernet2 fd00:cafe::4:5:4 00:06:01
O fd00::a/128 110/6667 ethernet1 fd00:cafe::1:5:1 00:11:45
O fd00::b/128 110/6668 ethernet1 fd00:cafe::1:5:1 00:09:45
O fd00:cafe::1:2:0/112 110/1 ethernet1 fd00:cafe::1:5:1 00:25:49
O fd00:cafe::1:4:0/112 110/1 ethernet2 fd00:cafe::4:5:4 00:25:49
C fd00:cafe::1:5:0/112 0/0 ethernet1 null 00:25:51
LOC fd00:cafe::1:5:5/128 0/1 ethernet1 null 00:25:51
O fd00:cafe::1:6:0/112 110/1 ethernet1 fd00:cafe::1:5:1 00:25:49
O fd00:cafe::1:10:0/112 110/1 ethernet1 fd00:cafe::1:5:1 00:25:49
O fd00:cafe::2:3:0/112 110/2 ethernet1 fd00:cafe::1:5:1 00:25:37
O fd00:cafe::2:6:0/112 110/2 ethernet1 fd00:cafe::1:5:1 00:25:37
O fd00:cafe::2:7:0/112 110/2 ethernet1 fd00:cafe::1:5:1 00:25:37
O fd00:cafe::2:11:0/112 110/2 ethernet1 fd00:cafe::1:5:1 00:25:37
O fd00:cafe::3:4:0/112 110/1 ethernet2 fd00:cafe::4:5:4 00:25:49
O fd00:cafe::3:7:0/112 110/2 ethernet2 fd00:cafe::4:5:4 00:25:37
O fd00:cafe::3:8:0/112 110/2 ethernet2 fd00:cafe::4:5:4 00:25:37
O fd00:cafe::3:11:0/112 110/2 ethernet2 fd00:cafe::4:5:4 00:25:37
C fd00:cafe::4:5:0/112 0/0 ethernet2 null 00:25:51
LOC fd00:cafe::4:5:5/128 0/1 ethernet2 null 00:25:51
O fd00:cafe::4:8:0/112 110/1 ethernet2 fd00:cafe::4:5:4 00:25:49
O fd00:cafe::4:10:0/112 110/1 ethernet2 fd00:cafe::4:5:4 00:25:49
c5#sh ipv6 ospf 1 topo 0
node reach via ifc met hop conn sr br neighbors
6.6.6.1/00000000 true fd00:cafe::1:5:1 ethernet1 1 1 5 0 0 6.6.6.2/00000000=1=10012 6.6.6.4/00000000=1=10015 6.6.6.5/00000000=1=10012 6.6.6.6/00000000=1=10012 6.6.6.10/00000000=6666=10012
6.6.6.2/00000000 true fd00:cafe::1:5:1 ethernet1 2 2 5 0 0 6.6.6.1/00000000=1=10012 6.6.6.3/00000000=1=10012 6.6.6.7/00000000=1=10012 6.6.6.6/00000000=1=10013 6.6.6.11/00000000=6666=10012
6.6.6.3/00000000 true fd00:cafe::4:5:4 ethernet2 2 2 4 0 0 6.6.6.2/00000000=1=10013 6.6.6.4/00000000=1=10012 6.6.6.8/00000000=1=10012 6.6.6.7/00000000=1=10013
6.6.6.4/00000000 true fd00:cafe::4:5:4 ethernet2 1 1 5 0 0 6.6.6.3/00000000=1=10013 6.6.6.8/00000000=1=10013 6.6.6.5/00000000=1=10013 6.6.6.1/00000000=1=10013 6.6.6.10/00000000=6666=10013
6.6.6.5/00000000 true null null 0 0 2 0 0 6.6.6.1/00000000=1=10014 6.6.6.4/00000000=1=10014
6.6.6.6/00000000 true fd00:cafe::1:5:1 ethernet1 2 2 2 0 0 6.6.6.1/00000000=1=10015 6.6.6.2/00000000=1=10015
6.6.6.7/00000000 true fd00:cafe::1:5:1 ethernet1 3 3 2 0 0 6.6.6.2/00000000=1=10014 6.6.6.3/00000000=1=10015
6.6.6.8/00000000 true fd00:cafe::4:5:4 ethernet2 2 2 2 0 0 6.6.6.3/00000000=1=10014 6.6.6.4/00000000=1=10013
6.6.6.10/00000000 true fd00:cafe::1:5:1 ethernet1 6667 2 2 0 0 6.6.6.1/00000000=6666=10016 6.6.6.4/00000000=6666=10016
6.6.6.11/00000000 true fd00:cafe::1:5:1 ethernet1 6668 3 1 0 0 6.6.6.2/00000000=6666=10016 |
Code Block |
---|
language | bash |
---|
theme | Midnight |
---|
title | Ping from R2 -> R1from rr1 from c5 |
---|
| c5#ping 10.10.10.10 /vrf v1
pinging 10.10.10.10, src=null, vrf=v1, cnt=5, len=64, tim=1000, ttl=255, tos=0, sweep=false
!!!!!
result=100%, recv/sent/lost=5/5/0, rtt min/avg/max/total=0/0/1/4
c5#ping fd00::a /vrf v1
pinging fd00::a, src=null, vrf=v1, cnt=5, len=64, tim=1000, ttl=255, tos=0, sweep=false
!!!!!
result=100%, recv/sent/lost=5/5/0, rtt min/avg/max/total=0/0/1/4
c5# |
|
...