Anuket Project

2022-03-23 ViNePerf Agenda and Minutes

Attendees

Sridhar Rao

Luc Provoost

Daniele Zulberti

Al Morton

Federica Paganelli


Sl. No

Topic

Presenters

Notes

1

Development Update


Trying to successfully deploy L3 network without manual configuration... 

Limited to 2 CNIs: SRIOV and Userspace

  1. Userspace OVS - Only Bridge mode is supported (L2).
  2. Userspace VPP - CNI is not adding IP/routes to the Interfaces in VPP. It assigns the IP on the container-side, but not on the host-side. Expect some forwarding rules to be added on the host-side.
2.Discussion of the results

Summary :

  1. VPP performs better than OVS. OVS performance is also inconsistent. OVS: Latency too is high ???
  2. Prox performs better than T-Rex
  3. Bi-Directional Prox is more inconsistent (throughput) than unidirectional
  4. Trex results are not consistent.
  5. Increasing hops, even with single node, affects the performance.  Better summary for varying topologies
  6. Prox has some limitations - mainly w.r.t core assignments - which affects the performance.

To Try:

Topology-1

VPP + Prox

Bidirectional, 64Bytes, ...

Variables: Cores (6), Rx_queues, Queue_size, Tx_Queues - double these values. 

3.Prox-L3

Prox-Tgen and Swap respond to arp - Include this in core configurations.

sub mode=l3

If You want an interface with only latency configured - but still need to respond to ARPs, configure a generator but set bps=0.

Need an interface that is configured for both receiving and sending

sub mode=l2 is also possible (that's a lower-case  L)

bps=0 on a core configured as a generator means that the interface will still respond to ARP packets


${local_hex_mac1} 00 00 00 00 00 00 08 00 45 00 00 2e 00 01 00 00 40 11 f7 7d ${local_hex_ip2} ${local_hex_ip1} 0b b8 0b b9 00 1a 55 7b

Source-MAC, Dest-Mac, .....

Copy/paste the source dest MAC and higher addresses  into https://hpd.gasmi.net/ for analysis

( a rough rule of thumb is the a core using DPDK requires 100 cycles to deal with a packet. so take the CPU clock and divide by 100 to get the Maximum packet header processing rate. the denominator could be higher in practice, Luc indicated it could be 200 or so)


4.There Is a Korean team working on Benchmarking CN scenarios

https://datatracker.ietf.org/meeting/113/materials/slides-113-bmwg-considerations-for-benchmarking-network-performance-in-containerized-infrastructures-slides-113-bmwg-considerations-for-benchmar-00