CMIS (Common Management Interface Specification) has emerged as the leading management interface for the modules ranging from copper cables to coherent pluggable modules.\nSONiC provides support for custom-built CMIS compliant modules with extended capabilities like updating the firmware on the remote end of the cables like AEC and ACC and support of custom memory map.\nLearn how SONiC is extending the capabilities of CMIS compliant modules and providing following advantages:\n1. Custom feature support with custom memory map\n2. Platform/ASIC/Switch and module vendor agnostic support\n2. Dynamic port breakout support for modules supporting multiple applications\n3. Streaming telemetry for DOM (Digital Optical Monitoring)\n4. Custom host side and module side signal integrity support\nAlso- we can learn from Nvidia’s story of moving from their custom firmware handling of the modules to SONiC adoption for managing the modules with improved link reliability along with hitless upgrade of the NOS.
Sflow uses UDP datagrams to send packet samples to the collector and the samples include first N bytes of the actual payload data that may have sensitive information. UDP packets are not encrypted and hence prone to man-in-the-middle attacks. UDP being connectionless is also prone to packet losses that lead to inaccurate traffic measurements.
gNPSI (gRPC Network Packet Sampling Interface) addresses the security vulnerabilities in sending Sflow packets over UDP. gNPSI encapsulates the Sflow samples in gRPC format and hence adds authentication, re-transmission and encryption to the samples, enabling usage of these samples in critical network-control loops. gNPSI also changes the mode of the collector connection from a dial-out to dial-in.
This talk will cover the integration of gNPSI into SONiC Sflow stack and the benefits it brings to SONiC. It will cover the configuration, migration, monitoring and performance parity aspects of gNPSI in SONiC.
The control of transceivers is becoming increasingly complex with the advent of technologies like 400ZR and Co-Packaged Optics (CPO). Platforms utilizing these transceivers require coordinated control with switch ASICs. Enabling development in a virtual environment can enhance software quality and allow for experimentation with novel transceiver control mechanisms. In SONiC, transceiver control is primarily managed by xcvrd. However, xcvrd is not enabled in SONiC's virtual environment (VS), making it challenging to conduct such tests and experiments.
This session will provide an overview of the necessary modifications to SONiC to enable xcvrd in the virtual environment. By making these adjustments, we can facilitate the development and testing of advanced transceiver control software in the VS environment without the need for physical hardware. Additionally, we will introduce a transceiver emulator developed to support the testing of sophisticated transceiver controls, such as those required for 400ZR and CPO.
Join us at the OCP Summit 2024 to explore the revolutionary world of Smart Switch for BYOD(Bring Your Own Datapath)- a cutting-edge form factor empowered by SONiC. In this session- we will delve into the software architecture behind Smart Switch for enabling BYOD and its seamless integration with SONiC. \nIn this session- we will demonstrate SONiC's Bring Your Own Datapath (BYOD) capability through a critical Azure scenario involving policy support on Private Endpoints. This session showcases the collaborative efforts between Microsoft and NVIDIA- illustrating how our existing application can be seamlessly transitioned to NVIDIA DPUs in Smart-Switches without requiring any application modifications. Join us to explore how SONiC empowers intelligent networking with Smart Switches- ensuring you stay ahead of the curve in embracing future networking paradigms.
SONiC is revolutionizing enterprise networking. Dell- a key contributor from SONiC's inception- has extended its capabilities for enterprise use- promoting widespread adoption. Partnering with BE Networks- Dell streamlines SONiC deployment across Data Centers and Edge- enhancing automation- agility- and fabric resiliency. The solution supports advanced features like Network Time Traveler- Firmware Management- Cabling Autodetection- PoE Mgmt- Device & Port Security- and Topology Assurance.\nRecent updates introduce packet broker capabilities with SONiC- allowing for advanced traffic management and monitoring. Along with support for GenAI fabric management and enhanced fabric observability for AI workloads. These innovations are essential for managing complex AI environments and ensuring optimal network performance.
Broadcom is a large corporation (~25000 employee base- 23 semiconductor and infra software divisions) has a huge R&D investment with over 13+ DCs worldwide. In early 2019- Broadcom decided to upgrade its DC infrastructure from a legacy 3-tier architecture with high vendor lock-in costs to a more robust and scalable CLOS architecture using an Open NOS SONiC with an Open ethernet ecosystem. This presentation describes Broadcom's journey using SONiC from 2020 to 2024 - how we had to have a paradigm shift for moving towards disaggregation- initial challenges we faced- lessons learnt after upgrading to SONiC. In the end- we were able to achieve a tenant based architecture- remove vendor lock-ins and high hardware costs whilst improving quality- much faster- scalable and reliable DC- with lessened overall TCOP costs for the entire DC infrastructure.
Accomplished and Passionate Product Line Manager with 11+ years of progressive product management, technical marketing and solutions engineer experience spanning Enterprise Data Center and Campus Switching technologies.
Collaborating with SONiC routing WG- we have introduced the prefix independent convergence into SONiC. Current FRR / SONiC’s routes handling come from data center use cases. But routing domain is another story due to the scale and the introduction of VPN routes. Since the DCN solution leads to prolonged routes convergence time at network outages- the traffic loss would be significant. This is one of the key infrastructure enhancements required to extend SONiC domain from data center to routing use cases. \n\nIn this talk- we introduces how we reformat FRR/ SONiC routing domain related data structure:\n- Decouple prefix information with next hop information\n- Decouple next hop information and VPN context information. \n- Introduce next hop group ID and VPN context ID programming from FRR to SONiC\nWe introduce e quick fix based on existing routing information to minimize traffic loss window. We also use BFD to detect BGP next hop connectivity.
Applications experience/detect the network problem and then we troubleshoot. We repeatedly encounter this issue. Why legacy network performance measurement solutions cannot measure the application experience? The network fabric is built on top of IP. The nature of IP is ECMP. There are many ECMP paths between the fabric edges that connect our applications. Legacy solutions lack the scale required to measure all the ECMP paths. In addition, they rely on metrics such as min/max/average that don’t reflect the experience of each application. The IP Measurements solution (IPM) is a network performance measurement solution applicable to any IP Fabric (SRv6/MPLS/VXLAN). It provides the scale required to monitor the network experience of all applications via the HW integration. IPM provides detailed insights into the application experience through accurate metrics (latency histograms). Additionally, it reduces CAPEX and OPEX via elimination of external probing appliances. The SAI/SONiC implementation of IPM is available and currently in the process of upstreaming to the community SAI/SONiC. In this presentation we review the IPM solution and provide an update on the SAI/SONIC support.
Ahmed Abdelsalam is Technical Leader at Cisco. His main focus is the IP technology. Ahmed has been working on several IP engineering projects including SRv6 uSID and IP Measurements. He helps the customers in their SRv6 and IP Measurement adoption journey. He contributed to the SRv6... Read More →
As AI workloads and digital transformations continue to reshape enterprise IT landscapes- the demand for robust- scalable- and flexible networking solutions is more critical than ever. This panel will examine the potential of SONiC (Software for Open Networking in the Cloud) for enterprise adoption. We will explore its journey from being successfully utilized by hyperscalers such as Google and Microsoft to the unique challenges and opportunities it presents for enterprises. Key discussion points include the role of enterprise distributions- the importance of vendor-agnostic integration- and the essential support mechanisms necessary for large-scale enterprise deployment. Join us to evaluate if SONiC is truly equipped to meet the needs of modern enterprises.
Scaling data centers for GPU-as-a-Service (GPUaaS) customers- connecting multiple data halls poses significant challenges- especially as the number of VxLAN endpoints grows within the network fabric. This presentation explores how EVPN VxLAN Multisite can address these challenges by dividing the network fabric into smaller segments and isolating them with split-horizon groups. We'll discuss the standard implementation and how SONiC- combined with Free Range Routing (FRR) and Switch Abstraction Interface (SAI)- achieves this. Additionally- we'll cover important considerations for network design to avoid common pitfalls and ensure a robust- scalable infrastructure for AI clusters.
ECMP (Equal Cost Multi-pathing) is the most commonly used technique to distribute traffic across various types of networks. WCMP (Weighted Cost Multipathing) is an additional method that further optimizes traffic distribution. In this talk- we will discuss WCMP implementation in SONiC disaggregated chassis and its benefits for Microsoft Azure Data Centers.