Job Description:
• Lead GFiber’s peering, caching and transit infrastructure design and IP planning. Optimizing traffic engineering, IX/Transit/CDN integrations, capacity planning to ensure optimal latency and cost-efficiency across the edge.
• Partner with software teams and Network Reliability Engineering team on the design and development of the GFiber automation stack, advancing low & zero touch operations and rapid configuration deployments across the core and edge
• Define and evolve standards for network observability and network health, integrating telemetry, fault management, and incident data to drive actionable insights and implement auto-remediation strategies.
• Serve as the Tier-3 escalation for complex routing, convergence, or large-scale DDoS issues, contribute to root cause analysis and propose remediation workflows to prevent recurrence.
• Collaborate with Product and Software teams to enable advanced service offerings (e.g., L2/L3 VPNs, DIA) and influence vendor roadmaps to align with GFiber’s business goals.
Requirements:
• Bachelor’s degree in Computer Science, Electrical Engineering, a related field, or equivalent practical experience.
• 7 years in service provider network design and operations, focusing on high-availability core environments.
• Knowledge of IP/MPLS protocols - BGP (v4/v6), IS-IS/OSPF, Segment Routing (SR-MPLS/SRv6), RSVP-TE, and EVPN, network troubleshooting and packet-level analysis tools (NetFlow, SNMP, Wireshark, TCPdump) and direct experience with multi-vendor platforms (e.g., Juniper, Nokia, Arista, or Cisco).
• Experience with automation and scripting for device configuration and validation (Python, Go, Ansible, or Netconf/YANG).
• Experience in incident management, telemetry design, and maintaining high-availability systems in a large network production environment.
• It's preferred if you have:
• Experience with building end to end automation workflows and event driven automation concepts.
• Familiarity with implementing Site Reliability Engineering (SRE) principles within a network context, including SLO/SLA definition and error budget management.
• Experience with modern telemetry stacks such as Prometheus, Grafana, or custom alerting systems.
• Deep understanding of BGP attributes and traffic engineering across an ISP core network.
• JNCIE-SP, CCIE Service Provider, or equivalent deep industry experience.
Benefits:
• bonuses
• cash award
• benefits
Apply tot his job
Apply To this Job