Job Description:
• Support our live streaming pipeline team and day-to-day live-streaming operations for Netflix
• Responsible for the reliability of our live streaming pipeline (transmission, encoding, packaging, origin)
• Instrumenting end to end observability and visualizing the data to achieve the desired availability at scale
• Working with cross functional teams in the preparation, validation, and execution of live streaming focused initiatives
• Impact multiple areas of the live event lifecycle, from the planning phase through testing and event launch days
• Lead innovation initiatives, driving new features that will enhance our live streaming services, encoding & content delivery
• Drive continual improvement in resilience, observability, monitoring, instrumentation, and automation with the primary goal to maintain highly scalable and reliable services worldwide
• Implement, automate, execute, and analyze results from a broad range of live streaming delivery focused functional, performance, resilience, and fault injection testing
• Coordinate, collaborate, and partner across multiple stakeholders for the smooth execution of live-streaming events
• Aggregate, analyze, and correlate large amounts of server and application performance data
• Use the innovative Netflix Big Data platform as a highly flexible, specialized and efficient toolset for service delivery optimization and system reliability improvements
• Participate in an on-call rotation and be able to work with flexible hours based on the live events schedule
Requirements:
• 5+ years service reliability/operational experience running large scale, high performance systems & internet services with focus on live-streaming and video-on-demand (VOD) delivery
• Experience with video transport protocols such as RTP, RTMP, SRT, UDP, Zixi, RIST, HLS, MPEG-DASH
• Knowledge of and proven experience with HTTP cache/proxy technologies
• Experience supporting live-streaming delivery at scale
• Expert-level knowledge of Unix or Linux system engineering fundamentals (networking, storage, operating systems) at scale
• Proficient understanding of networking principles, transport, and application protocols, especially TCP/IP, BGP, DNS, TLS, and HTTP/S
• Experience with using distributed analytic processing technologies (Hive, Presto/Trino, Spark SQL, etc)
• Proficient in a programming language such as Python or Go
• Preferred - B.S. in Computer Science, Electrical or Computer Engineering (or equivalent professional experience)
Benefits:
• Health Plans
• Mental Health support
• 401(k) Retirement Plan with employer match
• Stock Option Program
• Disability Programs
• Health Savings and Flexible Spending Accounts
• Family-forming benefits
• Life and Serious Injury Benefits
• Paid leave of absence programs
• Full-time hourly employees accrue 35 days annually for paid time off to be used for vacation, holidays, and sick paid time off
• Full-time salaried employees are immediately entitled to flexible time off
Apply tot his job
Apply To this Job