The Role
Netflix has transformed the way audiences engage with shows and films by providing on-demand access to an extensive library of movies and television content. Recently, the company has broadened its entertainment portfolio to include live content, featuring events such as Chris Rock’s comedy specials, the SAG Awards and The Netflix Slam tennis match. Delivering real-time storytelling to over 270 million viewers worldwide presents a complex challenge, requiring outstanding levels of stability and reliability from numerous services and systems between the camera and the viewer’s device.
About the Role
In this position, you will be instrumental in supporting the live-streaming pipeline team and overseeing daily live-streaming operations at Netflix. As a Site Reliability Engineer (SRE) specializing in Live Streaming, you will be accountable for ensuring the reliability of our live streaming pipeline, which encompasses transmission, encoding, packaging and origin processes. Your responsibilities will include implementing end-to-end observability and visualizing data to achieve the desired availability at scale. You will collaborate with cross-functional teams in the planning, validation and execution of initiatives focused on live streaming. Your role will influence various aspects of the live event lifecycle, spanning from the planning phase to testing and the launch days of events. You will spearhead innovation initiatives, driving new features that enhance our live streaming services, encoding processes and content delivery.
Responsibilities
- Drive Continuous Improvement: Work towards enhancing resilience, observability, monitoring, instrumentation and automation with the primary aim of maintaining highly scalable and reliable services across the globe.
- Testing and Analysis: Implement, automate, execute and analyze outcomes from a comprehensive range of live streaming delivery-focused functional, performance, resilience and fault injection testing.
- Stakeholder Coordination: Collaborate with various stakeholders to ensure the smooth execution of live-streaming events through effective coordination and partnership.
- Data Aggregation and Analysis: Collect, analyze and correlate vast amounts of server and application performance data. Utilize Netflix’s innovative Big Data platform as a highly flexible, specialized and efficient toolset for optimizing service delivery and enhancing system reliability.
- On-Call Participation: Engage in an on-call rotation and maintain flexibility in working hours based on the schedule of live events.
Qualifications
- Experience: A minimum of 5 years of service reliability or operational experience in managing large-scale, high-performance systems and internet services, with a specific focus on live-streaming and video-on-demand (VOD) delivery.
- Video Transport Protocols: Familiarity with video transport protocols such as RTP, RTMP, SRT, UDP, Zixi, RIST, HLS and MPEG-DASH.
- Cache/Proxy Technologies: Knowledge and hands-on experience with HTTP cache/proxy technologies, specifically in supporting large-scale live-streaming delivery.
- Unix/Linux Proficiency: Advanced knowledge of Unix or Linux system engineering principles (networking, storage, operating systems) at scale.
- Networking Principles: Strong understanding of networking fundamentals, including transport and application protocols, especially TCP/IP, BGP, DNS, TLS and HTTP/S.
- Distributed Analytics: Experience in using distributed analytic processing technologies such as Hive, Presto/Trino and Spark SQL.
- Programming Skills: Proficiency in a programming language like Python or Go.
- Collaborative Work Style: Ability to thrive in a highly collaborative environment and effectively communicate with both internal and external partners.
- Education: A Bachelor’s degree in Computer Science, Electrical Engineering, Computer Engineering or a related field (or equivalent professional experience) is preferred.
Compensation Structure
Our compensation framework consists solely of an annual salary; we do not provide bonuses. Each year, you have the flexibility to decide how much of your compensation you wish to allocate towards salary versus stock options. To establish your personalized competitive compensation, we rely on market indicators and consider your specific job family, background, skills and experience when determining your compensation within the market range. The salary range for this role is between $100,000 and $720,000.
Netflix offers a comprehensive benefits package that includes health plans, mental health support, a 401(k) retirement plan with employer matching, a stock option program, disability programs, health savings and flexible spending accounts, family-forming benefits, as well as life and serious injury benefits. We also have paid leave of absence programs. Full-time hourly employees earn 35 days annually for paid time off, which can be utilized for vacations, holidays and sick leave. Full-time salaried employees have immediate access to flexible time off. More details about our benefits can be found here.
Commitment to Diversity
We are an equal opportunity employer and celebrate diversity, acknowledging that a variety of perspectives and backgrounds contribute to building stronger teams. We take our approach to diversity and inclusion seriously and thoughtfully. Discrimination based on race, religion, color, ancestry, national origin, caste, sex, sexual orientation, gender identity or expression, age, disability, medical condition, pregnancy, genetic makeup, marital status or military service is not tolerated.