Do you want to help build some of the largest and most consequential enterprise and customer technology systems in the world? Join Apple’s Information Systems and Technology (IS&T) organization.IS&T is the engine behind everything Apple does for customers and for the people who build for them. It’s Apple’s central nervous system. Supporting 2.5 billion active Apple devices, processing billions of secure transactions, and keeping the technology that defines modern life running flawlessly, IS&T makes the impossible feel effortless.
Do you love building solutions to handle global complexity and immense scale? Imagine what you could do here.
Infrastructure Services is part of IS&T and the foundation of Apple's global network operations - managing data center equipment and systems to deliver compute, storage, and networking services for teams across Apple, including its internal developer community. From individual facilities to a worldwide network, Infrastructure Services ensures the technology underneath everything works without question
Description
The Apple Networking team builds software-defined cloud network infrastructure as a part of Apple Cloud. Our infrastructure is a critical foundation in delivering Apple’s services (such as iCloud, iTunes, Siri, Maps) to billions of customers. We are a fast paced organization where drive and collaboration are the keys to success. Teams across Apple rely critically on us for infrastructure that help them build services that scale globally, are highly available, and “just work”.
As a Operations Engineer you will be responsible day to day operations of K8 infra, Software Firewalls, VPC management, Incident management of the cloud network services to maintain high availability, scale and resilience. The successful candidate is expected to be highly self-motivated with a passion for excellence, quality and detail in driving solutions. This position requires you to be able to do on-call rotation on periodic basis","responsibilities":"Assist in managing and supporting SDN infrastructure across data centers, cloud, and hybrid environments.
Help with configuration and troubleshooting of basic networking (IP, DNS, firewalls)
Perform effective incident management and triage issues on P1 and P0
Extraordinary interpersonal communications and customer-service skills
Ability to maintain composure and customer-service focus in stressful situations
Position requires occasional on-call rotation support
Contribute to routine monitoring and incident response workflows
Support infrastructure automation and deployment using tools like Ansible or basic scripting
Work with Kubernetes for basic cluster maintenance and app deployments
Document processes and support runbooks for operational consistency
Collaborate with senior team members to resolve technical issues
Preferred Qualifications
BS or MS in Computer Science or equivalent industry experience
At least 2+ year experience in Network troubleshooting
Understanding of the following technologies: OSPF, BGP, STP, TCP/IP, 802.11ac/n/g/b, 802.1X
ACL management and implementation
Experience with SDN controllers and virtualized network infrastructure would be a Plus.
Experience supporting environments with thousands of servers and critical uptime requirements
Experience with IP network design and architecture, routing, firewalls, and switching configuration.
Experience with modern web-scale services including servers, vips, load balancers, proxies is big plus
Experience working with monitoring and metrics platforms like Splunk and Prometheus
Experience with 3 party cloud like AWS., GCP or AliCloud is a Big Plus
Minimum Qualifications
Strong experience with Linux systems administration, including configuration, automated provisioning and monitoring
Able to proficiently coordinate and troubleshoot with internal and external partners within a large scale enterprise
Experience evaluating & managing outages to business critical infrastructure
Ability to thrive under pressure and navigate ambiguous situations
Kubernetes experience, including cluster management as well as application deployment and configuration
Proficient in at least one of these languages: Python, Ansible, Shell
Solid grasp of fundamentals of modern technology stacks: containers, IP networking, HTTP, SSL
Understanding of TCP/IP, routing/switching, firewalls, and network protocols.
Familiarity with "infrastructure-as-code" approach and tools
Able to quickly learn and adapt to new technologies
Strong operational and troubleshooting skills