Are you a versatile engineer who excels at bridging the gap between innovative development and rock-solid operations? We are seeking a high-caliber Senior Full Stack & Operations Engineer to lead the end-to-end lifecycle of our next-generation services. In this role, you will not only architect and build robust frontend and backend systems but also take full ownership of their deployment, scaling, and long-term reliability across global cloud infrastructures. We are looking for a technical polyglot who thrives on learning new technologies, utilizes AI to solve complex engineering challenges, and remains calm under pressure during critical system incidents. If you are a proactive problem-solver who enjoys the intersection of software development and cloud-native orchestration, we want to hear from you.
Description
As a Full Stack Engineer, you will be the cornerstone of our team's technical resilience and service delivery. Your capabilities will span across active development, live system operations, and in-depth problem triaging:
- End-to-End Service Development: Design, build, and maintain scalable frontend and backend systems, ensuring seamless integration, high performance, and exceptional user experiences.、
- Cloud Infrastructure & Deployment: Architect and manage robust deployment pipelines. Leverage your deep expertise in CI/CD, Kubernetes (K8s), and distributed cloud platforms (e.g., AWS, GCP, AliCloud) to ensure smooth, automated rollouts.
- Operational Excellence & Monitoring: Take ownership of system health and operational readiness. Create, manage, and optimize comprehensive monitoring dashboards and alerting systems to proactively identify anomalies before they impact users.
- Incident Response & Emergency Triage: Act as a critical responder during technical escalations. Demonstrate swift analytical skills and unwavering composure to troubleshoot, manage, and mitigate live system emergencies effectively.
- Deep Problem Analysis: Apply a highly proactive mindset to dive deep into system logs and complex, multi-tiered architectures to pinpoint the exact root causes of issues, driving them to permanent systemic fixes.
- AI-Driven Engineering: Proactively integrate AI-powered tools and methodologies to accelerate development, automate repetitive tasks, and perform intelligent root-cause analysis of system anomalies.
- Continuous Learning: Champion a culture of proactive learning, rapidly adapting to new frameworks, infrastructure paradigms, and emerging technologies to keep our tech stack at the cutting edge.
Preferred Qualifications
Practical experience with iOS development technologies and strong familiarity with Swift.
Understanding of mobile-to-cloud architectures and client-side profiling tools (e.g., Xcode, Instruments).
Experience building custom internal developer tools or advanced automation frameworks for operational efficiency.
Highly adaptable and resilient, with the proven ability to maintain composure and drive swift solutions during critical, time-sensitive software emergencies.
Minimum Qualifications
BS/MS in Computer Science, Software Engineering, Information Systems, or equivalent practical experience.
Solid foundation in Computer Science with proven experience in full-stack software development, encompassing both backend services and frontend interfaces.
Strong proficiency in scripting and programming languages, specifically Python and Linux shell scripting.
Hands-on experience managing and deploying to modern cloud environments (e.g., GCP, AWS, AliCloud) and deep familiarity with containerization and orchestration using Kubernetes (K8s).
Demonstrated expertise in building, maintaining, and scaling CI/CD pipelines and automated deployment workflows.
Proven experience in live incident response and operational monitoring, including the creation and management of observability dashboards.
Excellent analytical and root-cause analysis skills, with a highly proactive, self-starting mentality and an eagerness to continuously learn.
Demonstrated ability to effectively utilize AI tools to enhance productivity, streamline debugging, and optimize operational workflows.