Skip to main content

AI/ML -Site Reliability Engineer (SRE), Siri Search, Knowledge & Platform

AI/ML -Site Reliability Engineer (SRE), Siri Search, Knowledge & Platform

Portland,Oregon,United States

Machine Learning and AI

+ * Solid understanding of Linux fundamentals (filesystems, processes, memory, signals, etc)

+ * Strong Systems Administration experience

+ * Understanding of standard Networking protocols and principles (HTTP, TLS, DNS, UDP,TCP,MPTCP etc)

+ * Familiarity with version control

+ * Ability to write and review programs/scripts in Bash and one or more high level language such as Java,Ruby,Python,Go

+ * Good grasp of distributed infrastructure and architecture with an understanding of CAP theorem, HA principles, etc

+ * Experience with systems and configuration management systems

+ * Strong troubleshooting principles for both systems and applications

+ * Working understanding of tsdb/logging basics and monitoring infrastructure

+ * Familiarity with containerization, resource management and task scheduling systems

+ * Strong Communication (written and verbal)

+ * Usage and understanding of basic CI/CD concepts

+ * Passion for eliminating toil through automation

**Description**

SREs in Siri are responsible for both infrastructure and the applications that run on top of it. We use a variety of open source and home grown tooling to achieve our goals. We are a Linux focused team running at scale while supporting regional deployments to support our customers across the globe. We push for more automation, monitoring, QA etc at all parts of the development lifecycle to ensure that the code we push to production meets Apple’s high standards. As a member of our team, you are responsible for learning our internal tools, driving their future development, and implementing new processes of your own in order to increase automation and excellence. To this end, we perform the following work: - On-Call (rotating schedule) - Code Deployment to dev and production environments - Automation - Performance and Scalability work - Architectural improvements surrounding SPOF and redundant systems - Application/System Troubleshooting - Instrumentation/Monitoring/Alerting - Tooling - Software updates/testing -Operationalizing developmental features (sometimes referred to as "Launch Readiness" or "Launch Readiness Engineering")

**Education & Experience**

B.S. in Computer Science or relevant/equivalent experience in the field

**Additional Requirements**

**Apple Footer**

Apple is an equal opportunity employer that is committed to inclusion and diversity. We take affirmative action to ensure equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics. Learn more about your EEO rights as an applicant (Opens in a new window) .

Apple will not discriminate or retaliate against applicants who inquire about, disclose, or discuss their compensation or that of other applicants. United States Department of Labor. Learn more (Opens in a new window) .

Apple will consider for employment all qualified applicants with criminal histories in a manner consistent with applicable law. If you’re applying for a position in San Francisco, review the San Francisco Fair Chance Ordinance guidelines (opens in a new window) applicable in your area.

Apple participates in the E-Verify program in certain locations as required by law. Learn more about the E-Verify program (Opens in a new window) .

Apple is committed to working with and providing reasonable accommodation to applicants with physical and mental disabilities. Apple is a drug-free workplace. Reasonable Accommodation and Drug Free Workplace policy Learn more (Opens in a new window) .

AI/ML -Site Reliability Engineer (SRE), Siri Search, Knowledge & Platform

Full time
Portland, OR

Published on 02/22/2021

Share this job now