Skip to main content

Software Engineer, Systems ML

**Summary:**

Meta is looking for software engineers to play a pivotal role designed to further enhance and elevate our AI training and inference infrastructure. As a cornerstone of our team, you will play a significant role in refining and fine-tuning the performance of our AI models and in building user facing APIs for our ML engineers. Your expertise will enable us to reach new heights in model training and inference. The position requires a combination of expertise in machine learning and software engineering.

**Required Skills:**

Software Engineer, Systems ML Responsibilities:

1. Analyze performance and power efficiency of deep learning inference workloads

2. Build user facing frameworks for efficient training and inference

3. Identify and prototype opportunities for performance optimization

4. Partner with teams across meta reality labs to optimize key training and inference workloads.

**Minimum Qualifications:**

Minimum Qualifications:

5. Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience.

6. Specialized experience in one or more of the following machine learning/deep learning domains:, GPU architecture, ML systems, AI infrastructure, high performance computing, performance optimizations, or Machine learning frameworks (e.g. PyTorch), numerics

7. Experience developing AI-System infrastructure or AI algorithms in C/C++ or Python

**Preferred Qualifications:**

Preferred Qualifications:

8. Master/PhD degree in Computer Science, Computer Engineering

9. 5+ years of experience in ML framework development or accelerating deep learning models on hardware architectures

10. Experience with distributed systems development

11. Optimizing machine learning model inference and training specifically on NVIDIA GPUs hardware.

12. Experience with CUDA.

13. Extensive experience in profiling GPU performance and distributed training.

14. Deep understanding of Open AI Triton and on techniques for mixed precision training.

**Public Compensation:**

$177,008/year to $251,000/year + bonus + equity + benefits

**Industry:** Internet

**Equal Opportunity:**

Meta is proud to be an Equal Employment Opportunity and Affirmative Action employer. We do not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender, gender identity, gender expression, transgender status, sexual stereotypes, age, status as a protected veteran, status as an individual with a disability, or other applicable legally protected characteristics. We also consider qualified applicants with criminal histories, consistent with applicable federal, state and local law. Meta participates in the E-Verify program in certain locations, as required by law. Please note that Meta may leverage artificial intelligence and machine learning technologies in connection with applications for employment.

Meta is committed to providing reasonable accommodations for candidates with disabilities in our recruiting process. If you need any assistance or accommodations due to a disability, please let us know at accommodations-ext@fb.com.


Similar jobs

Software Engineer, Systems ML

Full time
Salt Lake City, UT

Published on 04/10/2024

Share this job now

Similar jobs