Software Development Engineer II, ML_AI Job at Amazon Development Center U.S., Inc., Bellevue, WA

MXMxc1RDMkQ5MXR1ZjF4cEdURGd4ZnByenc9PQ==
  • Amazon Development Center U.S., Inc.
  • Bellevue, WA

Job Description

DESCRIPTION

AWS Utility Computing (UC) provides product innovations — from foundational services such as Amazon’s Simple Storage Service (S3) and Amazon Elastic Compute Cloud (EC2), to consistently released new product innovations that continue to set AWS’s services and features apart in the industry. As a member of the UC organization, you’ll support the development and management of Compute, Database, Storage, Internet of Things (Iot), Platform, and Productivity Apps services in AWS, including support for customers who require specialized security solutions for customers who require specialized security solutions for their cloud services.

At Amazon SageMaker AI, we want to make it easy for our customers to train their deep learning workload in the cloud. With SageMaker HyperPod, AWS is building customer-facing services to empower data scientists and software engineers in their deep learning endeavors. As our customers rapidly adopt LLMs and Generative AI for their business, we’re building the next-generation AI platform to accelerate their development.

As an SDE II, you will be responsible for designing, developing, testing, and deploying distributed machine learning systems and large-scale solutions for our world-wide customer base. You will collaborate closely with a team of ML engineers/scientists and customers to train AGI and Amazon Q models. You'll assist in gathering and analyzing business and functional requirements, and translate requirements into technical specifications for robust, scalable, supportable solutions that work well within the overall system architecture. You will also drive the system architecture, spearhead best practices that enable a quality product, and help coach and develop junior engineers. A successful candidate will have an established background in engineering large scale software systems, a strong technical ability, great communication skills, and a motivation to achieve results in a fast paced environment.

About You:

You are passionate about building platform and products for large scale deep learning model training (100+ billion parameter GPT, 1000s of GPU devices). You have a proven track record of bringing innovative research to customers. You are able to thrive and succeed in an entrepreneurial environment and not be hindered by ambiguity or competing priorities. Ownership, delivering results, thinking big and analytical leadership are essential to success in this role.

You have solid experience in python, typescript and multi-threaded asynchronous C++/Go development. You have prior experience in one of: resource orchestrators like slurm/kubernetes, high performance computing, building scalable systems, experience in large language model training.

This is a great team to join to have a huge impact on AWS and the world's customers we serve!

A day in the life
Every day will bring new and exciting challenges on the job while you:

* Build and improve next-generation AI platform
* Collaborate with internal engineering teams, leading technology companies around the world and open source community - PyTorch, NVIDIA/GPU
* Create innovative products to run at scale on the AI platform, and see them launched in high volume production

About the team
Diverse Experiences
AWS values diverse experiences. Even if you do not meet all of the preferred qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying.

Why AWS?
Amazon Web Services (AWS) is the world’s most comprehensive and broadly adopted cloud platform. We pioneered cloud computing and never stopped innovating — that’s why customers from the most successful startups to Global 500 companies trust our robust suite of products and services to power their businesses.

Inclusive Team Culture
Here at AWS, it’s in our nature to learn and be curious. Our employee-led affinity groups foster a culture of inclusion that empower us to be proud of our differences. Ongoing events and learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon (gender diversity) conferences, inspire us to never stop embracing our uniqueness.

Mentorship & Career Growth
We’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional.

Work/Life Balance
We value work-life harmony. Achieving success at work should never come at the expense of sacrifices at home, which is why we strive for flexibility as part of our working culture. When we feel supported in the workplace and at home, there’s nothing we can’t achieve in the cloud.

EEO/Accommodations
AWS is committed to a diverse and inclusive workplace to deliver the best results for our customers. Amazon is an equal opportunity employer and does not discriminate on the basis of race, national origin, gender, gender identity, sexual orientation, protected veteran status, disability, age, or other legally protected status; we celebrate the diverse ways we work. For individuals with disabilities who would like to request an accommodation, please let us know and we will connect you to our accommodation team. You may also reach them directly by visiting

BASIC QUALIFICATIONS

- 3+ years of non-internship professional software development experience
- 2+ years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience
- Experience programming with at least one software programming language

Job Tags

Full time, Internship,

Similar Jobs

Cadre Technologies Services LLC

Senior ServiceNow Developer Job at Cadre Technologies Services LLC

 ...Role: Senior ServiceNow Developer Location: 100% Remote Duration: 6 months C2H An ideal candidate would be someone who has experience working with ITSM, ITOM, and ITAM, along with some event management and integrations experience as well. The role... 

Total Shape

Web Developer Job at Total Shape

 ...articles. The Role Your responsibilities will include: Design, code, test, and deploy web applications with HTML, CSS, and JavaScript, as well as other web technologies Develop responsive and interactive web applications using frameworks and libraries like... 

Kids R Kids Lilburn

Childcare Teacher/Certified CDA Evaluator Job at Kids R Kids Lilburn

 ...Childcare teacher trainer/Certified CDA evaluator (1099 contractor). Qualifications: At least 5 years trainer experience working in a daycare or similar childcare setting is preferred. Knowledge of child welfare guidelines and best practices. Strong communication... 

Mondo

ETL Developer Job at Mondo

Apply Now: ETL Developer (extract transfer load), this opportunity is 100% remote. The start date for this contract role is ASAP. Title: ETL Developer Location: 100% REMOTE (EST or CST ONLY!) Duration: 6-12 Months + Contract... 

Hoskbrew LLC

Retro Gaming Magazine Editor Job at Hoskbrew LLC

 ...friendly manner. Community Engagement: Actively participate in online retro gaming communities, forums, and social media to stay...  ...the retro gaming community. Excellent writing, editing, and proofreading skills. Strong research and interviewing skills....