Sponsorship
25 results
based on your profileServiceNow
United States
Hy-Vee, Inc.
Madison, WI
Hy-Vee, Inc.
Rochester, MN
Hy-Vee, Inc.
Ottumwa, IA
Spectrum Plastics Group, A DuPont Business
Sandy, UT
Hitachi Energy
Pittsburgh, PA
Valnet
New York, NY
Hy-Vee, Inc.
Madison, WI
Hy-Vee, Inc.
Ottumwa, IA
Amtex Systems Inc
Montvale, NJ
Evergreen Healthcare Group
Tacoma, WA
Athleta
Collegeville, PA
Family Dollar
Blackshear, GA
Evergreen Healthcare Group
Missoula, MT
Evergreen Healthcare Group
Woonsocket, SD
Family Dollar
Raleigh, NC
Good Greek Moving & Storage
Orlando, FL
Lacoste
Atlanta, GA
Global Payments Inc.
United States
HomeWorks Energy, Inc.
West Hartford, CT
HomeWorks Energy, Inc.
Chicopee, MA
J.McLaughlin
Providence, RI
Wendy's
Orlando, FL
Sylvan Learning
Bellevue, WA
Wendy's
Orlando, FL
ML Engineer
Apply now
About the job
About The Company
Founded in sunny San Diego, California in 2004, ServiceNow has established itself as a global leader in enterprise cloud computing, revolutionizing how organizations work by connecting people, systems, and processes through innovative technology. Under the visionary leadership of Fred Luddy, the company has grown exponentially, serving over 8,100 customers worldwide, including 85% of the Fortune 500®. Our intelligent cloud platform leverages AI-enhanced solutions to enable smarter, faster, and more efficient workflows. Committed to making the world work better for everyone, ServiceNow continues to push the boundaries of technology, fostering a culture of innovation, collaboration, and excellence.
About The Role
We are seeking a highly skilled Staff Machine Learning Engineer to join our Platform Engineering and AI Technology Organization (PLATO) at ServiceNow. This role is pivotal in advancing our AI platform, building end-to-end AI-powered work experiences, and supporting the deployment and operation of large-scale AI workloads. The successful candidate will collaborate with research teams, AI engineers, and infrastructure specialists to design, develop, and optimize infrastructure, deployment pipelines, and observability features that ensure high performance, scalability, and reliability of AI systems. This position requires presence in our Santa Clara office two days per week and offers an exciting opportunity to work at the forefront of AI and cloud technology, shaping the future of intelligent enterprise solutions.
Qualifications
- Experience in integrating AI into work processes, decision-making, or problem-solving, including automation and AI-driven insights.
- Proficiency in prompt engineering and developing features based on large language models (LLMs).
- Hands-on experience with training, fine-tuning, and deploying large language models, including methods like distillation, supervised fine-tuning, and policy optimization.
- Experience working with AI productivity tools such as Cursor, Windsurf, etc.
- Operational experience with LLMs on NVIDIA GPUs.
- At least 4 years of development experience with Python, GoLang, Java, or similar programming languages.
- Minimum of 4 years of experience managing highly available distributed workloads on Kubernetes following DevOps practices.
- Proficiency with DevOps tools such as Helm, Ansible, Kubernetes, Prometheus, Splunk, and GitLab CI.
- Strong background in operating distributed systems built on Linux and J2EE platforms.
- Experience with software-defined networking, infrastructure as code, and configuration management.
- Knowledge of building secure and compliant software for regulated environments.
- Ability to lead projects with significant technical risks and deliver tangible outcomes.
- Preferred: Over 4 years of experience in infrastructure and platform operations, deployments, SRE, and continuous platform improvement.
Responsibilities
- Design, develop, and implement infrastructure, platform, deployment, and observability features that support AI workloads.
- Collaborate with cross-functional teams including researchers, AI engineers, and infrastructure specialists to optimize GPU clusters for performance, scalability, and reliability.
- Enhance the Site Reliability Engineering (SRE) practices by translating operational use cases into actionable software tooling requirements.
- Support deployment activities for AI/ML developers, ensuring smooth integration and operation of AI models.
- Write high-quality, scalable, and reusable code while adhering to best practices such as code reviews and unit testing.
- Work closely with product owners to understand detailed requirements and take ownership of code from design through testing and deployment.
- Operate and optimize large language models on NVIDIA GPUs, ensuring high efficiency and performance.
- Mentor colleagues and promote knowledge sharing within the team to foster a culture of continuous learning and improvement.
Benefits
- Competitive base salary ranging from $173,100 to $303,000, commensurate with experience and location.
- Equity options (when applicable) and variable/incentive compensation programs.
- Comprehensive health plans, including medical, dental, and vision coverage.
- Flexible spending accounts and a 401(k) plan with company matching contributions.
- Employee Stock Purchase Plan (ESPP) and matching donations programs.
- Flexible time-off policies, family leave programs, and wellness initiatives.
- Opportunities for professional growth and development in a cutting-edge technological environment.
Equal Opportunity
ServiceNow is an equal opportunity employer. We are committed to fostering an inclusive environment where all qualified applicants receive consideration for employment without regard to race, color, creed, religion, sex, sexual orientation, gender identity or expression, national origin, age, disability, veteran status, or any other protected category under law.
Group Referrals
Min 8 Jobs Required to Begin
You've selected: 0 Jobs
No recommended referral jobs available