Skip to main content
Back to jobs

AI Platform Engineer

External
xenergy logoXenergy · Rockville, MD
Full-timeHybrid2w ago
AWSCI/CDComplianceDatadogDockerDynamoDB
Cover LetterConnect

Prepare for this interview

Elite

AI-generated questions, company research, and talking points tailored to this role


Benefits

Vision insurance

Additional Information

X-energy LLC conducts a thorough recruiting process and will never issue offers without interview to discuss qualifications and responsibilities. All applications will be submitted via our company career page, www.x-energy.com/careers/ . We will never ask you to provide payment information as part of the recruiting process. If anyone claiming to represent X-energy directs you in a manner otherwise, please contact us at www.x-energy.com/contact-us . Job Description This role is responsible for designing, implementing, and maintaining the cloud infrastructure and CI/CD systems that power X-energy's AI-native application platform (APEX). The DevOps Engineer will work with the AI and Application Development team to accelerate deployment velocity, ensure system reliability, and enable the rapid delivery of AI-infused capabilities across engineering, manufacturing, regulatory, and deployment processes. This role requires expertise in containerization, cloud infrastructure automation, and modern DevOps practices to support X-energy's mission of becoming an AI-first organization and setting the industry standard for nuclear deployment speed and operational excellence. Job Profile Tasks/Responsibilities: Design, build, and maintain containerized applications using Docker, including image building, testing, versioning, and optimization for production deployment Develop and maintain GitLab CI/CD pipelines, including runner configuration, pipeline optimization, monitoring dashboards, and automated testing workflows Architect and manage AWS infrastructure using Terraform, including ECS (Elastic Container Service), ECR (Elastic Container Registry), VPC, EC2, ALB/NLB, and other cloud services Administer and optimize AWS data services including DocumentDB, OpenSearch, Redis, Aurora Postgres, DynamoDB, and S3 Implement and maintain comprehensive monitoring, alerting, and observability solutions using Datadog and CloudWatch Manage security and compliance requirements including ACM (AWS Certificate Manager) for certificate management and implementing security best practices across all infrastructure Lead release engineering efforts, including versioning strategies, deployment automation, and rollback procedures Collaborate with development teams to optimize application performance, troubleshoot production issues, and implement infrastructure improvements Leverage Claude Code and AI tools to accelerate infrastructure development and maintenance tasks Apply knowledge of LLMs and AI systems to support the platform's AI-native architecture Maintain professional demeanor and behavior at all times in all forms of communication Execute core tasks and responsibilities with minimal supervision in a fast-paced, team oriented environment Participate in on-call rotation to ensure system reliability and rapid incident response Maintain professional demeanor and behavior at all times in all forms of communication. Perform other duties as assigned by manager Job Profile Minimum Qualifications: Bachelor's degree in Computer Science, Information Technology, Engineering, or related field is required The skill required for this role are typically demonstrated by 10 plus years of relevant experience. 3+ years of hands-on experience with Docker containerization including building, testing, and deploying production applications Proven experience administering GitLab CI/CD systems, including runner setup, pipeline configuration, and troubleshooting Strong proficiency with Amazon Web Services, particularly ECS, ECR, VPC, and Terraform infrastructure-as-code Experience managing AWS data services such as DocumentDB, OpenSearch, Redis, Aurora Postgres, or DynamoDB Proficiency with Linux and/or macOS command-line environments Demonstrated experience with release engineering and deployment automation Knowledge of monitoring and observability tools (Datadog preferred) Understanding of AI/ML concepts and LLM architectures Proficiency with Claude Code or similar AI-assisted development tools Strong problem-solving skills and ability to work independently Excellent communication and collaboration skills Ability to work hybrid schedule in Rockville, MD office Tuesday, Wednesday, and Thursday Location: 530 Gaither Road, Rockville, MD. Work Site Expectations: 3 days in office. Travel Expectations: Up to 5% as needed. Hours: 8:00am-5:00pm, Mon-Fri.


Your Match

How well this role fits your profile.

Company Intel

What employees say

Worked at xenergy? Share your experience

Interested in this role?

Apply on the company's website.

Cover LetterConnect