Site Reliability Engineer (Top Secret Clearance)
ExternalPrepare for this interview
EliteAI-generated questions, company research, and talking points tailored to this role
Responsibilities
- Develop automation to deploy and manage compute resources both on-premises and in the cloud
- Build, maintain, and scale on-premises hardware systems designed to host GPU-accelerated machine learning workloads
- Deploy and manage core infrastructure such as databases, monitoring and storage
- Closely collaborate with software engineers to create highly scalable, operable and maintainable products
- Engage in and improve the whole lifecycle of services -- from inception and design, through deployment, operation and refinement
Requirements
- Bachelor's degree in computer science, information systems/IT, or an engineering discipline; OR 2+ years of professional experience in software, DevOps, or site reliability engineering in lieu of a degree
- 1+ year of experience with Kubernetes
- 1+ year of experience with Linux operating systems
- Experience in Bash, Python, and/or other scripting languages
- Experience building, maintaining, and scaling on-premises and/or cloud systems designed
- PREFERRED SKILLS AND EXPERIENCE:
- Active Top Secret, Top Secret SCI, or DOE Level Q clearance is highly desired
- Experience hosting and pushing the state of the art in inferential model benchmarks
- Experience with systems administration, site reliability engineering, or DevOps engineering
- Experience with Python and Python-based development frameworks
- Experience with virtualization and hypervisor technologies
- Experience with automatically managing dozens or hundreds of servers
- Knowledge of performance bottlenecks and performance improvement techniques
- Excellent communications skills with the ability to communicate with customers, peers, management etc. in both formal and informal situations
- Ability to quickly learn new tools and frameworks.
- ADDITIONAL REQUIREMENTS:
- An active clearance may provide the opportunity for you to work on sensitive SpaceX missions; if so, you will be subject to pre-employment drug and random drug and alcohol testing
- Must be willing to work extended hours and weekends as needed
- COMPENSATION AND BENEFITS:
- Pay Range:
- Level 3: $145,000.00 - $175,000.00
- Your actual level and base salary will be determined on a case-by-case basis and may vary based on the following considerations: job-related knowledge and skills, education, and experience.
- Those with an active clearance will receive a 10% differential, up to an additional $20,000 annually, once officially briefed into a classified program.
- ITAR REQUIREMENTS:
- Applicants wishing to view a copy of SpaceX's Affirmative Action Plan for veterans and individuals with disabilities, or applicants requir
Benefits
Additional Information
SpaceX was founded under the belief that a future where humanity is out exploring the stars is fundamentally more exciting than one where we are not. Today SpaceX is actively developing the technologies to make this possible, with the ultimate goal of enabling human life on Mars. SITE RELIABILITY ENGINEER (TOP SECRET CLEARANCE) As a member of the Classified IT Systems Engineering team, the Site Reliability Engineer is involved in designing scalable systems capable of supporting a growing volume of data products being generated in mass. We build tools that enable us to work more efficiently, and that help us build software systems that are secure, reliable, and autonomous. Our engineers are responsible for the life cycle of the systems they create, including development, testing, and operational support.
Your Match
How well this role fits your profile.
Company Intel
What employees say
Worked at SpaceX? Share your experience