Software Engineering, MTS (SRE & Devops)
ExternalPrepare for this interview
EliteAI-generated questions, company research, and talking points tailored to this role
About the role
To get the best candidate experience, please consider applying for a maximum of 3 roles within 12 months to ensure you are not duplicating efforts. Job Category Software Engineering Job Details About Salesforce Salesforce is the #1 AI CRM, where humans with agents drive customer success together. Here, ambition meets action. Tech meets trust. And innovation isn't a buzzword - it's a way of life. The world of work as we know it is changing and we're looking for Trailblazers who are passionate about bettering business and the world through AI, driving innovation, and keeping Salesforce's core values at the heart of it all. Ready to level-up your career at the company leading workforce transformation in the agentic era? You're in the right place! Agentforce is the future of AI, and you are the future of Salesforce. About Salesforce Tech and Product Engineering Our Tech and Product team is tasked with innovating and maintaining a massive distributed systems engineering platform that ships hundreds of features to production for tens of millions of users across all industries every day. Our users count on our platform to be highly reliable, lightning fast, supremely secure, and to preserve all of their customizations and integrations every time we ship. Our platform is deeply customizable to meet the differing demands of our vast user base, creating an exciting environment filled with complex challenges for our hundreds of agile engineering teams every day. Role Description: Salesforce is looking for Site Reliability Engineers to build and manage a multi-substrate kubernetes and microservices platform which powers Core CRM and a growing set of applications across Salesforce. This platform provides the ability to develop and deploy microservices quickly and efficiently, accelerating their path to production. In this role, You are responsible for the high availability of a large fleet of clusters running various technologies like Kubernetes, software load balancers, service mesh and so on. You'll gain valuable experience troubleshooting real production issues which will expand your knowledge on the architecture of k8s ecosystem services and internals. You will contribute code wherever possible to drive improvement You will drive automation efforts in Python/Golang/Terraform/Spinnaker/Puppet/Jenkins to eliminate manual work with day-to-day operations. You will help improve the visibility of the platform by implementing necessary monitoring and metrics. You'll implement self-healing mechanisms to proactively fix issues to reduce manual labor. You will get a chance to improve your communication and collaboration skills working with various other Infrastructure teams across Salesforce. You will be interacting with a highly innovative and creative team of developers and architects. You will evaluate new technologies to solve problems as needed You are the ideal candidate if you have a passion for live site service ownership. You have demonstrated a strong ability to manage large distributed systems. You are comfortable with troubleshooting complex production issues that span multiple disciplines. You bring a solid understanding of how infrastructure software components work. You are able to automate tasks using a modern high-level language. You have good written and spoken communication skills. Required Skills: 4 - 7 years years of experience in SRE/Devops/Systems Engineering roles Experience operating large-scale distributed systems, especially in cloud environments Excellent troubleshooting skills with the ability to learn new technologies in complex distributed systems Strong working experience with Linux Systems Administration. Good knowledge of linux internals. Good experience in any of the scripting/programming languages: Python, GoLang etc ., Basic knowledge of Networking protocols and components: TCP/IP Stack, Switches, Routers, Load Balancers. Experience in any of Puppet, Chef, Ansible or other devops tools. Experience in any of the monitoring tools like Nagios, grafana, Zabbix etc., Experience with Kubernetes, Docker or Service Mesh Experience with AWS, Terraform, Spinnaker A continuous learner and a critical thinker A team player with great communication skills Unleash Your Potential When you join Salesforce, you'll be limitless in all areas of your life. Our benefits and resources support you to find balance and be your best , and our AI agents accelerate your impact so you can do your best . Together, we'll bring the power of Agentforce to organizations of all sizes and deliver amazing experiences that customers love. Apply today to not only shape the future - but to redefine what's possible - for yourself, for AI, and the world. Accommodations If you need a reasonable accommodation during the application or the recruiting process, please submit a request via this Accommodations Request Form . Please note that Salesforce uses artificial intelligence (AI) tools to help our recruiters assess and