Senior Network Engineer
ExternalPrepare for this interview
EliteAI-generated questions, company research, and talking points tailored to this role
About the role
As a Senior Network Engineer at Together, you are responsible for designing, implementing, and maintaining our network infrastructure to ensure seamless connectivity and optimal performance for all user-facing services and production systems. As both a strategic planner and a hands-on engineer, you apply sound networking principles, operational discipline, and advanced automation to our network environments. You specialize in networking systems-including routing, switching, network security, and protocols-implementing best practices for availability, reliability, and scalability. You have a keen interest in network design, optimization, and emerging technologies in HPC-based data center networking. Outstanding problem-solving abilities and a comprehensive understanding of fundamental network theory are also critical to your success.
Responsibilities
- Design, deploy, manage and maintain global multi-vendor, multi-protocol high performance compute networks.
- Analyze data to diagnose and identify root causes to network issues to minimize downtime
- Evaluate and recommend network technologies, hardware, and software solutions.
- Participate in design reviews to ensure the proposed network architecture aligns with business needs and is optimized for performance, scalability, and reliability.
- Manage relationships with external vendors and partners to test and verify hardware and software selections.
- Develop, and deploy systems and tools to keep all networks running reliably and efficiently
- Establish and implement industry best practices and contribute to the design of new scalable network solutions
- Ensure compliance with IT governance standards and best practices.
- Lead projects to address complex technical challenges, directly contributing to roadmaps and partner alongside the best engineers in the industry to develop world-class solutions
- Preferred
- Knowledge of RoCE and Infiniband protocols a plus
- Experience with Docker, Kubernetes, or Slurm a plus
- Understanding of AI training workloads and the demands they exert on networks a plus
- About Together AI
Requirements
- 8+ years of professional experience building, managing, and supporting large-scale hybrid data center networks (excluding enterprise networks).
- High level of proficiency with TCP/IP networking architecture and technologies such as BGP, OSPF, VXLAN, EVPN, and QoS.
- Experience developing network automation pipelines using Python, Ansible, or other languages/tools utilized in infrastructure automation.
- Proficient in using tools such as Wireshark, tcpdump, nmap, MTR, and curl to identify connectivity issues, latency problems, and network bottlenecks.
- Experience designing and supporting multi-tenant networks
- Hands-on experience deploying and supporting network devices from Cisco, Arista, Juniper, and Mellanox.
- Experience working with cloud networks such as AWS, GCP, and Azure.
- Solid experience working in and troubleshooting within a Linux environment.
Benefits
Your Match
How well this role fits your profile.
Company Intel
What employees say
Worked at Together AI? Share your experience