Additional Information
Job Summary:
Squarepoint is seeking a Platform Specialist to join our global Platform Compute (PLC) team. This role is ideal for experienced engineers with a strong software development background who are passionate about building scalable, resilient infrastructure systems. You will architect, develop, and optimize compute platforms that support our high-performance, data-intensive workloads. You'll work closely with global engineering teams to design and implement infrastructure-as-code, observability pipelines, and self-healing systems. This is a hands-on engineering role with a strong emphasis on automation, performance tuning, and developer enablement.
Main Duties & Responsibilities:
Architect and evolve scalable compute platforms using modern infrastructure engineering practices.
Design and implement automation frameworks for provisioning, configuration management, and lifecycle operations.
Develop internal tooling and APIs to abstract infrastructure complexity and improve productivity.
Take part in bulk server provisioning which may include using remote hands to complete tasks.
Drive observability initiatives by building and integrating telemetry pipelines (metrics, logs, traces).
Collaborate with software engineering teams to ensure infrastructure supports application scalability, reliability, and security.
Mentor junior engineers and contribute to engineering best practices across the team.
Participate in on-call, incident response and postmortems, driving long-term improvements through automation and architectural changes.
Qualifications/Skills Desired:
Expert-level Linux systems administration (RHEL/CentOS/Ubuntu)
Configuration management (e.g., Ansible, Chef, SaltStack)
Proficient in Python, Go, or Rust for infrastructure tooling
Experience with AWS, GCP, or Azure (compute, storage, networking, IAM)
Infrastructure-as-Code (e.g., Terraform, Pulumi)
Monitoring and alerting (e.g., Prometheus, Grafana, Datadog, Zabbix)
Logging and tracing (e.g., ELK stack, Fluentd, OpenTelemetry, Jaeger
Identity and access management (e.g., LDAP, Kerberos, OAuth2)
Cloud-native services (e.g., S3, EBS, GKE, EKS, Cloud Functions)
Git-based workflows and version control
Agile methodologies and DevOps culture
Test-driven infrastructure development and automated testing frameworks
Bachelor's or Master's degree in Computer Science, Engineering, or a related field