Senior Platform Engineer
ExternalFull-timeHybridToday
API DesignAWSAzureCI/CDClusteringDocumentation
Prepare for this interview
EliteAI-generated questions, company research, and talking points tailored to this role
Responsibilities
- Design, implement, and optimize the central AI Gateway MCP server - the single governed endpoint through which all AI client connections route, built on FastAPI + uvicorn for high-concurrency enterprise workloads
- Build and maintain the Redis ElastiCache session layer that binds Microsoft Entra identity to role-resolved MCP tool sets, including token lifecycle management, sliding TTL extension, per-user quota enforcement, and distributed rate limiting
- Implement the on-demand connector provisioning engine - a system that provisions compute containers with enterprise client drivers, establishes VPC-internal network paths, and retrieves credentials from AWS Secrets Manager automatically when a user's AI agent declares intent to access a data source
- Build enterprise system connectors as MCP tool sets: Oracle DB, SharePoint Graph API, Rally, ServiceNow, and a vendor connector approval pipeline with ECR container image scanning and an Aurora-backed connector registry
- Implement comprehensive automated testing: unit, integration, load testing (1,000+ concurrent users), and chaos testing for connector fault tolerance
- Build and maintain the full observability stack: structured logging, Prometheus metrics, Kinesis Firehose → OpenSearch indexing, and Grafana dashboards for per-user, per-tool, per-session audit trails
- Design and implement CI/CD pipelines for all platform components via GitHub Actions, with automated container image builds, ECS task definition updates, and blue/green deployments
- Own security controls: Entra OIDC token validation, PII masking on all tool responses, WAF rule management, Secrets Manager integration with autorotation, and OWASP-aligned secure API design
- Maintain and extend the existing Snowflake MCP codebase that forms the foundation of the platform, including session management, RBAC, PII masking, configuration management, and secrets integration modules
- Develop troubleshooting and diagnostic tools for production support
- Create documentation, runbooks, and operational playbooks for platform support and maintenance
Requirements
- Minimum Requirements:
- Bachelor's degree in a related discipline and 4 years' experience in a related field. The right candidate could also have a different combination, such as a master's degree and 2 years' experience; a Ph.D. and up to 1 year of experience; or 16 years' experience in a related field
- Python development (4+ years) with advanced async/await patterns, FastAPI, multiprocessing, and production performance optimization
- Model Context Protocol (MCP) - hands-on implementation experience with MCP servers, tool definitions, and client integration patterns; ability to read and extend the protocol specification independently
- AWS platform depth: ECS Fargate task lifecycle management, ElastiCache Redis (TLS, clustering, eviction policies), Secrets Manager, Route 53, ALB (sticky sessions, TLS termination), ECR, Aurora Postgres, SSM Parameter Store, CloudWatch, Kinesis Firehose
- Microsoft Entra ID / Azure AD integration: OIDC federation, group membership extraction via Graph API or JWT claims, RBAC pattern implementation
- Database integration and optimization: Oracle, PostgreSQL, Snowflake, SQL Server - including connection pooling, query optimization, and schema introspection
- Distributed systems patterns: circuit breakers, retry with e
Benefits
Job DescriptionVision insuranceRemote work optionsFlexible schedule
Additional Information
Company Cox Automotive - USA Job Family Group Engineering / Product Development Job Profile Sr Software Engineer Management Level Individual Contributor Flexible Work Option Hybrid - Ability to work remotely part of the week Travel % No Work Shift Day
Your Match
How well this role fits your profile.
Company Intel
What employees say
Worked at Cox Automotive? Share your experience