System Architect Director - AI Platform Engineering
ExternalPrepare for this interview
EliteAI-generated questions, company research, and talking points tailored to this role
Benefits
Additional Information
You have a clear vision of where your career can go. And we have the leadership to help you get there. At CNA, we strive to create a culture in which people know they matter and are part of something important, ensuring the abilities of all employees are used to their fullest potential. The System Architect (SA) Director for AI Platforms Engineering serves as the technical owner for the enterprise AI platform which is the shared foundation powering all AI and GenAI products across the organization. This leader owns the platform's architecture, engineering standards, and delivery roadmap, translating strategic AI capabilities into reliable, scalable, and governed platform capabilities that accelerate every product team building on top of them. Working in close partnership with Enterprise Architects, Product Management, and Release Train Engineers (RTEs), the SA Director ensures that platform investments are tightly aligned to business outcomes, compliance requirements, and engineering excellence. This role combines the strategic depth of a principal architect with the hands-on leadership of a delivery-focused engineering director. JOB DESCRIPTION: Essential Duties & Responsibilities Performs a combination of duties in accordance with departmental guidelines: Own and continuously evolve the enterprise AI Platform reference architecture, encompassing all critical layers including model serving, orchestration engines, data and knowledge grounding pipelines, observability infrastructure, and ensuring the platform scales reliably to enterprise-grade workloads and usage patterns. Define and enforce platform-wide standards, reusable design patterns, and golden-path templates that enable product and feature teams to build, deploy, and operate AI solutions safely, consistently, and with significantly reduced time-to-production. Drive end-to-end delivery of new platform capabilities - from initial technical discovery and architecture design through prototyping, hardening, and full production rollout while maintaining meaningful hands-on involvement at critical technical milestones to ensure quality and coherence. Architect and operationalize the core platform service catalog, including LLM gateway and routing layers, prompt lifecycle management, agentic orchestration frameworks, Retrieval-Augmented Generation (RAG) pipelines, vector stores, model registries, and rigorous automated evaluation infrastructure. Build and maintain robust CI/CD and AIOps pipelines specifically designed for AI systems, incorporating automated evaluation gates, model and data versioning controls, staged deployment promotion, and continuous cost and performance optimization guardrails. Architect enterprise-grade multi-agent and single-agent workflow patterns for high-value business use cases, establishing clear standards for orchestration design, state and memory management, tool and API integration, and safe autonomy controls including human-in-the-loop approvals, permission scoping, and comprehensive audit trails. Design and implement knowledge grounding systems - spanning hybrid retrieval strategies, semantic reranking, ontology-driven entity modeling, and knowledge graph integration - to measurably improve AI output accuracy, traceability, and readiness for regulatory audit. Embed responsible AI and compliance-by-design principles into every layer of the platform, covering data privacy protections, enterprise secrets management, granular access controls, output leakage prevention, and model risk governance practices aligned to enterprise and regulatory standards. Actively shape PI Planning by authoring well-defined Enabler Epics and articulating architectural outcomes that anchor near-term delivery and long-horizon platform capability roadmaps, while contributing expert WSJF input to balance platform investment against feature team needs, risk reduction, and time-to-impact. Directly manage, mentor, and grow a high-performing team of platform engineers, solution architects, and technical specialists - hiring hands-on builders, coaching technical leadership skills, and sustaining a healthy innovation pipeline that continuously advances the organization's AI platform maturity. May perform additional duties as assigned. Skills, Knowledge & Abilities Deep AI Platform and AIOps engineering expertise, including hands-on experience designing, deploying, and operating shared AI platform capabilities such as model serving layers, LLM gateway and proxy services, prompt registries, vector databases, and automated evaluation harnesses at enterprise scale. Proven agentic system design capability, with hands-on experience architecting multi-agent and single-agent workflow systems using orchestration frameworks such as Lang Graph, Google ADK - including tool and function calling patterns, state and memory persistence strategies, and robust safe autonomy controls. Applied GenAI depth spanning LLM solution architecture patterns, model select