Software Developer (Agentic Evaluation)

External

Autodesk · Toronto, Canada

Full-timeOn-siteToday

AWSAzureCI/CDMachine LearningPlaywrightPyTest

Cover Letter Connect

Prepare for this interview

Elite

AI-generated questions, company research, and talking points tailored to this role

Responsibilities

Develop and orchestrate multi-agent AI systems for automated test generation, test execution, and end -to-end development workflow optimization using frameworks like LangGraph, AutoGen, or the Anthropic Agent SDK (Claude Code)
Design and implement agentic workflows that coordinate multiple AI agents to autonomously drive test automation across UI, API, integration, and system levels, from test case synthesis to result evaluation, ensuring seamless integration with existing developer tools and MCP-compatible services
Build evaluation frameworks and custom benchmarks for agentic systems, including comparisons of AI agents against commercial solvers, using tools like AgentBench and Langfuse
Evaluate MCP server and tool performance across agentic pipelines, measuring latency, accuracy, context fidelity, and end-to-end task completion rates

Requirements

BS/MS in Computer Science, Machine Learning, or a related applied AI field
Expertise in Python and ML frameworks (PyTorch, Transformers, scikit-learn)
Experience with Large Language Models applied to software understanding or test generation
Knowledge of AI evaluation methodologies and metrics for agentic task completion and test quality
Strong foundation in statistical analysis and experimental design
Experience with developer workflow and productivity measurement frameworks
Background in software engineering or QA with close collaboration with development teams
Familiarity with test automation frameworks (e.g., Playwright, Selenium, Pytest, Appium) and CI/CD pipelines
Experience designing benchmarks that compare AI agents against commercial or domain-specific solvers
Hands-on experience with MCP (Model Context Protocol), building, evaluating, and optimizing MCP servers and tool integrations within agentic pipelines
Experience with agentic AI frameworks including LangGraph, AutoGen, or the Anthropic Agent SDK / Claude Code
Knowledge in vision-language models or multi-modal AI for UI and system-level understanding and evaluation
Experience with Azure AI Foundry/ML or AWS cloud ML platforms
______________________________________________________________________________
Aperçu du poste
Responsabilités
Développer et orchestrer des systèmes d'IA multi-agents pour la génération automatisée de tests, l'exécution de tests et l'optimisation des flux de développement de bout en bout, à l'aide de cadres comme LangGraph, AutoGen ou le SDK Agent d'Anthropic (Claude Code)
Construire des cadres d'évaluation et des bancs d'essai personnalisés pour les systèmes agentiques, incluant des comparaisons entre agents d'IA et solveurs commerciaux, à l'aide d'outils comme AgentBench et Langfuse
Évaluer la performance des serveurs MCP et des outils au sein de pipelines agentiques, en mesurant la latence, la précision, la fidélité du contexte et les taux de complétion des tâches de bout en bout
Qualifications minimales
Baccalauréat ou maîtrise en informatique, en apprentissage automatique ou dans un domaine connexe de l'IA appliquée
Expertise en Python et en cadres d'apprentissage automatique (PyTor

Benefits

Vision insurance

Additional Information

Job Requisition ID # 26WD96920 Position Overview As a Software Developer on the Fusion platform services team within Product Development and Manufacturing Solutions (PDMS), you'll be part of a team of technologists dedicated to creating cutting-edge AI and gener ative AI solutions that enhance developer productivity and experience. You 'll work closely with AI engineers, software architects, and product engineering teams to build and rigorously evaluate intelligent agentic systems - including bench marking AI agents against commercial solvers - and develop M CP (Model Context Protocol)-based tooling that integrates seamlessly with IDEs such as VS Code and Cursor.

Your Match

How well this role fits your profile.

Company Intel

What employees say

Worked at autodesk? Share your experience

Interested in this role?

Apply on the company's website.

Cover Letter Connect