Senior Data Architect (AI & AI-Assisted Development)
ExternalPrepare for this interview
EliteAI-generated questions, company research, and talking points tailored to this role
About the role
Business Area: IT Seniority Level: Mid-Senior level Job Description: At Cloudera, we empower people to transform complex data into clear and actionable insights. With as much data under management as the hyperscalers, we're the preferred data partner for the top companies in almost every industry. Powered by the relentless innovation of the open source community, Cloudera advances digital transformation for the world's largest enterprises. We are seeking an experienced Senior Data Architect (AI-First Data Architecture & AI-Assisted Development) with 5+ years of experience designing scalable enterprise data platforms and enabling modern AI-driven ecosystems. The ideal candidate will bring deep expertise in data warehousing, lakehouse architectures, combined with hands-on experience in AI governance, Large Language Models (LLMs), Retrieval-Augmented Generation (RAG), semantic data architectures, and AI-assisted development practices. This role extends beyond traditional data architecture by partnering with Business Intelligence, Data Science, Engineering, and AI teams to build AI-ready data foundations. The architect will lead the design of data models, metadata frameworks, and governance practices that optimize enterprise data for AI consumption, intelligent search, agentic workflows, and RAG-based applications. A key focus will be establishing robust metadata, business definitions, lineage, data tagging, and semantic structures to improve the accuracy, discoverability, and scalability of AI-powered solutions. The successful candidate will drive AI-first data acquisition, curation, and governance strategies that support business intelligence, advanced analytics, and AI-driven decision-making across Finance, Sales, and other strategic business domains. They will also champion AI-assisted architecture and documentation practices to accelerate delivery, improve productivity, and create reusable patterns that enable both users and AI systems to effectively discover, understand, and leverage enterprise data. This role will lead the evolution of intelligent, governed, and scalable data platforms that seamlessly integrate traditional data engineering with next-generation AI-powered capabilities, ensuring the organization's data ecosystem is optimized for the future of AI-enabled business operations. As a Sr. Data Architect you will: Design and implement scalable data warehouse and lakehouse architectures on the Cloudera platform. Define enterprise data models, governance frameworks, data stewardship processes, security standards, and data quality practices. Architect and optimize analytics solutions across SQL engines including Impala, Hive, and Iceberg. Design AI-powered analytics solutions leveraging LLMs, Retrieval-Augmented Generation (RAG), vector databases (such as PostgreSQL, Qdrant, Milvus) , and NLQ capabilities. Lead the integration of AI/ML capabilities into enterprise data platforms and data pipelines while establishing governance controls for AI models, data usage, and lifecycle management. Leverage vibe coding / AI-assisted development tools to accelerate development and improve productivity. Build and optimize batch and near real-time data pipelines. Collaborate with business stakeholders to translate business requirements into scalable data products and analytics solutions. Establish best practices for performance optimization, data architecture, and AI-assisted development. Mentor teams on modern data architecture and AI-enabled development methodologies. Ensure data security, governance, compliance, and responsible AI practices within enterprise data platforms and AI-enabled solutions. Collaborate with business stakeholders across FP&A, Sales, and Revenue Operations to translate business requirements into scalable data solutions that support financial forecasting, revenue optimization, budgeting, pipeline analysis, and sales forecasting We are excited about you if you have: Bachelor's degree in Computer Science or equivalent and 5-6 years of related experience; OR Master's degree and 3-5 years of related experience; OR PhD and 0-3 years of related experience Deep expertise in enterprise data warehousing, lakehouse architectures, and Cloudera-based data platforms. Strong experience with CDP, including HDFS, Hive, Impala, Kudu, and Cloudera data ingestion and processing frameworks. Strong understanding of distributed data systems and Hadoop-based architectures. Advanced SQL skills, including performance tuning and query optimization. Proficiency in Python and data engineering frameworks. Experience with dimensional and normalized data modeling. Strong understanding of data governance, lineage, metadata management, data cataloging, enterprise security, and compliance requirements. Experience implementing AI governance practices including model governance, AI risk management, explainability, monitoring, and responsible AI controls. Experience implementing AI/ML, LLM, vector d
Your Match
How well this role fits your profile.
Company Intel
What employees say
Worked at cloudera? Share your experience