Data Scientist III - Lead Data Architect
ExternalFull-timeRemoteToday
AWSAzureBigQueryComplianceCross-functional CollaborationData Modeling
Prepare for this interview
EliteAI-generated questions, company research, and talking points tailored to this role
Responsibilities
- Design AI-ready data models to support machine learning, advanced analytics, and real-time decisioning
- Build and maintain feature-ready datasets for data science teams (feature engineering support)
- Develop semantic and analytical data layers for BI, AI, and self-service analytics
- Collaborate with data scientists to translate ML use cases into scalable data structures
- Model and integrate high-volume time-series and IoT data (e.g., smart meters, sensors, grid telemetry)
- Enable real-time / near-real-time data pipelines for AI-driven insights
- Ensure data models support MLOps frameworks (model training, validation, deployment pipelines)
- Implement data lineage, observability, and quality frameworks to support trusted AI outcomes
- Optimize data structures for lakehouse architectures and distributed compute environments
- Align with data governance, privacy, and regulatory compliance requirements
- AI/Analytics Use Case Alignment
- Predictive Maintenance: Asset failure prediction using sensor and maintenance data
- Wildfire Risk Modeling: Environmental and grid data modeling for risk forecasting
- Load Forecasting: Time-series modeling for energy demand prediction
- Customer 360 Analytics: Behavioral segmentation and usage insights
- Grid Intelligence: AI-driven outage prediction and response optimization
- Generative AI Enablement: Structuring enterprise data for LLM-based insights and copilots
- Required Qualifications
- 8+ years in data modeling, data architecture, or analytics engineering
- 3+ years of Utility/energy domain experience (smart grid, AMI, SCADA systems) supporting electric, gas, and/or water utilities.
- Strong expertise in: Dimensional modeling for analytics (Star/Snowflake schemas)
- Data modeling for machine learning pipelines
- SQL and data transformation frameworks (dbt preferred)
- Experience designing data models for: Data lakes / lakehouse architectures (Delta Lake, Iceberg, etc.)
- Structured + semi-structured data (JSON, Parquet)
- Proven experience supporting AI/ML workloads in production environments
Requirements
- Experience with cloud AI ecosystems : AWS (SageMaker, Redshift)
- Azure (Synapse, Azure ML)
- GCP (BigQuery, Vertex AI)
- Familiarity with time-series and streaming platforms (Kafka, Spark Streaming)
- Knowledge of feature stores (Feast, Tecton)
- Experience with MLOps tools (MLflow, Kubeflow)
- Understanding of LLM data preparation , vector databases, and embeddings
- Key Skills
- AI/ML Data Modeling & Feature Engineering
- Lakehouse & Modern Data Stack (dbt, Spark, Delta Lake)
- Time-Series & Streaming Data Modeling
- Data Governance for AI (quality, lineage, bias mitigation)
- Performance Optimization for Analytics Workloads
- Cross-functional collaboration (Data Science, Engineering, Business)
- Salary Range
- Astreya offers comprehensive b enefits to all Regular, Full-Time Employees, including:
- Medical provided through UHC (PPO, HSA, Surest options) / Medical provided through Kaiser (HMO option only) for California employees only
- Dental provided through UHC
- Nationwide Vision provided by UHC
- Flexible Spending Account for Health & Dependent Care
- Pre-Tax Account for Commuter Benefit/Parking & Transit (location-specific)
- Continuing Education and Professional Development via various integrated platforms, e.g. Udemy and Coursera
- Corporate Wellness Program provided by Goomi Group
- Employee Assistance Program
- Wellness Days
- 401k Plan
- Basic and Supplemental Life Insurance
- Short Term & Long Term Disability
- Critical Illness, Critical Hospital, and Voluntary Accident Insurance
- Tuition Reimbursement (available 6 months after start date, capped)
- Paid Time Off (accrued and prorated, maximum of 120 hours annually)
- Paid Holidays
- Any o
Benefits
Health insuranceDental insuranceVision insurance401(k)Remote work optionsFlexible schedulePerformance bonus
Additional Information
Job Title: Data Modeling Expert - AI & Analytics Location: California (Hybrid/Remote) We are seeking a Data Modeling Expert with strong AI/Analytics focus to enable next-generation data platforms supporting predictive analytics, machine learning, and intelligent automation. This role will design and optimize data models that power use cases such as grid reliability, predictive maintenance, wildfire risk modeling, customer analytics, and AI-driven operations .
Your Match
How well this role fits your profile.
Company Intel
What employees say
Worked at astreya? Share your experience