Skip to main content
Back to jobs

Research Engineer, AI Models

External
enchargeai36 logoEnchargeai36 · India
Full-timeOn-site1d ago
CachingMovePythonPyTorchRoutingTransformers
Cover LetterConnect

Prepare for this interview

Elite

AI-generated questions, company research, and talking points tailored to this role


About the role

Modern AI workloads-from large language models to diffusion-based generators to multimodal systems-represent some of the most compute-intensive frontiers in AI, and some of the most promising applications for our hardware's energy efficiency advantages. We're building a vertically integrated AI stack that will showcase the transformative potential of our silicon while delivering real value to customers today. We are seeking a Research Engineer to push the boundaries of AI model capability, quality, and efficiency. You'll build fine-tuning and post training pipelines, develop rigorous benchmarking frameworks, and work at the intersection of ML research and hardware-aware optimization-ensuring our models run beautifully on our silicon. This is a role for someone who thrives at the boundary between research and engineering. You'll read papers, implement techniques, and ship production-quality code-all in service of making AI inference faster, cheaper, and better.

Responsibilities

  • Evaluation: Build profiling tools and comprehensive benchmarking frameworks to understand compute bottlenecks, measure model quality across standard and domain-specific evals, and track efficiency metrics.

Requirements

  • 5+ years of experience in ML research, applied ML, or ML systems
  • Strong fundamentals in Python and PyTorch
  • Hands-on experience with transformers, diffusion models, state space models etc.
  • Experience fine-tuning large models and building training/evaluation pipelines
  • Deep understanding of transformers, attention mechanisms, & optimization techniques
  • Comfort reading and implementing techniques from research papers
  • Experience with efficient inference techniques (KV cache optimization, attention variants, MoE routing, flow matching)
  • Background in hardware-aware ML optimization or quantization
  • Familiarity with profiling tools (PyTorch Profiler, Nsight, custom instrumentation)
  • Publications in generative modeling, efficient inference, or ML systems
  • Contributions to open-source ML projects

Benefits

Remote work options

Additional Information

Research Engineer, Applied AI Location: India (or Remote-friendly with travel) About EnCharge AI: EnCharge AI is building the next generation AI platform. Our novel in-memory-computing architecture delivers a 10x step-function improvement in compute energy efficiency and performance for AI inference workloads. As the demands of artificial intelligence move beyond today's models, we believe fundamental underlying infrastructure must evolve. We are an experienced team of AI researchers, silicon & systems engineers, and architects backed by leading investors, poised to become the essential platform for the next wave of AI innovation.


Your Match

How well this role fits your profile.

Company Intel

What employees say

Worked at enchargeai36? Share your experience

Interested in this role?

Apply on the company's website.

Cover LetterConnect