Skip to main content
Back to jobs

LLM Fine-Tuning Engineer (Open-Weight Models / Secure Environments)

External
Trenchant logoTrenchant · Worldwide
Full-timeOn-site1mo ago
Hugging FaceLLMsPythonPyTorchTransformers
Cover LetterConnect

Prepare for this interview

Elite

AI-generated questions, company research, and talking points tailored to this role


Responsibilities

  • Fine-tune and adapt open-weight LLMs for specialized use cases in secure/local environments
  • Design, run, and compare different post-training approaches, including:
  • supervised fine-tuning (SFT)
  • parameter-efficient fine-tuning (LoRA / QLoRA and related methods)
  • preference tuning approaches such as DPO where appropriate
  • full fine-tuning when justified by the use case
  • Build and improve high-quality datasets for training and evaluation
  • Generate synthetic data and use it responsibly to expand coverage, improve robustness, and accelerate iteration
  • Assess model quality beyond headline metrics
  • Work with engineering teams to operationalize tuned models for local or restricted deployments
  • Collaborate with domain experts to translate real operational needs into measurable model requirements

Requirements

  • Strong experience fine-tuning LLMs or adjacent foundation models in production or serious research environments
  • Practical experience with multiple tuning approaches, ideally including SFT, LoRA, QLoRA
  • Experience building and curating datasets for post-training
  • Experience generating and validating synthetic data for model training
  • Strong Python skills and solid experience with:
  • PyTorch
  • Hugging Face Transformers
  • tokenization, data preprocessing and training pipelines
  • Good understanding of GPU constraints, memory/performance tradeoffs, quantization-aware workflows and practical training optimization
  • Strong debugging mindset and ability to investigate why a model improved, regressed, or failed
  • Experience with secure, air-gapped, or otherwise restricted deployment environments
  • Experience deploying or serving tuned models
  • Background in cybersecurity, especially offensive security, vulnerability research
  • Experience with distributed training frameworks
  • How We Think About the Role
  • This is not a "prompt engineer" role, and it is not limited to running a few LoRA jobs. We want someone who can think deeply about:
  • when fine-tuning is the right tool versus prompting or orchestration
  • how to create data that improves behavior instead of just inflating metrics
  • how to evaluate models in ways that reflect real operational value
  • how to make open-weight models reliable in constrained environments

Additional Information

We are hiring an LLM Fine-Tuning Engineer to help us adapt and evaluate open-weight language models for use in secure environments. This role is focused on the full post-training lifecycle: dataset design, supervised fine-tuning, parameter-efficient tuning, synthetic data generation, evaluation, and deployment support . You will work closely with technical teams operating in advanced cybersecurity contexts, helping turn foundation models into reliable task-specific systems. IMPORTANT NOTE : We are not looking for prompt engineers or " LLM whisperers ." We are looking for someone who understands post-training, data evaluation, and deployment deeply enough to make open-weight models reliable in real operational environments.


Your Match

How well this role fits your profile.

Company Intel

What employees say

Worked at Trenchant? Share your experience

Interested in this role?

Apply on the company's website.

Cover LetterConnect