Already filled

Don't miss the next one. Get matching roles delivered to your inbox.

LLM Fine-Tuning Engineer

Job summary

Naperville
Software Developer

Work model

Fully remote
Only US
1 month ago
Job description

LLM Fine-Tuning Engineer

Location: 100% Remote (Continental United States) Position Type: In-house Bright Vision Technologies SOW engagement Experience: 6+ years Sponsorship: No new H1B sponsorship available. H1B transfers welcomed. Employment Type: Full-time, direct W2 with Bright Vision Technologies (no C2C, no 1099, no third-party) Engagement: Long-term, multi-year

Compensation: Competitive base salary commensurate with experience, plus benefits.

Employment Terms & Visa Policy

This is a 100% remote, full-time, direct W2 position with Bright Vision Technologies. This role is part of Bright Vision Technologies' in-house Statement of Work (SOW) engagement. The client, end customer, and employer for this position is Bright Vision Technologies --- there is no third-party client, vendor, or implementation partner involved.

We do not engage in C2C, 1099, or third-party arrangements for this role. BUT STRICTLY NO C2C/1099/3RD PARTY COMPANIES. ALL OUR ROLES ARE W2 AND NO 3RD PARTY BROKERING PLEASE. Candidates must be willing to work directly as a full-time W2 employee of Bright Vision Technologies and contribute to our in-house SOW deliverables.

No new H1B sponsorship is available for this role. However, candidates who are currently on a valid H1B visa and require a transfer are welcome to apply. We will support H1B transfers for qualified candidates. For every role, a technical coding assessment is mandatory. Please apply only if you are confident in your technical abilities and hands-on experience.

Job Summary

Bright Vision Technologies is seeking an LLM Fine-Tuning Engineer to design, execute, and operationalize fine-tuning workflows for large language models across supervised, preference-based, and reinforcement learning approaches. The role requires deep practical experience with modern training stacks, careful dataset construction, rigorous evaluation methodology, and the engineering discipline to operate complex training pipelines reliably. The ideal candidate combines strong ML intuition with production-grade engineering practices, and is comfortable navigating the trade-offs between data quality, compute budget, evaluation rigor, and shipping velocity.

In this role, you will work closely with cross-functional partners --- product, design, engineering, operations, and business stakeholders --- to translate ambiguous requirements into well-engineered solutions. You will be expected to raise the bar through code review, design review, and mentorship of more junior engineers. The successful candidate brings strong engineering discipline, a clear communication style, and a track record of shipping meaningful work that holds up well in production.

Key Responsibilities

  • Design and execute fine-tuning experiments for large language models using supervised, DPO, RLHF, and related techniques
  • Lead dataset construction, curation, and quality assurance processes for instruction tuning and preference data
  • Build scalable training pipelines on top of modern distributed training frameworks
  • Tune hyperparameters, optimizer configurations, and training stability strategies for large-model fine-tuning
  • Implement parameter-efficient fine-tuning techniques such as LoRA, QLoRA, and adapter-based methods
  • Design rigorous evaluation suites including automated benchmarks, human evaluation, and capability-specific probes
  • Implement safety, refusal, and policy evaluations to track model behavior across releases
  • Operate large-scale training jobs on GPU clusters, diagnosing failures and recovering training state reliably
  • Optimize training throughput using mixed precision, sequence packing, and efficient attention implementations
  • Manage model artifacts, lineage tracking, and reproducibility across many concurrent experiments
  • Collaborate with product, research, and platform teams to align fine-tuning roadmaps with business needs
  • Document training methodology, results, and decisions clearly for technical and non-technical audiences
  • Mentor engineers on fine-tuning best practices, evaluation rigor, and responsible deployment
  • Stay current with LLM research and translate advances into production-ready fine-tuning recipes

Required Qualifications

  • Master's or PhD in Computer Science, Machine Learning, or a related field; or equivalent experience
  • Six or more years of combined ML research and engineering experience, with significant LLM exposure
  • Strong proficiency in Python and modern deep learning frameworks, especially PyTorch
  • Hands-on experience fine-tuning transformer-based language models at non-trivial scale
  • Familiarity with distributed training strategies including FSDP, ZeRO, and pipeline parallelism
  • Experience with RLHF, DPO, or other preference optimization techniques
  • Strong understanding of evaluation methodology, benchmarks, and human evaluation design
  • Experience operating training jobs on GPU clusters and recovering from failures
  • Strong written and verbal communication skills
  • Track record of shipping or publishing impactful LLM work

Preferred Qualifications

  • Publications at top-tier ML venues
  • Experience with multimodal model fine-tuning
  • Familiarity with synthetic data generation and dataset distillation
  • Open-source contributions to LLM training libraries
  • Exposure to responsible AI evaluation and red-teaming practices

How to Apply

For immediate consideration, please send your resume to [email protected] or contact us at +1 (908) 765-8199. Learn more about Bright Vision Technologies at www.bvteck.com.

Bright Vision Technologies is an equal opportunity employer and values diversity and inclusion. We do not discriminate on the basis of any protected attribute. We make reasonable accommodations for religious practices, beliefs, and disability needs.

Powered by JazzHR 5L7hnNY9K7