LLM Fine-Tuning Engineer

Job summary

Naperville

Software Developer

Work model

Fully remote

Only US

1 month ago

Job description

LLM Fine-Tuning Engineer

Location: 100% Remote (Continental United States) Position Type: In-house Bright Vision Technologies SOW engagement Experience: 6+ years Sponsorship: No new H1B sponsorship available. H1B transfers welcomed. Employment Type: Full-time, direct W2 with Bright Vision Technologies (no C2C, no 1099, no third-party) Engagement: Long-term, multi-year

Compensation: Competitive base salary commensurate with experience, plus benefits.

Employment Terms & Visa Policy

This is a 100% remote, full-time, direct W2 position with Bright Vision Technologies. This role is part of Bright Vision Technologies' in-house Statement of Work (SOW) engagement. The client, end customer, and employer for this position is Bright Vision Technologies --- there is no third-party client, vendor, or implementation partner involved.

We do not engage in C2C, 1099, or third-party arrangements for this role. BUT STRICTLY NO C2C/1099/3RD PARTY COMPANIES. ALL OUR ROLES ARE W2 AND NO 3RD PARTY BROKERING PLEASE. Candidates must be willing to work directly as a full-time W2 employee of Bright Vision Technologies and contribute to our in-house SOW deliverables.

No new H1B sponsorship is available for this role. However, candidates who are currently on a valid H1B visa and require a transfer are welcome to apply. We will support H1B transfers for qualified candidates. For every role, a technical coding assessment is mandatory. Please apply only if you are confident in your technical abilities and hands-on experience.

Job Summary

Bright Vision Technologies is seeking an LLM Fine-Tuning Engineer to design, execute, and operationalize fine-tuning workflows for large language models across supervised, preference-based, and reinforcement learning approaches. The role requires deep practical experience with modern training stacks, careful dataset construction, rigorous evaluation methodology, and the engineering discipline to operate complex training pipelines reliably. The ideal candidate combines strong ML intuition with production-grade engineering practices, and is comfortable navigating the trade-offs between data quality, compute budget, evaluation rigor, and shipping velocity.

In this role, you will work closely with cross-functional partners --- product, design, engineering, operations, and business stakeholders --- to translate ambiguous requirements into well-engineered solutions. You will be expected to raise the bar through code review, design review, and mentorship of more junior engineers. The successful candidate brings strong engineering discipline, a clear communication style, and a track record of shipping meaningful work that holds up well in production.

Key Responsibilities

Design and execute fine-tuning experiments for large language models using supervised, DPO, RLHF, and related techniques
Lead dataset construction, curation, and quality assurance processes for instruction tuning and preference data
Build scalable training pipelines on top of modern distributed training frameworks
Tune hyperparameters, optimizer configurations, and training stability strategies for large-model fine-tuning
Implement parameter-efficient fine-tuning techniques such as LoRA, QLoRA, and adapter-based methods
Design rigorous evaluation suites including automated benchmarks, human evaluation, and capability-specific probes
Implement safety, refusal, and policy evaluations to track model behavior across releases
Operate large-scale training jobs on GPU clusters, diagnosing failures and recovering training state reliably
Optimize training throughput using mixed precision, sequence packing, and efficient attention implementations
Manage model artifacts, lineage tracking, and reproducibility across many concurrent experiments
Collaborate with product, research, and platform teams to align fine-tuning roadmaps with business needs
Document training methodology, results, and decisions clearly for technical and non-technical audiences
Mentor engineers on fine-tuning best practices, evaluation rigor, and responsible deployment
Stay current with LLM research and translate advances into production-ready fine-tuning recipes

Required Qualifications

Master's or PhD in Computer Science, Machine Learning, or a related field; or equivalent experience
Six or more years of combined ML research and engineering experience, with significant LLM exposure
Strong proficiency in Python and modern deep learning frameworks, especially PyTorch
Hands-on experience fine-tuning transformer-based language models at non-trivial scale
Familiarity with distributed training strategies including FSDP, ZeRO, and pipeline parallelism
Experience with RLHF, DPO, or other preference optimization techniques
Strong understanding of evaluation methodology, benchmarks, and human evaluation design
Experience operating training jobs on GPU clusters and recovering from failures
Strong written and verbal communication skills
Track record of shipping or publishing impactful LLM work

Preferred Qualifications

Publications at top-tier ML venues
Experience with multimodal model fine-tuning
Familiarity with synthetic data generation and dataset distillation
Open-source contributions to LLM training libraries
Exposure to responsible AI evaluation and red-teaming practices

How to Apply

For immediate consideration, please send your resume to [email protected] or contact us at +1 (908) 765-8199. Learn more about Bright Vision Technologies at www.bvteck.com.

Bright Vision Technologies is an equal opportunity employer and values diversity and inclusion. We do not discriminate on the basis of any protected attribute. We make reasonable accommodations for religious practices, beliefs, and disability needs.

More Remote jobs in Software Developer

Mainframe CICS COBOL Developer - REMOTE

NTT DATA, Inc.

Join NTT DATA as a remote Mainframe CICS COBOL Developer. Design scalable logistics applications, work with DB2, and lead in an Agile environment.

Fully remote· Only US

3 days ago

Machine Learning Engineer, E-commerce Recommendation Foundation - USDS

TikTok USDS Joint Venture

Join our team to build next-gen generative recommendation systems using LLMs, multimodal learning, and foundation models. Apply your ML expertise t...

Fully remote

3 days ago

Senior Mobile Engineer, iOS

SeatGeek

Join the Fandom team at SeatGeek. Build beautiful, high-impact iOS features in a flexible, remote-friendly environment. Apply today!

Fully remote· Only United States

3 days ago

View all Remote jobs in Software Developer