Adversarial ML Engineer Remote

Remote
vor 3 Wochen
Berlin
Stellenbeschreibung

About The Job

Mercor connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include Benchmark , General Catalyst , Peter Thiel , Adam D'Angelo , Larry Summers , and Jack Dorsey .

Position: Red Team Specialist

Type: Full-time or Part-time Contract Work

Compensation: $56/hour

Location: Remote

Commitment: 20 hours/week

Role Responsibilities

  • Red team conversational AI models and agents, focusing on jailbreaks, prompt injections, and bias exploitation.
  • Generate high-quality human data by annotating failures, classifying vulnerabilities, and flagging systemic risks.
  • Apply structure using taxonomies, benchmarks, and playbooks to maintain consistent testing.
  • Document reproducibly by producing reports, datasets, and attack cases for customer action.
  • Work independently and asynchronously to meet deadlines while improving AI model performance.

Qualifications

Must-Have

  • Fluent Language Skills Required: Native-level fluency in English & German.
  • Prior experience in red teaming, AI adversarial work, cybersecurity, or socio-technical probing.
  • Ability to explain risks clearly to both technical and non-technical stakeholders.

Preferred

  • Experience in Adversarial ML, Cybersecurity, or socio-technical risk analysis.
  • Skills in jailbreak datasets, prompt injection, or RLHF/DPO attacks.

Compensation & Legal

  • Hourly contractor, Paid weekly via Stripe Connect.

Application Process (Takes 20--30 mins to complete)

  • Upload resume
  • AI interview based on your resume
  • Submit form

Resources & Support

  • For details about the interview process and platform information, please check: https://talent.docs.mercor.com/welcome/welcome
  • For any help or support, reach out to: [email protected]

PS: Our team reviews applications daily. Please complete your AI interview and application steps to be considered for this opportunity.