About The Job

Mercor connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include Benchmark , General Catalyst , Peter Thiel , Adam D'Angelo , Larry Summers , and Jack Dorsey .

Position: Red Team Specialist

Type: Full-time or Part-time Contract Work

Compensation: $56/hour

Location: Remote

Commitment: 20 hours/week

Role Responsibilities

Red team conversational AI models and agents, focusing on jailbreaks, prompt injections, and bias exploitation.
Generate high-quality human data by annotating failures, classifying vulnerabilities, and flagging systemic risks.
Apply structure using taxonomies, benchmarks, and playbooks to maintain consistent testing.
Document reproducibly by producing reports, datasets, and attack cases for customer action.
Work independently and asynchronously to meet deadlines while improving AI model performance.

Qualifications

Must-Have

Fluent Language Skills Required: Native-level fluency in English & German.
Prior experience in red teaming, AI adversarial work, cybersecurity, or socio-technical probing.
Ability to explain risks clearly to both technical and non-technical stakeholders.

Preferred

Experience in Adversarial ML, Cybersecurity, or socio-technical risk analysis.
Skills in jailbreak datasets, prompt injection, or RLHF/DPO attacks.

Compensation & Legal

Hourly contractor, Paid weekly via Stripe Connect.

Application Process (Takes 20--30 mins to complete)

Upload resume
AI interview based on your resume
Submit form

Resources & Support

For details about the interview process and platform information, please check: https://talent.docs.mercor.com/welcome/welcome
For any help or support, reach out to: [email protected]

PS: Our team reviews applications daily. Please complete your AI interview and application steps to be considered for this opportunity.

Adversarial ML Engineer Remote