Remote | Law Model Prompt Evaluator — $75–$100/hour

Job summary

San Francisco
Legal

Work model

Fully remote
1 week ago
Job description

Job Opportunity: Remote Law Model Prompt Evaluator — $75–$100/hour

We are sharing a specialised part-time consulting opportunity for experienced legal professionals with strong legal reasoning ability, deep subject matter expertise, and the ability to author and verify high-quality open-ended prompts for AI model evaluation.

This role supports an exciting collaboration with leading AI companies focused on improving frontier language models through high-quality prompt authoring, verification, and evaluation workflows across core legal domains.

Selected professionals will create or review open-ended legal prompts, assess prompt clarity and difficulty, apply expert legal judgment to evaluate the depth of reasoning required, and help improve overall model quality. This opportunity is especially well-suited to detail-oriented legal experts who are comfortable working across statutory interpretation, case analysis, jurisprudence, and other structured legal reasoning tasks with precision and consistency.

Key Responsibilities

Professionals in this role may contribute to:

Prompt Authoring for AI Evaluation

  • Create original, open-ended legal prompts from assigned subdomains at varying difficulty levels
  • Develop prompts that require human judgment to evaluate the quality of AI responses
  • Help ensure that prompts are clear, technically rigorous, and suitable for model evaluation

Prompt Verification & Quality Review

  • Review authored prompts for clarity, scope alignment, difficulty accuracy, and uniqueness
  • Edit prompts and difficulty ratings where needed
  • Help maintain high standards for precision, quality, and consistency across evaluation tasks

Legal Reasoning Assessment

  • Apply expert judgment to assess the depth and quality of legal reasoning required
  • Work across areas such as legal reasoning, statutory interpretation, case analysis, and jurisprudential theory
  • Help improve model quality through carefully designed and verified legal prompts

Ideal Profile

Strong candidates may have:

  • A JD, LLM, SJD, or Master's degree or higher in Law or a closely related field
  • 2--6 years of professional experience in legal practice, academia, or policy
  • Strong command of legal reasoning, statutory interpretation, and jurisprudential theory
  • Excellent written English and the ability to craft precise, well-scoped legal questions

Preferred Qualifications

  • Bar admission, judicial clerkship, or legal research experience
  • Strong familiarity with legal subdomains such as professional and statutory law, jurisprudence and legal theory, international law, contract law, criminal law, constitutional law, and regulatory or administrative law
  • High attention to detail and strong consistency in technical evaluation workflows
  • Ability to edit prompts and difficulty assignments with sound legal judgment

Why This Opportunity

  • Contribute specialised legal expertise to a cutting-edge AI collaboration
  • Help establish rigorous evaluation standards for frontier language models
  • Work on high-impact prompt design and verification tasks with strong real-world relevance
  • Flexible remote work with competitive hourly compensation

Contract Details

  • Independent contractor role
  • Fully remote with flexible scheduling
  • Hourly compensation of $75--$100 per hour
  • Expected commitment of 10 hours per week
  • Asynchronous work format
  • Assignments may involve either authoring or verification tasks depending on project needs
  • Projects may be extended, shortened, or concluded early depending on project needs and performance
  • Weekly payments via Stripe or Wise
  • Work will not involve access to confidential or proprietary information from any employer, client, or institution

Please note: We are unable to support H1-B or STEM OPT candidates at this time

Start date: Immediate

About The Platform

This opportunity is available through a leading AI-driven work platform that connects domain experts with frontier AI research projects.

Experts contribute to improving advanced AI systems by providing specialised expertise across real--world workflows, structured evaluation, model training support, and domain-specific content validation.

By submitting this application, you acknowledge that your information may be processed by 24-MAG LLC for recruitment and opportunity matching in accordance with our Privacy Policy: https://www.24-mag.com/privacy-policy