Remote | Mathematics Assessment Specialist — $25–$60/hour

Job summary

San Francisco

Work model

Fully remote
Worldwide
1 week ago
Job description

Mathematics Assessment Specialist (Remote)

We are sharing a specialised part-time consulting opportunity for expert mathematicians with strong academic judgment, deep subject matter expertise, and the ability to author and review rigorous assessment content across advanced mathematics domains.

This role supports an exciting collaboration with leading AI companies focused on improving advanced AI systems through high-quality academic question design, solution verification, and benchmark development across core areas of mathematics.

Selected professionals will create or review challenging multiple-choice questions, assess clarity and rigor, rate difficulty, write step-by-step solutions, provide academic references, and help improve overall model quality. This opportunity is especially well-suited to detail-oriented mathematics experts who are comfortable translating advanced mathematical knowledge into precise, self-contained assessment content that supports frontier model evaluation.

Key Responsibilities

Professionals in this role may contribute to:

Question Authoring for AI Evaluation

  • Create original, challenging multiple-choice questions in assigned areas of mathematical expertise
  • Ensure that questions test deep conceptual understanding rather than surface-level recall
  • Help ensure that prompts are unambiguous, self-contained, and precisely defined

Question Verification & Quality Review

  • Review pre-written questions for accuracy, clarity, rigor, completeness, and solvability
  • Edit question content where needed and document any changes made
  • Help maintain high standards for precision, quality, and consistency across benchmark tasks

Solution Writing & Benchmark Support

  • Rate question difficulty across medium, hard, and expert levels
  • Provide one correct answer and nine plausible but subtly incorrect alternatives
  • Write step-by-step solutions in markdown format with clear, concise intermediate steps
  • Supply academic references from reputable sources to support question quality and correctness

Ideal Profile

Strong candidates may have:

  • A PhD or doctoral candidacy in Mathematics, Applied Mathematics, Statistics, or a closely related field
  • Strong command of graduate-level mathematical concepts and formal proof writing
  • Excellent written English and the ability to express complex ideas clearly and concisely
  • Deep expertise in one or more areas such as algebra, linear algebra, probability and statistics, analysis, calculus, discrete mathematics, combinatorics, graph theory, number theory, geometry, topology, ODE/PDE and dynamical systems, optimization, operations research, game theory, computational mathematics, numerical mathematics, logic, set theory, foundations, or related mathematics domains

Preferred Qualifications

  • A Master's degree with exceptional depth in a specific mathematics subdomain
  • Experience with rigorous academic problem design or mathematical competition writing
  • Strong consistency in writing rigorous academic content and verifying mathematical precision
  • Ability to evaluate both conceptual depth and assessment design quality across repeated tasks

Why This Opportunity

  • Contribute specialised mathematics expertise to a cutting-edge AI collaboration
  • Help establish gold-standard academic benchmarks used to advance AI capabilities
  • Work on high-impact assessment design and verification tasks with strong intellectual relevance
  • Flexible remote work with competitive hourly compensation

Contract Details

  • Independent contractor role
  • Fully remote with flexible scheduling
  • Hourly compensation of $25--$60 per hour
  • Expected commitment of 10 hours per week
  • Asynchronous work format
  • Assignments may involve either question authoring or question verification depending on project needs
  • Projects may be extended, shortened, or concluded early depending on project needs and performance
  • Weekly payments via Stripe or Wise
  • Work will not involve access to confidential or proprietary information from any employer, client, or institution
  • Please note: We are unable to support H1-B or STEM OPT candidates at this time
  • Start date: Immediate

About The Platform

This opportunity is available through a leading AI-driven work platform that connects domain experts with frontier AI research projects.

Experts contribute to improving advanced AI systems by providing specialised expertise across real-world workflows, structured evaluation, model training support, and domain-specific content validation.

By submitting this application, you acknowledge that your information may be processed by 24-MAG LLC for recruitment and opportunity matching in accordance with our Privacy Policy: https://www.24-mag.com/privacy-policy