Remote | AI/ML Technical Evaluation Consultant -- $60-$90/hour

Job summary

Utica-Rome Area
Software Developer

Work model

Fully remote
Only US
2 days ago
Job description

Job Description

Job Title: AI/ML Technical Consultant (Part-Time)

Location: New York, NY (On-site/Remote)

Job Type: Independent Contractor, Part-Time (approx. 40 hrs/week)

Hourly Rate: $60--$90/hour

Key Responsibilities

  • Design challenging agentic tasks for AI/ML, data science, data engineering, software, and STEM workflows.
  • Write accurate, well-documented solutions as ground truth for evaluation.
  • Evaluate AI agent outputs for correctness, efficiency, reasoning quality, and technical rigor.
  • Provide detailed written feedback on technical issues and improvement areas.
  • Develop and refine evaluation frameworks and rubrics for agentic behavior.
  • Collaborate with subject-matter experts for consistency and accuracy.

Ideal Profile

  • 3+ years of experience in ML, data science, software engineering, statistics, mathematics, physics, chemistry, biology, materials science, or another STEM field.
  • Expertise in programming, data analysis, ML modeling, statistical methods, or computational methods.
  • Ability to design/evaluate complex technical tasks with strong judgment.
  • Strong written communication skills.
  • Prior experience with data annotation, evaluation, or human feedback collection (plus).
  • Experience with LLMs, AI systems, agentic workflows (plus).
  • Commitment to approx. 40 hours/week during weekdays.

Nice to Have

  • Familiarity with Python, R, SQL, data pipelines, ML workflows, software development, model evaluation.
  • Experience developing benchmark tasks, evaluation frameworks, rubrics.
  • Familiarity with AI agent behavior, tool use, multi-step reasoning, agentic task execution.
  • Comfort working across multiple technical domains.

Why This Opportunity

  • Apply AI/ML, data science, software, and STEM expertise to structured remote consulting.
  • Contribute to high-quality technical task design, agentic evaluation, ground truth solutions, and rubric creation.
  • Remote structure with competitive hourly compensation.

Contract Details

  • Independent contractor role.
  • Fully remote, weekday availability.
  • Must be based in the United States.
  • Expected commitment: approx. 40 hours/week.
  • Pay: $60--$90/hour, weekly via Stripe or Wise.
  • No access to confidential/proprietary information.
  • Projects may be extended or adjusted based on scope/performance.

About The Platform

Offered by 24-MAG LLC.