- Home
- Remote Jobs
- Remote | AI/ML Technical Evaluation Consultant -- $60-$90/hour
Remote | AI/ML Technical Evaluation Consultant -- $60-$90/hour
Job summary
Utica-Rome Area
Software Developer
Work model
Fully remote
Only US
Job description
Job Description
Job Title: AI/ML Technical Consultant (Part-Time)
Location: New York, NY (On-site/Remote)
Job Type: Independent Contractor, Part-Time (approx. 40 hrs/week)
Hourly Rate: $60--$90/hour
Key Responsibilities
- Design challenging agentic tasks for AI/ML, data science, data engineering, software, and STEM workflows.
- Write accurate, well-documented solutions as ground truth for evaluation.
- Evaluate AI agent outputs for correctness, efficiency, reasoning quality, and technical rigor.
- Provide detailed written feedback on technical issues and improvement areas.
- Develop and refine evaluation frameworks and rubrics for agentic behavior.
- Collaborate with subject-matter experts for consistency and accuracy.
Ideal Profile
- 3+ years of experience in ML, data science, software engineering, statistics, mathematics, physics, chemistry, biology, materials science, or another STEM field.
- Expertise in programming, data analysis, ML modeling, statistical methods, or computational methods.
- Ability to design/evaluate complex technical tasks with strong judgment.
- Strong written communication skills.
- Prior experience with data annotation, evaluation, or human feedback collection (plus).
- Experience with LLMs, AI systems, agentic workflows (plus).
- Commitment to approx. 40 hours/week during weekdays.
Nice to Have
- Familiarity with Python, R, SQL, data pipelines, ML workflows, software development, model evaluation.
- Experience developing benchmark tasks, evaluation frameworks, rubrics.
- Familiarity with AI agent behavior, tool use, multi-step reasoning, agentic task execution.
- Comfort working across multiple technical domains.
Why This Opportunity
- Apply AI/ML, data science, software, and STEM expertise to structured remote consulting.
- Contribute to high-quality technical task design, agentic evaluation, ground truth solutions, and rubric creation.
- Remote structure with competitive hourly compensation.
Contract Details
- Independent contractor role.
- Fully remote, weekday availability.
- Must be based in the United States.
- Expected commitment: approx. 40 hours/week.
- Pay: $60--$90/hour, weekly via Stripe or Wise.
- No access to confidential/proprietary information.
- Projects may be extended or adjusted based on scope/performance.
About The Platform
Offered by 24-MAG LLC.