- Home
- Remote Jobs
- Data Scientist – AI Training & Evaluation
AL
Data Scientist – AI Training & Evaluation
Job summary
Chicago
Work model
Fully remote
Worldwide
Job description
About The Role
AI is only as good as the experts who train it. We're looking for data scientists to help evaluate, refine, and improve next-generation AI systems — bringing your quantitative expertise directly to bear on how the world's most advanced models reason, analyze, and communicate.
This is a fully remote, flexible contract role. You set your hours and work at your own pace, contributing to projects that sit at the frontier of applied AI research.
- Organization: Alignerr
- Type: Hourly Contract
- Location: Remote
- Commitment: 10--40 hours/week
What You'll Do
- Evaluate AI model outputs for statistical soundness, reasoning quality, and analytical accuracy
- Design and apply data-driven evaluation criteria and scoring rubrics
- Analyze patterns in AI-generated responses to surface systematic errors or biases
- Create high-quality training data — including prompts, worked solutions, and expert annotations — across data science and ML domains
- Review AI-generated code, visualizations, and statistical analyses for correctness and best practices
- Provide structured, detailed feedback that directly improves model performance
- Work independently and asynchronously on your own schedule
Who You Are
- Degree in Data Science, Statistics, Computer Science, Mathematics, or a related quantitative field (MS or PhD preferred)
- Strong foundation in statistics, probability, and machine learning concepts
- Proficient in Python, R, SQL, or similar data analysis tools
- Experienced with data wrangling, exploratory data analysis, and model evaluation
- Sharp analytical thinker with excellent attention to detail
- Clear written communicator — able to explain complex technical concepts concisely
- Self-motivated and comfortable working independently in an async environment
Nice to Have
- Experience with deep learning frameworks such as PyTorch or TensorFlow
- Familiarity with NLP, large language models, or AI evaluation workflows
- Published research or hands-on industry experience in applied machine learning
- Background in A/B testing, causal inference, or experimental design
Why Join Us
- Work on cutting-edge AI projects alongside top research labs and AI teams globally
- Get rare, inside exposure to how state-of-the-art LLMs are trained and evaluated
- Fully remote and async — work when and where it suits you
- Complete autonomy over your schedule and workload (10--40 hrs/week)
- Join a growing community of expert contributors who are actively shaping the future of AI
- Potential for ongoing work and long-term contract extension