Maths is where reasoning is most exposed — a step is right or it isn’t. AI systems need mathematicians who can find the flawed step and explain it.
This isn’t advisory work and it isn’t labelling. You’re reviewing real mathematical work and explaining your judgement clearly enough that an AI can learn from it — paid for your expertise, by the hour.
Applied Clinical Judgement connects qualified people to vetted platforms, and Sean Key personally vouches for those he refers. We’re paid a referral fee by the platform on a successful placement — never by you. The roles below are live today.
16 live Mathematics & Statistics roles · updated daily
Computational Bayesian Statistics and Applied Mathematics Expert
Mercor is seeking a Computational Bayesian Statistics and Applied Mathematics Expert at $70–$100 per hour to design graduate-level scientific problems for an AI benchmark. You'll create original computational tasks using specialised libraries like PyMC, FEniCS, and others, testing whether advanced AI systems can execute research workflows, design experiments, and extract insights from data. The work requires hands-on expertise in Bayesian methods, numerical PDEs, or computational topology, plus strong problem-design thinking. Minimum 15–20 hours weekly, remote.
Mathematician Talent Network
$60–$80 per hour. Mercor's Mathematician Expert Network connects qualified professionals with AI labs and research companies on a project basis. Roles involve training and evaluating AI models, creating mathematical tasks, and providing domain expertise to advance frontier research. Requires professional experience in statistical analysis, mathematical modelling, proof, theory, or computational mathematics, plus strong communication and independent working ability.
Mathematics Specialist
Turing seeks a Mathematics Specialist to design advanced reasoning datasets for large language model evaluation. You'll develop novel mathematical systems, create multi-step reasoning tasks involving symbolic computation and proofs, and establish rigorous assessment rubrics. The role requires strong abstract mathematics grounding, 2+ years in mathematics or theoretical computer science, and proven ability to author structured problems that test both conceptual and procedural reasoning. This 8-week freelance position requires 4 hours daily overlap with Pacific time.
Maths STEM Expert
Turing seeks mathematicians with undergraduate, master's or doctoral qualifications to develop advanced mathematical models for leading LLM companies. The role involves solving complex problems across mathematical domains, reviewing cutting-edge research, and applying quantitative techniques. Candidates should demonstrate expertise across at least four areas including computable functions, algebra, linear algebra, topology, analysis, probability or applied mathematics. Python knowledge preferred. Contractor basis, minimum 20 hours weekly with PST overlap required.
Mathematics Expert (Competition & Olympiad-Level) - AI Training & Evaluation
Turing seeks mathematicians to design competition-level problems and evaluate large language model reasoning. The role involves creating original AIME, HMMT, and IMO-difficulty problems across algebra, number theory, combinatorics, and geometry, then rigorously assessing how models handle these challenges. You'll author detailed solutions in LaTeX, diagnose reasoning failures, and contribute to evaluation benchmarks. Suits those with strong competitive mathematics backgrounds and creative problem-design skills.
Mathematics Expert (Master’s/Ph.D.)
Turing seeks mathematics graduates and PhD candidates to design, solve, and evaluate mathematical problems for large language model improvement. Work involves creating rigorous multi-step problems, providing detailed solutions, reviewing AI-generated answers, and developing Python and Lean-based computational tasks. Requires strong analytical skills, clear communication, and ability to work independently in a remote, contractor-based role with flexible weekly hour commitments.
Research Analyst - Advanced Math
Turing seeks a Research Analyst for advanced mathematics to help fine-tune large language models. You'll solve analytical problems, create training scenarios, and explain mathematical concepts clearly to improve LLM performance. The role requires strong high school/college-level maths skills, excellent English, and systematic problem-solving ability. Ideal for those without formal qualifications but proven mathematical expertise. Contractor position, minimum 20 hours weekly with PST timezone overlap.
Computational Bayesian Statistics and Applied Mathematics Expert
Mercor is seeking a Computational Bayesian Statistics and Applied Mathematics Expert at $70–$100 per hour to design graduate-level scientific problems for an AI benchmark. You'll create original computational tasks using specialised libraries like PyMC, FEniCS, and others, testing whether advanced AI systems can execute research workflows, design experiments, and extract insights from data. The work requires hands-on expertise in Bayesian methods, numerical PDEs, or computational topology, plus strong problem-design thinking. Minimum 15–20 hours weekly, remote.
Math Expert (PhD)
$80–$90 per hour. micro1 seeks PhD mathematicians to develop high-quality training responses for AI systems. You'll create and refine comprehensive answers to complex mathematics problems, review peer content for precision, and collaborate with distributed experts. No AI background required—advanced mathematics expertise and strong communication skills are essential. Ideal for academics with graduate-level teaching or publishing experience.
IMO Experts
Mercor is recruiting experienced Olympiad mathematicians to create and evaluate IMO-standard problems for AI training. This short-term research engagement pays $54–$93 per hour on a task-based model with no weekly cap. Suited to competition medalists, problem writers, and selection committee members who can craft rigorous problems and assess AI reasoning at elite difficulty levels.
Mathematics Specialist
Turing seeks a Mathematics Specialist to design advanced reasoning datasets for large language model evaluation. You'll create novel mathematical systems, axioms, and multi-step reasoning problems requiring symbolic computation and logical proof construction. The role suits mathematicians, theoretical computer scientists, or data scientists with two years' analytical experience and strong communication skills. Eight-week contractor assignment requiring four hours PST overlap daily.
Mathematics Professor/Researcher (PhD)
micro1 is recruiting mathematics PhDs to train AI systems on $60–$90 per hour as remote contractors. You'll review mathematical content, develop problems and proofs, annotate datasets, and create educational materials. The role suits experienced academics and researchers with university teaching or advanced research backgrounds who can communicate complex concepts precisely. No prior AI experience required.
Computational Bayesian Statistics and Applied Mathematics Expert
Mercor is seeking a Computational Bayesian Statistics and Applied Mathematics Expert at $70–$100 per hour to design graduate-level scientific problems for an AI benchmark. You'll create original computational tasks using specialised libraries like PyMC, FEniCS, and others, testing whether advanced AI systems can execute research workflows, design experiments, and extract insights from data. The work requires hands-on expertise in Bayesian methods, numerical PDEs, or computational topology, plus strong problem-design thinking. Minimum 15–20 hours weekly, remote.
Mathematics Professor/Researcher (PhD)
micro1 is recruiting mathematics PhDs to train AI systems on $60–$90 per hour as remote contractors. You'll review mathematical content, develop problems and proofs, annotate datasets, and create educational materials. The role suits experienced academics and researchers with university teaching or advanced research backgrounds who can communicate complex concepts precisely. No prior AI experience required.
Scientific Computing SME (Physics)
Turing seeks a Physics subject matter expert to design and solve complex problems for large language model evaluation. You'll work with researchers to create clear step-by-step solutions, probe model limitations in areas like multi-step reasoning and symbolic manipulation, and help define evaluation benchmarks. Suited to those pursuing or holding Master's or PhD-level physics qualifications with strong Python and scientific computing skills. Six-week contract.
Mathematics Expert
Paying $20–$40 per hour, micro1 seeks mathematics experts to evaluate and generate solutions to advanced mathematical problems, develop rigorous explanations and proofs, and assess AI-generated mathematical outputs. You'll annotate datasets and collaborate with remote subject matter experts. A BSc or higher in mathematics or related field is preferred, along with experience writing research papers or technical reports and communicating complex ideas clearly. No AI background required.
No live roles match your search.
AI training work is organised by profession, task and software — not by topic or sector. Try your field (for example “nursing” or “Python”), clear the filters, or browse the categories further down the page. The always-open talent pools below are a good place to start.
What the work looks like
Reviewing the model's work
Read what an AI produced in your field and judge whether the reasoning holds — mark where it went wrong.
Setting hard problems
Write the realistic, demanding tasks that separate competent work from confident-but-wrong.
Judging AI answers
Compare two AI outputs and say which is stronger, and why — your written reasoning is what the model learns from.
Common questions
How much does it pay?
Hourly and contractor-based, varying with seniority and role. Every role card shows its pay band. You invoice as an independent contractor and choose your hours.
Can I do this alongside my current job?
Yes — the work is flexible and part-time by design. Check your employer's policy on outside work first; ACJ can't advise on that.
Who is ACJ, and what's your part in this?
Applied Clinical Judgement is run by Sean Key. We connect qualified people to vetted AI-training platforms (Mercor, micro1, Turing), and Sean personally vouches for the people he refers. We're paid a referral fee by the platform on a successful placement — never by you.
How do I get started?
Find a role below that fits, and apply through the link — it carries Sean's referral. If you'd like him to vouch for you or talk it through first, book a short call.
Sean Key vouches for the people he refers
I’m Sean Key, editor of Applied Clinical Judgement. After 29 years in the NHS I help qualified professionals find legitimate, well-paid AI-training work — and I’ll personally vouch for you when you apply.
Applied Clinical Judgement is a referral intermediary, not an employer or recruiter. We refer candidates to third-party platforms (Mercor, micro1, Turing) and may earn a referral fee on a successful placement. We never charge candidates. Pay rates are set by the platforms and may change. PRAG-DEL-SOL-ONE LTD · Co. 07204925 · VAT 987-3626-64 · ICO ZC086000.
