Research is judgement at the edge of what’s known. AI systems need researchers and PhDs who can review how a model reasons about hard, specialist problems.
This isn’t advisory work and it isn’t labelling. You’re reviewing real research work and explaining your judgement clearly enough that an AI can learn from it — paid for your expertise, by the hour.
Applied Clinical Judgement connects qualified people to vetted platforms, and Sean Key personally vouches for those he refers. We’re paid a referral fee by the platform on a successful placement — never by you. The roles below are live today.
55 live Academic Researchers & PhDs roles · updated daily
Humanities / arts / culture Evaluator
Mercor seeks humanities, arts, and culture specialists to evaluate AI-generated documents, spreadsheets, and presentations at $80–$120 per hour. You'll assess outputs for accuracy, rigour, and domain quality, identifying factual and aesthetic errors whilst providing structured feedback. Requires 5+ years of relevant professional experience, native or professional English fluency, and proficiency in Microsoft Office and Google Workspace.
Government / public administration Evaluator
Mercor is recruiting Government / public administration Evaluators at $80–$120 hourly to assess AI-generated documents, spreadsheets and presentations for accuracy and quality. You'll apply five years' professional domain expertise to grade outputs against rubrics, spotting factual and presentation errors, then provide structured feedback. Requires native English fluency and strong Microsoft Office and Google Workspace skills.
Japanese Audio Generalist Evaluator Expert
Mercor offers £40 hourly on a short-term remote engagement for Japanese native speakers with professional English. You'll transcribe and evaluate audio content, develop evaluation standards for Japanese language models, and test AI outputs across consumer contexts. Suited to linguists, recent graduates, and those with transcription or localisation experience who understand dialects, keigo, and contemporary Japanese usage.
Biology Research Scientist (BA, MS, PhD's)
Mercor is recruiting biology scientists to verify protein target assignments in major bioactivity databases for AI-driven drug discovery. The role pays $50–$70 per hour and suits experienced researchers with hands-on assay background (binding, kinase, or GPCR work) currently in bench roles. You'll assess UniProt accuracy against primary literature, flag errors systematically, and document findings to support computational pipelines. US-based, 10–20 hours weekly, minimum one month.
Research Physics Expert
Mercor is recruiting research physicists at $80–$135 per hour to author, audit, and adjudicate solutions for CritPt, a frontier physics benchmark spanning high-energy, condensed-matter, quantum, and astrophysics domains. Roles suit PhD-holders and postdocs with demonstrable expertise in their subfield, strong LaTeX and Python skills, and recent peer-reviewed publications. Work is asynchronous, approximately 10 hours weekly over 8–10 weeks per task.
STEM PhDs and Technical Domain Experts
Mercor pays $55–80 hourly for STEM PhDs and technical domain experts to contribute to a premier AI lab project. You'll create high-quality training data and formulate challenging problems within your specialism—medicine, statistics, AI/ML, computer science, game development, or aerospace engineering. The role suits those with advanced qualifications, meticulous attention to detail, and strong communication skills across written and verbal formats.
Academic Researchers
Earning $20–$55 per hour, this remote contractor role suits researchers with a master's, PhD, or JD and 3+ years' relevant experience in academic, legal, policy, or healthcare settings. You'll conduct in-depth research, prepare and review complex documents in Word and PDF, synthesise literature and industry data, and deliver client-ready reports for Fortune 500 standards. The work involves cross-functional collaboration and rigorous attention to detail. Opportunities available through micro1.
Linguistic Expert
Micro1 seeks a linguistic expert at $40–$95/hour for remote contract work analysing and transcribing English-language video content. The role demands native English proficiency and advanced knowledge of linguistic theory to assess tone, grammar, vocabulary nuance, and speaker intent. Suited to those with transcription or language-research experience and minimum 15 hours weekly availability. An advanced degree in linguistics or English is preferred.
Armenian Bilingual Expert
This remote contract role on micro1 suits Armenian-English bilinguals with strong language skills and transcription or linguistic analysis experience. You'll transcribe Armenian video content, analyse speaker emotion and tone, evaluate grammar and word choice, and provide written feedback to train AI systems. No AI background is required. Flexible full-time or part-time arrangements available.
Sinhala Bilingual Expert
A part-time remote role on micro1 for fluent Sinhala-English speakers to support AI training. You will transcribe bilingual video content, analyse grammatical structures and emotional tone, and provide linguistic insights. No AI background required; native or near-native fluency in both languages and transcription expertise are essential. Minimum 15 hours weekly commitment.
Shona Bilingual Experts
micro1 seeks bilingual Shona-English experts for remote AI training work with flexible full-time or part-time arrangements. The role involves transcribing video content with timestamps, analysing emotional tone and grammatical structures, and providing detailed linguistic explanations. Applicants must demonstrate professional fluency in both languages and prior experience in transcription, translation, or linguistic analysis. No AI background required.
Haitian Creole Bilingual Expert
Fluent Haitian Creole–English speakers are invited to support AI training on micro1. The role involves transcribing and translating video content, analysing linguistic nuances, emotional tone and grammar, then collaborating with the client team on language assignments. Requires proven transcription or translation experience, strong attention to detail, and a commitment to 15 hours weekly. Remote contract work.
Myanmar Bilingual Expert
Paying $45–95 per hour, this remote contract role suits bilingual Myanmar–English speakers with transcription or translation experience. You'll annotate video content, analyse emotional undertones and grammar, define complex terms, and support language evaluation tasks. No AI experience required; cultural and linguistic depth matter most. Work at least 15 hours weekly with a distributed team via micro1.
Galician Bilingual Expert
This part-time remote role on micro1 suits native or near-native Galician–English speakers with transcription or language-analysis experience. You'll transcribe video content, annotate timestamps, analyse speaker emotion and tone, and examine grammar and idiom within cultural context. The work requires at least 15 hours weekly commitment and collaboration with the client team on linguistic feedback.
Sanskrit Language Expert
Micro1 seeks Sanskrit language experts at $45–$95 per hour on a contractual basis. You'll translate and review Sanskrit texts, develop educational content, and advise on language use in digital platforms. The role suits those with advanced Sanskrit proficiency and experience in translation or editing. No AI background is required; your domain expertise drives the work.
Nepali Bilingual Expert
Contract role on micro1 training AI systems through Nepali-English bilingual expertise. You'll transcribe video content, annotate timestamps, analyse speaker emotion and tone, and perform advanced linguistic analysis. Requires native or near-native proficiency in both languages, proven transcription or translation experience, and strong grammatical knowledge. No AI background needed.
Latin Bilingual Expert
micro1 seeks a Latin–English bilingual expert to support AI training through transcription and linguistic analysis. The role involves transcribing video content, analysing grammar and syntax, and interpreting emotional tone and intent. Professional fluency in both languages and proven transcription or language expertise are essential. Applicants should commit at least 15 hours weekly.
Welsh Language Expert
Part-time contract role on micro1 for a Welsh Language Expert to review, edit and translate Welsh content, ensuring linguistic accuracy and cultural relevance. Requires native or near-native Welsh fluency, proven translation or localization experience, and meticulous attention to detail. Candidates should work independently in a remote setting, collaborate across teams, and maintain high quality standards. Background in linguistics or experience with CAT tools preferred.
Malayalam Language Expert
micro1 seeks native or near-native Malayalam speakers for contract roles training AI systems. You'll translate and localize content, review Malayalam documents, develop language resources, and mentor team members. Strong English proficiency and translation experience required. Suit those with linguistics or translation backgrounds who can commit minimum 15 hours weekly to remote work.
Nynorsk Language Expert
micro1 seeks a Nynorsk Language Expert for remote contract work (full-time or part-time). This role suits native or near-native speakers with proven editing, translation, or proofreading experience. You will review and enhance Nynorsk content across projects, develop language standards, guide localization efforts, and mentor language professionals. The position requires strong attention to detail, collaborative skills, and digital tool proficiency.
Commercial Real Estate Manager
Micro1 seeks experienced commercial real estate professionals earning $40–65/hr to train AI systems on property valuations, lease agreements, and market analysis. You'll review transactions, assess feasibility studies, and explain real estate concepts for model development. Suited to brokers, appraisers, and property managers with 5+ years' experience who can work remotely, contributing insights on US, Canadian, UK, and European markets.
Telecommunications Expert
Micro1 seeks a telecommunications expert to guide AI model development at $20–$75 per hour on a contractor basis. You'll analyse telecom systems and protocols, document complex concepts clearly, and review AI-generated content for accuracy. The role suits experienced telecom professionals comfortable collaborating remotely and translating technical knowledge for machine learning applications.
Spanish Language Expert (Spain)
Micro1 seeks Spanish Language Experts (Spain) on a contractor basis at $10–$20 per hour. You'll review and correct machine-generated Spanish transcriptions, apply metadata tags, and annotate audio and text datasets to train AI systems. The role demands fluent Castilian Spanish, demonstrated transcription and annotation experience, meticulous attention to detail, and comfortable independent work. Prior exposure to audio engineering, transcription tools, or linguistics is valued.
English Specialist
Part-time remote role on micro1 for English specialists to train AI language models. Requires C1 English proficiency or native-level skill, plus a degree in English, linguistics or related field, with proven experience in language editing or linguistic analysis. Work involves analysing content, refining datasets, providing feedback on language use, and designing annotation guidelines. No AI experience necessary.
Physics Research Expert (Solver)
Micro1 seeks physics researchers at $80–$110 per hour to train AI systems through advanced problem-solving. PhD physicists with active research in high-energy, biophysics, condensed matter, quantum, or astrophysics fields will formulate and solve publication-level problems, document derivations in LaTeX, and develop computational models using Python and SymPy. Requires 2–5 recent publications and independent execution of novel research tasks. Ten hours weekly, remote, US/UK/Canada based.
Computer Science - Domain experts
Turing seeks PhD-level computer science experts to design advanced conceptual problems for large language model evaluation and fine-tuning. The role involves developing rigorous solutions across algorithms, systems design, cybersecurity, and other advanced CS domains, then assessing AI-generated responses for correctness and reasoning quality. Contractor position requiring minimum 4 hours weekly with PST timezone overlap.
Ph.D./Researcher Workflow
Turing seeks Ph.D. researchers to define foundational questions and document workflows in their field, breaking complex research into structured tasks. Suited to self-motivated STEM doctoral holders with strong analytical and communication skills. This is a fully remote freelance assignment working on frontier AI projects with leading LLM companies, with potential for contract extension based on performance.
Ph.D./Researcher Workflow
Turing seeks Ph.D. holders or candidates in STEM fields to document and refine their research workflows for frontier AI projects. The role involves articulating foundational research questions, mapping workflows with timelines and success metrics, and identifying cross-domain expertise gaps. Suited to self-motivated researchers with strong analytical and communication skills who can work independently in a fully remote setting. Contractor engagement with potential extension.
Ph.D./Researcher Workflow
Turing seeks Ph.D. researchers to articulate foundational research questions and map detailed workflows for frontier AI projects. You'll document timelines, tools, outputs and success criteria, breaking complex research into specific tasks. Ideal for self-motivated researchers with strong analytical and communication skills, able to work independently in a fully remote setting. Contractor basis with potential extension.
Ph.D./Researcher Workflow
Turing seeks Ph.D. holders or candidates in STEM fields to articulate foundational research questions and document their workflows for frontier AI projects. The role involves describing research methodologies, timelines, tools and success criteria, then breaking these into specific tasks. Suited to self-motivated researchers with strong analytical and communication abilities who can work independently in a fully remote setting. Contract-based with potential extension.
Annotator - STEM
Turing seeks STEM annotators to design and write AI evaluation tasks for SkillsBench, their commercial evaluation pipeline. You'll create structured challenges testing whether LLM agents perform better with domain-specific knowledge, write reference solutions, and author skill files. Requires a bachelor's degree, 1–3 years hands-on expertise in domains like coding, finance, or data science, and strong ability to write precise instructions. Two-month contractor role based in specified countries.
Biology STEM Expert
Turing seeks a Biology expert to design and solve complex problems for large language model evaluation. You will develop rigorous step-by-step solutions, work with LLM researchers to identify model limitations, and contribute to new evaluation benchmarks spanning undergraduate to PhD-level biology. Suited to those with a Master's or PhD in Biology, Biotechnology, Biochemistry or related fields who can explain difficult concepts clearly and think creatively.
Ph.D./Researcher Workflow
Turing seeks Ph.D. holders or candidates in STEM fields to define research workflows and foundational questions in their discipline. You'll document key processes, timelines, tools and success criteria, identifying cross-domain expertise that accelerates progress. Strong analytical and communication skills essential for independent remote work. Contractor role with potential extension.
Ph.D./Researcher Workflow
Turing seeks Ph.D. holders or candidates in STEM fields to document foundational research workflows and identify key progress indicators in their discipline. The role involves breaking down complex workflows into discrete tasks, articulating success criteria, and collaborating remotely with frontier AI labs. Suited to self-motivated researchers with strong analytical and communication skills who can work independently.
Ph.D./Researcher Workflow
Turing seeks Ph.D. researchers to document foundational questions and key workflows in their STEM field. You'll map research processes—timelines, tools, outputs, success criteria—and identify cross-domain expertise opportunities. Requires strong analytical and communication skills, self-motivation, and reliable home setup. Fully remote contractor role with potential extension.
Business Analyst (Danish Language)
Turing seeks a Danish-English speaker with strong analytical abilities to help train large language models. You'll analyse business scenarios, answer complex questions, and provide detailed feedback to refine AI systems. The role suits independent researchers, journalists, copywriters, or technical writers comfortable working remotely with US timezone overlap. No specialist domain experience required.
Business Analyst (Japanese Language)
Turing seeks a Business Analyst fluent in English and Japanese to help refine large language models. You'll analyse business scenarios, validate claims through research, and provide detailed feedback to improve AI reasoning capabilities. The role suits self-motivated analytical thinkers with strong research skills and professional writing experience. A bachelor's degree is preferred but not essential with relevant background.
Spanish (Mexico) Audio Generalist Evaluator Expert
Mercor is hiring Spanish (Mexico) audio evaluators at $50/hour for short-term, remote work on AI language model training. The role involves transcribing and annotating Spanish audio, developing evaluation standards, testing model outputs, and supporting quality assurance. Suited to those with strong writing skills, native Mexican Spanish fluency, professional English, and 10–20 hours weekly availability. Academic or analytical backgrounds preferred.
Board Game Reasoning Expert (AI Training & Evaluation)
Turing seeks Board Game Reasoning Experts to develop and evaluate AI training datasets. You'll analyse game scenarios, assess AI reasoning quality, and create evaluation rubrics using expertise in board games, game mechanics, logic, and strategic systems. Requires a bachelor's degree in an analytical discipline and 2+ years' experience in game design, playtesting, or strategy communities. Fully remote contractor role, 20 hours weekly minimum with PST overlap.
English (New Zealand) Audio Generalist Evaluator Expert
Mercor is seeking a New Zealand English audio evaluator at $50/hour for a short-term AI research project. You'll transcribe and analyse audio content, develop evaluation standards, test language models, and support benchmarking work. The role suits native or near-native NZ English speakers with strong writing skills, ideally from linguistics, humanities, journalism or technical backgrounds, committed to 10–20 hours weekly.
Uyghur Bilingual Expert
Earning $40–$95 hourly, this remote contractor role on micro1 suits native or professional-level Uyghur–English speakers with transcription or linguistic analysis experience. You will annotate video content, analyse grammar and syntax, interpret speaker tone and emotionality, and support AI model training. No AI background required, but reliability and attention to detail are essential.
Basque Audio Generalist Evaluator Expert
Mercor is recruiting a Basque Audio Generalist Evaluator Expert at $50/hour for short-term remote work. The role involves transcribing and annotating Basque audio content, developing evaluation standards for language models, and assessing AI-generated outputs. Suited to fluent Basque and English speakers with strong writing skills and familiarity with dialects and contemporary usage. Ideal for students or graduates with linguistics or humanities backgrounds. Commitment of 10–20 hours weekly.
Bambara Language Expert
Contractor role on micro1 for native or near-native Bambara speakers to support AI training. You'll translate, edit, and annotate Bambara language data, develop linguistic resources, and collaborate with AI teams. Background in linguistics or translation valued. No AI experience required—your language expertise is what matters.
Kinyarwanda Language Expert
Earning £32–£76 per hour, this remote contractor role suits native or near-native Kinyarwanda speakers with translation or editing experience. You'll translate, review and evaluate content for AI language model training on the micro1 platform, applying linguistic expertise to ensure cultural and grammatical accuracy. Prior AI experience is unnecessary; your domain knowledge in Kinyarwanda is what matters.
Tamang Language Expert
Paying $40–$95 per hour, this remote contractor role on micro1 invites bilingual Tamang–English speakers to contribute to AI training through language analysis work. You'll transcribe video content, assess grammar and tone, define idioms, and refine linguistic data. The role suits professionals with native or near-native Tamang fluency and prior translation or editing experience. Minimum commitment is 15 hours weekly.
Sherpa Language Expert
Contractor role with micro1 seeking Sherpa language experts to transcribe video content, analyse speaker emotion and intent, and perform linguistic analysis. Requires fluency in Sherpa and English, with experience in transcription, translation, or linguistic work preferred. Work involves detailed transcription with timestamps, tone assessment, grammar review, and semantic evaluation of Sherpa expressions. Minimum 15 hours weekly commitment.
Lingala Bilingual Expert
Micro1 seeks native or near-native bilingual Lingala-English speakers to train AI language models through transcription and linguistic analysis work. The role involves transcribing video content, evaluating grammar and tone, and performing complex language tasks to refine AI systems. Compensation ranges from $45 to $95 per hour. Minimum 15 hours weekly commitment required. Prior AI experience unnecessary; linguistic expertise and attention to detail are essential. Suited to those with transcription, translation, or localisation backgrounds.
Telecom/Network Expert
$50/hour. Telecom and network specialists create benchmark datasets for AI model evaluation via Mercor, authoring complex tasks focused on visual document understanding and instruction-following. Work involves designing grounded exercises with clear outputs and objective assessment criteria. Suits experienced professionals able to commit 15–20 hours weekly.
Government Backoffice — Visual Document Understanding
Mercor offers $50/hour for experts to author benchmark dataset tasks evaluating AI performance on visual document understanding in government backoffice contexts. Work involves designing complex, grounded tasks with objective rubrics and clear ground-truth outputs. Approximately 15–20 hours weekly, remote, available to US and Canadian applicants.
Logistics & Supply Chain — Visual Document Understanding
Mercor offers $50/hour for logistics and supply chain specialists to evaluate AI model performance on visual document understanding tasks. You'll design complex, grounded evaluation scenarios with clear ground-truth outputs and objective assessment criteria. The role suits domain experts capable of authoring rigorous benchmark datasets. Around 15–20 hours weekly, remote across US and Canada.
Government Backoffice — Visual Document Understanding
Mercor offers $50/hour for experts to author benchmark dataset tasks evaluating AI performance on visual document understanding in government backoffice contexts. Work involves designing complex, grounded tasks with objective rubrics and clear ground-truth outputs. Approximately 15–20 hours weekly, remote, available to US and Canadian applicants.
Real Estate Appraisal — Visual Document Understanding
Mercor is seeking real estate appraisal experts for a benchmark dataset project at $50/hour. You'll design complex evaluation tasks with objective criteria to assess how AI models understand visual documents and follow instructions within the appraisal domain. The work involves authoring grounded tasks and establishing clear ground-truth outputs. Suitable for practitioners with appraisal expertise who can articulate domain knowledge clearly.
Military Operations & IHL Expert
$50–$90 per hour. micro1 seeks military operations and international humanitarian law experts to develop AI training frameworks for defense and conflict scenarios. You'll create taxonomies, dual-use triage systems, and ethical evaluation rubrics. Requires 5+ years in military analysis, defense policy, or IHL work, with advanced academic credentials or equivalent operational experience. Ideal for defence professionals, legal specialists, and policy analysts.
Tibetan Language Expert
micro1 is seeking a Tibetan language expert on a contractor basis at $40–95/hr for remote work. The role involves translating, reviewing and annotating Tibetan content to train AI systems, alongside developing linguistic quality standards and analysing model output. Native Tibetan speakers with strong written communication and translation experience are suited to this position; no prior AI experience is required.
Music Expert — AI Evaluation & Annotation (Generative Music)
Mercor pays $30–$65 per hour for trained musicians and musicologists to evaluate and annotate AI-generated music. You'll judge track quality, write detailed music descriptions, and align lyrics to audio using structured rubrics. The role suits professionals with strong critical listening skills, production knowledge, and the ability to articulate subjective musical judgements clearly. Flexible hours with remote work.
No live roles match your search.
AI training work is organised by profession, task and software — not by topic or sector. Try your field (for example “nursing” or “Python”), clear the filters, or browse the categories further down the page. The always-open talent pools below are a good place to start.
What the work looks like
Reviewing the model's work
Read what an AI produced in your field and judge whether the reasoning holds — mark where it went wrong.
Setting hard problems
Write the realistic, demanding tasks that separate competent work from confident-but-wrong.
Judging AI answers
Compare two AI outputs and say which is stronger, and why — your written reasoning is what the model learns from.
Common questions
How much does it pay?
Hourly and contractor-based, varying with seniority and role. Every role card shows its pay band. You invoice as an independent contractor and choose your hours.
Can I do this alongside my current job?
Yes — the work is flexible and part-time by design. Check your employer's policy on outside work first; ACJ can't advise on that.
Who is ACJ, and what's your part in this?
Applied Clinical Judgement is run by Sean Key. We connect qualified people to vetted AI-training platforms (Mercor, micro1, Turing), and Sean personally vouches for the people he refers. We're paid a referral fee by the platform on a successful placement — never by you.
How do I get started?
Find a role below that fits, and apply through the link — it carries Sean's referral. If you'd like him to vouch for you or talk it through first, book a short call.
Sean Key vouches for the people he refers
I’m Sean Key, editor of Applied Clinical Judgement. After 29 years in the NHS I help qualified professionals find legitimate, well-paid AI-training work — and I’ll personally vouch for you when you apply.
Applied Clinical Judgement is a referral intermediary, not an employer or recruiter. We refer candidates to third-party platforms (Mercor, micro1, Turing) and may earn a referral fee on a successful placement. We never charge candidates. Pay rates are set by the platforms and may change. PRAG-DEL-SOL-ONE LTD · Co. 07204925 · VAT 987-3626-64 · ICO ZC086000.
