Live, remote AI-training roles relevant to Software Build Agent, updated daily. Applied Clinical Judgement is a UK-based referral intermediary: we point you to genuine openings on the major training platforms and are paid only when a referral succeeds. Pay rates are shown on each role; we never display our referral fee.
50 live Software Build Agent roles · updated daily
Manufacturing - Aerospace & Defense Expert
Mercor is recruiting aerospace and defence specialists to help train frontier AI agents through Project Atlas. Earn $1,150–$1,450 per completed task. You'll recreate authentic digital workspaces and design realistic multi-step scenarios based on your experience at Fortune 500 primes or major Tier-1 suppliers. The role suits professionals with 3+ years in A&D program management, design, manufacturing, supply chain, or quality assurance who can translate actual workflows into structured AI training tasks. US Person status required.
Open Source Applied Engineer Talent Network
Mercor is hiring an open-source engineer at $100/hour to design coding evaluations, develop test cases, and analyse system performance across Python, Java, C, JavaScript, and TypeScript. This suits experienced contributors with a strong GitHub presence and demonstrated expertise in core programming fundamentals. You'll work asynchronously with a research team, identifying improvements and executing contributions independently using Git and CI/CD workflows.
Software Engineer
Contractors earn $40–75 per hour on micro1, paid per completed task meeting specifications. This role suits experienced software engineers with 3+ years' hands-on backend or full-stack experience who can reason through unfamiliar codebases and explain technical decisions clearly. You'll build reinforcement learning environments that test AI systems' ability to identify and fix security vulnerabilities, create features, refactor code, and optimise performance. Cybersecurity or SecOps background is preferred.
Member of Technical Staff, Forward Deployed (US Gov)
micro1 seeks a Member of Technical Staff to develop and deploy agentic AI systems for U.S. Government missions at $40–60/hr. Based hybrid in Washington, D.C., the role spans model experimentation, infrastructure, and forward-deployed work with strategic partners. You'll build LLM applications, design data pipelines, and own systems across discovery through deployment. Requires strong Python expertise, LLM experience, and comfort in high-security environments. Security clearance eligibility and government mission background preferred.
Forward Deployed Engineer, U.S. Government
Micro1 seeks a Forward Deployed Engineer for full-time hybrid work in Washington, D.C., supporting U.S. Government missions. The role combines building agentic AI systems, LLM applications, and data pipelines with forward-deployed collaboration on mission-critical projects. You'll own systems across discovery, architecture, deployment, and iteration. Base salary $200,000–$260,000 USD plus equity and performance bonuses.
Fullstack Engineer (Python+React)
Fullstack engineer role paying $50–$100 per hour on micro1. Design and build production web applications using Python, React, Django and Flask, working with relational databases, Docker and CI/CD pipelines. No AI experience required. Suits developers with strong backend and frontend skills who can work independently in remote, collaborative teams and prioritise code quality and clear communication.
MLE Bench – ML Engineers
Turing seeks Machine Learning Engineers with 3+ years' experience for benchmark-driven evaluation work on production ML systems. You'll build and modify training pipelines, prepare datasets, debug complex codebases, and collaborate on real-world ML engineering tasks. Strong Python proficiency, hands-on pipeline experience, and understanding of ML fundamentals required. Minimum 20 hours weekly with 4-hour PST overlap; 3-month contractor role.
SWE Bench – Data Engineer/Data Scientist
Turing seeks experienced data engineers and data scientists to build and validate data pipelines for AI system evaluation projects. You'll work with production-like datasets, design benchmarking workflows, and collaborate with researchers to create real-world data engineering tasks. Requires 3+ years in data roles, strong Python skills, and ability to work 20+ hours weekly with 4-hour PST overlap. Three-month contractor assignment.
Senior Software Engineer – LLM Evaluation (US/Canada/WEU based)
Turing seeks senior software engineers to evaluate and improve large language models through code curation, review, and refinement across multiple languages. You'll assess AI-generated code for production readiness, design verification systems, and collaborate with research teams on frontier AI projects. Requires 3+ years' engineering experience and expertise in full-stack development.
Senior Software Engineer – C#(LLM Evaluation & Repository Validation)
Turing seeks an experienced senior software engineer to evaluate and validate LLM performance on realistic software engineering tasks. Working with public GitHub repositories, you'll analyse issues, set up development environments, assess test coverage, and help build synthetic training datasets. The role combines hands-on coding with AI research, requiring 3+ years' experience and strong C# proficiency. Based in specified regions only.
Senior Software Engineer – Go (LLM Evaluation & Repository Validation)
Turing seeks experienced Go engineers (3+ years) to evaluate how large language models interact with real software. The role involves analysing GitHub repositories, configuring development environments, assessing test coverage, and triaging issues to build datasets for LLM training. Work involves hands-on coding, Docker expertise, and collaboration with researchers. Based in India, Pakistan, Nigeria, Kenya, Egypt, Ghana, Bangladesh, Turkey, or Mexico; minimum 20 hours weekly with PST overlap.
Senior Software Engineer – Ruby (LLM Evaluation & Repository Validation)
Turing seeks a senior Ruby engineer to support LLM evaluation and repository validation work. You'll analyse GitHub issues, set up development environments, assess test coverage, and evaluate how language models perform on real software engineering tasks. The role involves hands-on development, environment automation, and collaboration with researchers on dataset design. Requires 3+ years experience, Ruby proficiency, Docker knowledge, and familiarity with open-source projects. Contractor basis, minimum 20 hours weekly with PST overlap.
Senior Software Engineer – Rust (LLM Evaluation & Repository Validation)
Turing seeks experienced senior software engineers with 3+ years' background and strong Rust skills for remote contractor work evaluating how large language models perform on real software engineering tasks. You'll analyse open-source repositories, set up development environments, assess test coverage, and collaborate with researchers on LLM evaluation datasets. The role suits engineers comfortable triaging issues and running complex codebases locally. Available in nine countries; 20–40 hours weekly with PST overlap required.
Senior Software Engineer – C++ (LLM Evaluation & Repository Validation)
Turing seeks experienced C++ engineers (3+ years) to evaluate LLM performance on real-world software tasks. You'll analyse GitHub repositories, set up development environments, assess test coverage, and help identify challenging coding problems for AI systems. Work involves hands-on coding, Docker configuration, and collaboration with researchers building evaluation datasets. Fully remote; minimum 20 hours weekly with PST overlap required.
Senior Software Engineer – Python (LLM Evaluation & Repository Validation)
Turing seeks experienced software engineers to evaluate how large language models perform on real coding tasks. Based on public repositories and GitHub issues, you'll set up environments, triage problems, assess test coverage and debug code to inform LLM training datasets. The role suits engineers with 3+ years' experience, Python proficiency, and familiarity with Git and Docker who can work flexibly across distributed teams.
Senior Software Engineer – LLM Evaluation & Repository Validation
Turing seeks a senior software engineer to evaluate LLM performance on realistic coding tasks. You'll analyse GitHub repositories, triage issues, configure development environments, and assess test coverage to build training datasets. This role suits experienced engineers proficient in languages like Python, Java, Go, or Rust, with strong Git and Docker knowledge. Three-month contractor position, 20 hours weekly.
LLM C/ C++ Developer
Turing seeks C++ developers to review AI-generated code and lead collaborative feature development for next-generation dialogue systems. You'll validate code quality, guide engineering teams, and contribute to public GitHub repositories whilst training LLM models with scalable back-end components. Requires a bachelor's degree in computer science or equivalent, demonstrated leadership experience, and strong C/C++ expertise. Prior exposure to AI code-creation systems desirable.
Python + Full-Stack (JS) Developer
Turing seeks Python and full-stack JavaScript developers to build AI training solutions for US-based companies. The contractor role involves designing code for AI model optimisation, conducting model evaluations, creating datasets for supervised fine-tuning, and collaborating on RLHF processes. Minimum 20 hours weekly with 4-hour PST overlap required. Bachelor's degree in engineering or computer science (or equivalent) and Docker proficiency mandatory.
Senior Python Developer
Turing seeks experienced Python developers to support foundational LLM companies in advancing their models. You'll generate high-quality training data, conduct model evaluations, and design SFT/RLHF datasets. The work involves writing production-grade Python code, benchmarking AI outputs, and collaborating with researchers. Minimum 3 years' Python experience required, plus strong testing and debugging expertise. Fully remote contractor roles with flexible 20–40 hour weekly commitments.
Senior Software Engineer
Micro1 seeks experienced software engineers to design reinforcement learning environments that test AI models on complex coding tasks including debugging, feature creation, and optimisation. Work remotely for approximately 15 hours weekly at $40–85 per hour, on a task-based payment model. Requires proficiency in Python3, Java, Rust, or TypeScript, with strong algorithmic knowledge and proven expertise in performance optimisation and collaborative development.
Legal Technology Expert
Mercor seeks legal-operations and legal-technology professionals from AmLaw 100 firms, Fortune 500 in-house departments, or major legal-tech vendors for Project Atlas. You'll reconstruct your digital workspace—files, workflows, and platforms—then design realistic multi-step tasks to benchmark frontier AI agents. Compensation ranges from $1,750–$2,150 per completed task, with performance bonuses available. Requires 3+ years' experience with tools like Relativity, Icertis, and contract-management systems.
Software Engineer – AI Code Evaluation & Benchmarking (SWE-Bench)
Turing seeks experienced software engineers to evaluate and benchmark AI-generated code for large language models. You'll assess coding solutions, identify correctness issues, debug implementations, and build evaluation datasets. The role suits engineers with strong code review experience and deep software engineering expertise. Minimum 20 hours weekly with 4-hour PST overlap; one-month contractor assignment.
Senior Software Engineer
Micro1 seeks experienced software engineers to design reinforcement learning environments that test AI models on complex coding tasks including debugging, feature creation, and optimisation. Work remotely for approximately 15 hours weekly at $40–85 per hour, on a task-based payment model. Requires proficiency in Python3, Java, Rust, or TypeScript, with strong algorithmic knowledge and proven expertise in performance optimisation and collaborative development.
Software Engineer
Contractors earn $40–75 per hour on micro1, paid per completed task meeting specifications. This role suits experienced software engineers with 3+ years' hands-on backend or full-stack experience who can reason through unfamiliar codebases and explain technical decisions clearly. You'll build reinforcement learning environments that test AI systems' ability to identify and fix security vulnerabilities, create features, refactor code, and optimise performance. Cybersecurity or SecOps background is preferred.
Competitive Coder
Earning $40–$80 per hour, this remote contractor role suits experienced competitive programmers. You'll design and implement checkers for programming problems, validate submissions against complex constraints, and develop robust C++ solutions. The work involves collaborating with platform teams, documenting logic clearly, and maintaining high code quality under tight deadlines on micro1.
Senior Frontend Developer (Javascript)
Micro1 seeks a Senior Frontend Developer at $30–$80 per hour on a contractor basis. You'll design and optimize sophisticated JavaScript applications using modern frameworks, mentor junior developers, and help train AI systems through high-quality real-world input. The role requires extensive web application experience, expertise in React or similar frameworks, and proven architectural leadership. Remote, distributed team environment.
Software Engineer Expert
Mercor is recruiting Software Engineers at $40–$50/hour to develop MCP servers and integrate applications into its RL Studio platform. You'll build backend systems using Python and FastMCP, manage Docker and Linux environments, and ensure apps meet production standards. The role suits engineers with solid Python skills, API experience, and familiarity with containerisation and debugging workflows.
Member of Technical Staff, Research Engineering
micro1 seeks a Research Engineer to develop reinforcement learning systems at scale. The full-time remote role, paying $140,000–$180,000 USD base salary, involves architecting RL environments, designing training pipelines, building synthetic data systems, and establishing evaluation frameworks. You'll fine-tune open-source models and contribute to benchmark releases. Suited to engineers with deep RL experience, proven track records scaling RL systems, and familiarity with automated evaluation and data generation workflows.
Software Engineer
Paying $40–$75 per hour on a task-based model, this remote contractor role suits experienced backend and full-stack engineers with cybersecurity exposure. Working via micro1, you'll design reinforcement learning environments that test AI systems' ability to identify and patch security vulnerabilities in code. Responsibilities include code review, debugging across multiple languages, evaluating solutions, and communicating technical reasoning. Requires 3+ years of production engineering experience and familiarity with at least one major backend language.
Software Engineer
Contractors earn $40–75 per hour on micro1, paid per completed task meeting specifications. This role suits experienced software engineers with 3+ years' hands-on backend or full-stack experience who can reason through unfamiliar codebases and explain technical decisions clearly. You'll build reinforcement learning environments that test AI systems' ability to identify and fix security vulnerabilities, create features, refactor code, and optimise performance. Cybersecurity or SecOps background is preferred.
Member of Technical Staff, Forward Deployed (US Gov)
micro1 seeks a Member of Technical Staff to develop and deploy agentic AI systems for U.S. Government missions at $40–60/hr. Based hybrid in Washington, D.C., the role spans model experimentation, infrastructure, and forward-deployed work with strategic partners. You'll build LLM applications, design data pipelines, and own systems across discovery through deployment. Requires strong Python expertise, LLM experience, and comfort in high-security environments. Security clearance eligibility and government mission background preferred.
Competitive Coder
Earning $40–$80 per hour, this remote contractor role suits experienced competitive programmers. You'll design and implement checkers for programming problems, validate submissions against complex constraints, and develop robust C++ solutions. The work involves collaborating with platform teams, documenting logic clearly, and maintaining high code quality under tight deadlines on micro1.
Software Engineer
Micro1 seeks experienced backend software engineers at $40–75/hour to build reinforcement learning environments that test AI model capabilities on realistic programming tasks. The role demands expertise in Java, Node.js, Go, Rust, or Python, with emphasis on secure coding, thorough testing, and code quality review. Cybersecurity or SecOps background is highly preferred. Work is output-based, asynchronous, and remote; candidates must commit 10–15+ hours weekly and start tasking within 48 hours of onboarding.
Game Developer (Java / libGDX)
Contractor role paying $50–120 per hour on micro1. Game developers with Java and libGDX experience needed to build 2D game features and help train AI systems through high-quality interactive data. Portfolio or demo projects preferred. Fully remote, no prior AI experience required.
Python Game Developer (Panda3D)
Micro1 seeks a Python game developer specialising in Panda3D to design and build 3D simulations for AI training. Paying $50–$120 per hour, the contract role suits experienced developers with proven hands-on Panda3D expertise and strong Python and C++ skills. You'll optimise game environments for diverse hardware, collaborate via GitHub, and document technical decisions clearly. Domain knowledge matters more than prior AI experience.
Software Engineer
Micro1 seeks experienced backend software engineers at $40–75/hour to build reinforcement learning environments that test AI model capabilities on realistic programming tasks. The role demands expertise in Java, Node.js, Go, Rust, or Python, with emphasis on secure coding, thorough testing, and code quality review. Cybersecurity or SecOps background is highly preferred. Work is output-based, asynchronous, and remote; candidates must commit 10–15+ hours weekly and start tasking within 48 hours of onboarding.
Member of Technical Staff, Forward Deployed (US Gov)
micro1 seeks a Member of Technical Staff to develop and deploy agentic AI systems for U.S. Government missions at $40–60/hr. Based hybrid in Washington, D.C., the role spans model experimentation, infrastructure, and forward-deployed work with strategic partners. You'll build LLM applications, design data pipelines, and own systems across discovery through deployment. Requires strong Python expertise, LLM experience, and comfort in high-security environments. Security clearance eligibility and government mission background preferred.
Software Engineer
Paying $40–$75 per hour on a task-based model, this remote contractor role suits experienced backend and full-stack engineers with cybersecurity exposure. Working via micro1, you'll design reinforcement learning environments that test AI systems' ability to identify and patch security vulnerabilities in code. Responsibilities include code review, debugging across multiple languages, evaluating solutions, and communicating technical reasoning. Requires 3+ years of production engineering experience and familiarity with at least one major backend language.
Senior Software Engineer
Micro1 seeks experienced software engineers to design reinforcement learning environments that test AI models on complex coding tasks including debugging, feature creation, and optimisation. Work remotely for approximately 15 hours weekly at $40–85 per hour, on a task-based payment model. Requires proficiency in Python3, Java, Rust, or TypeScript, with strong algorithmic knowledge and proven expertise in performance optimisation and collaborative development.
Software Engineer
Contractors earn $40–75 per hour on micro1, paid per completed task meeting specifications. This role suits experienced software engineers with 3+ years' hands-on backend or full-stack experience who can reason through unfamiliar codebases and explain technical decisions clearly. You'll build reinforcement learning environments that test AI systems' ability to identify and fix security vulnerabilities, create features, refactor code, and optimise performance. Cybersecurity or SecOps background is preferred.
Cocos2d-x Junior Game Developer
Remote contractor role at $50–$120 per hour. Develop game logic, mechanics and assets using Cocos2d-x for AI training projects. Suit developers with hands-on 2D game experience and strong troubleshooting skills. You'll collaborate with distributed teams, document code thoroughly, and refine gameplay based on feedback. No AI background required; domain expertise in game development is what matters.
Junior Python Game Developer (Panda3D)
Earning $50–$120 per hour, this remote contractor role suits junior Python developers with hands-on experience in Panda3D game engine development. You'll analyse game code, engine implementations, and development workflows, providing detailed feedback and domain expertise to train AI systems. The work involves code review, technical documentation, and iterative collaboration with a distributed team on micro1.
Game Developer (Java / libGDX)
Contractor role paying $50–120 per hour on micro1. Game developers with Java and libGDX experience needed to build 2D game features and help train AI systems through high-quality interactive data. Portfolio or demo projects preferred. Fully remote, no prior AI experience required.
Software Engineer
Paying $40–$75 per hour on a task-based model, this remote contractor role suits experienced backend and full-stack engineers with cybersecurity exposure. Working via micro1, you'll design reinforcement learning environments that test AI systems' ability to identify and patch security vulnerabilities in code. Responsibilities include code review, debugging across multiple languages, evaluating solutions, and communicating technical reasoning. Requires 3+ years of production engineering experience and familiarity with at least one major backend language.
UK Based Software Engineers (COBOL)
$130–$170 per hour. Mercor seeks UK-based software engineers with COBOL expertise and at least 5 years' experience, preferably from major tech firms. The role involves creating high-quality training data for advanced AI systems. Fully remote and project-based, with 40 hours weekly commitment. Strong performance may lead to future opportunities with Mercor.
Software Engineer
Micro1 seeks experienced backend software engineers at $40–75/hour to build reinforcement learning environments that test AI model capabilities on realistic programming tasks. The role demands expertise in Java, Node.js, Go, Rust, or Python, with emphasis on secure coding, thorough testing, and code quality review. Cybersecurity or SecOps background is highly preferred. Work is output-based, asynchronous, and remote; candidates must commit 10–15+ hours weekly and start tasking within 48 hours of onboarding.
Software Engineer
Paying $40–$75 per hour on a task-based model, this remote contractor role suits experienced backend and full-stack engineers with cybersecurity exposure. Working via micro1, you'll design reinforcement learning environments that test AI systems' ability to identify and patch security vulnerabilities in code. Responsibilities include code review, debugging across multiple languages, evaluating solutions, and communicating technical reasoning. Requires 3+ years of production engineering experience and familiarity with at least one major backend language.
Senior Software Engineer
Micro1 seeks experienced software engineers to design reinforcement learning environments that test AI models on complex coding tasks including debugging, feature creation, and optimisation. Work remotely for approximately 15 hours weekly at $40–85 per hour, on a task-based payment model. Requires proficiency in Python3, Java, Rust, or TypeScript, with strong algorithmic knowledge and proven expertise in performance optimisation and collaborative development.
Software Engineer
Contractors earn $40–75 per hour on micro1, paid per completed task meeting specifications. This role suits experienced software engineers with 3+ years' hands-on backend or full-stack experience who can reason through unfamiliar codebases and explain technical decisions clearly. You'll build reinforcement learning environments that test AI systems' ability to identify and fix security vulnerabilities, create features, refactor code, and optimise performance. Cybersecurity or SecOps background is preferred.
Java Developer
£16–£40 per hour. micro1 seeks expert Java developers to train AI systems through high-quality backend work. You'll design and maintain scalable Spring-based solutions, integrate APIs, and collaborate with cross-functional teams on enterprise projects. Ideal for experienced contractors comfortable working independently in remote settings with clear communication skills.
No live roles match your search.
AI training work is organised by profession, task and software — not by topic or sector. Try your field (for example “nursing” or “Python”), clear the filters, or browse the categories further down the page. The always-open talent pools below are a good place to start.
