Mindrift

Freelance Agent Evaluation Engineer - AI Projects on Mindrift

United Kingdom
English
up to $50 per hour

The task involves creating complex datasets and simulated environments to evaluate the performance of AI coding agents on real-world developer tasks. The company is looking for experienced software engineers or test automation specialists with over 5 years of experience, particularly in Python and full-stack development. This is a freelance, project-based role where contributors design tasks and write tests to challenge frontier AI models.

View on Mindrift

© 2025 AIJobList. All listings link to original sources.

Made with for humans training AI