Freelance Agent Evaluation Engineer - AI Projects on Mindrift at Mindrift

The task involves creating complex datasets and simulated environments to evaluate the performance of AI coding agents on real-world developer tasks. The company is looking for experienced software engineers or test automation specialists with over 5 years of experience, particularly in Python and full-stack development. This is a freelance, project-based role where contributors design tasks and write tests to challenge frontier AI models.