GenAI Engineer – Test Agent / CI Integration

Typical $25–60/hr Worldwide Remote · worldwide coding Contract / freelance

Pay rate · Typical $25–60/hr

Typical hourly range for this type of role — the exact rate is confirmed by the hiring company.

We are looking for a highly skilled GenAI Engineer – Test Agent / CI Integration to build and scale intelligent AI-powered testing systems for next-generation applications. The ideal candidate will work on automated test-agent frameworks, synthetic data generation, evaluation harnesses, and CI/CD-integrated AI testing pipelines.
This role requires strong expertise in Python backend development, LLM evaluation frameworks, retrieval-grounded testing systems, and modern DevOps practices.

AI Test Agent Development
• Design and develop autonomous AI-driven test agents for validating GenAI and LLM-powered applications

• Synthetic data generation
• Test-case synthesis
• Scenario generation
• Adversarial and edge-case testing
• Develop reusable evaluation harnesses for benchmarking model quality, accuracy, safety, and reliability

• Integrate test agents with BLK’s knowledge/context graph for retrieval-grounded testing
• Enable contextual test generation using RAG pipelines and graph-based retrieval systems
• Ensure generated tests align with enterprise knowledge sources and real-world workflows

• Integrate AI test agents into CI/CD pipelines as first-class pipeline jobs
• Automate regression testing, evaluation runs, and quality scoring during deployments
• Build scalable validation workflows for continuous model monitoring and release gating

This is a niche requirement and not a regular GenAI developer role. We are specifically looking for candidates with experience in:
• AI validation/testing
• QE automation
• Python backend
• CI/CD integration
• LLM evaluation frameworks
• RAG and retrieval-grounded systems

Fill in your name, country and email to proceed to next step.

Looking for something else? Browse all AI jobs →