AI Research Engineer (Multi-Modal Reinforcement Learning)

Apply for this job

You will be redirected to the employer’s website

Share this job

Netherlands
Competitive
22 May 2026

Full-Time
On-Site

Job Description

…Design and build scalable RL infrastructure supporting distributed training and evaluation across complex multi-modal environments.
Develop reward modeling strategies to improve alignment, training stability, and mitigate failure modes such as reward hacking.
Create…