News_27
Super excited to share DR Tulu - an open, end-to-end trained deep research agent for long-form, real-world research tasks. We introduce a new RL recipe, Reinforcement Learning with Evolving Rubrics (RLER), to tackle the inherently hard-to-verify nature of deep research. Check out our paper and a static demo. A live demo is coming soon so please stay tuned!