INTELLECT-3: A 100B+ MoE trained with Large-Scale RL
M. Senghaas, F. Obeid, S. Jaghouar, W. Brown, J. Ong, D. Auras, M. Sirovatka, J. Straube, A. Baker, S. Müller, J. Mattern, M. Basra, A. Ismail, D. Scherm, C. Miller, A. Patel, S. Kirsten, M. Sieg, C. Reetz, K. Erdem, V. Weisser, J. Hagemann
PRIME-RL: Async & Decentralized RL Training at Scale
M. Senghaas, F. Obeid, S. Jaghouar, W. Brown, J. Ong, A. Baker, J. Mattern, D. Auras, J. Straube, M. Basra, A. Ismail, J. Hagemann
SYNTHETIC-2: Scaling Distributed Synthetic Data Generation for Verified Reasoning
M. Senghaas, J. Ong, M. Basra, J. Mattern, J. Straube, S. Jaghouar, J. Hagemann
INTELLECT-2: A Reasoning Model Trained Through Globally Decentralized Reinforcement Learning
S. Jaghouar, J. Mattern, J. Ong, J. Straube, M. Basra, A. Pazdera, K. Thaman, M. Di Ferrante, F. Gabriel, F. Obeid, K. Erdem, M. Keiblinger, M. Senghaas, J. Hagemann
DiLoCo-SWARM: Towards Scalable Decentralized Training Across the Globe
M. Senghaas, M. de Vos, R. Sharma, A. Dhasade