top of page

I am a machine learning researcher, currently at the SLAMPAI lab at JSC (Jülich Supercomputing Centre). I completed my M.Sc. thesis in Computer Science at Tel Aviv University under the supervision of Dr. Yair Carmon. Prior to that, I obtained my B.Sc. in Computer Science and Mathematics, also at Tel Aviv University.
I am interested in optimizing the scaling of ML systems to develop AI models more efficiently, predictably, and robustly – using empirical insights from small-scale experiments as well as theoretical foundations.
Publications
Resolving Discrepancies in Compute-Optimal Scaling of Language Models [code] [checkpoints]
with Mitchell Wortsman, Jenia Jitsev, Ludwig Schmidt, and Yair Carmon.
Spotlight at Conference on Neural Information Processing Systems (NeurIPS), 2024.
bottom of page