Muon tomography, or muography, is the practice of using muons generated by cosmic rays interacting with Earth’s atmosphere to ...
These muon particles are heavyweight cousins of electrons that travel close to the speed of light. They can penetrate through many meters of solid rock, including the limestone and granite blocks used ...
A tense particle-physics showdown will reach new heights in 2025. Over the past 25 years researchers have seen a persistent and growing discrepancy between the theoretical predictions and experimental ...
近日,月之暗面团队宣布其开源改进版的Muon优化器在算力需求上相较于传统优化器AdamW锐减48%。这一突破由OpenAI的技术人员提出的训练优化算法Muon演变而来,经过团队深入研究与优化,结果令人振奋。团队通过实验发现,Muon不仅在参数量最高达到1.5B的Llama架构模型上表现优异,其算力需求仅为AdamW的52%。这一进展标志着Muon的可扩展性得到了验证,为更大规模的训练奠定了基础。
最近,月之暗面团队再次发力,推出了对OpenAI的训练优化算法Muon的改进版本,算力需求较广泛使用的AdamW下降了48%!这一成就引发了广泛关注,团队也展现了Muon在大型模型中的强大适应性。
机器之心报道编辑:陈陈、佳琪省一半算力跑出2倍效果,月之暗面开源优化器Muon,同预算下全面领先。月之暗面和 DeepSeek 这次又「撞车」了。上次是论文,两家几乎前后脚放出改进版的注意力机制,可参考《撞车 DeepSeek NSA,Kimi 杨植麟署名的新注意力架构 MoBA 发布,代码也公开》、《刚刚!DeepSeek ...
In an effort to address these challenges, Moonshot AI in collaboration with UCLA has developed Moonlight—a Mixture-of-Expert (MoE) model optimized using the Muon optimizer. Moonlight is offered in two ...
High-resolution muon imaging and AI are being used to unlock the potential of one of the largest undeveloped primary zinc ...
Company invites individual and institutional investors, as well as advisors and analysts, to attend online at ...
Dr William Sharratt, a lecturer in Soft Materials in the University’s School of Engineering, has been named as an ISIS ...