Mixture of Experts

Mixture of Experts

Published on Oct 8
3286
Argmax
0:00
0:00
<p>In this episode we talk about the paper &quot;Outrageously Large Neural Networks: The Sparsely-Gated Mixture-of-Experts Layer&quot; by Noam Shazeer, Azalia Mirhoseini, Krzysztof Maziarz, Andy Davis, Quoc Le, Geoffrey Hinton, Jeff Dean.</p>
Mixture of Experts - Argmax - 播刻岛