Layers / Decoder / Mlp¶
Classes¶
| Class | Description |
|---|---|
| MLPConfig | Configuration for a dense feedforward (MLP) layer with optional gating and activation recomputation |
| MoEMLPConfig | Configuration for a Mixture-of-Experts (MoE) feedforward layer with top-k token routing |