Alternatives to MoE
Options that appear in the same research papers as MoE, by co-occurrence.
| Alternative | Papers (with MoE) | Avg viability |
|---|---|---|
| Mixture-of-Experts | 2 | — |
| PyTorch | 1 | — |
| GPT-4 | 1 | — |
| GRPO | 1 | — |
| ReAct | 1 | — |
| Llama-3 | 1 | — |
| RAG | 1 | — |
| PPO | 1 | — |
| Transformer | 1 | — |
| LLaMA | 1 | — |
| DeepSeek-R1 | 1 | — |
| Llama 3 | 1 | — |
| RLHF | 1 | — |
| DPO | 1 | — |
| ORPO | 1 | — |
| ACT | 1 | — |
| multi-agent systems | 1 | — |
| o1 | 1 | — |
| MoE routing | 1 | — |
| Multi-head Latent Attention | 1 | — |
| Phi-4 | 1 | — |
| agentic systems | 1 | — |
| Action Chunking Transformer | 1 | — |
| Vision-Language-Action models | 1 | — |
| GPT-2 | 1 | — |
| LLaMA-2 | 1 | — |