Mixtral Outperforms Llama and GPT-3.5 Across Multiple Benchmarks
Table of Links
Abstract and 1. Introduction
2 Architectural details and 2.1 Sparse Mixture of Experts
3 Results
3.1 Multilingual benchmarks, 3.2 Long range performance, and 3.3 Bias Benchmarks
4 Inst...