Tag
Pull request to add architecture support for the cohere2-MoE model to llama.cpp, enabling inference of this Mixture of Experts model.