2024-01-23

Open Llms

Open LLMs

Meta

[Llama 1-7

65B](https://ai.facebook.com/blog/large-language-model-llama-meta-ai/)

Mistral AI
- [Codestral-7 22B](https://mistral.ai/news/codestral/)
- Mistral-7B
- Mixtral-8x7B
- Mixtral-8x22B
Google
- [Gemma2-9 27B](https://blog.google/technology/developers/google-gemma-2/)
- [Gemma-2 7B](https://blog.google/technology/developers/gemma-open-models/)
- RecurrentGemma-2B
- T5
Apple
- [OpenELM-1.1 3B](https://huggingface.co/apple/OpenELM)
Microsoft
- Phi1-1.3B
- Phi2-2.7B
- [Phi3-3.8 7 14B](https://huggingface.co/microsoft/Phi-3-mini-4k-instruct)
AllenAI
- OLMo-7B
xAI
- Grok-1-314B-MoE
Cohere
- Command R-35B

DeepSeek

[DeepSeek-Coder-1.3

6.7

33B](https://huggingface.co/collections/deepseek-ai/deepseek-coder-65f295d7d8a0a29fe39b4ec4)

[DeepSeek-VL-1.3 7B](https://huggingface.co/collections/deepseek-ai/deepseek-vl-65f295948133d9cf92b706d3)
DeepSeek-MoE-16B
DeepSeek-v2-236B-MoE
[DeepSeek-Coder-v2-16 236B-MOE](https://github.com/deepseek-ai/DeepSeek-Coder-V2)

Alibaba

[Qwen-1.8

72B](https://huggingface.co/collections/Qwen/qwen-65c0e50c3f1ab89cb8704144)

[Qwen1.5-1.8

110B](https://huggingface.co/collections/Qwen/qwen15-65c0a2f577b1ecb76d786524)

01-ai
- Yi-34B
- [Yi1.5-6 9 34B](https://huggingface.co/collections/01-ai/yi-15-2024-05-663f3ecab5f815a3eaca7ca8)
- [Yi-VL-6B 34B](https://huggingface.co/collections/01-ai/yi-vl-663f557228538eae745769f3)
Baichuan
- [Baichuan-7 13B](https://huggingface.co/baichuan-inc)
- [Baichuan2-7 13B](https://huggingface.co/baichuan-inc)
Nvidia
- Nemotron-4-340B
BLOOM
- BLOOMZ&mT0
Zhipu AI
- [GLM-2 6 10 13 70B](https://huggingface.co/THUDM)
- CogVLM2-19B

OpenBMB

[CPM-Bee-1

10B](https://huggingface.co/collections/openbmb/cpm-bee-65d491cc84fc93350d789361)

RWKV Foundation
- [RWKV-v4 5 6](https://huggingface.co/RWKV)
ElutherAI
- [Pythia-1 1.4 2.8 6.9 12B](https://github.com/EleutherAI/pythia)
Stability AI
- StableLM-3B
- [StableLM-v2-1.6 12B](https://huggingface.co/collections/stabilityai/stable-lm-650852cfd55dd4e15cdcb30a)
- StableCode-3B

BigCode

[StarCoder-1

7B](https://huggingface.co/collections/bigcode/%E2%AD%90-starcoder-64f9bd5740eb5daaeb81dbec)

[StarCoder2-3 7 15B](https://huggingface.co/collections/bigcode/starcoder2-65de6da6e87db3383572be1a)

Shanghai AI Laboratory

[InternLM2-1.8

20B](https://huggingface.co/collections/internlm/internlm2-65b0ce04970888799707893c)

[InternLM-Math-7B 20B](https://huggingface.co/collections/internlm/internlm2-math-65b0ce88bf7d3327d0a5ad9f)

[InternLM-XComposer2-1.8

7B](https://huggingface.co/collections/internlm/internlm-xcomposer2-65b3706bf5d76208998e7477)

[InternVL-2

26](https://huggingface.co/collections/OpenGVLab/internvl-65b92d6be81c86166ca0dde4)