..
Open Llms
Open LLMs
- Meta
-
[Llama 3.1-8 70 405B](https://llama.meta.com/) -
[Llama 3-8 70B](https://llama.meta.com/llama3/) -
[Llama 2-7 13 70B](https://llama.meta.com/llama2/) -
[Llama 1-7 13 33 65B](https://ai.facebook.com/blog/large-language-model-llama-meta-ai/) -
[OPT-1.3 6.7 13 30 66B](https://arxiv.org/abs/2205.01068)
-
- Mistral AI
-
[Codestral-7 22B](https://mistral.ai/news/codestral/) - Mistral-7B
- Mixtral-8x7B
- Mixtral-8x22B
-
- Google
-
[Gemma2-9 27B](https://blog.google/technology/developers/google-gemma-2/) -
[Gemma-2 7B](https://blog.google/technology/developers/gemma-open-models/) - RecurrentGemma-2B
- T5
-
- Apple
-
[OpenELM-1.1 3B](https://huggingface.co/apple/OpenELM)
-
- Microsoft
- AllenAI
- xAI
- Cohere
- DeepSeek
- DeepSeek-Math-7B
-
[DeepSeek-Coder-1.3 6.7 7 33B](https://huggingface.co/collections/deepseek-ai/deepseek-coder-65f295d7d8a0a29fe39b4ec4) -
[DeepSeek-VL-1.3 7B](https://huggingface.co/collections/deepseek-ai/deepseek-vl-65f295948133d9cf92b706d3) - DeepSeek-MoE-16B
- DeepSeek-v2-236B-MoE
-
[DeepSeek-Coder-v2-16 236B-MOE](https://github.com/deepseek-ai/DeepSeek-Coder-V2)
- Alibaba
-
[Qwen-1.8 7 14 72B](https://huggingface.co/collections/Qwen/qwen-65c0e50c3f1ab89cb8704144) -
[Qwen1.5-1.8 4 7 14 32 72 110B](https://huggingface.co/collections/Qwen/qwen15-65c0a2f577b1ecb76d786524) - CodeQwen-7B
- Qwen-VL-7B
-
[Qwen2-0.5 1.5 7 57-MOE 72B](https://qwenlm.github.io/blog/qwen2/)
-
- 01-ai
- Yi-34B
-
[Yi1.5-6 9 34B](https://huggingface.co/collections/01-ai/yi-15-2024-05-663f3ecab5f815a3eaca7ca8) -
[Yi-VL-6B 34B](https://huggingface.co/collections/01-ai/yi-vl-663f557228538eae745769f3)
- Baichuan
-
[Baichuan-7 13B](https://huggingface.co/baichuan-inc) -
[Baichuan2-7 13B](https://huggingface.co/baichuan-inc)
-
- Nvidia
- BLOOM
- Zhipu AI
-
[GLM-2 6 10 13 70B](https://huggingface.co/THUDM) - CogVLM2-19B
-
- OpenBMB
- MiniCPM-2B
- OmniLLM-12B
- VisCPM-10B
-
[CPM-Bee-1 2 5 10B](https://huggingface.co/collections/openbmb/cpm-bee-65d491cc84fc93350d789361)
- RWKV Foundation
-
[RWKV-v4 5 6](https://huggingface.co/RWKV)
-
- ElutherAI
-
[Pythia-1 1.4 2.8 6.9 12B](https://github.com/EleutherAI/pythia)
-
- Stability AI
- StableLM-3B
-
[StableLM-v2-1.6 12B](https://huggingface.co/collections/stabilityai/stable-lm-650852cfd55dd4e15cdcb30a) - StableCode-3B
- BigCode
-
[StarCoder-1 3 7B](https://huggingface.co/collections/bigcode/%E2%AD%90-starcoder-64f9bd5740eb5daaeb81dbec) -
[StarCoder2-3 7 15B](https://huggingface.co/collections/bigcode/starcoder2-65de6da6e87db3383572be1a)
-
- DataBricks
- Shanghai AI Laboratory
-
[InternLM2-1.8 7 20B](https://huggingface.co/collections/internlm/internlm2-65b0ce04970888799707893c) -
[InternLM-Math-7B 20B](https://huggingface.co/collections/internlm/internlm2-math-65b0ce88bf7d3327d0a5ad9f) -
[InternLM-XComposer2-1.8 7B](https://huggingface.co/collections/internlm/internlm-xcomposer2-65b3706bf5d76208998e7477) -
[InternVL-2 6 14 26](https://huggingface.co/collections/OpenGVLab/internvl-65b92d6be81c86166ca0dde4)
-