Model | 作者 | Size | 类型 | 开源? |
---|---|---|---|---|
LLaMa | Meta AI | 7B-65B | Decoder | open |
OPT | Meta AI | 125M-175B | Decoder | open |
T5 | 220M-11B | Encoder-Decoder | open | |
mT5 | 235M-13B | Encoder-Decoder | open | |
UL2 | 20B | Encoder-Decoder | open | |
PaLM | 540B | Decoder | no | |
LaMDA | 2B-137B | Decoder | no | |
FLAN-T5 | 同T5 | Encoder-Decoder | open | |
FLAN-UL2 | 同U2 | Encoder-Decoder | open | |
FLAN-PaLM | 同PaLM | Decoder | no | |
FLAN | 同LaMDA | Decoder | no | |
BLOOM | BigScience | 176B | Decoder | open |
T0 | BigScience | 3B | Decoder | open |
BLOOMZ | BigScience | 同BLOOM | Decoder | open |
mT0 | BigScience | 同T0 | Decoder | open |
GPT-Neo | EleutherAI | 125M-2.7B | Decoder | open |
GPT-NeoX | EleutherAI | 20B | Decoder | open |
GPT3 | OpenAI | 175B (davinci) | Decoder | no |
GPT4 | OpenAI | unknown | OpenAI | no |
InstructGPT | OpenAI | 1.3B | Decoder | no |
Alpaca | Stanford | 同LLaMa | Decoder | open |
参考资料:
https://blog.csdn.net/jarodyv/article/details/129992142 开源大语言模型(LLM)汇总(持续更新中)
https://zhuanlan.zhihu.com/p/611403556 腾讯算法工程师 总结当下可用的大模型LLMs
https://blog.csdn.net/bqw18744018044/article/details/128908060 BLOOM:一个176B参数且可开放获取的多语言模型
https://mp.weixin.qq.com/s/Q3BihZjpAonVIfNuFUOVmQ 百模大战时代,思考大模型的未来格局
https://mp.weixin.qq.com/s/SAlWXzdqc-wIyrFEc9ujJA 国产108个大模型,谁是36天罡?谁是72地煞?百模争霸排行榜