상업적으로 이용할 수 있는 라이선스를 가진 오픈소스 LLM 목록

9bow · 5월 9, 2023, 4:39오전

아래 소개해드린 MPT-7B에 이어, 상업적으로 사용 가능한 (라이선스를 가진) LLM들이 속속 등장하는 가운데, 이를 정리한 (& 하고 있는) GitHub 저장소가 있어 공유드립니다.

아래는 오늘(2023년 05월 09일) 기준, MIT, Apache 2.0, OpenRAIL-M 라이선스를 가진 LLM들의 목록입니다.

Open LLMs

These LLMs are all licensed for commercial use (e.g., Apache 2.0, MIT, OpenRAIL-M). Contributions welcome!

Language Model	Checkpoints	Paper/Blog	Size	Context Length	Licence
T5	T5 & Flan-T5, Flan-T5-xxl (HF)	Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer	60M - 11B	512	Apache 2.0
UL2	UL2 & Flan-UL2, Flan-UL2 (HF)	UL2 20B: An Open Source Unified Language Learner	20B	512, 2048	Apache 2.0
Cerebras-GPT	Cerebras-GPT	Cerebras-GPT: A Family of Open, Compute-efficient, Large Language Models (Paper)	111M - 13B	2048	Apache 2.0
Pythia	pythia 70M - 12B	Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling	70M - 12B	2048	Apache 2.0
Dolly	dolly-v2-12b	Free Dolly: Introducing the World's First Truly Open Instruction-Tuned LLM	3B, 7B, 12B	2048	MIT
RWKV	RWKV, ChatRWKV	The RWKV Language Model (and my LM tricks)	100M - 14B	infinity (RNN)	Apache 2.0
GPT-J-6B	GPT-J-6B, GPT4All-J	GPT-J-6B: 6B JAX-Based Transformer	6B	2048	Apache 2.0
GPT-NeoX-20B	GPT-NEOX-20B	GPT-NeoX-20B: An Open-Source Autoregressive Language Model	20B	2048	Apache 2.0
Bloom	Bloom	BLOOM: A 176B-Parameter Open-Access Multilingual Language Model	176B	2048	OpenRAIL-M v1
StableLM-Alpha	StableLM-Alpha	Stability AI Launches the First of its StableLM Suite of Language Models	3B - 65B	4096	CC BY-SA-4.0
FastChat-T5	fastchat-t5-3b-v1.0	We are excited to release FastChat-T5: our compact and commercial-friendly chatbot!	3B	512	Apache 2.0
h2oGPT	h2oGPT	Building the World’s Best Open-Source Large Language Model: H2O.ai’s Journey	12B - 20B	256 - 2048	Apache 2.0
MPT-7B	MPT-7B, MPT-7B-Instruct	Introducing MPT-7B: A New Standard for Open-Source, Commercially Usable LLMs	7B	84k (ALiBi)	Apache 2.0, CC BY-SA-3.0
RedPajama-INCITE	RedPajama-INCITE	Releasing 3B and 7B RedPajama-INCITE family of models including base, instruction-tuned & chat models	3B - 7B	?	Apache 2.0
OpenLLaMA	OpenLLaMA-7b-preview-300bt	OpenLLaMA: An Open Reproduction of LLaMA	7B	2048	Apache 2.0

LLMs for code

Language Model	Checkpoints	Paper/Blog	Size	Context Length	Licence
SantaCoder	santacoder	SantaCoder: don't reach for the stars!	1.1B	2048	OpenRAIL-M v1
StarCoder	starcoder	StarCoder: A State-of-the-Art LLM for Code, StarCoder: May the source be with you!	15B	8192	OpenRAIL-M v1
Replit Code	replit-code-v1-3b	Training a SOTA Code LLM in 1 week and Quantifying the Vibes — with Reza Shabani of Replit	2.7B	infinity? (ALiBi)	CC BY-SA-4.0
CodeGen2	codegen2 1B-16B	CodeGen2: Lessons for Training LLMs on Programming and Natural Languages	1B - 16B	2048	Apache 2.0

Evals on open LLMs

LLM datasets for fine-tuning

PENDING

Want to contribute? Just add a row above.

What do the licences mean?

Apache 2.0: Allows users to use the software for any purpose, to distribute it, to modify it, and to distribute modified versions of the software under the terms of the license, without concern for royalties.
MIT: Similar to Apache 2.0 but shorter and simpler. Also, in contrast to Apache 2.0, does not require stating any significant changes to the original code.
CC BY-SA-4.0: Allows (i) copying and redistributing the material and (ii) remixing, transforming, and building upon the material for any purpose, even commercially. But if you do the latter, you must distribute your contributions under the same license as the original. (Thus, may not be viable for internal teams.)
OpenRAIL-M v1: Allows royalty-free access and flexible downstream use and sharing of the model and modifications of it, and comes with a set of use restrictions (see Attachment A)

Disclaimer: The information provided in this repo does not, and is not intended to, constitute legal advice. Maintainers of this repo are not responsible for the actions of third parties who use the models. Please consult an attorney before using models for commercial purposes.