주제에 llm-distillation 태그가 달렸습니다

글	댓글	조회수	활동
DeepSeek-R1, 지도학습 기반 파인튜닝(SFT) 대신, 강화학습(RL)으로 추론 능력을 개선하여 추론 능력을 강화한 대규모 언어 모델 읽을거리&정보공유 deepseek , llm-distillation , deepseek-r1 , deepseek-r1-zero , distilled-models	2	5545	1월 29, 2025
minitron: 15B -> 8B -> 4B 더 작고 효율적으로 정제한 모델 (feat. NVIDIA)\ 읽을거리&정보공유 nvidia , knowledge-distillation , pruning , llm-distillation , minitron , depth-pruning , width-pruning , llama-3-1-minitron	0	583	8월 26, 2024
[2024/07/15 ~ 07/21] 이번 주의 주요 ML 논문 (Top ML Papers of the Week) 읽을거리&정보공유 prompt-engineering , paper , top-ml-papers-of-the-week , rag , survey-paper , llm-reasoning , spreadsheetllm , context-embeddings , context-compression-method , needlebench , llm-distillation , llmsuite , beyond-euclid , prover-verifier-game	0	436	7월 22, 2024

DeepSeek-R1, 지도학습 기반 파인튜닝(SFT) 대신, 강화학습(RL)으로 추론 능력을 개선하여 추론 능력을 강화한 대규모 언어 모델

읽을거리&정보공유

2

5545

1월 29, 2025

minitron: 15B -> 8B -> 4B 더 작고 효율적으로 정제한 모델 (feat. NVIDIA)\

읽을거리&정보공유

nvidia , knowledge-distillation , pruning , llm-distillation , minitron , depth-pruning , width-pruning , llama-3-1-minitron

0

583

8월 26, 2024

[2024/07/15 ~ 07/21] 이번 주의 주요 ML 논문 (Top ML Papers of the Week)

읽을거리&정보공유

prompt-engineering , paper , top-ml-papers-of-the-week , rag , survey-paper , llm-reasoning , spreadsheetllm , context-embeddings , context-compression-method , needlebench , llm-distillation , llmsuite , beyond-euclid , prover-verifier-game

0

436

7월 22, 2024