주제에 llm-in-production 태그가 달렸습니다

글	댓글	조회수	활동
Any-LLM: 하나의 API로 다양한 LLM을 제어하는 통합 프레임워크 (feat.Mozilla AI) 읽을거리&정보공유 llm-in-production , llm-interpretability , any-llm , mozilla , mozilla-ai , any-llm-gateway , any-suite , any-agent , any-guardrail , mcpd	0	339	11월 12, 2025
Dynamo: NVIDIA가 공개한 고성능 AI 추론 프레임워크 읽을거리&정보공유 nvidia , llm-in-production , llm-framework , llm-serving , dynamo	0	1276	3월 20, 2025
Humbug: 여러 모델을 사용할 수 있는 AI 중심의 오픈소스 개발 도구 읽을거리&정보공유 llm-for-code , llm-in-production , llm-for-software-engineering , humbug	0	242	1월 22, 2025
Moonlight: 연구자들을 위한 논문 AI PDF 뷰어 읽을거리&정보공유 tool , tldr-ai , paper , llm-in-production , gpt-researcher , pdf-extraction-tool , visualization-tool , pdf-reader , pdf-translation , latex	0	4686	1월 9, 2025
Semantic Reader: AI를 활용한 논문 읽기 도구 (feat. AI2) 읽을거리&정보공유 tool , paper , llm-in-production , allen-ai , visualization-tool , semantic-reader , semantic-scholar	2	961	10월 23, 2024
[2024/08/19 ~ 08/25] 이번 주의 주요 ML 논문 (Top ML Papers of the Week) 읽을거리&정보공유 paper , llm-in-production , tabular-llm , top-ml-papers-of-the-week , survey-paper , graphrag , adas , agentic-system , minitron , google-vizier , magicdec , controllable-text-generation , pedal	0	780	8월 26, 2024
[GN⁺] Postgres.new: AI 인터페이스를 갖춘 브라우저 내 Postgres 읽을거리&정보공유 vector-db , postgres , llm-in-production , pgvector , vector-search , ai , text-to-sql , postgres-new , pg-gateway , transformers-js	0	210	8월 14, 2024
[GN⁺] RouteLLM - LLM 라우터 서빙 및 평가를 위한 프레임워크 읽을거리&정보공유 geeknews , multi-llm-support , llm-in-production , llm-framework , llm-evaluation , llm-applications , anyscale , lmsys , routellm , llm-serving	0	765	7월 12, 2024
AI 알리바이(AI Alibis): LLM 및 멀티 에이전트 기반의 텍스트 기반 추리 게임 읽을거리&정보공유 llm-in-production , multi-agent , game-maker , ai-alibis , ai-game , webapp , mystery-game	0	357	7월 12, 2024
OpenPipe, MoA 기법 활용하여 25배 낮은 가격으로 GPT-4 성능을 뛰어넘는 모델 제공 읽을거리&정보공유 gpt-4 , llm-in-production , openpipe , mixture-of-agents , moa-gpt-4	0	798	6월 21, 2024
TokenCost: LLM 애플리케이션을 위한 사용 토큰 계산 및 비용 추정 도구 읽을거리&정보공유 llm-in-production , tool-for-llm , tokencost , cost-estimation	0	783	6월 18, 2024
Paddler, llama.cpp 서버 최적화를 위한 오픈소스 로드 밸런서 읽을거리&정보공유 llamacpp , llm-in-production , mit-license , paddler , load-balancer , golang	0	321	6월 14, 2024
LlamaNet: 1~2줄의 코드 변환만으로 OpenAI 기반 애플리케이션을 llama.cpp 기반 로컬 모델로 쉽게 변경 가능한 라이브러리 읽을거리&정보공유 python , llamacpp , opensource , llm-in-production , javascript , local-llm , llamanet , openai-api-compatibility	0	422	6월 12, 2024
[GN] 1년 동안 LLM과 함께 구축하며 배운 점 읽을거리&정보공유 geeknews , llm , prompt , llm-in-production , rag , ai-engineering	0	2001	6월 11, 2024
대규모 언어 모델을 위한 검색-증강 생성(RAG) 기술 현황 - 1/2편 읽을거리&정보공유 llm , paper , embedding , rag , prompt-compression , naive-rag , indexing , retrieve , generation , advanced-rag , modular-rag , chunking , rag-pipeline-optimization , llm-in-production , survey-paper	9	16155	6월 3, 2024
구조화된 출력에서 환각 현상을 줄이기 위한 RAG (feat. ServiceNow) 읽을거리&정보공유 paper , llm-in-production , rag , hallucination , servicenow , structured-data-generation , reducing-hallucination	0	700	5월 25, 2024
2024년 LLM 모델 개발 트렌드 관련 영상 [영어/유튜브] 읽을거리&정보공유 youtube , llm , llm-finetuning , llm-in-production , llm-library , llm-training	3	2318	5월 22, 2024
LLM 상용화 시, 비용을 낮추면서 성능 향상을 위한 3가지 전략 (feat. FrugalGPT by Portkey) 읽을거리&정보공유 prompt-engineering , frugalgpt , llm-finetuning , llm-in-production , prompt-compression , llm-efficiency-challenge , portkey , llm-cascade , llm-approximation , query-cache , prompt-adaptation	0	792	5월 4, 2024
OpenLIT: OpenTelemetry 기반 생성형 AI 및 LLM 모니터링 도구 읽을거리&정보공유 tool , genai , llm , llm-in-production , observability , opentelemetry , openlit	0	575	5월 2, 2024
AI에 관심이 있는 개발자라면 Embedding(임베딩)부터 시작해보세요! 😉 읽을거리&정보공유 vector-db , embedding , llm-in-production , pgvector , rag	0	1276	4월 18, 2024
상용 수준의 LLM 애플리케이션 구축하기 (무료/영어/온라인) 행사&이벤트 홍보 deeplearningai , webinar , llm-in-production , llm-applications	2	299	3월 5, 2024
[GN] Menlo Ventures가 공개한 최신 AI 스택 : 기업용 AI의 미래를 위한 설계 원칙 읽을거리&정보공유 geeknews , genai , llm-in-production , tech-stack , enterprise-ai , rag , observability , ai-stack , llm-deployment	0	392	1월 31, 2024
RadixAttention과 SGLang을 활용한 LLM 프로그래밍 혁신 (feat. LMSYS) 읽을거리&정보공유 llm , guidance , llm-in-production , vllm , lmsys , sglang , radixattention	0	2661	1월 20, 2024
[GN⁺] 스마트 홈 제어를 위한 완전 로컬 LLM 음성 비서 구축하기 읽을거리&정보공유 geeknews , llm , llm-in-production , personal-assistant , mixtral , glados , vllm , home-assistant	0	760	1월 15, 2024
메두사: 여러 디코딩 헤더를 사용한 대규모 언어 모델 추론 가속화 프레임워크 (Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads) 읽을거리&정보공유 framework , llm-in-production , llm-framework , medusa , llm-acceleration , speculative-decoding	0	1193	12월 26, 2023
LongLLMLingua: 중간 손실을 줄이고 프롬프트 압축을 통한 RAG 비용 절감 (LongLLMLingua: Bye-bye to Middle Loss and Save on Your RAG Costs via Prompt Compression 읽을거리&정보공유 llm , llm-agent , framework , llm-in-production , llm-framework , rag , llmlingua , longllmlingua , prompt-compression , middle-loss , reranking	0	1116	12월 22, 2023
[GN] Microsoft (Long)LLMLingua - 추론 가속 및 비용 절감을 위해 프롬프트 압축하기 읽을거리&정보공유 geeknews , prompt , microsoft , llm-in-production , llm-inference , llmlingua , longllmlingua	0	474	12월 22, 2023
[GN] PowerInfer - 소비자용 GPU를 사용해서 빠르게 LLM 서빙하기 읽을거리&정보공유 geeknews , ggml , llamacpp , falcon , llm-in-production , llm-framework , gguf , powerinfer	0	773	12월 21, 2023
PyTorchKR이 정리한 오늘의 주요 AI/ML 소식들 @ 2023-12-12: StripedHyena-7B, Zephyr-3B, FollowMe 등 읽을거리&정보공유 howto , stablelm , together , llm-in-production , optimizing , gpu-optimization , robot , small-llm , zephyr-llm , pytorchkr-news , followme	1	394	12월 31, 2023
상용 수준의 LLM 애플리케이션을 위한 개발자 가이드(The Developer's Guide to Production-Grade LLM Apps) 읽을거리&정보공유 howto , llm , prompt-engineering , llm-finetuning , llm-in-production , llm-evaluation , rag	0	5424	11월 24, 2023

Any-LLM: 하나의 API로 다양한 LLM을 제어하는 통합 프레임워크 (feat.Mozilla AI)

llm-in-production , llm-interpretability , any-llm , mozilla , mozilla-ai , any-llm-gateway , any-suite , any-agent , any-guardrail , mcpd

0

339

11월 12, 2025

Dynamo: NVIDIA가 공개한 고성능 AI 추론 프레임워크

읽을거리&정보공유

nvidia , llm-in-production , llm-framework , llm-serving , dynamo

0

1276

3월 20, 2025

Humbug: 여러 모델을 사용할 수 있는 AI 중심의 오픈소스 개발 도구

읽을거리&정보공유

llm-for-code , llm-in-production , llm-for-software-engineering , humbug

0

242

1월 22, 2025

Moonlight: 연구자들을 위한 논문 AI PDF 뷰어

읽을거리&정보공유

tool , tldr-ai , paper , llm-in-production , gpt-researcher , pdf-extraction-tool , visualization-tool , pdf-reader , pdf-translation , latex

0

4686

1월 9, 2025

Semantic Reader: AI를 활용한 논문 읽기 도구 (feat. AI2)

읽을거리&정보공유

tool , paper , llm-in-production , allen-ai , visualization-tool , semantic-reader , semantic-scholar

2

961

10월 23, 2024

[2024/08/19 ~ 08/25] 이번 주의 주요 ML 논문 (Top ML Papers of the Week)

읽을거리&정보공유

paper , llm-in-production , tabular-llm , top-ml-papers-of-the-week , survey-paper , graphrag , adas , agentic-system , minitron , google-vizier , magicdec , controllable-text-generation , pedal

0

780

8월 26, 2024

[GN⁺] Postgres.new: AI 인터페이스를 갖춘 브라우저 내 Postgres

읽을거리&정보공유

vector-db , postgres , llm-in-production , pgvector , vector-search , ai , text-to-sql , postgres-new , pg-gateway , transformers-js

0

210

8월 14, 2024

[GN⁺] RouteLLM - LLM 라우터 서빙 및 평가를 위한 프레임워크

읽을거리&정보공유

geeknews , multi-llm-support , llm-in-production , llm-framework , llm-evaluation , llm-applications , anyscale , lmsys , routellm , llm-serving

0

765

7월 12, 2024

AI 알리바이(AI Alibis): LLM 및 멀티 에이전트 기반의 텍스트 기반 추리 게임

읽을거리&정보공유

llm-in-production , multi-agent , game-maker , ai-alibis , ai-game , webapp , mystery-game

0

357

7월 12, 2024

OpenPipe, MoA 기법 활용하여 25배 낮은 가격으로 GPT-4 성능을 뛰어넘는 모델 제공

읽을거리&정보공유

gpt-4 , llm-in-production , openpipe , mixture-of-agents , moa-gpt-4

0

798

6월 21, 2024

TokenCost: LLM 애플리케이션을 위한 사용 토큰 계산 및 비용 추정 도구

읽을거리&정보공유

llm-in-production , tool-for-llm , tokencost , cost-estimation

0

783

6월 18, 2024

Paddler, llama.cpp 서버 최적화를 위한 오픈소스 로드 밸런서

읽을거리&정보공유

llamacpp , llm-in-production , mit-license , paddler , load-balancer , golang

0

321

6월 14, 2024

LlamaNet: 1~2줄의 코드 변환만으로 OpenAI 기반 애플리케이션을 llama.cpp 기반 로컬 모델로 쉽게 변경 가능한 라이브러리

읽을거리&정보공유

python , llamacpp , opensource , llm-in-production , javascript , local-llm , llamanet , openai-api-compatibility

0

422

6월 12, 2024

[GN] 1년 동안 LLM과 함께 구축하며 배운 점

읽을거리&정보공유

geeknews , llm , prompt , llm-in-production , rag , ai-engineering

0

2001

6월 11, 2024

대규모 언어 모델을 위한 검색-증강 생성(RAG) 기술 현황 - 1/2편

읽을거리&정보공유

llm , paper , embedding , rag , prompt-compression , naive-rag , indexing , retrieve , generation , advanced-rag , modular-rag , chunking , rag-pipeline-optimization , llm-in-production , survey-paper

9

16155

6월 3, 2024

구조화된 출력에서 환각 현상을 줄이기 위한 RAG (feat. ServiceNow)

읽을거리&정보공유

paper , llm-in-production , rag , hallucination , servicenow , structured-data-generation , reducing-hallucination

0

700

5월 25, 2024

2024년 LLM 모델 개발 트렌드 관련 영상 [영어/유튜브]

읽을거리&정보공유

youtube , llm , llm-finetuning , llm-in-production , llm-library , llm-training

3

2318

5월 22, 2024

LLM 상용화 시, 비용을 낮추면서 성능 향상을 위한 3가지 전략 (feat. FrugalGPT by Portkey)

읽을거리&정보공유

prompt-engineering , frugalgpt , llm-finetuning , llm-in-production , prompt-compression , llm-efficiency-challenge , portkey , llm-cascade , llm-approximation , query-cache , prompt-adaptation

0

792

5월 4, 2024

OpenLIT: OpenTelemetry 기반 생성형 AI 및 LLM 모니터링 도구

읽을거리&정보공유

tool , genai , llm , llm-in-production , observability , opentelemetry , openlit

0

575

5월 2, 2024

AI에 관심이 있는 개발자라면 Embedding(임베딩)부터 시작해보세요! 😉

읽을거리&정보공유

vector-db , embedding , llm-in-production , pgvector , rag