[TLDR] 오늘의 AI 뉴스, 2023-08-04: Coreweave, 23억달러 대출 💰, AI 및 헬스케어 분야의 a16z 🏥, 로봇 내비게이션 개선 🤖

9bow · 8월 5, 2023, 12:15오후

파이토치 한국 사용자 모임에서는 TLDR 뉴스레터의 승인을 받아 AI 소식을 DeepL로 번역하여 전합니다.

더 많은 AI 소식 및 정보를 공유하고 함께 성장하고 싶으신가요? 지금 파이토치 한국어 커뮤니티에 방문해주세요!

주요 뉴스 & 신규 출시 소식 / Headlines & Launches

Coreweave, 엔비디아 칩을 담보로 23억 달러 규모의 부채 조달 / Coreweave raises $2.3B in debt collateralized by Nvidia chips (3 minute read)

대부분의 AI 기업이 모방할 수 없는 영리한 자금 조달 전략은 기존 자산을 부채 보증으로 사용하는 것입니다. 코어위브는 하드웨어를 소유하고 있기 때문에 채권 시장에 접근하고 지분 매각을 피할 수 있는 현명한 방법을 가지고 있습니다.

A clever strategy for financing that most AI companies cannot replicate is to use existing assets as a guarantee on debt. Since Coreweave owns hardware, they have a clever way to access debt markets and avoid equity sales.

IBM과 NASA, 허깅페이스에서 지리공간 기반 모델 출시 / IBM and NASA launch geospatial foundation model on HuggingFace (6 minute read)

이 모델은 주로 위성 이미지에서 작동하면서 홍수 예측 작업에서 성능을 15% 향상시켰습니다. 이 모델은 작물 분류와 같이 건축업자에게 유용할 수 있는 몇 가지 작업을 수행할 수 있습니다.

This model improved performance by 15% on a flood prediction task while mostly operating on satellite images. It is able to perform a few tasks that could be useful to folks building, such as crop classification.

[GN] IBM과 NASA가 최대의 Geospatial AI 파운데이션 모델을 오픈소스로 공개

AI가 가장 큰 영향을 미칠 분야는 어디일까요? 헬스케어입니다 / Where Will AI Have The Biggest Impact? Healthcare (5 minute read)

의료 산업은 미국 경제의 20%를 차지하지만 기술 도입이 더딘 분야입니다. AI를 통해 의료 기술 기업은 더 이상 소프트웨어에 대한 직원 교육을 위해 힘겨운 싸움을 벌일 필요가 없습니다. 대신 사람처럼 행동하고 의료 전문가의 업무에서 점점 더 많은 부분을 덜어주는 AI를 판매하여 의료 전문가가 더 흥미로운 문제를 해결하고 면허의 최고 수준에서 진료할 수 있도록 지원할 수 있습니다.

Healthcare is 20% of the US economy, but the industry has been slow to adopt technology. With AI, healthtech companies no longer need to fight the uphill battle of training people on software. Instead, they can sell AI that acts like a person and takes more and more of the work off healthcare professionals’ plates, enabling them to work on more interesting problems and practice at the top of their licenses.

연구 & 혁신 관련 소식 / Research & Innovation

LLM 기반 시스템 및 제품 구축을 위한 패턴 / Patterns For Building LLM-Based Systems And Products (65 minute read)

이 게시물에서는 대규모 언어 모델을 시스템과 제품에 통합하기 위한 실용적인 패턴에 대해 설명합니다. 이 게시물은 학술 연구, 업계 리소스 및 실무자의 노하우를 바탕으로 작성되었습니다. 이 게시물에서는 성능 향상과 비용/리스크 감소, 데이터에 더 가까이 다가가는 것과 사용자에 더 가까이 다가가는 것의 스펙트럼을 포괄하는 7가지 주요 패턴을 살펴봅니다.

This post discusses practical patterns for integrating large language models into systems and products. It draws from academic research, industry resources, and practitioner know-how. The post looks at seven key patterns that cover a spectrum of improving performance vs reducing cost/risk and closer to the data vs closer to the user.

PointAnywhere: 로봇 내비게이션 개선 / Improving Robot Navigation (GitHub Repo)

이 연구는 특정 위치 및 방향 제약을 극복하기 위해 전방위 카메라로 캡처한 포인팅 제스처를 사용하여 로봇 내비게이션에 대한 혁신적인 접근 방식을 제안합니다.

This research proposes an innovative approach for robot navigation using pointing gestures, captured by an omnidirectional camera to overcome certain position and direction constraints.

Zep (GitHub Repo)

Zep을 사용하면 관련 문서, 채팅 기록 메모리 및 풍부한 사용자 데이터를 LLM 앱의 프롬프트에 쉽게 추가할 수 있습니다.

Zep allows you to easily add relevant documents, chat history memory, and rich user data to your LLM app’s prompts.

엔지니어링 및 리소스 관련 소식 / Engineering & Resources

SoftMoE (28 minute read)

SoftMoE는 희소 전문가 혼합(MoE; Mixture-of-Expert) 모델을 개선한 새로운 모델 아키텍처입니다. 각 전문가가 입력 토큰의 하위 집합을 처리하는 소프트 할당 방법을 사용하여 더 낮은 비용으로 더 큰 모델 용량을 제공합니다. SoftMoE는 시각 인식 작업에서 표준 트랜스포머 및 기타 MoE 모델보다 성능이 뛰어납니다. ViT-Huge/14 모델과 비교했을 때 성능은 동등하지만 추론 비용은 훨씬 낮습니다. 확장 시 SoftMoE는 모델 파라미터를 크게 늘리면서도 추론 시간 비용을 최소화하여 더 나은 성능을 제공합니다. 게다가 완전히 차별화됩니다!

SoftMoE is a new model architecture that improves upon sparse Mixture of Expert (MoE) models. It offers larger model capacity with lower costs by using a soft assignment method where each expert processes a subset of input tokens. SoftMoE outperforms standard Transformers and other MoE models in visual recognition tasks. It performs equally well compared to the ViT-Huge/14 model, but with a much lower inference cost. When scaled up, SoftMoE significantly increases model parameters while keeping inference time cost minimal, resulting in better performance. Plus, it's fully differentiable!

SelfCheck: 오류 인식을 위한 대규모 언어 모델 개선 / Improving Large Language Models for Error Recognition (15 minute read)

이 연구는 대규모 언어 모델이 특히 다단계 추론 문제에서 외부 리소스 없이도 자체적으로 오류를 인식할 수 있는 방법을 탐구합니다. 연구진은 제로 샷 검증 방법을 개발하여 오류를 식별하고 질문-답변 과제에서 성능을 향상시켰습니다.

This research explores how large language models can recognize their own errors, particularly in multi-step reasoning problems, without needing external resources. The researchers developed a zero-shot verification method to identify errors and enhance performance in question-answering tasks.

학습이 필요 없는 인스턴스 세분화 / Training-Free Instance Segmentation (14 minute read)

이 연구에서는 집중적인 학습이나 상세한 이미지 주석 없이도 이미지 내에서 인스턴스를 세분화할 수 있는 새로운 접근 방식인 '학습 없는 인스턴스 세분화'를 소개합니다. 이 방법은 기존의 시맨틱 분할 모델을 사용하고 학습 가능한 객체 경계 분기를 통해 출력을 개선함으로써 추가 리소스나 계산 비용 증가 없이 경쟁력 있는 결과를 제공합니다.

This study introduces a new approach called Training-Free Instance Segmentation that can segment instances within images without the need for intensive training or detailed image annotations. The method provides competitive results without additional resources or increased computation costs by using pre-existing semantic segmentation models and refining the output through a learnable object boundary branch.

그 외 소식 / Miscellaneous

2023년에 무슨 일이 벌어질까? / WTF Happened In 2023? (12 minute read)

최근 AI, 초전도체, 원자력, 인프라 분야의 발전으로 2023년은 기술 발전이 다시 도약하기 시작한 해로 기억될 것입니다.

With recent advancements in AI, superconductors, nuclear energy, and infrastructure, 2023 will likely be looked back on as the year that technological progress really began taking off again.

의회가 고려 중인 AI 규칙 설명 / The AI Rules That Congress Is Considering, Explained (10 minute read)

A full explanation of the various AI rules and regulations that Congress is considering, including those pertaining to copyrights, privacy, and algorithmic bias.

더 읽어보기 / Quick Links

OpenAI, GPT-5 상표 출원 / OpenAI Applies For GPT-5 Trademark (2 minute read)

OpenAI는 언어 모델을 사용하기 위한 새로운 다운로드 가능한 컴퓨터 소프트웨어라고 설명하며 GPT-5에 대한 상표를 출원했습니다.

OpenAI has applied for a trademark for GPT-5, describing it as a new downloadable computer software for using language models.

Color-Diffusion (GitHub Repo)

흑백 이미지를 컬러화하기 위한 디퓨젼 모델입니다.

A diffusion model to colorize black and white images.

Vizly (Product)

Vizly는 AI를 사용하여 몇 초 만에 데이터를 분석합니다.

Vizly uses AI to analyze your data in seconds.