[TLDR] 오늘의 AI 뉴스, 2023-08-09: 빅테크의 생성형 AI 🧑‍💻, Stability AI, StableCode 출시 💻, 에이전트로서의 LLM 🤖

9bow · 8월 10, 2023, 5:57오전

파이토치 한국 사용자 모임에서는 TLDR 뉴스레터의 승인을 받아 AI 소식을 DeepL로 번역하여 전합니다.

더 많은 AI 소식 및 정보를 공유하고 함께 성장하고 싶으신가요? 지금 파이토치 한국어 커뮤니티에 방문해주세요!

주요 뉴스 & 신규 출시 소식 / Headlines & Launches

Stability AI, StableCode 출시 / Stability AI Launches StableCode (4 minute read)

Stability AI는 사용자가 프로그래밍 언어 코드를 생성할 수 있도록 설계된 새로운 개방형 대규모 언어 모델인 StableCode의 첫 번째 공개 릴리스를 발표했습니다.

Stability AI announced the first public release of StableCode, its new open large language model designed to help users generate programming language code.

[GN] StabilityAI, 코드를 위한 LLM 생성형 AI "StableCode" 릴리즈 stablecode stability-ai

빅테크의 생성형 AI : 요약 / Roundup of Generative AI in Big Tech (10 minute read)

2분기 실적 발표에서 논의된 내용을 바탕으로 빅 테크 기업들이 생성형 AI 분야에서 어떤 일을 하고 있는지에 대한 멋진 요약입니다. AI에 대한 빅 테크의 반응은 매우 빨랐으며, 이 기회를 활용하는 것이 단기적으로 얼마나 중요한지를 보여줍니다.

An awesome roundup covering what the big tech companies are up to in Generative AI, based on some of the discussions in their earnings calls for Q2. The reaction from big tech to AI has been swift, demonstrating the near term importance of capitalizing on this opportunity.

HuggingFace의 모델, AWS 또는 NVIDIA 클라우드에서 학습 지원 / Train models on HuggingFace backed by AWS or Nvidia cloud (2 minute read)

허깅페이스는 AWS 및 엔비디아와 협력하여 원클릭 트레이닝을 플랫폼에 도입하고 있습니다. 데이터를 업로드하기만 하면 허브에서 바로 최첨단 모델을 미세 조정할 수 있습니다.

HuggingFace is working with AWS and Nvidia to bring one click training to the platform. You can fine-tune state-of-the-art models directly from the hub just by uploading your data.

연구 & 혁신 관련 소식 / Research & Innovation

Sweep (GitHub Repo)

Sweep은 이슈를 풀 리퀘스트로 전환하는 오픈소스 AI 주니어 개발자입니다. "윈도우에서 운영체제에 구애받지 않는 임시 디렉터리 사용"과 같은 GitHub 이슈를 만들면 Sweep이 풀 리퀘스트를 작성하여 "/tmp"를 모두 "tempfile.gettempdir()"로 바꾸어 줍니다.

Sweep is an open-source AI junior developer that turns issues into pull requests. You make a GitHub Issue like "use os agnostic temp directory for windows" and Sweep writes a pull request to replace all occurrences of "/tmp" with "tempfile.gettempdir()".

미니 SDXL 구현 / Mini SDXL implementation (GitHub Repo)

새로운 SDXL 모델의 구현을 자세히 들여다보고 싶으신가요? 이 디퓨저 호환 저장소는 몇 백 줄의 코드에 불과합니다. 학습용으로 적합합니다.

Ever wanted to peer into the depths of implementation of the new SDXL model? This diffusers compatible repo is just a few hundred lines of code. Perfect for learning.

sdxl mini-sdxl

휴먼스크립트 / Humanscript (GitHub Repo)

휴먼스크립트는 대규모 언어 모델을 사용하여 자연어로 작성된 명령어의 의미를 유추하는 스크립트 인터프리터입니다.

Humanscript is a script interpreter that infers the meaning behind commands written in natural language using large language models.

엔지니어링 및 리소스 관련 소식 / Engineering & Resources

DAT: 이미지 품질을 향상시키는 새로운 방법 / A New Way to Enhance Image Quality (GitHub Repo)

연구원들은 공간과 채널 주의력을 모두 사용하여 이미지 초고해상도를 향상시키는 듀얼 어그리게이션 트랜스포머(DAT; Dual Aggregation Transformer)라는 새로운 방법을 개발했습니다. DAT는 적응형 상호 작용 모듈 및 공간 게이트 피드 포워드 네트워크와 같은 특수 도구를 사용하여 현재의 다른 방법보다 더 나은 성능을 발휘합니다.

Researchers have created a new method called the Dual Aggregation Transformer (DAT) that uses both space and channel attention to make image super-resolution better. DAT performs better than other current methods by using special tools like the adaptive interaction module and spatial-gate feed-forward network.

학습의 예시가 최종 모델에 미치는 영향 / How do examples in training influence a final model (4 hour read)

이 논문은 "영향력 함수(influence function)"라는 사실과 반대되는 기법을 사용하여 학습 예제가 최종 모델 성능에 어떤 영향을 미치는지에 대한 질문을 탐구하는 Anthropic의 거물급 논문입니다. 이 접근법의 많은 문제점과 몇 가지 놀라운 발견을 설명하는 꽤 기술적인 논문입니다. 주로 핵심 문구의 순서가 뒤바뀌면 예제 영향력이 0으로 감소한다는 사실을 발견했습니다. 언어 모델은 민감합니다!

This behemoth from Anthropic explores the question of how training examples affect the final model performance using a counterfactual technique called "influence functions". It is a pretty technical paper that outlines many challenges with this approach and some surprising findings. Primarily, they find that instance influence decays to zero when the order of key phrases is flipped. Language models are sensitive!

AgentBench: 에이전트로서 대규모 언어 모델의 성능 평가 / Evaluating the Capabilities of Large Language Models as Agents (GitHub Repo)

이 프로젝트에서는 다양한 대화형 환경에서 대규모 언어 모델(LLM)을 테스트할 수 있는 벤치마크 도구인 AgentBench를 소개합니다. 25개의 LLM에 대한 초기 테스트 결과 상용 모델이 오픈소스 모델보다 성능이 뛰어난 것으로 나타났습니다.

This project introduces AgentBench, a benchmark tool to test Large Language Models (LLMs) in various interactive settings. Initial tests across 25 LLMs showed that commercial models outperformed open-sourced ones.

그 외 소식 / Miscellaneous

야옹-학습 / MeowLearning (Online Course)

스타트업의 경영진부터 ML 엔지니어까지 모든 사람에게 프로덕션 환경에서 AI로 작업하는 방법을 교육하기 위해 만들어진 문제 선택형 스타일 가이드입니다. 대부분의 ML/AI 과정에서는 배울 수 없는 내용입니다.

A pick-your-problem style guide that was created to educate everyone, from the leadership team to the ML engineers in a startup, on how to work with AI in production settings. This is the stuff you won't learn in most ML/AI courses.

Apple 기기를 위한 Swift-Transformers / Swift Transformers for Apple devices (9 minute read)

Swift를 사용하여 HuggingFace Hub에서 언어 모델을 배포하는 방법에 대한 훌륭한 튜토리얼입니다.

A really nice tutorial on how to use Swift to deploy language models from the HuggingFace Hub.

Glaive, 소형 합성 모델을 위한 3백만 달러 시드 라운드 모집 / Glaive raises $3.5m seed round for small synthetic models (4 minute read)

Glaive는 합성 데이터 생성 시스템의 도움을 받아 사용 사례에 맞는 초집중 소규모 언어 모델을 학습할 수 있는 플랫폼입니다. 이러한 소규모 모델은 2~20배 더 빠르고, 최소 2배 더 저렴하며, 학습된 작업에서 범용 LLM API를 능가하는 성능을 발휘합니다.

Glaive is a platform that lets people train hyper-focused small language models for their use case with the help of a synthetic data generation system. These small models are 2-20x faster, at least 2x cheaper, and outperform general purpose LLM APIs on the task they were trained for.

더 읽어보기 / Quick Links

Agentflow (GitHub Repo)

에이전트플로는 대규모 언어 모델 기반의 워크플로를 생성하고 실행하기 위한 강력하면서도 사용자 친화적인 도구로 설계되었습니다.

Agentflow is designed to be a powerful yet user-friendly tool for creating and executing workflows powered by large language models.

Google, 프로젝트 IDX 출시 / Google Launches Project IDX (2 minute read)

Google은 풀스택 웹 및 멀티플랫폼 앱을 구축하기 위한 AI 지원 브라우저 기반 개발 환경인 Project IDX를 출시한다고 발표했습니다.

Google announced the launch of Project IDX, an AI-enabled browser-based development environment for building full-stack web and multiplatform apps.

(광고) 설문조사 결과: 기업이 실제로 생성형 AI를 얼마나 빠르게 도입하고 있나요? / Survey results: how fast are enterprises actually adopting generative AI? (Sponsor)

AI 인프라 연합은 매출이 10억 달러 이상인 1000개 이상의 기업을 대상으로 생성형 AI를 얼마나 빠르게 도입하고 있는지, 어떤 과제가 있는지 설문조사를 실시했습니다. 결과 읽기

The AI Infrastructure Alliance surveyed over 1000 companies with >1B USD in revenue to see how fast they’re adopting generative AI and what their challenges are. Read the results

Google, 음반사와 딥페이크 음악 거래 추진 / Google, Record Labels Working On Deepfake Music Deal (2 minute read)

구글과 유니버설 뮤직은 인공지능으로 생성된 노래에 아티스트의 멜로디와 목소리를 라이선스하기 위해 협의 중입니다.

Google and Universal Music are in talks to license artists’ melodies and voices for songs generated by artificial intelligence.