[TLDR] 오늘의 AI 뉴스, 2023-06-07: 다국어 AI🇺🇳, 마크 안드르센이 말하는 AI가 세상을 구하는 이유🌎, 시각적 학습 개선👀

9bow · 6월 8, 2023, 4:31오전

파이토치 한국 사용자 모임에서는 TLDR 뉴스레터의 승인을 받아 AI 소식을 DeepL로 번역하여 전합니다.

더 많은 AI 소식 및 정보를 공유하고 함께 성장하고 싶으신가요? 지금 파이토치 한국어 커뮤니티에 방문해주세요!

주요 뉴스 & 신규 출시 소식 / Headlines & Launches

영어 외 다른 언어 기반 AI를 만들기 지원 / Help make AI that's not just English based (4 minute read)

언어 모델링 스타트업인 Cohere의 연구팀은 영어뿐만 아니라 더 많은 언어에 최첨단 생성 모델링 성능을 제공하기 위한 인스트럭션 튜닝 노력을 주도하고 있습니다. 이들은 이러한 조직 간 노력에 커뮤니티와 함께 참여하고 있습니다.

Language modeling startup Cohere's research arm is leading an instruction tuning endeavor to bring state of the art generative modeling performance to more languages than just English. They're engaging with the community in this cross organization effort.

RedPajama, 7B 모델 훈련 완료 / RedPajama 7B model finished training (6 minute read)

부분적으로 학습된 모델과 함께 RedPajama 데이터 세트가 몇 주 전에 출시되었습니다. 7B 모델은 이제 막 3,000개의 V100을 사용하여 학습을 마쳤습니다. 인스트럭션 튜닝된 버전은 동급 규모의 오픈 모델 중 가장 성능이 뛰어난 모델입니다. 팀은 지속적인 모델 개발을 위한 많은 계획을 가지고 있습니다.

The RedPajama dataset along with partially trained models was released a few weeks ago. The 7B model just finished its training on 3000 V100s. The instruction tuned version is the most performant open model of its size. The team has lots of plans for continued model development.

AI가 세상을 구할 이유 / Why AI Will Save the World (12 minute read)

마크 안데르센은 포스팅을 자주 하지는 않지만, 포스팅을 할 때면 항상 주옥같은 내용이 가득합니다. 마크 안드르센은 AI가 인간 삶의 모든 측면을 개선하는 데 사용될 수 있기 때문에 AI가 역사상 가장 위대한 발명품이라는 주장을 펼칩니다.

He doesn’t post often, but when he does, Andreessen is always full of gems. Marc Andreessen makes the case that AI is the greatest invention of all time, as the technology can be used to make every single aspect of human life better.

연구 & 혁신 관련 소식 / Research & Innovation

Aviary (GitHub Repo)

Aviary는 한 곳에서 다양한 대규모 언어 모델(LLM)과 상호작용할 수 있는 앱입니다. 다양한 모델의 결과물을 직접 비교하고, 품질별로 순위를 매기고, 비용 및 지연 시간 추정치를 얻는 등의 작업을 수행할 수 있습니다.

Aviary is an app that lets you interact with a variety of large language models (LLMs) in a single place. You can compare the outputs of different models directly, rank them by quality, get a cost and latency estimate, and more.

HQ-SAM: 더 나은 객체 세분화를 위한 '세그먼트 애니씽 모델' 개선 / HQ-SAM: Improving 'Segment Anything Model' for Better Object Segmentation (GitHub Repo)

연구원들은 기존의 장점을 유지하면서 복잡한 구조를 가진 물체를 포함한 모든 물체의 윤곽을 그릴 수 있는 능력을 향상시키는 최신 '세그먼트 애니씽 모델'(SAM)의 업그레이드 버전인 HQ-SAM을 개발했습니다.

Researchers developed HQ-SAM, an upgrade to the recent 'Segment Anything Model' (SAM), which enhances its ability to outline any object, even those with complex structures, while keeping its original benefits.

InstructZero: 소프트 프롬프트를 사용하여 LLM에 대한 명령어 제공 개선 / InstructZero: Improving Instruction Giving for LLMs Using Soft Prompts (GitHub Repo)

이 연구에서는 대규모 언어 모델('블랙박스' 모델이라고도 함)을 직접 조정할 수 없는 경우에도 명령어를 더 잘 따르도록 만드는 새로운 방법인 InstructZero를 소개합니다. 이 방법은 더 나은 명령어를 생성하도록 최적화된 '소프트 프롬프트'를 사용하며, 테스트 결과 Vicuna 및 ChatGPT를 비롯한 다양한 LLM을 사용하는 여러 작업에서 현재 최고의 방법보다 더 잘 작동하는 것으로 나타났습니다.

The research introduces InstructZero, a new method for making large language models (LLMs) better at following instructions, even when you can't directly tweak them (known as "black-box" models). This method uses "soft prompts" which are optimized to create better instructions, and our tests show that it works better than current top methods in different tasks with various LLMs, including Vicuna and ChatGPT.

(광고) 제로 플레이크의 엔드투엔드 테스트 커버리지 / End-to-end test coverage with zero flakes (Sponsor)

AI는 높은 자동화된 테스트 커버리지를 빠르게 달성할 수 없지만, QA Wolf는 가능합니다. 모든 것을 오픈 소스 Playwright로 작성하고(코드를 소유하고 있습니다!) 모든 배포에서 전체 테스트 스위트를 병렬로 실행할 수 있는 인프라를 제공합니다. 또한 유지 관리(전체 리팩터까지!)를 처리하므로 팀은 기능 출시에만 집중할 수 있습니다. 마법 같지만 QA Wolf입니다.

AI can't get you to high automated test coverage fast - but QA Wolf can. They write everything in open-source Playwright (you own the code!) and provide the infra to run your entire test suite in parallel on every deploy. Plus, they take care of maintenance (even full refactors!) so your team can stay focused on shipping features. It’s like magic but it’s QA Wolf.

엔지니어링 및 리소스 관련 소식 / Engineering & Resources

MultiLegalPile: 거의 1,000억 개에 달하는 다국어 법률 토큰 / Almost 100B multilingual legal tokens (19 minute read)

24개 언어로 된 방대한 법률 문서 말뭉치(코퍼스)가 새로 도착했습니다. 또한 학습된 모델과 학습 코드도 제공됩니다. 연구진은 이 689GB의 말뭉치로 훈련된 모델이 법률 언어 모델에 대한 새로운 최첨단 상태를 달성한다는 사실을 발견했습니다.

A huge new corpus of legal documents in 24 languages just arrived. Also, trained models and training code are available. They found that models trained on this 689GB corpus achieve a new state of the art for legal language models.

MotionDiffuser: 여러 에이전트의 미래 움직임을 예측하는 기술 / MotionDiffuser: A Technique for Predicting Future Movements of Multiple Agents (22 minute read)

MotionDiffuser는 디퓨전 프로세스를 사용하여 차량이나 로봇과 같은 여러 개체의 다양한 미래 움직임을 예측하는 새로운 기술입니다. 이러한 움직임을 효과적으로 예측하는 방법을 학습하고, 간단한 설계로 다양한 시나리오를 처리합니다. 샘플링 기능을 통해 다양한 규칙이나 시뮬레이션에 따라 이러한 예측을 안내할 수 있으며, Waymo 오픈 모션 데이터세트에서 놀라운 결과를 보여줍니다.

MotionDiffuser is a new technology that uses diffusion processes to predict different possible future movements of multiple entities, such as vehicles or robots. It learns to forecast these movements effectively, handles various scenarios with a straightforward design. Its sampling feature allows us to guide these forecasts under different rules or simulations, showing remarkable results on the Waymo Open Motion Dataset.

StableRep: 인공 이미지를 사용하여 시각 학습 개선하기 / StableRep: Using Artificial Images to Improve Visual Learning (23 minute read)

이 연구는 StableDiffusion 기반의 텍스트-이미지 모델에서 인공적으로 생성된 이미지를 사용하여 시각적 개념을 학습할 때의 효과를 조사합니다. 저자는 동일한 텍스트 프롬프트에서 생성된 이미지를 연관성이 있는 것으로 간주하여 합성 이미지만 사용해도 SimCLR 및 CLIP과 같은 기존 방법보다 향상된 성능을 이끌어내는 StableRep을 소개합니다.

This research investigates the effectiveness of learning visual concepts using artificially generated images from a text-to-image model called Stable Diffusion. The authors introduce StableRep, a method that treats images produced from the same text prompt as related, which leads to improved performance over existing methods like SimCLR and CLIP, even with synthetic images alone.

그 외 소식 / Miscellaneous

GGML, 회사 설립 / GGML forms a company (2 minute read)

대규모 언어 모델은 로컬에서 실행하는 데 비용이 많이 드는 경우가 많습니다. GGML은 MacBook 컴퓨터에서 언어 모델 대신 쉽게 실행할 수 있는 순수 C로 작성된 프레임워크입니다. 현재는 취미로 사용하는 커뮤니티에서 사용되지만, 기업에서 모델을 배포하는 데에도 많이 활용되고 있습니다.

Large language models are often expensive to run locally. GGML is a framework written in pure C that makes it easy to run instead of their language models on MacBook computers. It is used by the hobbyist community currently, but has many applications for Enterprise deployment of models.

OpenAI는 비공개로 유지됩니다 / OpenAI Is Staying Private (1 minute read)

샘 알트먼은 OpenAI가 더욱 강력해짐에 따라 기술에 대한 완전한 통제권을 유지하고 싶기 때문에 공개하는 데 관심이 없다고 말했습니다.

Sam Altman said he’s not interested in taking OpenAI public because he wants to maintain full control over the technology as it becomes more powerful.

더 읽어보기 / Quick Links

Instagram, AI 챗봇 개발 중 / Instagram Working On An AI Chatbot (1 minute read)

인스타그램은 더 재미있고 매력적인 경험을 제공하기 위해 인공지능 챗봇을 개발 중입니다.

Instagram is working on an AI chatbot to provide a more fun and engaging experience.

Apple, WWDC에서 AI를 피하다 / Apple Avoids AI At WWDC (5 minute read)

Apple은 WWDC 기조연설에서 'AI'라는 용어를 사용하지 않았지만, 새로운 제품과 기능에는 여전히 머신러닝이 적용되었습니다. 예를 들어, 새로운 iOS 17의 자동 수정 기능은 최근 차세대 AI 혁신의 원동력이 되고 있는 AI의 한 유형인 트랜스포머 언어 모델을 사용합니다.

Apple avoided using the term "AI" at its WWDC keynote, but the company's new products and features were still powered by machine learning. For example, the new iOS 17 autocorrect feature uses a transformer language model, which is a type of AI that has been powering recent generative AI innovations.

Sivi AI (Product)

텍스트를 시각적 디자인으로 전환하는 생성형 AI.

Generative AI to turn text to visual designs for your company.

ChatGPT 플러그인 크리에이터 / ChatGPT Plugin Creator (Product)

2분만에 모든 API를 ChatGPT 플러그인으로 전환해보세요.

Turn any API into a secured ChatGPT plugin in 2 minutes.