[TLDR] 오늘의 AI 뉴스, 2023-06-05: Asana 인텔리전스🤖, 스테이블 디퓨전 창업자 이야기🤔, 얼굴 모델링에 대한 새로운 접근 방식😃

9bow · 6월 6, 2023, 8:25오전

파이토치 한국 사용자 모임에서는 TLDR 뉴스레터의 승인을 받아 AI 소식을 DeepL로 번역하여 전합니다.

더 많은 AI 소식 및 정보를 공유하고 함께 성장하고 싶으시면 파이토치 한국 사용자 모임에 방문해주세요!

주요 뉴스 & 신규 출시 소식 / Headlines & Launches

스테이블 디퓨전의 설립자는 과장의 역사를 가지고 있다 / Stable Diffusion’s Founder Has A History Of Exaggeration (15 minute read)

스테이블 디퓨전 설립자인 에마드 모스타크(Emad Mostaque)가 과거에 했던 오해의 소지가 있는 주장에 대해 자세히 살펴봅니다.

A deep-dive into past misleading claims made by Emad Mostaque, the founder of Stability AI.

Asana 인텔리전스 / Asana Intelligence (Product Launch)

조직의 업무 데이터에 대한 가장 완벽하고 연결된 지도(Asana Work Graph)를 기반으로 하는 Asana의 새로운 AI 제품군은 조직이 더 나은 의사결정을 내리고 생산성을 가속화하는 데 도움이 될 것입니다. 일부 기능에는 목표 기반 리소스 관리, 상태 확인, 자체 최적화 워크플로, 글쓰기 도우미, 빠른 요약 등이 포함됩니다.

Powered by the most complete and connected map of your organization’s work data (Asana Work Graph), Asana’s new set of AI products will help organizations make better decisions and accelerate productivity. Some features include goal-based resource management, health checks, self-optimizing workflows, writing assistant, instant summaries, and more.

연구 & 혁신 관련 소식 / Research & Innovation

CodeTF: 코드 인텔리전스를 위한 Python 라이브러리 / CodeTF: Python Library for Code Intelligence (GitHub Repo)

CodeTF는 코드 인텔리전스 향상을 위해 설계된 올인원 파이썬 라이브러리로, 코드 생성, 번역, 요약과 같은 작업을 위한 모델을 더 쉽게 훈련할 수 있도록 지원합니다. 코드 조작, 빠른 추론, 모델 미세 조정(파인튜닝), 성능 평가, 여러 프로그래밍 언어에서 사전 처리된 데이터셋 작업 등을 위한 사용자 친화적인 도구를 제공하여 프로세스를 간소화합니다.

CodeTF is an all-in-one Python library designed for enhancing code intelligence, making it easier to train models for tasks like code generation, translation, and summarization. It simplifies the process by offering user-friendly tools for code manipulation, rapid inferencing, fine-tuning models, evaluating performance, and working with preprocessed data sets across many programming languages.

AWQ: 대규모 AI 모델을 더 작고 빠르게 만들기 / AWQ: Making Big AI Models Smaller and Faster (GitHub Repo)

이 논문에서는 높은 하드웨어 요구 사항과 느린 토큰 생성 문제를 극복하고자 대규모 언어 모델(LLM)을 보다 효율적으로 압축하는 새로운 방법인 활성화 인식 가중치 정량화(AWQ; Activation-aware Weight Quantization) 를 소개합니다. AWQ는 모델에서 가장 중요한 가중치를 선택적으로 보호하고 다양한 도메인에서 더 나은 일반화를 가능하게 하여, 기존 방법보다 성능이 뛰어나며 더 빠르고 효율적인 모델 배포로 이어집니다.

This paper presents Activation-aware Weight Quantization (AWQ), a new method that compresses large language models (LLMs) more efficiently, overcoming issues of high hardware requirements and slow token generation. AWQ selectively protects the most important weights in the model and allows for a better generalization on different domains, outperforming existing methods, and leading to faster and more efficient model deployments.

AITemplate (GitHub Repo)

AITemplate는 심층 인공 신경망을 CUDA(NVIDIA GPU)/HIP(AMD GPU) C++ 코드로 변환하여 초고속 추론 서비스를 제공하는 Python 프레임워크입니다.

AITemplate is a Python framework that transforms deep neural networks into CUDA (NVIDIA GPU)/HIP (AMD GPU) C++ code for lightning-fast inference serving.

엔지니어링 및 리소스 관련 소식 / Engineering & Resources

Brainformer: 단순함과 효율성 사이의 거래 / Brainformer: trade simplicity for efficiency (22 minute read)

트랜스포머는 놀랍도록 강력하며 최근 AI의 많은 성공의 기반이 되었습니다. 트랜스포머는 완전한 연결과 주의 블록을 번갈아 가며 사용하는 다소 단순한 방식을 따릅니다. Google은 유전자 검색 알고리즘과 방대한 양의 TPU를 사용하여 5배 더 빠르게 수렴하고 2배 더 빠르게 추론하는 모델을 찾을 수 있었습니다. 이 모델은 MoE(Mixture of Experts) 블록과 다른 영리한 트릭을 사용합니다.

Transformers are amazingly powerful and foundational to much of AI's recent success. They follow a somewhat simple recipe of alternating fully connected and attention blocks. It turns out using a genetic search algorithm and a huge pile of TPUs, the folks at Google were able to find a model that converges 5x faster and runs 2x faster for inference. It uses MoE blocks and some other clever tricks.

ArtGPT-4의 부상과 그 예술적 비전 / The Rise of ArtGPT-4 and its Artistic Vision (12 minute read)

최근 AI의 발전으로 예술적 이미지를 구체적으로 이해하고 생성하는 방식으로 학습된 새로운 언어 및 이미지 이해 모델인 ArtGPT-4가 탄생했습니다. 이 모델은 더 작고 더 적은 데이터를 필요로 함에도 불구하고 예술적 이미지 이해 및 생성 측면에서 다른 모델보다 성능이 뛰어나며 일부 테스트에서는 인간 아티스트와 거의 일치하는 수준까지 도달했습니다.

Recent advancements in AI have led to the creation of ArtGPT-4, a new language and image understanding model that is trained in a way to specifically understand and create artistic images. Despite being smaller and requiring less data, it outperforms other models in terms of artistic image understanding and creation, and even comes close to matching human artists in some tests.

BlendFields: 얼굴 모델링을 위한 새로운 접근 방식 / BlendFields: A Novel Approach for Facial Modeling (14 minute read)

사람의 얼굴을 사실적으로 시각화하는 것은 크고 작은 디테일을 모두 캡처해야 하는 복잡한 작업입니다. 전통적인 컴퓨터 그래픽 기술에서 영감을 얻은 새로운 방법은 몇 가지 극단적인 포즈를 사용하여 보이지 않는 표정을 정확하게 모델링한 다음 새로운 표정을 위해 그 모습을 재현하여 더 세밀하고 사실적인 얼굴 디테일을 구현하기 위해 개발되었습니다.

Creating realistic visualizations of human faces is a complex task that demands capturing both large and minute details. A new method, inspired by traditional computer graphics techniques, has been developed to accurately model unseen expressions by using a few extreme poses and then recreating their appearance for new expressions, resulting in finer, more realistic facial details.

그 외 소식 / Miscellaneous

Apple: 생성형 AI 엔지니어 구인 중 / Apple Now Seeking Generative AI Engineers (2 minute read)

애플이 게시한 새로운 채용 공고에 따르면, 애플은 증강 현실과 가상 현실을 포함한 회사의 "가장 진보된 기술"을 함께 연구할 생성형 AI 전문가를 찾고 있습니다.

Apple is calling on experts with backgrounds in generative AI to work with the company's "most advanced technologies," including augmented and virtual reality, according to new job listings posted by the company.

Google: 생성형 AI 강좌 공개 / Google’s Generative AI course (2 minute read)

이 분야는 빠르게 발전하고 있지만 생성형 머신 러닝의 기초는 상당히 잘 정립되어 있습니다. 9부로 구성된 이 과정은 이 분야에 관심이 있는 분들을 위해 많은 내용을 다루고 있습니다. 궁극적인 목표는 새로운 사용자를 머신러닝 클라우드 제품인 Vertex로 유도하는 것입니다. 그럼에도 불구하고 이 기술에 대해 더 깊이 이해하고자 하는 분들을 위한 좋은 정보가 많이 있습니다.

While the field is moving quickly, the foundations of generative machine learning are fairly well established. This 9 part course covers a lot of ground for people interested in the field. The ultimate goal is to funnel new users into their ML cloud product Vertex. Even still, there is a lot of good information for people looking to build a deeper understanding of this technology.

AI로 작성된 코드는 기업에 좋은가, 나쁜가? / Is AI-Written Code Good Or Bad For Companies? (3 minute read)

생성형 AI 코딩 도구는 개발자에게 엄청난 효율성 향상을 약속하지만, 일부 기술 리더들은 코드 생성의 장벽이 낮아지면 복잡성, 기술 부채, 혼란이 증가할 수 있기 때문에 너무 많은 코드가 너무 빨리 생성되는 결과를 우려하고 있습니다.

Generative AI coding tools promise huge efficiency gains for developers, but some tech leaders fear the consequences of spawning too much code too fast, as lowering the barrier for code creation could also result in growing levels of complexity, technical debt, and confusion as they try to manage a ballooning pile of software.

더 읽어보기 / Quick Links

마인크래프트를 지배한 GPT-4 / GPT-4 Dominated Minecraft (5 minute read)

실험용 AI 봇 보이저가 마인크래프트에 투입되어 성공적으로 게임을 플레이하며 향후 업무 자동화를 위한 AI의 잠재력을 보여줬습니다.

Voyager, an experimental AI bot, was let loose into Minecraft and successfully played the game, showcasing AI’s potential to automate workplace tasks in the future.

StyleDrop (GitHub Repo)

StyleDrop은 하나의 참조 이미지를 사용하여 어떤 스타일로든 텍스트를 이미지로 생성할 수 있습니다. 드림부스 연구팀의 연구 결과입니다.

StyleDrop provides text-to-image generation in any style just by using a single reference image. It is the next step in the DreamBooth line of research.

Poe - Midjourney Bot (Product)

미드저니의 프롬프트를 개선하는 데 도움이 되는 간단한 AI 앱입니다.

A simple AI app that helps improve your prompts for Midjourney.

Zeda (Product)

Zeda는 고객 중심 팀을 위해 제품 검색을 개선하도록 설계된 AI 도구입니다.

Zeda is an AI tool designed to improve product discovery for customer-focused teams.