[TLDR] 오늘의 AI 뉴스, 2023-09-07: Siri AI 개선 🤖, Falcon 180B 모델 출시 🦅, Jax에서 주문서 시뮬레이션 💹

9bow · 9월 8, 2023, 3:00오전

파이토치 한국 사용자 모임에서는 TLDR 뉴스레터 의 승인을 받아 AI 소식을 DeepL로 번역 하여 전합니다.
더 많은 AI 소식 및 정보를 공유하고 함께 성장하고 싶으신가요? 지금 파이토치 한국어 커뮤니티에 방문해주세요!

주요 뉴스 & 신규 출시 소식 / Headlines & Launches

애플, 생성형 AI로 Siri 개선에 투자 중 / Apple Is Pouring Money Into Siri Improvements With Generative AI (2 minute read)

애플은 인공지능 개발 예산을 늘리면서 Siri의 대화형 챗봇 기능을 개발하는 데 중점을 두고 있으며, 매일 수백만 달러를 연구 개발에 지출하는 것으로 알려져 있습니다.

Apple has boosted its budget for developing artificial intelligence, emphasizing creating conversational chatbot features for Siri — allegedly spending millions of dollars daily on research and development.

Falcon 180B 모델 출시 / Falcon 180B model released (6 minute read)

UAE의 Falcon 모델은 오랫동안 최고의 개방형 모델로 사용되어 왔습니다. 최신 180B 매개변수 모델은 Llama 2 70B보다 약간 성능이 뛰어나며 컨텍스트 창이 2k입니다. 이 모델은 역사적으로 상당히 조정이 가능했습니다. 그러나 리소스 요구 사항을 고려할 때 커뮤니티에서 이 최신 모델을 채택할지는 확실하지 않습니다.

The Falcon models from the UAE have long been the best open models available. The newest 180B parameter model slightly outperforms Llama 2 70B and has a 2k context window. The models have historically been quite tunable. However, given the resource requirements, it's not clear whether the community will adopt this newest model.

(더 읽어보기 [GN] TII, Falcon-180B 모델 공개)

OpenAI의 첫 개발자 컨퍼런스 / OpenAI’s first developer conference (2 minute read)

OpenAI는 2023년 11월 6일 샌프란시스코에서 첫 번째 개발자 데이 컨퍼런스를 개최합니다. 이 행사에서는 새로운 도구를 미리 선보이고 아이디어 교류를 촉진하며 전 세계 수백 명이 참가할 예정입니다. 현재 2백만 명 이상의 개발자가 지속적으로 업데이트되는 OpenAI의 API를 통해 GPT-4 및 DALL-E와 같은 도구를 사용하고 있습니다.

OpenAI will hold its inaugural DevDay conference in San Francisco on November 6, 2023. This event will preview new tools and foster idea exchange, attracting hundreds globally. Currently, over 2 million developers use tools like GPT-4 and DALL·E through OpenAI's continuously updated API.

연구 & 혁신 관련 소식 / Research & Innovation

Comgra (GitHub Repo)

신경망의 내부를 더 쉽게 검사할 수 있도록 PyTorch와 함께 사용할 수 있는 라이브러리입니다.

A library for use with PyTorch that makes it easier to inspect the internals of your neural networks.

토큰 플로우 / TokenFlow (GitHub Repo)

사전 학습된 텍스트-대-이미지 모델을 사용하여 동영상을 편집하면 꿈같은 이상한 결과가 나옵니다. 토큰플로는 훨씬 더 매끄럽고 원본 동영상의 많은 의미적, 구조적 특징을 유지합니다. 거의 런웨이 2세대를 능가하는 것 같습니다.

Using a pre-trained text-to-image model to edit videos produces dream-like and strange results. TokenFlow is much smoother and maintains many semantic and structural features of the original video. It almost seems to outperform Runway Gen-2.

ReliTalk: 다양한 조명과 배경에 맞게 비디오 아바타 만들기 / Making Video Avatars Adapt to Different Lights and Backgrounds (3 minute read)

릴리토크는 조명이나 배경을 변경해도 비디오 아바타가 자연스럽게 보이도록 하는 획기적인 기술입니다. 하나의 비디오와 목소리로 3D 얼굴 모델을 생성합니다.

ReliTalk is a cool tech breakthrough that lets us make video avatars look natural even when you change the lighting or background. It uses a single video and the sound of your voice to create a 3D face model.

엔지니어링 및 리소스 관련 소식 / Engineering & Resources

음성 기반 3D 얼굴 애니메이션 향상하기 / Enhancing Speech-Driven 3D Face Animation (14 minute read)

이 연구는 음성에 의해 구동되는 3D 얼굴 애니메이션의 복잡성을 탐구하며, 시간이 지남에 따라 얼굴이 움직이는 방식을 변화시키는 글로벌 요인(복합적 특성)과 얼굴의 여러 부분이 국소 근육에 따라 독립적으로 움직이는 방식(국소적 특성)이라는 두 가지 중요한 측면을 강조합니다.

This study delves into the complexities of 3D face animation driven by speech, highlighting two important aspects: global factors that change how the face moves over time (composite nature) and how different parts of the face move independently based on local muscles (regional nature).

Efficient RLHF: RLHF에서 PPO 메모리 사용량 감소 / Reducing PPO memory usage in RLHF (25 minute read)

PPO는 3개의 모델을 떠다니게 해야 하기 때문에 RLHF에서 골칫거리입니다. 하지만 이 모델들은 모두 가까이 있고 서로에게 작은 업데이트만 필요합니다. LoRA를 도입하세요. 대신 어댑터를 사용하면 성능 저하 없이 메모리 비용을 획기적으로 줄일 수 있습니다. 단순하지만 아름다운 아이디어입니다.

PPO is a pain in RLHF because you need to have 3 models floating around. However, they all stay close and only require small updates from one another. Enter LoRA. If you instead use adapters you can dramatically reduce memory costs without losing performance. Beautiful idea in its simplicity.

Jax에서 주문서 시뮬레이션하기 / Simulating an order book in Jax (28 minute read)

금융기관은 지정가 주문장이라는 것을 사용하여 플랫폼에서 거래하는 모든 거래 정보를 유지 관리합니다. 이는 유용하지만 일반적으로 CPU에서 실행되기 때문에 RL을 실행하는 것이 어렵습니다. 이 백서에서는 JAX로 작성된 GPU에서 실행되는 오더북을 만드는 방법에 대해 설명합니다.

Financial institutions use something called a limit order book to maintain all of the transaction information for trades on their platforms. This is useful, but it usually runs on the CPU, which makes running RL on them hard. This paper talks through creating an order book that runs on the GPU written in JAX.

그 외 소식 / Miscellaneous

LLM이 하나의 사례에서 배울 수 있을까요? / Can LLMs Learn From A Single Example? (10 minute read)

AI 모델은 데이터 세트의 예시를 한 번만 보고도 빠르게 암기할 수 있는 것으로 나타났습니다. 이 놀라운 성과는 신경망 샘플 효율성에 대한 기존의 통념을 뒤집는 것입니다.

It appears that AI models were able to rapidly memorize examples from the dataset after seeing them just once. This astonishing feat contradicts most prior wisdom about neural network sample efficiency.

Doppelgangers: 3D에서 두 개의 유사한 사진이 정말 같은지 확인하는 새로운 방법 / A New Way to Tell if Two Similar Pictures Are Really the Same in 3D (3 minute read)

도플갱어는 거의 똑같아 보이는 두 장의 사진이 실제로 같은 3D 물체를 보여주는지 아닌지 알아내는 데 도움이 되는 새로운 기술 도구입니다. 사람들도 실수할 수 있는 실수를 피할 수 있을 만큼 똑똑합니다.

Doppelgangers is a new tech tool that helps figure out if two pictures that look almost the same are actually showing the same 3D object or not. It's smart enough to avoid mistakes that even people might make.

AI 혁명이 세상을 재편하는 방법 / How The AI Revolution Will Reshape The World (4 minute read)

인공지능이 주도하는 임박한 기술 물결이 역사적인 권력 재분배를 예고할 것이라는 주장이 제기되었습니다.

An argument that the impending technological wave, largely driven by AI, will herald a historic redistribution of power.

더 읽어보기 / Quick Links

저렴한 인건비를 사용한 3D 모델 생성을 위한 버즈 AI 스타트업 / Buzzy AI Startup For Generating 3D Models Used Cheap Human Labor (5 minute read)

3D 세대 스타트업인 Kaedim은 한때는 3D 디자인 원단을 직접 제작하기 위해 인력을 고용하여 모델을 제작하기도 했습니다.

Kaedim, a 3D generation startup, often used human artists to make models, at one point using workers to produce the 3D design wholecloth themselves.

서나 / Seona (Product)

Seona는 SEO를 최적화하도록 설계된 AI 어시스턴트입니다.

Seona is an AI assistant designed to optimize your SEO.

OpenAI, 11월 6일 첫 개발자 컨퍼런스 개최 / OpenAI Will Host Its First Developer Conference On November 6th (2 minute read)

오픈AI가 11월 6일 개발자 컨퍼런스인 '오픈AI 데브데이'를 개최한다고 오늘 발표했습니다.

OpenAI will host a developer conference, OpenAI DevDay, on November 6, the company announced today.