[TLDR] 오늘의 AI 뉴스, 2023-06-01: 바이든 행정부의 AI 규제🏛️, AI 모델을 더 작게 만들려는 경쟁🤏, OpenAI - "프로세스 감독" 도입🦺

9bow · 6월 2, 2023, 6:25오전

파이토치 한국 사용자 모임에서는 TLDR 뉴스레터의 승인을 받아 AI 소식을 DeepL로 번역하여 전합니다.

더 많은 AI 소식 및 정보를 공유하고 함께 성장하고 싶으시면 파이토치 한국 사용자 모임에 방문해주세요!

주요 뉴스 & 신규 출시 소식 / Headlines & Launches

샘 알트먼이 말하는 OpenAI의 계획 / OpenAI’s Plans According To Sam Altman (4 minute read)

샘 알트먼은 2023년에는 더 저렴하고 빠른 GPT-4, 더 긴 컨텍스트 윈도우, 파인튜닝 API, 상태 유지 API, 2024년에는 멀티모달리티를 도입할 예정이라고 밝히며 OpenAI의 로드맵을 공유했습니다.

Sam Altman has shared his roadmap for OpenAI, with highlights for the rest of 2023 including a cheaper and faster GPT4, longer context windows, a fine-tuned API, and a stateful API, while 2024 will bring multimodality.

Falcon : 무료 사용 가능한 세계 최고의 개방형 언어 모델 / World's best open language model is now royalty free (2 minute read)

새로운 Falcon 모델은 매우 강력하지만 제한적인 수익 공유 모델을 따르도록 공개되었습니다. 이제 이러한 제약이 사라져 자유롭게 모델을 사용할 수 있습니다. 현재 허깅페이스 리더보드에서 1위를 차지하고 있습니다.

The new Falcon models are extremely powerful but were released under a restrictive revenue sharing model. This restriction has now been waived and the models can be used freely. They rank #1 on the HuggingFace leaderboard.

AI 규제로 갈등 중인 바이든 행정부 / Biden Administration Torn On AI Regulation (5 minute read)

바이든 행정부 관리들은 새로운 인공 지능 도구를 얼마나 적극적으로 규제해야 하는지에 대해 분열되어 있으며, 일부 백악관과 상무부 관리들은 EU와 같은 강력한 규제를 선호하는 반면 국가 안보 관리들은 국가 경쟁력을 유지하기 위해 덜 규제하는 것을 선호합니다.

Biden administration officials are divided over how aggressively new artificial intelligence tools should be regulated, with some White House and Commerce Department officials favoring strong regulation like those in the EU while those in national security preferring less regulation in order to keep the nation competitive.

연구 & 혁신 관련 소식 / Research & Innovation

멋진 3D 아바타 만들기 / Creating Stylish 3D Avatars (GitHub Repo)

사전 학습된 이미지-텍스트 디퓨전 모델과 학습을 위한 적대적 생성 신경망(GAN)의 조합을 사용하여 고품질의 개인화된 3D 아바타를 만드는 혁신적인 방법을 제시합니다. 이러한 고급 모델을 사용하면 다양한 스타일의 다양한 멀티-뷰 아바타 이미지를 만들 수 있습니다.

The authors present an innovative method to create high-quality, personalized 3D avatars, using a combination of pre-trained image-text diffusion models and a Generative Adversarial Network (GAN) for training. By using these advanced models, we can create diverse, multi-view avatar images in various styles.

Langchain Course (GitHub Repo)

이 과정은 ChatGPT와 같은 대규모 언어 모델(LLM)을 사용하여 애플리케이션을 개발하기 위한 강력한 오픈 소스 프레임워크인 LangChain을 시작하는 데 도움이 되도록 설계되었습니다.

This course is designed to help you get started with LangChain, a powerful open-source framework for developing applications using large language models (LLMs) like ChatGPT.

슈퍼 언어 AI: 도구 사용자에서 도구 제작자로 / Super Language AIs: From Tool Users to Tool Makers (GitHub Repo)

이 최근 연구에서는 대규모 인공지능(AI) 언어 시스템, 즉 LLM을 단순히 도구를 사용하는 데 그치지 않고 직접 만들어 문제를 더 효율적으로 해결할 수 있도록 훈련시키는 획기적인 방법을 소개합니다. 이 LLM은 기존 도구에 의존하는 대신 작은 소프트웨어 툴킷처럼 다양한 작업에 사용할 수 있는 자체 '유틸리티 기능'을 개발하여 향후 문제 해결 요청에 도움을 줄 수 있습니다.

This recent study introduces a groundbreaking method where big artificial intelligence (AI) language systems, or LLMs, are trained not just to use tools but to create their own to solve problems more efficiently. Rather than depending on existing tools, these LLMs develop their own 'utility functions' - like little software toolkits - which can be used for a variety of tasks and help future problem-solving requests.

엔지니어링 및 리소스 관련 소식 / Engineering & Resources

프로세스 감독을 통한 수학적 추론 능력 향상 / Improving Mathematical Reasoning With Process Supervision (5 minute read)

OpenAI는 AI 모델의 수학적 추론 능력을 향상시키기 위해 "프로세스 감독"이라는 새로운 방법을 도입했습니다. 이 기술은 추론 과정에 초점을 맞춰 단계별로 문제를 해결하도록 모델을 훈련하고 솔루션에 대한 설명을 제공합니다. 이 기법은 이전 작업보다 크게 개선되어 대규모 언어 모델의 기능을 확장하여 더 복잡한 수학적 문제를 처리할 수 있습니다

OpenAI has introduced a new method known as "process supervision" to improve mathematical reasoning capabilities in AI models. This technique focuses on the reasoning process, training the model to solve problems step by step and providing explanations for its solutions. It has shown significant improvements over previous work, extending the capabilities of large language models to handle more complex mathematical problems

포이즌 공격과 이것이 LLM 성능에 미치는 영향 / Poisoning Attacks and Their Impact on Performance of LLMs (6 minute read)

어떤 사람들은 훈련 데이터에 유해한 예시를 추가하여 특정 단어나 문구가 언급될 때 모델이 이상하게 작동하도록 함으로써 ChatGPT와 같은 언어 모델을 속일 수 있습니다. 이로 인해 모델의 유용성이 떨어지고 신뢰성과 안전성에 대한 우려가 제기됩니다.

Some people can trick language models like ChatGPT by adding harmful examples to the training data, causing the models to behave strangely when certain words or phrases are mentioned. This makes the models less useful and raises concerns about their reliability and safety.

가능도 기반 디퓨전 언어 모델 / Likelihood based Diffusion language models (26 minute read)

가능도(likelihood)는 단어가 생성될 확률을 측정하는 방법입니다. 현대 언어 모델의 핵심 중 하나입니다. 지금까지는 디퓨전 언어 모델에는 적용되지 않았습니다. 디퓨전 과정(점수 매칭)에 몇 가지 영리한 조정을 추가하면 10배 더 작은 언어 모델만큼 성능이 좋은 확률 기반 스케일링 법칙을 가진 1B 매개변수 모델을 얻을 수 있습니다. 아직 거기까지는 이르지 못했지만 올바른 방향으로 나아가는 중입니다.

Likelihood is a way of measuring the probability of words being generated. It's one of the keys of modern language models. It hasn't worked yet for diffusion language models until now. It turns out if you add some clever tweaks to the diffusion process (score matching) you can get a 1B parameter model with likelihood based scaling laws that performs as well as 10x smaller language models. We're not there yet, but this is a step in the right direction.

그 외 소식 / Miscellaneous

더 작은 AI 모델을 만들기 위한 경쟁 / The Race To Make AI Smaller (5 minute read)

BabyLM 챌린지는 대형 LLM 모델의 단점(예: 대형 모델에는 소수의 회사만 보유한 처리 능력이 필요하다는 사실)으로 인해 더 작지만 여전히 효과적인 AI 모델을 개발하기 위한 노력입니다.

The BabyLM challenge is a push to develop smaller but still effective AI models due to the drawbacks of large LLM models, for example, the fact that bigger models require processing power that few companies possess.

역대 최대 규모의 튜링 테스트 실험 / The Largest Turing Test Experiment To Date (7 minute read)

AI21 연구소는 150만 명 이상의 참가자가 1,000만 건 이상의 대화를 수행한 역대 최대 규모의 튜링 테스트 실험을 완료했습니다. 사람들은 73%의 시간 동안 사람을 정확하게 식별할 수 있었지만, 봇과 대화할 때는 60%의 시간 동안만 정확하게 식별할 수 있었습니다. 이는 AI 챗봇이 아직 인간과 완전히 구별되지는 않지만 점점 더 가까워지고 있음을 보여줍니다.

AI21 Labs has concluded the largest Turing Test experiment to date, with over 10 million conversations conducted by more than 1.5 million participants. People were able to correctly identify a human 73% of the time, but only 60% of the time when talking to a bot. This suggests that AI chatbots are still not quite indistinguishable from humans, but they are getting closer.

더 읽어보기 / Quick Links

Shoggoth 밈이 AI를 점령한 방법 / How The Shoggoth Meme Has Taken Over AI (4 minute read)

AI 업계 종사자들 사이에서 인기를 끌고 있는 쇼고스 밈의 인기에 대한 탐구.

An exploration into the popularity of the Shoggoth meme, a favorite among workers in AI.

Macaw-LLM (GitHub Repo)

Macaw-LLM은 이미지, 비디오, 오디오 및 텍스트 데이터를 원활하게 결합하여 다중 모드 언어 모델링을 개척하는 탐색적 노력의 일환으로, CLIP, Whisper 및 LLaMA의 기반으로 구축되었습니다.

Macaw-LLM is an exploratory endeavor that pioneers multi-modal language modeling by seamlessly combining image, video, audio, and text data, built upon the foundations of CLIP, Whisper, and LLaMA.

Siit (Product Launch)

GPT-4로 구동되는 Siit AI는 내부 지식 기반인 Notion 및 Confluence를 기반으로 모든 직원의 질문에 Slack 및 Teams를 통해 직접 즉시 답변합니다. 이는 직원 경험의 혁명입니다.

Siit AI, powered by GPT-4, instantly answers all employees' questions based on your internal knowledge bases, Notion and Confluence, directly via Slack and Teams. This is a revolution in your employee experience.

Recraft.AI (Product Launch)

Recraft는 웹 사이트, 인쇄 및 마케팅에 적합한 다양한 스타일의 벡터 아트, 아이콘, 3D 이미지 및 일러스트레이션을 생성하고 편집할 수 있는 무한한 AI 아트보드입니다. 누구나 무료로 사용할 수 있으며, 생성된 이미지를 상업적으로 사용할 수 있습니다.

Recraft is an infinite AI artboard, where you can generate and edit vector art, icons, 3D images and illustrations in a wide range of styles suitable for websites, print, and marketing. It is free for everyone and allows commercial use of the generated images.