[TLDR] 오늘의 AI 뉴스, 2023-06-12: 메타의 오픈 소스 AI 🌐, 칸 아카데미의 칸미고 챗봇 🤖, 언어 모델을 더 진실하게 만들기🥸

9bow · 6월 13, 2023, 4:32오전

파이토치 한국 사용자 모임에서는 TLDR 뉴스레터의 승인을 받아 AI 소식을 DeepL로 번역하여 전합니다.

더 많은 AI 소식 및 정보를 공유하고 함께 성장하고 싶으신가요? 지금 파이토치 한국어 커뮤니티에 방문해주세요!

주요 뉴스 & 신규 출시 소식 / Headlines & Launches

스테이블 디퓨젼은 고정관념을 극한으로 몰고 간다 / Stable Diffusion Takes Stereotypes To The Extreme (15 minute read)

블룸버그 기자들은 스테이블 디퓨전이 인종과 성별 격차를 현실 세계에서 발견되는 것보다 훨씬 더 극단적인 수준으로 끌어올리며, 생성형 AI가 편견을 영속화할 것이라는 우려를 불러일으킨다는 사실을 발견했습니다.

Bloomberg journalists found that Stable Diffusion takes racial and gender disparities to extremes that are even worse than those found in the real world, stroking fears that generative AI will perpetuate biases.

칸 아카데미의 칸미고 챗봇 / Khan Academy Khanmigo (6 minute read)

칸 아카데미는 학생들에게 개인화된 과외를 제공하는 것을 목표로 하는 AI 기반 챗봇인 칸미고(Khanmigo)를 도입했습니다. GPT-4로 구동되는 이 봇은 광범위한 주제에 대한 지침을 제공하고 소크라테스 방식을 사용하여 학습자가 스스로 문제를 해결하도록 장려합니다. 이러한 봇을 지지하는 사람들은 이러한 봇이 교육에 혁명을 일으킬 수 있다고 믿지만, 기술의 오류 가능성과 교사의 역할에 미치는 영향에 대한 우려가 지속되고 있으며 현재 미국 학교에서의 광범위한 출시를 앞두고 그 효과에 대해 연구 중입니다.

Khan Academy has introduced an AI-powered chatbot, Khanmigo, which aims to provide personalized tutoring to students. Powered by GPT-4, the bot offers guidance on a broad range of subjects and uses the Socratic method to encourage learners to solve their problems independently. While proponents believe such bots could revolutionize education, concerns about the technology's potential for error and impact on teachers' roles persist and its effectiveness is currently being studied ahead of a broader rollout in U.S. schools.

메타, 새로운 텍스트-음악 변환 모델로 오픈소스 AI 선도 / Meta leads open-source AI with new text-to-music model (4 minute read)

마크 저커버그 CEO는 최근 팟캐스트에서 고가의 AI 도구를 오픈소스화하려는 자신의 목표에 대해 이야기했습니다. 이 음악 생성 코드, 모델 가중치 및 평가 모음은 바로 그 예입니다. 이를 통해 멜로디 조건부 음악과 텍스트 조건부 음악을 모두 생성할 수 있습니다. 코드는 완전히 오픈 소스이지만 모델 가중치는 상업적으로 공개되지 않았습니다.

CEO Mark Zuckerburg recently discussed his goals with open-sourcing significant expensive AI tools in a podcast. This collection of music generation code, model weights, and evaluations is an example of just that. It allows you to generate both melody conditioned and text conditioned music. The code is fully open sourced, but model weights are not commercially open.

연구 & 혁신 관련 소식 / Research & Innovation

Matte Anything (GitHub Repo)

자율주행차 연구가 다소 활기를 잃은 후 컴퓨터 비전의 발전이 둔화되는 듯했지만, 다시 속도가 붙고 있습니다. 이 새로운 기술은 세 가지 이미지 모델의 히드라를 사용하여 자연스러운 이미지 매팅의 성능을 향상시킵니다. 그 결과는 상당히 매력적입니다.

Advances in computer vision seemed to slow after autonomous vehicle research lost some steam, but things are picking up again. This novel technique uses a hydra of three image models to enhance the performance of natural image matting. The results are quite compelling.

고밀도 픽셀 현명한 추적의 획기적인 발전, 오클루전에서도 가능 / Breakthrough in dense pixel wise tracking, even with occlusions (4 minute read)

2015년에 6D 숫자로 연속 값을 표현하는 획기적인 기술이 개발되었습니다. 이 연구는 비디오를 3D 볼륨으로 표현할 것을 제안한다는 점에서 의미가 있습니다. 이 표현과 픽셀 공간을 일치시킴으로써 동영상에서 임의의 픽셀을 상당히 오랫동안 추적할 수 있습니다.

In 2015, there was a breakthrough in representing continuous values with a 6D number. This work rhymes in that they propose representing videos as a 3D volume. By matching between that representation and pixel space, they can perform quite long term tracking of arbitrary pixels in a video.

Lanarky (GitHub Repo)

Lanarky는 프로덕션에 LLM 애플리케이션을 배포하기 위한 오픈 소스 프레임워크입니다.

Lanarky is an open-source framework to deploy LLM applications in production.

엔지니어링 및 리소스 관련 소식 / Engineering & Resources

싱가포르의 새로운 멀티모달 텍스트/비전 모델 / New multimodal text/vision model from Singapore (7 minute read)

이 연구에서는 새로운 약어들이 많이 소개되었습니다. 요점은 연구자들이 시각적 인스트럭션 튜닝을 위해 새로운 데이터 세트를 수집하고 모델을 학습시킨 다음 그 과정에서 알고리즘을 약간 조정했다는 것입니다. 이 데모는 특히 Apple의 AR 공개 이후 매우 인상적입니다. 인공지능 비서의 미래 모습을 엿볼 수 있는 흥미로운 장면입니다.

A lot of new acronyms are introduced in this work. The gist is that researchers collected a new dataset for visual instruction tuning, trained a model, and made some algorithmic tweaks along the way. The demo is really impressive, especially after the AR reveal from Apple. An exciting peak at what AI assistants might look like in the future.

테스트 시 언어 모델을 더욱 사실적으로 만들기 / Making language models more truthful at test time (28 minute read)

언어 모델은 자신감에 가득차서 사람들을 혼란스럽게 표현(환각이라고 불리는)하는 것으로 잘 알려져 있습니다. 이러한 모델에는 진실성에 대한 내부 표현이 있으며 모델 활성화를 조정하여 진실한 반응을 이끌어 낼 수 있다는 것이 밝혀졌습니다. 연구진은 이 기법을 지침에 따라 조정된 알파카 모델에 적용하여 진실성QA 성능을 32.5%에서 65.1%로 향상시켰다는 사실을 발견했습니다.

Language models are well known for confidently confabulating (sometimes called hallucinating). It turns out that these models have an internal representation for truthfulness and by tweaking the model activations you can elicit a truthful response. The researchers found that by using this technique on the instruction-tuned Alpaca model, they improved TruthfulQA performance from 32.5% to 65.1%.

SpQR: 고효율 LLM 압축을 위한 기법 / SpQR: A Technique for High-Efficiency LLM Compression (19 minute read)

이 논문에서는 대규모 언어 모델(LLM)을 거의 무손실로 압축하여 양자화로 인한 일반적인 정확도 손실을 극복할 수 있는 새로운 형식 및 기법인 SpQR(Sparse-Quantized Representation)을 소개합니다. SpQR을 사용하면 노트북이나 휴대폰과 같은 일반적인 장치에서 성능 저하 없이 강력한 LLM을 실행할 수 있으며, 기존 방식보다 4배 이상의 메모리 압축 이득과 빠른 추론을 제공합니다.

The paper presents Sparse-Quantized Representation (SpQR), a new format and technique that allows almost lossless compression of large language models (LLMs), overcoming the usual accuracy losses from quantization. SpQR enables powerful LLMs to run on common devices like laptops and mobile phones without performance degradation, offering over 4x memory compression gains and faster inference than traditional methods.

그 외 소식 / Miscellaneous

LLM은 당신을 연기하는 데 능숙하다 / LLMs Are Good At Playing You (5 minute read)

LLM의 섬뜩한 인간과 같은 특징과 어떤 분야에서는 잠재력을 발휘하고 어떤 분야에서는 문제를 일으키는지에 대한 탐구.

An exploration into the eerie human-like features of LLMs and how they hold promise for some fields and trouble for others.

의회, 두 가지 새로운 AI 법안 검토 / Congress To Consider Two New AI Bills (2 minute read)

미국 상원의원들은 목요일에 인공지능 기술을 둘러싼 문제 해결에 대한 관심이 높아지는 가운데 두 개의 초당적 인공지능 법안을 별도로 발의했습니다. 하나는 미국 정부가 AI를 사용하여 사람들과 상호 작용할 때 투명성을 요구하고 다른 하나는 미국이 최신 기술에서 경쟁력을 유지하고 있는지 판단하기 위한 사무실을 설립하는 것입니다.

U.S. senators introduced two separate bipartisan artificial intelligence bills on Thursday amid growing interest in addressing issues surrounding the technology. One would require the U.S. government to be transparent when using AI to interact with people and the other would establish an office to determine if the United States is remaining competitive in the latest technologies.

더 읽어보기 / Quick Links

ChatGPT가 아는 농담은 단 25개뿐 / ChatGPT Only Knows 25 Jokes (3 minute read)

두 명의 독일 연구원인 Sophie Jentzsch와 Kristian Kersting은 유머를 이해하고 생성하는 OpenAI의 ChatGPT-3.5의 능력을 조사하는 논문을 발표했습니다. 이들은 ChatGPT의 농담에 대한 지식이 상당히 제한적이라는 사실을 발견했습니다: 테스트 실행 중 1,008세대 중 90%가 동일한 25개의 농담을 사용했습니다.

Two German researchers, Sophie Jentzsch and Kristian Kersting, released a paper that examines the ability of OpenAI's ChatGPT-3.5 to understand and generate humor. They discovered that ChatGPT's knowledge of jokes is fairly limited: During a test run, 90 percent of 1,008 generations were the same 25 jokes.

알렉사, 학생들이 AI에 대해서 어떻게 배워야 할까? / Hey, Alexa, What Should Students Learn About AI? (8 minute read)

거대 기술 기업, 대학, 비영리 단체들은 학교에서 학생들에게 인공지능에 대한 수업을 제공하기 위해 노력하고 있습니다. 현재 이 기술에 대해 무엇을 가르쳐야 하는지에 대한 국가적 합의는 없습니다. 현재 커리큘럼은 인공지능의 기초부터 알렉사 프로그래밍 방법까지 다양한 주제를 다루고 있습니다.

Tech giants, universities, and nonprofits are working to provide lessons on artificial intelligence to students in schools. There is currently no national consensus on what should be taught about the technology. Current curriculums cover a range of topics, from the basics of AI to how to program Alexa.

ClientZen (Product)

ClientZen은 AI를 사용하여 고객 피드백을 실행 가능한 인사이트로 전환합니다.

ClientZen uses AI to transform your customer feedback into actionable insights.

Gong Engage (Product)

공 인게이지는 고객 상호작용을 기반으로 하는 AI 영업 인게이지먼트 솔루션입니다.

Gong Engage is an AI sales engagement solution powered by customer interactions.