[TLDR] 오늘의 AI 뉴스 @ 2023-05-11

9bow · 5월 12, 2023, 5:20오전

파이토치 한국 사용자 모임에서는 TLDR 뉴스레터의 승인을 받아 AI 소식을 DeepL로 번역하여 전합니다.

주요 뉴스 & 신규 출시 소식 / Headlines & Launches

허깅페이스 에이전트 출시 / HuggingFace Agents (11 minute read)

에이전트는 도구를 사용할 수 있는 언어 모델입니다. 이러한 도구는 계산기나 다른 AI 모델처럼 디지털입니다. 허깅페이스에는 에이전트를 쉽고 빠르게 사용할 수 있는 인터페이스가 있습니다. 다양한 작업에 적합한 엄선된 모델 세트를 제공합니다. 최근 출시된 스타코더 역시 에이전트의 사용성을 크게 개선했습니다.

Agents are language models that can use tools. These tools are digital like calculators or other AI models. HuggingFace has an interface that allows you to use agents easily and quickly. They provide a curated set of models that work well for many tasks. The recent release of Starcoder also improves the usability of agents pretty dramatically.

Google, MusicLM 출시, Google Releases MusicLM (3 minute read)

Google은 오늘 텍스트 설명을 음악으로 변환할 수 있는 새로운 실험적 AI 도구인 MusicLM을 출시했습니다. Web, Android 또는 iOS의 AI Test Kitcken 앱에서 사용할 수 있는 MusicLM은, 사용자가 "디너 파티를 위한 소울풀한 재즈" 또는 "최면에 걸린 듯한 인더스트리얼 테크노 사운드 만들기"와 같은 메시지를 입력하면 여러 버전의 노래가 생성되도록 할 수 있습니다.

Google today released MusicLM, a new experimental AI tool that can turn text descriptions into music. Available in the AI Test Kitchen app on the web, Android or iOS, MusicLM lets users type in a prompt like “soulful jazz for a dinner party” or “create an industrial techno sound that is hypnotic” and have the tool create several versions of the song.

Google의 AI 챗봇 바드, 다국어 지원 시작 / Google's AI Chatbot Bard Gets Multilingual with New Upgrades (6 minute read)

Google은 다국어 지원, Google 문서 및 Gmail 통합, 시각적 검색 등의 기능을 추가하여 AI 챗봇 Bard의 기능을 강화하는 동시에 전 세계에서 사용할 수 있도록 하고 있습니다. 이제 구글의 새로운 PaLM 2 언어 모델로 업그레이드된 바드는 향상된 응답, 디버깅을 포함한 더 나은 코딩 지원, AI를 사용한 시각적 생성 기능을 제공하여 향후 OpenAI의 ChatGPT, Microsoft의 Bing 챗봇과 같은 경쟁사와의 경쟁력을 유지할 수 있게 될 것입니다.

Google is boosting the abilities of its AI chatbot Bard with added features such as multi-language support, Google Docs and Gmail integration, and visual search, while also making it globally available. Bard, now upgraded to Google's new PaLM 2 language model, will offer improved responses, better coding assistance including debugging, and the ability to generate visuals using AI in the future, keeping it competitive with rivals such as OpenAI's ChatGPT and Microsoft's Bing chatbot.

연구 & 혁신 관련 소식 / Research & Innovation

어테션 없는 사전 학습 / Pre-training without attention (GitHub Repo)

어텐션은 언어 모델이 단어에 대한 연산을 수행하는 방법으로, 제법 비싼 연산입니다. BERT 학습 시에 1D 컨볼루션 상태 공간 모델을 사용할 수 있다면 어떨까요? 꽤 잘 작동하는 것으로 밝혀졌으며 모든 코드, 가중치 등은 여기 Jax에서 사용할 수 있습니다.

Attention is a way for a language model to perform computation over words. It is pretty expensive. What if we could use 1d convolution state space models for BERT training? It turns out to perform quite well, all the code, weights, etc is available here in Jax.

Chain-Of-Thought-Hub (GitHub Repo)

Chain-Of-Thought-Hub는 생각의 사슬 프롬프트를 통해 대규모 언어 모델의 복잡한 추론 능력을 벤치마킹합니다.

Chain-Of-Thought-Hub benchmarks large language models' complex reasoning ability with chain-of-thought prompting.

LLM Chain (GitHub Repo)

LLM Chain은 대규모 언어 모델에 체인을 구축하여 텍스트를 요약하고 복잡한 작업을 완료할 수 있는 강력한 러스트 크레이트입니다.

LLM Chain is a powerful rust crate for building chains in large language models allowing you to summarize text and complete complex tasks.

(광고) AI는 점점 더 똑똑해지고 있습니다. 당신은 어떤가요? / AI is getting smarter. Are you? (Sponsor)

내일 로봇이 여러분의 책상에 앉아있을까요? 그럴 가능성은 낮습니다. 하지만 여러분의 다음 일자리를 놓고 경쟁하는 사람은 아마도 AI의 개념과 이를 활용하는 방법을 알고 있을 것입니다. 다행히도 Brilliant는 수학, 데이터, CS 등의 핵심 개념을 가장 쉽고 효과적으로 레벨업할 수 있는 방법입니다. 30일 무료 체험 + 연간 프리미엄 구독 20% 할인 받기

Will a robot be sitting at your desk tomorrow? Unlikely. But the person competing for your next job will probably know the concepts behind AI — and how to put them to work. Luckily, Brilliant is the easiest, most effective way to level up on core concepts in math, data, CS, and more. Try it free for 30 days + get 20% off an annual premium subscription

엔지니어링 및 리소스 관련 소식 / Engineering & Resources

대규모 언어 모델로 경제적인 비즈니스 구축하기 / Building Affordable Businesses with Large Language Models (5 minute read)

이번 deeplearning.ai 뉴스레터(Batch)에서는 GPT-3.5-turbo와 같은 대규모 언어 모델(LLM)을 사용하는 것이 어떻게 작업을 자동화하거나 콘텐츠를 생성하는 데 비용 효율적인 솔루션이 될 수 있는지 살펴봅니다. LLM은 최저 0.08달러로 1시간 동안 사용자의 참여를 유도할 수 있는 충분한 텍스트를 생성할 수 있어 많은 작업에서 인건비보다 훨씬 저렴하며, 생성된 콘텐츠가 많은 청중에게 소비될 때 비용 효율성이 더욱 향상된다는 점을 강조합니다.

This batch of deeplearning.ai newsletter examines how using large language models (LLMs), like GPT-3.5-turbo, can be a cost-effective solution for automating tasks or generating content. It highlights that LLMs can generate enough text to engage a user for an hour for as low as $0.08, making them significantly cheaper than human labor for many tasks, and that the cost-effectiveness improves further when the generated content is consumed by a large audience.

확산(Diffusion) 모델로 데이터 증강의 강력함과 잠재력 알아보기 / Exploring the Power and Potential of Data Augmentation with Diffusion Models (16 minute read)

이 논문에서는 확산 모델에서 이미지를 생성하는 방법과 다운스트림 작업에서 데이터 증강을 위한 이점을 체계적으로 평가합니다. 이 연구는 대상 데이터에 대해 확산 모델을 개인화하는 것이 효과적이지만, 간단한 최인접 이웃 검색 절차를 통해 확산 모델의 학습 데이터만 사용하는 것이 더 강력한 다운스트림 성능을 가져온다는 것을 보여주며 데이터 증강에서 확산 모델의 잠재력과 한계를 강조합니다.

This paper systematically evaluates methods for generating images from diffusion models and their benefits for data augmentation in downstream tasks. The study reveals that personalizing diffusion models towards target data is effective, but using the training data of the diffusion model alone through a simple nearest neighbor retrieval procedure results in stronger downstream performance, highlighting the potential and limitations of diffusion models in data augmentation.

더 나은 성능을 위한, 알고리즘으로 언어 모델 강화하기 / Enhancing Language Models with Algorithms for Superior Performance (13 minute read)

최근 연구에 따르면 사전 학습된 대규모 언어 모델(LLM)에 알고리즘을 통합하면 미세 조정 없이도 그 기능을 크게 향상시킬 수 있습니다. 증빙을 포함한 질의응답의 한 예는 일반적인 성능 기준(standard baseline)보다 6.4% 향상된 성능을 보여줌으로써 이 혁신적이고 비용 효율적인 방법의 잠재적 이점을 시사합니다.

Recent research shows that incorporating algorithms into large pre-trained language models (LLMs) can significantly improve their capabilities, without the need for fine-tuning. An example of evidence-supported question-answering demonstrates a 6.4% improvement over the standard baseline, suggesting the potential advantages of this innovative, cost-effective method.

그 외 소식 / Miscellaneous

Google 검색, 본격적으로 AI 도입 / The AI Takeover Of Google Search Starts Now (7 minute read)

이 기사는 AI로 생성된 경험을 사용하여 검색 결과를 개선하려는 Google의 계획에 초점을 맞추고 있습니다. Google은 사용자에게 더욱 개인화되고 관련성 높은 검색 결과를 제공하기 위해 검색 엔진에 AI를 통합하기 위해 노력하고 있습니다. 이러한 접근 방식은 AI로 생성된 이미지, 동영상, 텍스트와 같은 기능을 포함하여 사용자에게 더욱 풍부하고 인터랙티브한 경험을 제공하는 것을 목표로 합니다. 그러나 이 기사에서는 사용자 개인정보 보호 유지 및 AI 생성 콘텐츠의 편향성 해결과 같은 잠재적인 문제도 강조하고 있습니다.

The article focuses on Google's plan to improve search results using AI-generated experiences. Google is working on integrating AI into its search engine to create more personalized and relevant search results for users. This approach aims to provide users with a richer and more interactive experience, including features like AI-generated images, videos, and text. However, the article also highlights potential challenges, such as maintaining user privacy and addressing biases in AI-generated content.

개인을 위한 LLM의 활용 / The Leverage Of LLMs For Individuals (4 minute read)

이 문서에서는 개인 사용자를 위한 LLM(대규모 언어 모델) 기술의 잠재력에 대해 설명합니다. 저자는 창의적인 글쓰기 지원, 코드 생성, 다양한 주제에 대한 질문에 대한 답변 등 다양한 사용 사례를 살펴봅니다. 또한 AI가 생성하는 잘못된 정보에 대한 우려, 윤리적 문제, 리소스 집약적인 특성으로 인한 개인용 LLM 기술의 접근성 등 잠재적인 단점도 다루고 있습니다.

The article discusses the potential of LLM (Large Language Model) technology for individual users. The author explores various use cases, such as aiding in creative writing, generating code, and answering questions on various topics. The article also addresses potential drawbacks, including concerns about AI-generated misinformation, ethical issues, and the accessibility of LLM technology for personal use due to its resource-intensive nature.

더 읽어보기 / Quick Links

샘 알트먼, 의회 증언 예정 / Sam Altman To Testify Before Congress (2 minute read)

OpenAI의 CEO인 샘 알트먼이 다음 주 처음으로 의회에서 증언할 예정입니다. 이는 실리콘밸리에서 기술 붐을 일으키고 있는 인공지능에 대해 워싱턴의 정책 입안자들이 검토를 강화하고 있다는 신호입니다.

OpenAI CEO Sam Altman will testify to Congress for the first-time next week, the latest sign that policymakers in Washington are ratcheting up scrutiny of artificial intelligence as the technology booms in Silicon Valley.

인도의 종교 챗봇, 폭력을 묵인 / India’s Religious Chatbots Are Condoning Violence (5 minute read)

이 기사에서는 인도에서 사용자에게 영적 안내와 지원을 제공하는 Gitagpt와 크리슈나와 같은 종교 챗봇의 부상에 대해 설명합니다. 이러한 인공지능 기반 챗봇은 인도인들의 문화적, 종교적 니즈를 충족시키면서 인기를 얻고 있지만, 일부 비평가들은 잘못된 정보, 법 위반, 폭력을 조장할 수 있다는 우려를 제기하고 있습니다.

The article discusses the rise of religious chatbots in India, like Gitagpt and Krishna, which provide spiritual guidance and support to users. These AI-driven chatbots are gaining popularity as they cater to the cultural and religious needs of the Indian population, but some critics raise concerns about their potential to promote misinformation, lawbreaking, and violence.

PrivateGPT (GitHub Repo)

PrivateGPT를 사용하면 GPT의 강력한 기능을 사용하여 문서와 비공개로 상호 작용할 수 있습니다.

PrivateGPT allows you to Interact privately with your documents using the power of GPT.

Raycast Pro (Product Launch)

Raycast는 전문가를 위한 생산성 도구로, AI, 클라우드 동기화, 사용자 지정 테마, 무제한 클립보드 기록, Pro API 등을 포함한 Pro 플랜을 출시할 예정입니다.

Raycast Pro is the productivity tool for professionals, with AI, cloud synchronization, custom themes, unlimited clipboard history, Pro APIs, and more coming soon.