[TLDR] 오늘의 AI 뉴스 @ 2023-05-09

9bow · 5월 10, 2023, 2:25오전

파이토치 한국 사용자 모임에서는 TLDR 뉴스레터 의 승인을 받아 AI 소식을 DeepL로 번역하여 전합니다.

주요 뉴스 & 신규 출시 소식 / Headlines & Launches

백만 달러 규모의 AI 미세 조정 대회 / Million dollar AI fine-tuning challenge (3 minute read)

언어 모델을 미세 조정하기 위한, Kaggle과 비슷한 대회가 열립니다. 저희는 이러한 모델의 안전성을 개선하고 싶기 때문에 현재 커뮤니티에 의존하여 진전을 이룰 수 있는지 살펴보고 있습니다. 사용할 수 있는 컴퓨팅 파트너십과 보상 모델이 있습니다. 대회는 6월 10일부터 시작됩니다.

Think of this like Kaggle but for fine tuning a language model. We want to improve the safety of these models, and so we are relying on the community now to see if we can make progress. There are compute partnerships available and a reward model that you can use. The competition starts on June 10th.

Google, PaLM 2 발표 / Google to announce PaLM 2 (2 minute read)

구글이 범용 대규모 언어 모델(LLM)을 출시하는 등 여러 가지 생성형 AI 업데이트를 발표할 계획인 가운데, 이번 수요일 구글의 연례 개발자 컨퍼런스에서 인공 지능이 주요 주제가 될 것이라고 CNBC가 보도했습니다.

Artificial intelligence is going to be a central theme at Google’s annual developer conference on Wednesday, as the company is planning to announce a number of generative AI updates, including launching a general-use large language model (LLM), CNBC has learned.

공정 사용 및 모델 훈련, 게티 이미지 vs. Stable Diffusion / Fair Use and Model Training, Getty Images vs. Stable Diffusion (14 minute read)

유니버설 뮤직 그룹(UMG)은 스포티파이나 애플 같은 스트리밍 플랫폼에 AI 서비스가 저작권이 있는 노래의 멜로디와 가사를 스크랩하지 못하도록 차단해 달라고 요청했습니다. UMG는 AI 봇이 인기 아티스트의 곡을 사용하여 인기 아티스트와 유사한 음악을 생성함으로써 잠재적으로 오리지널 아티스트의 시장에 영향을 미칠 수 있다고 우려하고 있습니다. 그러나 저작권이 있는 자료를 모델 학습에 사용하는 것이 저작권의 공정 사용 예외에 해당할 수 있으므로 불법인지 여부는 불분명합니다. 한편, Adobe는 대규모 스톡 사진 라이브러리를 소유하고 있고, 리스크를 회피하는 기업들과 관계를 맺고 있기 때문에 AI 모델을 제외하는 좁은 의미의 공정 사용 정의를 지지하고 있습니다.

Universal Music Group (UMG) has requested streaming platforms like Spotify and Apple to block AI services from scraping melodies and lyrics from copyrighted songs. UMG is concerned about AI bots using their songs to generate music that resembles popular artists, potentially affecting the market for the original artists. However, it is unclear if using copyrighted material for model training is illegal, as it may fall under the fair use exception to copyright. Meanwhile, Adobe is betting on a narrow fair use definition that excludes AI models, as it owns a large stock photo library and has relationships with risk-averse corporations.

연구 & 혁신 관련 소식 / Research & Innovation

RasaGPT (GitHub Repo)

RasaGPT는 Rasa와 랭체인(LangChain)을 기반으로 구축된 최초의 헤드리스 LLM 챗봇 플랫폼입니다. 이는 보일러플레이트 코드이며, 색인, 검색 및 컨텍스트 삽입을 위해 Langchain과 같은 LLM 라이브러리를 활용하는 Rasa 및 Telegram의 참조 구현입니다.

RasaGPT is the first headless LLM chatbot platform built on top of Rasa and Langchain. It is boilerplate code and a reference implementation of Rasa and Telegram utilizing an LLM library like Langchain for indexing, retrieval and context injection.

MLC LLM (GitHub Repo)

MLC LLM은 다양한 언어 모델을 다양한 하드웨어 백엔드 및 네이티브 애플리케이션에 직접 배포할 수 있는 범용 솔루션으로, 누구나 자신의 사용 사례에 맞게 모델 성능을 더욱 최적화할 수 있는 생산적인 프레임워크를 제공합니다.

MLC LLM is a universal solution that allows any language models to be deployed natively on a diverse set of hardware backends and native applications, plus a productive framework for everyone to further optimize model performance for their own use cases.

PandasAI (GitHub Repo)

Pandas AI는 널리 사용되는 데이터 분석 및 조작 도구인 Pandas에 생성형 인공 지능 기능을 추가하는 Python 라이브러리입니다.

Pandas AI is a Python library that adds generative artificial intelligence capabilities to Pandas, the popular data analysis and manipulation tool.

엔지니어링 및 리소스 관련 소식 / Engineering & Resources

경사도 하강 방식으로 Decision Tree 학습하기 / Learning Decision Trees with Gradient Descent (14 minute read)

의사 결정 트리(Decision Tree)는 데이터를 기반으로 의사 결정을 내리는 인공지능의 일종이지만, 현재 의사 결정 트리를 생성하는 방식은 때때로 완벽하지 않은 결과를 가져올 수 있습니다. 이 논문에서는 경사 하강이라는 새로운 방법을 사용하여 더 나은 의사 결정 트리를 구축하고 AI가 사물을 분류해야 하는 작업에서 더 정확한 결과를 얻을 수 있는 방법에 대해 설명합니다.

Decision Trees are a type of artificial intelligence that make decisions based on data, but the current method of creating these trees can sometimes give us less than perfect results. This paper talks about a new method using something called gradient descent, which helps us to build better Decision Trees and get more accurate results for tasks where the AI needs to categorize things.

Compose Objects Localized with Attributes (29 minute read)

구성 추론은 객체와 객체의 특징을 결합하여 복잡한 시각적 요소를 이해하는 데 도움이 되지만, 현재 AI 모델은 이 작업에 어려움을 겪고 있습니다. 연구원들은 이러한 AI 모델을 테스트하고 개선하기 위해 Cola라는 벤치마크를 만들었으며, 경량 멀티모달 어댑터가 모델이 이미지와 언어를 더 잘 추론하는 데 도움이 된다는 사실을 발견했습니다.

Compositional reasoning helps us understand complex visuals by combining objects and their features, but current AI models struggle with this. Researchers created a benchmark called Cola to test and improve these AI models, finding that a lightweight multi-modal adapter helps the models reason about images and language better.

새로운 어텐션 메커니즘으로 웹 페이지 이해도 향상 / Enhancing Webpage Understanding with a Novel Attention Mechanism (17 minute read)

웹 페이지는 AI 작업에 많은 정보를 제공하지만, 그 잠재력을 최대한 활용하지는 못했습니다. 새로운 Wikipedia 웹페이지 제품군(WikiWeb2M)은 2백만 개의 페이지와 설명 생성, 섹션 요약, 문맥 이미지 캡션 생성 같은 작업의 성능을 향상시키는 특별한 어텐션 메커니즘을 도입하여 멀티모달 웹페이지 이해 연구에 도움을 줍니다.

Webpages offer a lot of information for AI tasks, but they haven't been used to their full potential. The new Wikipedia Webpage suite (WikiWeb2M) helps study multimodal webpage understanding by introducing 2 million pages and a special attention mechanism that improves performance on tasks like generating descriptions, summarizing sections, and creating contextual image captions.

그 외 소식 / Miscellaneous

펜타곤(미국방부)에서 ChatGPT 사용할 수 있을까? / Can The Pentagon Use ChatGPT? (5 minute read)

온라인 미디어인 The Intercept에서 미 국방부의 ChatGPT 사용과 AI의 군사적 사용으로 인해 발생하는 윤리적 문제, 특히 OpenAI가 군의 기술 사용을 명시적으로 금지했기 때문에 발생하는 윤리적 문제에 대해 파헤쳐보았습니다.

The Intercept dives into the use of ChatGPT by the Pentagon and the ethical questions that arise from the military use of AI, especially because OpenAI has explicitly banned the military from using its technology.

기본소득(UBI)이 우리를 AI로부터 구할 수 있을까? / Will UBI Save Us From AI? (4 minute read)

이 글에서는 AI와 자동화로 인한 일자리 대체에 대한 해결책으로 보편적 기본소득(UBI)의 가능성을 살펴봅니다. 이 글에서는 보편적 기본소득의 잠재적 혜택, 도전 과제, 시행을 둘러싼 지속적인 논쟁 등 보편적 기본소득에 대한 다양한 관점에 대해 설명합니다. 또한 저자는 AI가 노동 시장을 계속 재편함에 따라 소득 불평등과 일자리의 미래를 해결해야 할 필요성에 대해서도 살펴봅니다.

This article explores the potential of universal basic income (UBI) as a solution to job displacement caused by AI and automation. It discusses various perspectives on UBI, including its potential benefits, challenges, and the ongoing debate surrounding its implementation. The author also examines the need to address income inequality and the future of work as AI continues to reshape the labor market.

중국, 미국보다 앞서가는 AI 규제 / China Races Ahead Of US On AI Regulation (2 minute read)

Axios의 이 기사는 글로벌 AI 경쟁에서 주도권을 잡기 위해 AI에 대한 규제 프레임워크를 구축하려는 중국의 노력에 대해 설명합니다. AI 혁신을 장려하는 것과 잠재적인 위험 및 윤리적 우려를 해결하는 것 사이에서 균형을 맞추려는 중국 정부의 시도를 살펴봅니다. 또한 데이터 프라이버시, 감시, 기술 거버넌스에 대한 여러 국가의 관점을 연결하는 데 따르는 어려움을 인정하면서 AI 규제에 있어 국제적인 협력과 조화의 중요성을 강조합니다.

The article from Axios discusses China's efforts to establish regulatory frameworks for AI as it tries to lead in the global AI race. It examines the Chinese government's attempt to strike a balance between encouraging AI innovation and addressing potential risks and ethical concerns. The article also highlights the importance of international collaboration and harmonization in AI regulation while acknowledging the challenges of bridging different countries' perspectives on data privacy, surveillance, and technology governance.

더 읽어보기 / Quick Links

OpenAI는 고객 데이터로 학습하지 않는다 / OpenAI Won’t Train On Customer Data (1 minute read)

OpenAI는 AI 모델 학습을 위해 고객의 API 데이터 사용을 중단합니다.

OpenAI is to stop using customer’s API data to train its AI models.

Metering.ai (Product Launch)

제품 사용 데이터를 수동으로 계량하고 계신가요? metering.ai로 전환하고 단 3번의 클릭으로 수백만 행의 사용량 데이터를 Stripe 인보이스로 전환하세요. 이제 더 이상 청구 오류나 수작업으로 인한 번거로움이 없습니다. 여러분의 제품에 사용량 기반 요금제를 적용해서 몇 일 안에 출시하세요.

Metering your product usage data manually? Switch to metering.ai & turn millions of rows of usage data into Stripe invoices in just 3 clicks. This means no more billing errors & no more manual grunt work. Launch usage based pricing for your product in days.

미국은 IBM 왓슨을 잊었다. 다음은 ChatGPT인가? / America Forgot About IBM Watson. Is ChatGPT Next? (3 minute read)

이 글은 IBM의 왓슨이 차세대 인공지능의 대명사였던 시절로 거슬러 올라간 다음, ChatGPT가 그와 같은 운명을 맞이할 것인지에 대한 질문에 대해 재미있게 설명합니다.

This article is a fun dive back into the days when IBM’s Watson was the next big AI thing, and then dives into the question of whether ChatGPT will meet the same fate of irrelevance.

ChatGPT는 교육을 파괴하는 것이 아니라 변화시킬 것이다 / ChatGPT Is Going To Change Education, Not Destroy It (7 minute read)

이 글에서는 교사들이 ChatGPT가 교육을 향상시킬 것이라고 믿는 방법에 대해 자세히 설명합니다.

This article dives into how teachers believe ChatGPT will enhance education.