[TLDR] 오늘의 AI 뉴스, 2023-08-03: 메타의 오디오크래프트 🔈, 화이트 캐슬: 드라이브 스루에 AI 도입 🍔, YOLO 기반 모델 벤치마킹 🪑

9bow · 8월 4, 2023, 10:06오전

파이토치 한국 사용자 모임에서는 TLDR 뉴스레터의 승인을 받아 AI 소식을 DeepL로 번역하여 전합니다.

더 많은 AI 소식 및 정보를 공유하고 함께 성장하고 싶으신가요? 지금 파이토치 한국어 커뮤니티에 방문해주세요!

주요 뉴스 & 신규 출시 소식 / Headlines & Launches

Meta의 AudioCraft: 생성형 오디오를 위한 코드 베이스 / Meta released AudioCraft: a code base for generative audio needs (3 minute read)

AudioCraft는 음악, 음향 효과, 압축 등 모든 제너레이티브 오디오 요구 사항을 위한 원스톱 코드 베이스입니다.

AudioCraft is a single-stop code base for all your generative audio needs: music, sound effects, and compression.

[GN] Meta, 오디오를 위한 생성형 AI "AudioCraft" 오픈소스로 공개

White Castle, 드라이브 스루에 더 많은 AI 도입 / White Castle will bring more AI to its drive-thrus (2 minute read)

White Castle은 2024년까지 100개 이상의 드라이브 스루에 음성 인식 회사인 SoundHound의 기술을 활용하여 AI 기반 음성 지원을 구현할 계획입니다. 이 시스템은 더 빠른 주문 처리를 약속하고 오류 발생 시 고객에게 직원과 대화할 수 있는 옵션을 제공합니다. 이 이니셔티브는 성공적인 개념 검증을 거쳤으며 해고로 이어지지 않을 것입니다.

White Castle plans to implement AI-enabled voice assistance at over 100 drive-thrus by 2024, utilizing technology from speech recognition company SoundHound. The system promises quicker order processing and offers customers the option to speak with human staff if errors occur. The initiative follows a successful proof-of-concept and will not lead to layoffs.

중국의 새로운 규제를 앞두고 애플 앱스토어에서 생성형 AI 서비스 삭제 / Generative AI services pulled from Apple App Store in China ahead of new regulations (3 minute read)

Apple은 중국의 새로운 생성형 AI 규제를 앞두고 ChatGPT 클라이언트인 OpenCat을 포함한 수많은 AI 앱을 중국 앱 스토어에서 삭제했습니다. 곧 시행될 규제에 따르면 AI 앱은 중국에서 작동하기 위해 관리 라이선스를 확보해야 합니다. 새로운 프레임워크는 소규모 개발자의 진입을 막고, 규정 준수를 관리할 수 있는 대기업에게 시장을 내줄 수 있습니다.

Apple has removed numerous AI apps from its China App Store, including OpenCat, a ChatGPT client, ahead of new generative AI regulations in China. The upcoming regulations demand that AI apps secure an administrative license to operate in China. The new framework could deter smaller developers, potentially leaving the market to larger firms capable of managing compliance.

연구 & 혁신 관련 소식 / Research & Innovation

FLatten으로 효율적인 비전 트랜스포머 / Efficient Vision Transformers with Focused Linear Attention (GitHub Repo)

이 연구는 트랜스포머를 더욱 효율적이고 강력하게 만드는 새로운 'FLatten(Focused Linear Attention)' 방법을 제안합니다. 연구진은 계산 요구량을 낮추면서 모델 성능을 향상시키는 새로운 매핑 기능과 순위 복원 모듈을 고안했습니다.

This research introduces a new 'Focused Linear Attention' method that makes Transformers more efficient and powerful. The researchers devised a new mapping function and rank restoration module that enhances model performance while keeping computational demands low.

LISA: AI의 새로운 과제인 추론 세분화 소개 / Introducing Reasoning Segmentation: A New Task in AI (GitHub Repo)

이 연구에서는 복잡하고 암시적인 텍스트 지침에서 세그먼트 마스크를 생성하도록 설계된 '추론 세그먼테이션'이라는 새로운 AI 작업을 소개합니다. 데모로 LISA(Large-language Instructed Segmentation Assistant)라는 도구를 소개합니다. LISA는 대규모 언어 모델의 언어 생성 기능과 세분화 마스크를 생성하는 기능을 결합한 도구입니다.

This study introduces a new AI task called 'reasoning segmentation' designed to generate segmentation masks from complex and implicit textual instructions. It presents a tool called LISA (Large-language Instructed Segmentation Assistant) as a demonstration. LISA combines the language generation capabilities of large language models with the ability to create segmentation masks.

엔지니어링 및 리소스 관련 소식 / Engineering & Resources

뇌종양 분류 개선 / Enhancing Brain Tumor Classification (6 minute read)

이 연구는 MRI 영상에서 다양한 유형의 뇌종양을 식별할 때 정확도를 향상시키기 위해 L2 정규화된 공간 주의력을 사용하는 새로운 분류 네트워크를 제시합니다.

This study presents a novel classification network that uses L2-normalized spatial attention to improve accuracy when identifying different types of brain tumors in MRI images.

YOLOBench: YOLO 기반 객체 감지 모델 벤치마킹 / Benchmarking YOLO-Based Object Detection Models (18 minute read)

이 연구에서는 4개의 고유한 데이터 세트와 하드웨어 시스템에서 테스트한 550개 이상의 물체 감지 모델에 대한 성능 측정 방법인 "YOLOBench"를 소개합니다.

This study introduces "YOLOBench", a performance measure for over 550 models based on the YOLO (You Only Look Once) method of object detection, tested across four unique datasets and hardware systems.

그 외 소식 / Miscellaneous

MIT의 액체 신경망(LNN)으로 로봇 공학부터 자율주행차까지 AI 문제를 해결하는 방법 / How MIT’s Liquid Neural Networks can solve AI problems from robotics to self-driving cars (9 minute read)

MIT CSAIL의 리퀴드 뉴럴 네트워크(LNN)는 로보틱스와 자율주행차 분야에서 탁월한 성능을 발휘하는 소형 AI입니다. LNN은 적응력이 뛰어나고 계산 집약도가 낮으며 변화하는 환경에서 표준 모델보다 성능이 뛰어나지만 정적 데이터베이스로는 어려움을 겪습니다.

MIT CSAIL’s Liquid Neural Networks (LNNs) are a compact AI that excels in robotics and autonomous vehicles. LNNs are adaptable, less computationally intensive, and outperform standard models in changing environments, but struggle with static databases.

에이전트화된 LLM은 얼라인먼트 환경을 바꿀 것 / Agentized LLMs will change the alignment landscape (4 minute read)

Auto-GPT 및 Baby AGI와 같은 에이전트화된 LLM의 개발은 AGI를 빠르게 발전시킬 수 있습니다. 인간의 인지 기능을 모방하는 이러한 LLM은 정렬 및 해석 가능성에 대한 새로운 과제를 제기하지만, 영어로 정보를 처리하기 때문에 고유한 해석 가능성을 제공합니다.

The development of agentized LLMs like Auto-GPT and Baby AGI could rapidly advance AGI. These LLMs, which emulate human cognitive functions, pose new challenges for alignment and interpretability, yet offer unique interpretability because they process information in English.

더 읽어보기 / Quick Links

BrainyPDF (Product Launch)

ChatGPT와 같은 PDF용. ChatGPT를 사용하여 문서에 대한 질문을 요약하고 답변하세요.

ChatGPT but for PDFs. Summarize and answer questions for your documents using ChatGPT.

Fluent 2.0 (Product Launch)

브라우저의 AI 언어 튜터.

An AI language tutor in your browser.

킥스타터, 생성형 AI 프로젝트에 추가 정보 공개 의무화 / Kickstarter requires generative AI projects to disclose additional info (5 minute read)

킥스타터는 콘텐츠 제작 또는 AI 개발을 위해 AI 도구를 사용하는 모든 프로젝트에 대해 AI 사용 및 데이터 소스에 대한 투명성을 제공하도록 의무화했습니다. 이는 오리지널 콘텐츠 제작자를 보호하고 AI 프로젝트의 책임성을 강화하기 위한 조치입니다.

Kickstarter has mandated that all projects using AI tools for content generation or AI development must provide transparency on their AI use and data sources. The move aims to protect original content creators and enhance accountability in AI projects.