[TLDR] 오늘의 AI 뉴스, 2023-10-25: AI 위험과 기후 위기 🤖, 메타의 해비타트 3.0 🏠, 노이즈 스케줄링을 통한 비디오 디퓨젼 📹

9bow · 10월 26, 2023, 3:30오전

파이토치 한국 사용자 모임에서는 TLDR 뉴스레터 의 승인을 받아 AI 소식을 DeepL로 번역 하여 전합니다.
더 많은 AI 소식 및 정보를 공유하고 함께 성장하고 싶으신가요? 지금 파이토치 한국어 커뮤니티에 방문해주세요!

주요 뉴스 & 신규 출시 소식 / Headlines & Launches

OpenAI, 저렴한 비용으로 개발자를 유인하기 위한 주요 업데이트 계획 / OpenAI plans major updates to lure developers with lower costs (5 minute read)

OpenAI는 개발자가 보다 저렴하고 신속하게 AI 기반 애플리케이션을 개발할 수 있도록 중요한 업데이트를 계획하고 있습니다. 이번 업데이트에는 개발 비용을 크게 절감할 수 있는 메모리 스토리지와 개발자를 위한 새로운 비전 기능이 추가될 예정입니다. 이러한 개선 사항은 11월 6일에 열리는 OpenAI의 첫 번째 개발자 컨퍼런스에서 발표될 예정입니다. openai

OpenAI is planning significant updates to help developers create AI-based applications more affordably and swiftly. The updates include the addition of memory storage and new vision capabilities for developers that could greatly reduce development costs. The enhancements are expected to be announced at OpenAI’s first-ever developer conference on November 6th.

선도적인 AI 전문가, 기후 위기에 버금가는 AI 위험에 대한 긴급한 글로벌 대응 옹호 / Leading AI Expert Advocates for Urgent Global Response to AI Risks Comparable to Climate Crisis (4 minute read)

구글의 AI 사업부 CEO인 데미스 하사비스는 전 세계가 기후 위기와 마찬가지로 AI 위험에 시급히 대처해야 한다고 경고합니다. 그는 기후 변화에 관한 정부 간 협의체와 같은 감독 기관을 설립하여 AI로 인한 위험에 대처할 것을 제안합니다. 핫사비스는 AI가 매우 유익할 수 있지만 잠재적인 위험 때문에 국제적인 관심이 필요하다고 생각합니다.

Google's AI unit's CEO, Demis Hassabis, warns that the world needs to address AI risks urgently, similarly to the climate crisis. He suggests an oversight body like the Intergovernmental Panel on Climate Change to tackle dangers from AI. Hassabis believes AI could be highly beneficial, but its potential risks require international attention.

Nightshade: 아티스트가 생성 AI에 맞서 싸울 수 있는 새로운 데이터 중독 도구 / This new data poisoning tool lets artists fight back against generative AI (5 minute read)

시카고 대학교의 연구원들은 아티스트가 작품에 보이지 않는 변화를 추가할 수 있는 도구를 개발했습니다. 나이트셰이드는 이러한 작품에 대해 학습된 AI 모델이 오작동을 일으키게 합니다. 이 도구는 AI 기업이 아티스트의 작품을 무단으로 사용하는 것을 막는 것을 목표로 합니다. 나이트셰이드는 아티스트가 자신의 스타일링을 가릴 수 있는 도구인 글레이즈에 통합될 예정이며, 더 많은 사람들이 사용할 수 있도록 오픈소스로 공개될 예정입니다.

Researchers from the University of Chicago have developed a tool that allows artists to add invisible changes to their artwork. Nightshade causes AI models trained on these pieces to malfunction. The tool aims to deter AI firms from using artists' works without permission. Nightshade will be integrated into Glaze, a tool for artists to mask their styling, and will be made open-source for wider use.

연구 & 혁신 관련 소식 / Research & Innovation

Cola: 비전 언어 모델을 통한 시각적 추론 강화 / Enhanced Visual Reasoning with Vision-Language Models (GitHub Repo)

이 저장소에서는 시각적 추론을 개선하기 위해 대규모 언어 모델을 사용하여 다양한 시각 언어 모델(VLM)을 조정하는 시스템인 Cola를 소개합니다. vision-language multimodal

The repo introduces Cola, a system that uses a large language model to coordinate various vision-language models (VLMs) for improved visual reasoning.

DeepSparse (GitHub Repo)

DeepSparse는 희소성을 활용하여 신경망 추론을 가속화하는 CPU 추론 런타임입니다.

DeepSparse is a CPU inference runtime that takes advantage of sparsity to accelerate neural network inference.

AgentTuning (GitHub Repo)

여러 에이전트 작업에서 인터랙션 궤적을 사용하여 LLM을 인스트럭션 조정합니다.

Instruction-tune LLMs using interaction trajectories across multiple agent tasks.

엔지니어링 및 리소스 관련 소식 / Engineering & Resources

해비타트 3.0 : 지능형 AI 로봇 훈련을 위한 실제 환경 시뮬레이션 환경 / Meta’s Habitat 3.0 Simulates Real-World Environments For Intelligent AI Robot Training (2 minute read)

메타의 FAIR 팀은 로봇이 실제 시나리오를 탐색할 수 있도록 훈련할 수 있는 향상된 AI 시뮬레이션 환경인 해비타트 3.0을 출시했습니다.

Meta’s FAIR team has launched Habitat 3.0, an enhanced AI simulation environment for training robots to navigate real-world scenarios.

FreeNoise: 노이즈 스케줄링으로 비디오 디퓨젼 개선 / Improved video diffusion with noise scheduling (25 minute read)

단일 프롬프트에서 단일 이미지를 생성하는 것은 일반적으로 괜찮지만 프레임 사이의 시간적 변화로 인해 비디오를 생성할 때는 실패합니다. 그러나 텍스트를 변경할 때 일관성을 유지하는 것은 매우 어렵습니다. 이 작업은 이 두 가지 문제를 모두 해결하고 디퓨젼을 사용하여 최대 512프레임 길이의 생성을 가능하게 합니다.

Generating a single image from a single prompt is generally fine, but fails when generating video because of the temporal changes between frames. However, consistency when changing text is quite challenging. This work tackles both of these problems and enables generations up to 512 frames long using diffusion.

SAM-Med3D: 3D 의료 영상에 세그먼트 애니씽 모델 사용 / Using Segment Anything Model for 3D Medical Imaging (GitHub Repo)

SAM-Med3D는 3D 의료 영상에 맞게 특별히 맞춤화된 SAM(Segment Anything Model)의 업그레이드 버전입니다. 기존 SAM은 3D 의료 영상에 어려움을 겪었지만, 13만 1천 개 이상의 3D 마스크로 구성된 방대한 데이터 세트를 학습한 SAM-Med3D는 더 적은 입력 프롬프트로 3D 공간 디테일을 캡처하여 우수한 결과를 제공합니다.

SAM-Med3D is an upgraded version of the Segment Anything Model (SAM) tailored specifically for 3D medical imaging. While the original SAM struggled with 3D medical images, SAM-Med3D, trained on a vast dataset of over 131K 3D masks, delivers superior results by capturing 3D spatial details with fewer input prompts.

그 외 소식 / Miscellaneous

감정 노동이 AI와 차별화되는 방법 / Emotional labor is how we differentiate from AI (1 minute read)

미소를 지으며 악수를 청하는 것은 현재로서는 AI가 할 수 없는 일입니다.

Showing up with a smile and a handshake is something AI can’t do, for now.

유출된 구글 AI 혁신: 멀티모달 제미니와 혁신적인 앱 프로토타이핑 기능, 스텁스 / Leaked Google AI Innovations: Multimodal Gemini and Revolutionary App Prototyping Feature, Stubbs (6 minute read)

Google은 PaLM 2를 이미지 및 텍스트 인식 기능을 제공하는 멀티모달 AI 모델인 Gemini로 대체하여 Makersuite에 출시할 예정입니다. 또한 사용자가 AI로 생성된 앱 프로토타입을 제작하고 실행할 수 있는 Stubbs라는 숨겨진 도구도 제공합니다. Makersuite는 곧 언어 번역을 완벽하게 지원할 예정입니다.

Google is replacing PaLM 2 with Gemini, a multimodal AI model coming to Makersuite, which will offer image and text recognition features. The company also has a hidden tool called Stubbs that enables users to build and launch AI-generated app prototypes. Makersuite will soon fully support language translation.

OpenAI가 소비자와 기업을 모두 이길 수 있을까? / Can OpenAI Win Consumer And Enterprise? (2 minute read)

OpenAI는 지금까지 ChatGPT와 API로 각각 소비자와 기업 모두를 사로잡았지만, 앞으로도 계속 그렇게 할 수 있을지는 아직 미지수입니다.

OpenAI has so far won both consumer and enterprise with ChatGPT and APIs respectively, but whether they will be able to continue to do so remains unknown.

더 읽어보기 / Quick Links

애플, 2024년 AI 서버에 47억 5천만 달러를 지출할 수 있다 / Apple Could Spend $4.75B On AI Servers In 2024 (1 minute read)

Apple은 2023년과 2024년에 AI 서버에 수십억 달러를 지출할 것으로 예상됩니다.

Apple is expected to spend billions of dollars on AI servers in 2023 and 2024.

Anthropic의 Claude, 이제 95개국에서 사용 가능 / Claude available in 95 countries (1 minute read)

현재 전 세계 95개국에서 앤트로픽의 클로드 챗봇을 사용할 수 있습니다.

Anthropic’s Claude chatbot is now available in 95 countries around the world.

Zaplify (Product Launch)

Zaplify는 AI를 사용하여 LinkedIn과 이메일 홍보를 개인화합니다.

Zaplify uses AI to personalize LinkedIn and email outreach.