[TLDR] 오늘의 AI 뉴스, 2023-10-06: 구글 포토 AI 📷, Arc 브라우저의 AI 기능 💻, 뇌파에서 음성 해독 🧠

9bow · 10월 7, 2023, 11:14오전

파이토치 한국 사용자 모임에서는 TLDR 뉴스레터 의 승인을 받아 AI 소식을 DeepL로 번역 하여 전합니다.
더 많은 AI 소식 및 정보를 공유하고 함께 성장하고 싶으신가요? 지금 파이토치 한국어 커뮤니티에 방문해주세요!

주요 뉴스 & 신규 출시 소식 / Headlines & Launches

Arc 브라우저, AI 기반 기능 도입 / Arc Browser Introduces AI-Powered Features (2 minute read)

Arc 브라우저는 "Arc Max"라는 이름으로 AI 강화 기능을 도입했습니다. 이 기능은 고정된 탭 이름 바꾸기, 링크 콘텐츠 요약, 다운로드한 파일의 콘텐츠에 따라 이름 바꾸기 등의 기능에 OpenAI의 GPT-3.5 및 Anthropic의 모델을 활용합니다. anthropic

The Arc browser has introduced AI-enhanced features under the name "Arc Max". These features utilize OpenAI's GPT-3.5 and Anthropic’s models for functions such as renaming pinned tabs, summarizing link content, and renaming downloaded files based on their content.

AI의 가까운 미래 / The shape of the shadow of The Thing (15 minute read)

곧 출시될 구글의 제미니 모델은 GPT-4를 능가할 수 있습니다. AI가 시각과 음성 기능을 확보함에 따라 그 응용 분야와 잠재적 우려도 커지고 있습니다. AI의 미래는 인간의 책임감 있는 사용과 의사 결정에 달려 있습니다.

Google's upcoming Gemini model may outshine GPT-4. As AI gains vision and voice capabilities, its applications and potential concerns grow. The future of AI hinges on responsible human use and decision-making.

Google 포토 AI / Google Photos AI (4 minute read)

구글의 픽셀 8과 픽셀 8 프로는 AI로 강화된 사진 편집 기능을 제공합니다. 일각에서는 이 기술에 찬사를 보내는 반면, 다른 한편에서는 사진의 변화하는 환경을 강조하는 윤리적 우려를 표명하고 있습니다. Google은 AI로 편집된 이미지에 메타데이터를 표시할 계획입니다.

Google's Pixel 8 and 8 Pro offer AI-enhanced photo edits. While some laud the technology, others voice ethical concerns that underscore the changing landscape of photography. Google plans to flag AI-edited images with metadata.

연구 & 혁신 관련 소식 / Research & Innovation

Ctoc (GitHub Repo)

Ctoc은 토큰 수준에서 코드베이스를 분석할 수 있는 경량 도구를 제공합니다.

Ctoc provides a lightweight tool for analyzing codebases at the token level.

심층 분석: 분산 교육 및 효율적인 미세 조정에 대한 모든 것 / Deep dive: everything about distributed training and efficient fine-tuning (39 minute read)

멀티 GPU 및 멀티 노드 시스템에서 언어 모델을 학습할 때는 여러 가지 어려움이 있습니다. 이러한 경우 경험을 쌓는 것도 매우 어렵습니다. 이 블로그 게시물에서는 대규모로 모델을 훈련하거나 미세 조정할 때 발생하는 ZeRO, FSDP 및 문제점에 대해 매우 자세히 설명합니다. llm

There are different challenges when training language models on multi-GPU and multi-node systems. It is also very hard to gain experience in those cases. This blog post is extremely detailed and dives into ZeRO, FSDP, and gotchas when training or fine-tuning models at scale.

brainmagik: 뇌파에서 음성 해독하기 / Decoding Speech from Brain Waves (GitHub Repo)

메타 과학자들은 수술 없이도 뇌파를 연구하여 사람들이 생각하거나 듣는 단어와 문구를 이해할 수 있는 새로운 방법을 개발했습니다.

Meta Scientists have developed a new method to understand the words and phrases people think or hear by studying their brain waves - without needing any surgery.

엔지니어링 및 리소스 관련 소식 / Engineering & Resources

GET: 이벤트 카메라를 더 잘 이해하는 새로운 방법 / A New Way to Understand Event Cameras Better (17 minute read)

이벤트 카메라는 일반 카메라와 다르게 작동하는 특수 센서입니다. 이 새로운 방법인 그룹 이벤트 트랜스포머(GET)는 이벤트의 공간과 시간 세부 정보를 모두 살펴봄으로써 이러한 카메라를 더 잘 이해합니다.

Event cameras are special sensors that work differently from regular cameras. This new method, the Group Event Transformer (GET), better understands these cameras by looking at both the space and time details of events.

CoDA: 보이지 않는 3D 물체 발견 및 식별 / Spot and Identify Unseen 3D Objects (4 minute read)

3D 장면에서 새로운 물체를 감지하고 이름을 지정하는 것은 어려운 과제입니다. 이 프로젝트에서는 이러한 새로운 물체의 위치를 스마트하게 추측한 다음 이미지 및 텍스트와 특징을 일치시켜 하나의 시스템에서 이를 수행하는 새로운 방법인 CoDA를 소개합니다.

Detecting and naming new objects in 3D scenes is a tough challenge. This project introduces CoDA, a new method that does this by smartly guessing where these new objects might be and then matching their features with images and texts, all in one system.

해석 가능성을 향한 Anthropic의 다음 단계 / Anthropic’s next step towards interpretability (45 minute read)

기계적 해석 가능성은 신경망을 더 해석 가능한 하위 조각으로 분해하여 이해하는 과정입니다. 안타깝게도 뉴런은 그 자체로는 해석할 수 없는 경향이 있습니다. 중첩과 같은 여러 가지 깊은 이유가 이 문제를 야기합니다. 앤트로픽의 이 작업은 스파스 자동 인코더를 사용해 단층 트랜스포머에서 해석 가능한 표현을 생성합니다. 이는 이러한 모델에 대한 이해에 있어 큰 진전입니다. anthropic

Mechanistic interpretability is the process of understanding neural networks by breaking them down into more interpretable sub-pieces. Unfortunately, neurons tend to not be interpretable by themselves. There are a number of deep reasons like superposition that lead to this challenge. This work by Anthropic uses a sparse autoencoder to generate interpretable representations from a one-layer Transformer. It’s a big step forward in our understanding of these models.

그 외 소식 / Miscellaneous

생각을 위한 멈춤: AI 일시 정지 토론 / Pause For Thought: The AI Pause Debate (17 minute read)

AI 연구를 단순히 6개월 동안 일시 중지하거나, 적절한 시기에 일시 중지했다가 다시 시작하거나, 좋은 규제를 개발할 수 있을 만큼 충분히 오래 일시 중지하거나, 정렬 문제가 완전히 해결될 때까지 일시 중지하거나, 아예 일시 중지하지 않는다면 어떤 일이 일어날지 고려하면서 AI 일시 중지 논쟁을 들여다봅니다.

A peek into the AI pause debate, considering what might happen if AI research was simply paused for 6 months, paused and restarted at exactly the right time, paused long enough to develop good regulation, paused until alignment was completely solved, or not paused at all.

AI가 컴퓨터 과학의 가장 큰 미스터리를 풀 수 있을까요? / Can AI Solve Computer Science’s Greatest Mystery? (6 minute read)

컴퓨터 과학자들은 GPT-4를 사용하여 오랫동안 지속되어 온 P=NP 문제를 해결하고 있습니다. Microsoft와 북경대학교의 연구원 그룹은 소크라테스적 방법을 사용하여 GPT-4에 일련의 프롬프트를 통해 문제를 탐색하도록 유도한 결과, P가 NP와 같지 않다는 인사이트를 얻었습니다. 이 방법은 프롬프트 엔지니어링의 새로운 지평을 열었으며, 잠재적으로 AI와의 채팅 세션을 더욱 집중적이고 심도 있게 만들 수 있습니다. gpt4

Computer scientists are using GPT-4 to address the longstanding P=NP problem. A group of researchers from Microsoft and Peking University guided GPT-4 using a Socratic Method with a series of prompts to explore the problem, leading to insights that suggested P does not equal NP. The method represents a new frontier in prompt engineering, potentially making chat sessions with AI more focused and in-depth.

6,800만 달러 규모의 AI 해커 집에서의 생활 / Life in a $68M AI hacker house (11 minute read)

다른 AI 창업자 및 빌더들과 함께 6,800만 달러 규모의 AI 해커 하우스에서 생활하는 것이 어떤 것인지 자세히 알아보세요.

Get an inside look at what it’s like to live in a $68M AI hacker house with other AI founders and builders.

더 읽어보기 / Quick Links

Artifact, 게시물에 어울리는 이미지를 AI로 생성 / Artifact will let you generate images with AI for your posts (2 minute read)

원래 Instagram의 공동 창립자들이 만든 AI 기반 뉴스 앱인 Artifact는 이제 사용자가 AI를 활용하여 게시물의 커버 아트를 만들 수 있습니다. 사용자가 입력한 키워드를 기반으로 '눈길을 사로잡는 이미지'를 생성하는 이 기능의 향상으로 Artifact는 Instagram 및 X와 같은 플랫폼의 경쟁자로서의 입지를 더욱 공고히 할 수 있게 되었습니다.

Artifact, originally an AI-driven news app by Instagram’s co-founders, now lets users leverage AI to create cover art for their posts. This enhancement, which crafts “eye-catching images” based on user-inputted keywords, further positions Artifact as a competitor to platforms like Instagram and X.

색상 AI / Colors AI (Product)

고객 피드백을 자동으로 수집, 통합, 분류하여 제품 개발의 우선순위를 정할 수 있습니다.

Automatically gather, consolidate, and categorize customer feedback to prioritize product development.

Open X-Embodiment, 딥마인드가 만든 로봇 동작 분야의 이미지넷 같은 데이터셋 / Google DeepMind Creates An ImageNet Of Robot Actions (1 minute read)

구글의 딥마인드는 33개 연구 기관과 협력하여 컴퓨터 비전의 이미지넷 데이터베이스와 유사한 데이터베이스인 오픈 X-임베디먼트를 개발했습니다. 22개 로봇 유형에서 500개 이상의 기술과 150,000개 이상의 작업으로 구성된 이 데이터베이스는 보다 다양한 로봇 모델을 훈련할 수 있는 공유 리소스를 제공하여 로봇 연구를 촉진하는 것을 목표로 합니다. deepmind google

Google’s DeepMind, in collaboration with 33 research institutes, has developed Open X-Embodiment, a database akin to the ImageNet database in computer vision. This database, comprising over 500 skills and 150,000 tasks from 22 robot types, aims to propel robotics research by offering a shared resource to train more versatile robot models.