[TLDR] 오늘의 AI 뉴스, 2023-11-08: OpenAI 기조연설 🎤, YouTube 생성형 AI 기능 📹, 즉각적인 3D 검색 🔍

9bow · 11월 10, 2023, 4:12오전

파이토치 한국 사용자 모임에서는 TLDR 뉴스레터의 승인을 받아 AI 소식을 DeepL로 번역하여 전합니다.
더 많은 AI 소식 및 정보를 공유하고 함께 성장하고 싶으신가요? 지금 파이토치 한국어 커뮤니티에 방문해주세요!

주요 뉴스 & 신규 출시 소식 / Headlines & Launches

OpenAI 기조연설 / The OpenAI Keynote (14 minute read)

OpenAI의 CEO인 샘 알트먼은 첫 번째 개발자 컨퍼런스에서 자사 AI 모델의 새로운 기능과 개선 사항을 선보이며 AI 통합이 소비자 기술에서 중심적인 역할을 하게 될 미래를 강조했습니다. 이 행사에서는 새로운 도구를 즉시 사용할 수 있는 제품 중심 접근 방식으로의 전환을 강조하고, AI 접근성의 마찰을 줄이기 위한 하드웨어로의 잠재적 진출을 암시했습니다. sam-altman openai

OpenAI CEO Sam Altman showcased new features and improvements to the company’s AI models at its first developer conference, emphasizing a future where AI integration will play a central role in consumer technology. The event highlighted OpenAI's shift towards a product-centric approach, with new tools being made immediately available, and hinted at potential forays into hardware to reduce friction in AI accessibility.

ChipNeMo: 엔비디아, 엔지니어를 위한 생성형 AI 시범 운영 / Nvidia Is Piloting A Generative AI For Its Engineers (3 minute read)

엔비디아는 IEEE/ACM 국제 컴퓨터 지원 설계 컨퍼런스 기조연설에서 칩 설계자의 생산성을 향상시키기 위한 대규모 언어 모델인 ChipNeMo의 테스트 결과를 공개했습니다. 아직 완전히 입증되지는 않았지만 ChipNeMo는 설계 도구의 스크립팅, 버그 보고서 요약, 설계자를 위한 숙련된 챗봇 역할을 지원합니다. chipnemo nvidia

Nvidia revealed the testing of a large language model named ChipNeMo to enhance chip designers' productivity during a keynote at the IEEE/ACM International Conference on Computer-Aided Design. Although not yet fully proven, ChipNeMo aids in scripting for design tools, summarizing bug reports, and serving as an experienced chatbot for designers.

생성 AI 기능을 테스트하는 YouTube / YouTube To Test Generative AI Features (2 minute read)

YouTube는 프리미엄 구독 서비스의 일부로 새로운 생성형 AI 기능을 테스트하고 있습니다. 이러한 기능에는 콘텐츠에 대해 질문하고 추천을 받을 수 있는 대화형 도구와 댓글 주제를 요약하는 도구가 포함됩니다. youtube generative

YouTube is testing new generative AI features as part of its premium subscription service. These features include a conversational tool for asking questions about content and getting recommendations and another for summarizing comment topics.

연구 & 혁신 관련 소식 / Research & Innovation

SCT: 문장 임베딩을 위한 효율적인 학습 / Efficient Learning for Sentence Embedding (21 minute read)

Self-Supervised Cross-View Training(SCT)은 소규모 언어 모델을 향상시켜 이전에는 대규모 모델에서만 가능했던 문장 임베딩을 생성할 수 있게 함으로써 성능과 계산 효율성을 모두 최적화합니다.

Cross-View Training (SCT) elevates small language models, enabling them to produce sentence embeddings previously only achievable by larger models, optimizing both performance and computational efficiency.

PixArt: UNet 기반 모델보다 학습 비용이 90% 저렴한 트랜스포머 기반 디퓨전 모델 / Transformer based diffusion model is 90% cheaper to train than UNet based models (5 minute read)

PixArt는 T5 텍스트 인코딩, 크로스 어텐션, 확산 변환기를 사용하여 동급 모델에 비해 훨씬 적은 컴퓨팅 비용으로 뛰어난 결과를 제공하는 새로운 텍스트-이미지 변환 모델입니다.

PixArt is a new text to image model that uses T5 text encodings, cross attention, and a diffusion transformer to great results at a fraction of the compute cost of comparable models.

DARE: 재교육 없이 언어 모델 개선하기 / Enhancing Language Models Without Retraining (GitHub Repo)

DARE 방식은 BERT와 같은 언어 모델의 개선을 간소화하여 새로운 기능을 통합된 모델에 통합할 수 있게 함으로써 다양한 언어 작업의 효율성을 높입니다.

The DARE method streamlines the enhancement of language models like BERT, allowing the integration of new functions into a unified model, increasing efficiency on various linguistic tasks.

엔지니어링 및 리소스 관련 소식 / Engineering & Resources

OVIR-3D: 인스턴트 3D 검색 / Instant 3D Search (GitHub Repo)

OVIR-3D는 2D 이미지 융합을 사용하여 텍스트 프롬프트에서 3D 객체 검색을 혁신하여 3D 데이터 학습의 필요성을 없애고 로봇 공학에 이상적인 즉각적인 실시간 검색 기능을 지원합니다.

OVIR-3D innovates 3D object retrieval from text prompts using 2D image fusion, bypassing the need for 3D data training and facilitating instant, real-time search capabilities ideal for robotics.

CogVLM-17B 오픈 비전-언어 모델(VLM) / CogVLM-17B open vision language model (3 minute read)

10억 개의 비전 파라미터와 7억 개의 언어 파라미터를 갖춘 이 멀티모달 모델은 여러 표준 벤치마크에서 탁월한 성능을 발휘하며 사람의 평가에서도 우수한 성능을 보입니다.

With 10B vision parameters and 7B language parameters, this multimodal model excels at many standard benchmarks and performs well on human evaluation.

Langroid 멀티 에이전트 프로그래밍 프레임워크 / Langroid multi-agent programming framework (GitHub Repo)

액터 프레임워크에서 영감을 받은 이 경량 Python 라이브러리를 사용하면 LLM 기반 에이전트를 쉽게 만들 수 있습니다.

Inspired by the Actor framework, this lightweight Python library makes it easy to create LLM-powered agents.

그 외 소식 / Miscellaneous

언어 모델은 예측을 잘할까? / Are Language Models Good At Making Predictions? (5 minute read)

매니폴드 마켓의 5000개 질문을 사용한 연구에서 GPT-4는 예측에 대해 지속적으로 과신하는 것으로 나타났습니다.

In a study using 5000 questions from Manifold Markets, GPT-4 was found to be consistently overconfident in its predictions.

OpenAI는 생각보다 훨씬 더 취약합니다 / OpenAI Is A Lot More Vulnerable Than You Think (4 minute read)

OpenAI가 GPT 래퍼 기업의 절반을 몰락시킨 기능을 출시한 이후, 이 시장이 계속 진화하고 있는 가운데 OpenAI의 사각지대와 약점에 대한 상반된 시각을 소개합니다.

On the heels of OpenAI releasing functionality that decimated half GPT-wrapper companies, here is a contrarian view of the blind spots and weaknesses of OpenAI as this market continues to evolve.

거의 에이전트 수준: GPT가 할 수 있는 일 / Almost An Agent: What GPTs Can Do (8 minute read)

OpenAI의 최신 GPT는 자율 AI 에이전트의 비전에 더 가까워졌습니다. 간혹 부정확성이 발생하는 등의 제약에도 불구하고 최소한의 인간 입력만으로 학술 논문 작성과 같은 작업을 수행할 수 있습니다. 이러한 AI 자율성의 도약은 상당한 발전을 약속하지만, AI가 더 복잡하고 독립적인 역할을 수행하기 시작하면서 잠재적인 취약성을 완화하기 위한 신중한 개발이 시급함을 강조하기도 합니다.

OpenAI's latest GPTs edge closer to the vision of autonomous AI agents. They are capable of executing tasks like crafting academic papers with minimal human input, despite current constraints like occasional inaccuracies. This leap forward in AI autonomy promises significant advancements but also underscores the urgency for vigilant development to mitigate potential vulnerabilities as AI begins to navigate more complex and independent roles.

더 읽어보기 / Quick Links

자동 스포츠 내레이션 / Automatic sports narration (Jupyter Notebook)

이 노트북은 OpenAI의 다양한 신기술을 사용하여 스포츠 경기를 설득력 있게 내레이션합니다.

This notebook uses many new technologies from OpenAI to convincingly narrate a sporting event.

Luminance: 사상 최초로 계약을 협상하는 AI 개발 / AI Negotiates Contract For The First Time Ever (3 minute read)

영국의 AI 기업 Luminance는 일상적인 협상을 처리하여 변호사의 업무량을 간소화하는 것을 목표로 계약을 자율적으로 협상할 수 있는 AI 시스템을 개발했습니다.

British AI firm Luminance has developed an AI system that can autonomously negotiate contracts, aiming to streamline the workload of lawyers by handling routine negotiations.

오르바 / Orba (Product)

개인화된 AI 대화로 웹 방문자를 전환하세요.

Convert your web visitors with personalized AI conversations.