[TLDR] 오늘의 AI 뉴스, 2023-10-27: OpenAI 재난 대응 팀 🦺, 엔비디아와 레노버의 파트너십 🤝, LLM 4비트 양자화 🌐

9bow · 10월 28, 2023, 4:00오전

파이토치 한국 사용자 모임에서는 TLDR 뉴스레터 의 승인을 받아 AI 소식을 DeepL로 번역 하여 전합니다.
더 많은 AI 소식 및 정보를 공유하고 함께 성장하고 싶으신가요? 지금 파이토치 한국어 커뮤니티에 방문해주세요!

주요 뉴스 & 신규 출시 소식 / Headlines & Launches

OpenAI, 재난 대비 팀을 신설 / OpenAI creates Preparedness team (5 minute read)

OpenAI는 사이버 보안 및 핵 재난과 같은 AGI의 최전선 위험에 대처하기 위해 새로운 팀을 만들었습니다. ai-security

OpenAI has created a new team to deal with frontier risks of AGI such as cyber security and nuclear disasters.

월요일 시행 예정인 AI 행정 명령 / AI Executive Order Expected Monday (2 minute read)

바이든 행정부는 연방 기관의 AI 채택에 대한 표준을 설정하고 AI 발전과 관련된 우려를 해결하기 위해 인공 지능에 대한 포괄적인 행정 명령을 발표할 준비를 하고 있습니다. ai-act ai-regulation

The Biden Administration is preparing to announce a comprehensive executive order on artificial intelligence, aiming to set standards for AI adoption in federal agencies and address concerns related to AI advancements.

레노버와 엔비디아, 하이브리드 AI 솔루션을 위한 파트너십 확대 / Lenovo and NVIDIA Expand Partnership for Hybrid AI Solutions (3 minute read)

레노버와 엔비디아는 엣지부터 클라우드까지 생성형 AI를 활용하는 통합 시스템을 제공하기 위해 파트너십을 확대하고 있습니다. 양사는 기업이 비즈니스 혁신을 위해 맞춤형 AI 애플리케이션을 신속하게 배포할 수 있도록 지원하는 것을 목표로 합니다. 레노버 AI 프로페셔널 서비스 프랙티스가 지원하는 솔루션은 하이브리드 클라우드 접근 방식과 엔비디아의 최신 하드웨어를 사용하여 생성형 AI를 구현합니다. generative

Lenovo and NVIDIA are expanding their partnership to deliver integrated systems that utilize generative AI from edge to cloud. They aim to help enterprises swiftly deploy tailored AI applications for business innovation. Their solutions, supported by Lenovo AI Professional Services Practice, will use a hybrid cloud approach and NVIDIA's latest hardware for generative AI.

연구 & 혁신 관련 소식 / Research & Innovation

GPT-4V의 광학 문자 인식(OCR) 기능 평가 / Evaluating GPT-4V's Optical Character Recognition Abilities (18 minute read)

이 논문에서는 대형 멀티모달 모델인 GPT-4V가 장면 및 필기 텍스트 읽기부터 복잡한 문서 구조 이해에 이르기까지 다양한 광학 문자 인식(OCR) 작업에서 얼마나 우수한 성능을 발휘하는지 살펴봅니다. easy-ocr gpt-4v multimodal

The paper examines how well GPT-4V, a Large Multimodal Model, performs in various Optical Character Recognition (OCR) tasks, from reading scenes and handwritten texts to understanding complex document structures.

CommonCanvas: 오픈 데이터로 학습된 이미지 디퓨젼 모델 / Image diffusion model trained on open data (24 minute read)

안정적 확산과 같이 널리 사용되는 확산 모델은 다양한 데이터로 학습되었지만, 이러한 데이터의 대부분은 다양한 저작권 제한의 적용을 받습니다. MosaicML의 이 새로운 모델은 크리에이티브 커먼즈 데이터로만 학습되며, 학습 데이터의 일부만 필요로 하면서도 SD 2와 유사한 성능을 달성합니다.

Most popular diffusion models like Stable Diffusion have been trained on a wide variety of data, but much of that data is subject to different copyright restrictions. This new model from MosaicML is trained purely on Creative Commons data and achieves similar performance to SD 2 while requiring only a fraction of the training data.

lLM의 4비트 양자화 / 4-bit Quantization in LLMs (17 minute read)

이 연구에서는 학습 후 가중치와 액션을 4비트 부동소수점 값으로 변환하여 대규모 언어 모델을 압축하는 새로운 방법인 LLM-FP4를 소개합니다. quantized-llm

This research introduces LLM-FP4, a novel method to compress large language models by converting their weights and actions to 4-bit floating-point values after training.

엔지니어링 및 리소스 관련 소식 / Engineering & Resources

ToRA (GitHub Repo)

ToRA는 도구와 상호 작용하여 까다로운 수학적 추론 문제를 해결하도록 설계된 일련의 도구 통합 추론 에이전트입니다.

ToRA is a series of Tool-integrated Reasoning Agents designed to solve challenging mathematical reasoning problems by interacting with tools.

UWDepth: 수중 차량의 실시간 수심 추정 / Real-Time Depth Estimation for Underwater Vehicles (GitHub Repo)

이 연구는 희박한 수심 세부 정보를 결합하여 단일 이미지에서 실시간 수심을 추정할 수 있는 수중 차량용 맞춤형 모델을 제시합니다.

This study presents a model tailored for underwater vehicles that can estimate real-time depth from single images by combining sparse depth details.

GPT 파일럿 / GPT Pilot (GitHub Repo)

GPT 파일럿은 확장 가능한 앱을 처음부터 작성하는 AI 기반 개발 도구입니다. 개발자는 만들고자 하는 앱의 종류에 대한 설명을 사용하여 앱을 만들 수 있습니다. GPT 파일럿은 단계별로 앱을 만들면서 막히는 부분이 있으면 질문을 통해 설명이나 도움을 요청할 수 있습니다. 비디오 데모는 리포지토리에서 확인할 수 있습니다.

GPT Pilot is an AI-powered dev tool that writes scalable apps from scratch. It enables developers to create apps using a description of the kind of app they want to build. GPT Pilot creates apps step-by-step while asking questions for clarification or help when it gets stuck. A video demo is available in the repository.

그 외 소식 / Miscellaneous

AI 시대의 데이터와 소프트웨어 엔지니어링의 융합 / The convergence of data and software engineering in the age of AI (2 minute read)

AI가 작동하려면 데이터가 필요하며, 이는 데이터 기능이 내부 비즈니스 인텔리전스에서 벗어나 제품 구축 프로세스의 핵심에 가까워지고 있음을 의미합니다.

AI needs data to work, meaning data functions are getting closer to the core of the product-building process and moving away from internal business intelligence.

RGM: 이미지의 픽셀 매칭을 위한 다목적 접근 방식 / A Versatile Approach for Pixel Matching in Images (GitHub Repo)

이 연구에서는 이미지의 희소 및 고밀도 픽셀 매칭을 위해 설계된 딥 러닝 모델인 RGM(Robust Generalist Matching)을 소개합니다. 이 방법은 고유한 캐스케이드 GRU 모듈과 새로운 대규모 데이터 세트를 활용합니다.

The study introduces RGM (Robust Generalist Matching), a deep learning model designed for both sparse and dense pixel matching in images. The method leverages a unique cascaded GRU module and a new large-scale dataset.

Google을 차단하겠다고 위협하는 Reddit / Reddit threatens to block Google (3 minute read)

Reddit은 데이터 비용을 지불하기 위해 AI 회사와 계약을 체결하지 못하면 검색 크롤러를 차단할 수 있습니다. 535개 이상의 뉴스 조직이 AI 회사가 콘텐츠를 스크랩하지 못하도록 차단하기로 결정했습니다. X의 새로운 읽기 제한은 AI 스타트업의 데이터 스크래핑 때문입니다. ai-copyright

Reddit may block search crawlers if it can't reach deals with generative AI companies to pay for its data. More than 535 news organizations have opted to block their content from being scraped by AI companies. X's new reading limits are due to data scraping by AI startups.

더 읽어보기 / Quick Links

ResolveAI (Product)

서포트 티켓의 50%를 즉시 해결하세요.

Resolve 50% of your support tickets instantly.

구글, 버그 바운티 프로그램을 확대하여 생성적 AI 공격을 표적으로 삼다 / Google Expands Its Bug Bounty Program To Target Generative AI Attacks (1 minute read)

Google은 취약점 보상 프로그램(VRP)을 확장하여 AI 관련 보안 문제를 포함하도록 했습니다.

Google has expanded its Vulnerability Rewards Program (VRP) to include AI-specific security concerns.

허깅페이스 데이터셋을 통한 분산 지도 / Distributed map over HuggingFace dataset (Github Gist)

데이터 세트에 매핑하는 것은 모든 데이터 포인트에 트랜스폼을 적용하는 표준 방법입니다. 하지만 여러 GPU에 걸쳐 매핑하는 것은 어렵습니다. 이 요점은 바로 그 작업을 위한 멋진 함수를 제공합니다.

Mapping over a dataset is the standard way to apply a transform to every data point. Doing so across multiple GPUs is hard. This gist provides nice functions for doing just that.