[TLDR] 오늘의 AI 뉴스, 2023-09-13: 구글의 2천만달러 규모의 responsible AI 펀드 💸, 학술 문서용 OCR 📃, 대규모 RAG 🌎

9bow · 9월 14, 2023, 3:00오전

파이토치 한국 사용자 모임에서는 TLDR 뉴스레터 의 승인을 받아 AI 소식을 DeepL로 번역 하여 전합니다.
더 많은 AI 소식 및 정보를 공유하고 함께 성장하고 싶으신가요? 지금 파이토치 한국어 커뮤니티에 방문해주세요!

주요 뉴스 & 신규 출시 소식 / Headlines & Launches

구글, responsible AI 기금에 2천만 달러 기부 / Google pledges $20 million for responsible AI fund (2 minute read)

Google의 자선 부서는 책임감 있는 AI 개발을 촉진하기 위해 디지털 퓨처스 프로젝트에 2천만 달러를 투자하고 있습니다. 이 이니셔티브는 AI의 사회적 영향에 대응하는 글로벌 기관을 지원합니다. Google.org는 AI의 잠재적 혜택을 위해 협업을 강조합니다.

Google's philanthropic arm is investing $20 million in the Digital Futures Project to promote responsible AI development. The initiative supports global institutions, addressing AI's societal impacts. Google.org stresses collaboration for AI's potential benefit.

백악관 AI 안전 협정에 동참하는 기업이 늘고 있습니다 / More companies commit to the White House AI safety accord (2 minute read)

Adobe, IBM, Nvidia 등이 백악관의 AI 안전 및 신뢰성에 관한 협약에 참여하기로 약속했으며, 이는 앞서 메타 및 구글의 서약을 반영한 것입니다. 이러한 자발적 협약은 출시 전 테스트와 위험 분담을 강조합니다. AI 규제는 바이든 행정부 하에서 여전히 혁신에 뒤처져 있습니다.

Adobe, IBM, Nvidia, and others have committed to the White House’s accord on AI safety and trustworthiness, echoing earlier pledges by Meta and Google. These voluntary agreements stress pre-release testing and risk-sharing. AI regulation remains a step behind innovation under the Biden administration.

Nougat: 학술 문서용 OCR / OCR for academic documents (7 minute read)

광학 문자 인식(OCR)은 이미지에서 텍스트를 추출하는 프로세스입니다. 전문 용어가 많거나 수학 같은 특수 문자가 많은 문서에서는 실패할 수 있습니다. Facebook 연구팀의 이 연구는 학문적 영역에서 강력한 성능을 보여주며 많은 오래된 텍스트를 디지털화할 수 있게 해줍니다.

Optical character recognition (OCR) is the process of extracting text from images. It can fail on documents with lots of jargon or special characters like mathematics. This work from Facebook research showcases strong performance across academic domains, enabling the digitization of many old texts.

(더 읽어보기 [2023/08/28 ~ 09/03] 이번 주의 주요 ML 논문 (Top ML Papers of the Week))

연구 & 혁신 관련 소식 / Research & Innovation

행성 규모의 RAG / Retrieval Augmented Generation at Planet Scale (8 minute read)

Arcus는 계층적 검색기를 사용해 RAG를 행성 규모로 확장합니다. 시맨틱 콘텐츠에 따라 문서를 그룹으로 클러스터링한 후, 이러한 그룹을 점진적으로 필터링하여 검색 공간을 좁힐 수 있습니다. 그 결과, 행성 규모의 데이터 코퍼스에서 훨씬 더 관련성이 높은 컨텍스트가 검색되고 환각이 줄어들며 신뢰성이 향상됩니다. rag

Arcus scales RAG to planet scale by using a hierarchical retriever. After clustering documents into groups based on their semantic content, you can progressively filter over these groups to narrow the search space. This results in much more relevant context being retrieved, fewer hallucinations, and better reliability over planet-scale corpuses of data.

위스퍼 터보 / Whisper Turbo (GitHub Repo)

Rust로 작성된 20배 빠른 트랜스 크립 션을 제공하는 OpenAI Whisper API의 대체제입니다. whisper whisper-cpp

A drop-in replacement for the OpenAI Whisper API that offers 20x faster transcription written in Rust.

엔지니어링 및 리소스 관련 소식 / Engineering & Resources

후각을 얻는 AI / AI gets a sense of smell (29 minute read)

연구원들은 수천 개의 수작업 라벨이 부착된 분자로 구성된 맞춤형 데이터 세트에 그래프 신경망을 훈련시켜 냄새를 정확하게 인식하는 모델을 훈련시킬 수 있었습니다.

Researchers were able to train a model to accurately recognize smells by training a graph neural network on a custom data set of thousands of hand-labeled molecules.

글쓰기의 고유성을 잃고 있나요? AI 사용이 콘텐츠 다양성에 미치는 영향 / Is Your Writing Losing Its Uniqueness? The Impact of Using AI on Content Diversity (18 minute read)

이 연구는 InstructGPT와 같은 고급 AI 글쓰기 도우미가 사람들의 글쓰기를 너무 비슷하게 만들어 공개 대화를 덜 다양하게 만드는 것은 아닌지 살펴봤습니다.

This study looked at whether advanced AI writing assistants like InstructGPT were making people's writing sound too similar and potentially making public conversations less diverse.

Scenimefy: 실제 사진을 멋진 애니메이션 장면으로 바꾸는 방법 / Turning Real-World Pictures into Stunning Anime Scenes (3 minute read)

일상적인 사진을 애니메이션 스타일의 디테일한 이미지로 변환할 수 있는 새로운 도구인 Scenimefy를 소개합니다.

Researchers have introduced Scenimefy, a new tool to transform everyday photos into detailed anime-style images.

그 외 소식 / Miscellaneous

가상 세계를 구축하는 데 도움이 되는 Roblox의 새로운 AI 챗봇 / Roblox’s new AI chatbot will help you build virtual worlds (3 minute read)

유니티는 2023 RDC에서 크리에이터가 가상 경험을 디자인하는 데 도움이 되는 AI 어시스턴트를 소개했습니다. 올해 말 또는 내년 초에 출시될 예정인 이 툴은 고급 게임플레이와 3D 모델 생성을 지원합니다.

Roblox introduced an AI assistant to help creators design virtual experiences at the 2023 RDC. Set for release late this year or early next, the tool will enable advanced gameplay and 3D model generation.

AI 규제가 어려운 이유 / Why Regulating AI Is So Difficult (17 minute read)

잠재적인 피해를 제한하면서 AI 개발을 장려하고자 하는 열망과 혁신의 빠른 속도가 맞물려 AI를 규제하는 것은 특히 까다로운 상황입니다.

The desire to encourage AI development while limiting potential harm, combined with the breakneck pace of innovation, makes regulating AI an especially tricky situation.

a16z의 새 글: AI가 온라인 소비자 서비스를 위한 대규모 시장을 여는 이유 / Let’s Get Personal: Why AI Will Unlock a Massive Market for Online Consumer Services (4 minute read)

쇼핑을 넘어선 개인 맞춤형 서비스라는 아이디어는 정말 흥미롭습니다. 이 포스팅에서는 이 분야의 기회와 이미 이 분야에서 활약하고 있는 몇몇 기업에 대해 자세히 살펴봅니다. a16z

The idea of personalized services beyond what we have around shopping is really interesting. This post breaks down the opportunities and some of the companies already playing in this space.

더 읽어보기 / Quick Links

실행 중인 AI 모델의 속도를 테스트하는 새로운 벤치마크 / New Benchmark Tests Speed Of Running AI Models (1 minute read)

MLCommons는 최고급 하드웨어가 AI 모델을 얼마나 빠르게 실행할 수 있는지에 대한 새로운 벤치마크를 공개했으며, 엔비디아가 1위, 인텔이 2위를 차지했습니다.

MLCommons has unveiled a new benchmark for how quickly top-of-the-line hardware can run AI models, with Nvidia coming in first and Intel coming in second.

멀린 / Merlin (Product)

AI가 수행, 분석 및 요약한 정성적 연구.

Qualitative research conducted, analyzed, and summarized by AI.