[TLDR] 오늘의 AI 뉴스, 2023-07-11: 구글의 대규모 AI 셔플 들여다보기 👀, 코미디언: OpenAI와 Meta 고소 🧑‍⚖️, 첫 번째 원칙에서 바라본 AGI🥇

9bow · 7월 12, 2023, 12:02오전

파이토치 한국 사용자 모임에서는 TLDR 뉴스레터의 승인을 받아 AI 소식을 DeepL로 번역하여 전합니다.

더 많은 AI 소식 및 정보를 공유하고 함께 성장하고 싶으신가요? 지금 파이토치 한국어 커뮤니티에 방문해주세요!

주요 뉴스 & 신규 출시 소식 / Headlines & Launches

사라 실버먼, OpenAI와 메타 고소 / Sarah Silverman Sues OpenAI And Meta (2 minute read)

코미디언 사라 실버맨은 금요일에 OpenAI와 Meta를 상대로 소송을 제기한 세 명의 작가 중 한 명으로, 두 회사의 인공지능 모델이 저작권이 있는 자료에 대해 불법적으로 학습했다고 주장했습니다.

Comedian Sarah Silverman is among a trio of writers who filed a lawsuit against OpenAI and Meta on Friday, claiming the companies’ artificial intelligence models were illegally trained on copyrighted material.

CodeGen2.5: Salesforce, 작지만 강력한 코드 모델 출시 / Salesforce releases small by mighty code models (8 minute read)

Salesforce는 오픈소스 코드 모델을 공개한 최초의 그룹 중 하나입니다. 그 이후로 동등한 크기의 모든 모델(7억 개의 매개변수)을 능가하고 심지어 2배 더 큰 모델을 능가하는 등 최신 기술에 발맞춰 왔습니다.

Salesforce was one of the first groups to release open source code models. Since then, they have kept pace with the state of the art, surpassing all equivalently sized models (7B params) and even beating out models 2x as large.

구글의 빅 AI 셔플 들여다보기 / Inside Google’s Big AI Shuffle (30 minute read)

구글 딥마인드 CEO 데미스 하사비스와 구글의 AI 노력에 대한 인터뷰.

An interview with Google DeepMind CEO Demis Hassabis on Google’s AI efforts.

연구 & 혁신 관련 소식 / Research & Innovation

T-MARS: 더 나은 시각적 표현 학습하기 / Learning Better Visual Representations (3 minute read)

이 프로젝트는 더 나은 시각적 표현을 학습하고 비전 작업에서 최첨단 제로샷 정확도를 달성하기 위해 CLIP 학습에 사용되는 웹 데이터셋을 필터링하는 알고리즘을 제안합니다.

This project proposes an algorithm to filter web datasets used for training CLIP in order to learn better visual representations and achieve state-of-art zero-shot accuracy on vision tasks.

(광고) 어디서나 시간을 되찾으세요 / Get back your time everywhere you work (Sponsor)

Typedesk의 AI 지원 타이핑은 기존 자동 완성 기능보다 한 단계 개선된 기능입니다. 반복적인 타이핑은 이제 그만. 복사 및 붙여넣기는 잊으세요. 메시지 전반의 일관성을 개선하세요. 모든 앱과 웹사이트에서 작동하는 텍스트 바로가기를 만드세요! 무료로 시작하세요.

Typedesk's AI assisted typing is a step function improvement over traditional autocomplete. Ditch repetitive typing. Forget copy & paste. Improve consistency across your messages. Create text shortcuts that work on all your apps and websites! Get started for free.

Danswer (GitHub Repo)

Danswer를 사용하면 내부 문서에 대해 자연어 질문을 하고 소스 자료의 인용문과 참조가 뒷받침된 신뢰할 수 있는 답변을 얻을 수 있습니다.

Danswer allows you to ask natural language questions against internal documents and get back reliable answers backed by quotes and references from the source material.

Alibaba Chat2DB (GitHub Repo)

알리바바는 OpenAI의 ChatGPT 기능을 통합한 데이터베이스용 다목적 범용 SQL 클라이언트 및 보고 도구인 Chat2DB를 출시했습니다. 이 도구는 데이터베이스와 상호 작용하는 방식을 혁신하여 데이터베이스에 더 쉽게 접근하고 사용자 친화적으로 만드는 것을 목표로 합니다.

Alibaba has launched Chat2DB, a versatile general-purpose SQL client and reporting tool for databases that integrates OpenAI's ChatGPT capabilities. This tool aims to revolutionize the way databases are interacted with, making it more accessible and user-friendly.

엔지니어링 및 리소스 관련 소식 / Engineering & Resources

첫 번째 원칙에서 바라본 AGI / AGI from first principles (31 minute read)

최근 OpenAI가 리소스의 상당 부분을 "슈퍼 얼라인먼트"에 할애할 것이라고 발표함에 따라, 잠시 시간을 내어 얼라인먼트의 필요성과 AGI가 실제로 위험을 초래할 수 있는 이유에 관한 얼라인먼트 커뮤니티의 중요한 게시물을 다시 살펴보는 것이 좋습니다.

With the recent announcement that OpenAI will be dedicating a significant portion of their resources to “superalignment”, it is worthwhile for us to take a moment and revisit an important post from the alignment community around the need for alignment and why AGI might truly pose a risk.

개인 디바이스에서 LLM 실행하기 / Running LLMs on Personal Devices (11 minute read)

이 논문에서는 다양한 수준의 수치 정밀도에서 대규모 언어 모델과 비전 트랜스포머가 얼마나 잘 작동하는지 테스트하는 오픈 소스 시뮬레이터인 INT-FP-QSim을 소개합니다.

This paper introduces INT-FP-QSim, an open-source simulator that tests how well large language models and vision transformers can work at different levels of numerical precision.

사진 복원을 더 빠르고 효율적으로 만들기 / Making Picture Restoration Better and Faster (22 minute read)

이 연구에서는 압축 감지(CS; Compressive Sensing)에서 이미지 재구성을 최적화하는 혁신적인 모델인 동적-경로 제어-가능 딥-언폴딩 네트워크(DPC-DUN; Dynamic Path-Controllable Deep Unfolding Network)를 소개하여 성능과 복잡성의 균형을 맞춰 효율성과 결과를 향상시킵니다.

This research introduces the Dynamic Path-Controllable Deep Unfolding Network (DPC-DUN), an innovative model that optimizes image reconstruction in Compressive Sensing (CS), balancing performance and complexity for enhanced efficiency and results.

그 외 소식 / Miscellaneous

데이비슨 이륙 속도 / Davidson On Takeoff Speeds (10 minute read)

이 글에서는 인공 일반 지능(AGI)이 "이륙"하거나 인간보다 훨씬 더 지능적이 될 수 있는 다양한 가능한 속도에 대해 논의하며, AGI 이륙을 촉진할 수 있는 두 가지 주요 요인인 컴퓨팅 리소스의 가용성과 더 나은 알고리즘의 개발에 초점을 맞춥니다.

This article discusses the different possible speeds at which artificial general intelligence (AGI) could "take off," or become significantly more intelligent than humans, focusing on the two main factors that could drive AGI takeoff: the availability of compute resources and the development of better algorithms.

우리는 AI 규제로 매우 어두운 길을 가고 있다 / We’re Going Down A Very Dark Path With AI Regulation (4 minute read)

Techdirt는 EU와 미국이 혁신을 억제하고 대기업을 고착화할 수 있는 위험한 AI 규제로 나아가고 있다고 주장합니다. 저자는 지속적인 혁신을 허용하면서 투명성과 책임에 초점을 맞춘 더 나은 접근 방식을 제안합니다.

Techdirt argues that the EU and US are moving towards dangerous AI regulation that will stifle innovation and lock in the big players. The author proposes a better approach that would focus on transparency and accountability while allowing for continued innovation.

과학 분야의 상징과 구조 회귀 / Symbolic and structure regression for science (25 minute read)

과학 연구의 대부분은 회귀를 기반으로 합니다. 그러나 회귀는 연구 중인 문제의 근본적인 구조를 밝혀내지 못합니다. 구조적 회귀와 상징적 회귀를 활용하면 이전에는 이해하지 못했던 문제의 일부를 밝혀낼 수 있습니다.

Much of scientific research rests on the back of regression. However, regression doesn’t uncover the underlying structure of the problems being studied. If you reach for structured and symbolic regression, it can illuminate parts of a problem previously not understood.

더 읽어보기 / Quick Links

라즈베리 파이에서 LLaMA-65B 실행? / Llama 65B on Raspberry Pis? (GitHub Issue)

GGML은 리소스가 적은 기기에서 언어 모델을 실행하기 위한 오픈 소스 라이브러리입니다. 최근 MPI와 추론을 병렬화하기 위한 작업이 진행되었습니다. 이러한 변경 사항을 적용하여 이제 팀은 라즈베리 파이 클러스터에서 라마 65B를 추론하기 위해 작업하고 있습니다.

GGML is an open source library for running language models on low resource devices. Work has been done recently to parallelize inference with MPI. With these changes in effect the team is now working to inference llama 65B on a cluster of Raspberry Pis.

영어가 모국어가 아닌 사용자를 차별하는 AI를 감지하는 프로그램 / Programs To Detect AI Discriminate Against Non-Native English Speakers (2 minute read)

새로운 연구에 따르면 AI 차별을 감지하도록 설계된 프로그램이 영어가 모국어가 아닌 사용자를 불균형적으로 표적으로 삼는다는 사실이 밝혀졌습니다. 이 연구는 이러한 시스템이 의도치 않게 편견을 고착화하고 언어 능력에 따라 특정 개인에게 불이익을 줄 수 있으며, AI 기술의 공정성과 포용성에 대한 우려를 불러일으킨다고 지적합니다.

A new study reveals that programs designed to detect AI discrimination disproportionately target non-native English speakers. The research indicates that these systems can inadvertently perpetuate bias and disadvantage certain individuals based on language proficiency, raising concerns about the fairness and inclusivity of AI technology.

Coframe, 생성형 A/B 테스트로 웹사이트 최적화 / Coframe optimizes websites with generative A/B testing (Product Launch)

오늘날의 사용자 인터페이스는 대부분 정적이고 지능적이지 않습니다. 5년 후에는 어떤 모습일까요? 어제 출시된, 사용자 반응에 따라 콘텐츠를 생성하고 A/B 테스트를 거쳐 반복하는 제품인 Coframe을 소개합니다. 현재는 카피만 지원하지만, Coframe은 사용자 인터페이스와 흐름 전반에 대한 폭넓은 비전을 가지고 있어 사용자 인터페이스에 지능을 부여하고 스스로 개선할 수 있는 기능을 제공합니다.

User interfaces today are largely static and unintelligent. What will they look like 5 years from now? Enter Coframe, a product launched yesterday that generates content based on user reception, A/B tests it, and repeats. While it only supports copy today, Coframe has a broader vision for user interfaces and flows at large, giving them their own sense of intelligence and ability to self-improve.