[TLDR] 오늘의 AI 뉴스, 2023-06-14: OpenAI, 함수 호출 기능 출시 🚀, 미스트랄 AI, 1억 1,300만 달러 시드 모금 💰, 얀 르쿤의 비전을 기반으로 한 메타의 첫 번째 AI 모델 👀

9bow · 6월 15, 2023, 2:38오전

파이토치 한국 사용자 모임에서는 TLDR 뉴스레터의 승인을 받아 AI 소식을 DeepL로 번역하여 전합니다.

더 많은 AI 소식 및 정보를 공유하고 함께 성장하고 싶으신가요? 지금 파이토치 한국어 커뮤니티에 방문해주세요!

주요 뉴스 & 신규 출시 소식 / Headlines & Launches

미스트랄 AI, 1억 1,300만 달러 시드 라운드 유치 / Mistral AI Raises $113M Seed Round (3 minute read)

딥마인드와 메타 퇴직자들이 공동 설립한 파리에 본사를 둔, 설립 4주차 스타트업인 미스트랄 AI(Mistral AI)는 주로 기업용 오픈소스 접근 방식을 목표로 대규모 언어 모델과 생성 AI를 개발 및 적용하기 위해 1억 1,300만 달러의 시드 펀딩을 모금했습니다. 라이트스피드(Lightspeed) 벤처 파트너스가 주도한 이번 투자 라운드에서 미스트랄 AI의 기업 가치는 2억 6천만 달러로 평가되었습니다. 미스트랄 AI는 공개적으로 사용 가능한 데이터를 사용하고 사용자가 직접 데이터 세트를 제공할 수 있도록 하여 투명성과 유용성에 중점을 둔 모델을 개발할 계획입니다. 2024년에 텍스트 기반 생성형 AI를 위한 첫 번째 모델을 출시하는 것이 목표입니다.

Mistral AI, a four-week-old Paris-based startup co-founded by DeepMind and Meta alumni, has raised $113 million in seed funding to develop and apply large language models and generative AI, targeting an open-source approach primarily for enterprises. The investment round, led by Lightspeed Venture Partners, valued Mistral AI at $260 million. Mistral AI plans to create models with a focus on transparency and utility using publicly available data and allowing users to contribute their own datasets. It aims to release its first models for text-based generative AI in 2024.

메타, I-JEPA 공개 / Meta Unveils I-JEPA (2 minute read)

메타 AI 연구원들은 인간처럼 학습하는 새로운 컴퓨터 비전 모델인 I-JEPA를 공개했습니다. 이 모델은 세계가 어떻게 작동하는지에 대한 내부 모델을 학습할 수 있으며(이를 통해 더 빠르게 학습할 수 있음), 복잡한 작업을 수행하는 방법을 계획하고, 낯선 상황에 쉽게 적응할 수 있습니다.

Meta AI researchers have unveiled a new computer vision model called I-JEPA, which learns more like humans do. The model is able to learn internal models of how the world works (which could help it to learn faster), plan how to accomplish complex tasks, and readily adapt to unfamiliar situations.

OpenAI, 새로운 생성 텍스트 기능 도입 및 가격 인하 / OpenAI Intros New Generative Text Features And Reduce Pricing (2 minute read)

OpenAI는 gpt-3.5-turbo 및 GPT-4의 새로운 버전과 함수 호출이라는 기능을 출시하여 개발자가 프로그래밍 함수를 gpt-3.5-turbo 및 GPT-4에 설명하고 모델이 해당 함수를 실행하는 코드를 생성하도록 할 수 있습니다. Gpt-3.5-turbo의 가격이 25% 인하되었습니다.

OpenAI has released new versions of gpt-3.5-turbo and GPT-4 and a capability called function calling that allows developers to describe programming functions to gpt-3.5-turbo and GPT-4 and have the models create code to execute those functions. The pricing for gpt-3.5-turbo has been reduced by 25%.

연구 & 혁신 관련 소식 / Research & Innovation

LLaMA-Tokenizer-JS (GitHub Repo)

브라우저(및 노드)에서 클라이언트 측에서 작동하는 LLaMA용 자바스크립트 토큰화 도구입니다.

A JavaScript tokenizer for LLaMA that works client-side in the browser (and also in Node).

detrex: 객체 탐지 트랜스포머 벤치마킹 / detrex: Benchmarking Detection Transformers (GitHub Repo)

detrex는 최첨단 Transformer 기반 객체 탐지 알고리즘을 제공하는 오픈소스 툴박스입니다. Detectron2를 기반으로 구축되었으며 모듈 설계는 MMDetection 및 DETR에서 부분적으로 차용했습니다.

detrex is an open-source toolbox that provides state-of-the-art Transformer-based detection algorithms. It is built on top of Detectron2 and its module design is partially borrowed from MMDetection and DETR.

AlgoPerf: 효율적인 학습을 위한 새로운 벤치마크 / AlgoPerf: A Novel Benchmark for Training Efficiently (GitHub Repo)

이 연구는 데이터 처리 속도를 높이고 리소스를 절약하며 정확한 모델을 생성하는 데 필수적인 도구인 딥러닝 학습 알고리즘을 비교하고 개선하는 문제를 해결합니다. 연구진은 다양한 학습 알고리즘을 공정하게 평가하고 향후 개선의 기반을 마련하기 위해 설계된 경쟁력 있는 벤치마크 시스템인 AlgoPerf를 제시합니다.

This study tackles the challenge of comparing and improving deep learning training algorithms, which are vital tools to speed up data processing, save resources, and create accurate models. The researchers present AlgoPerf, a competitive benchmark system, designed to fairly assess various training algorithms and establish a basis for future improvements.

엔지니어링 및 리소스 관련 소식 / Engineering & Resources

메타, 얀 르쿤의 원대한 비전에서 첫 번째 모델 학습 / Meta trains first model in Yann LeCun’s grand vision (7 minute read)

Meta의 수석 과학자는 현재 AI 연구 경로에 대해 회의적인 입장을 표명해 왔습니다. 그는 새로운 모델 유형이 필요하다고 (강력한 증거와 함께) 믿습니다. 이 새로운 모델 유형의 첫 번째 버전은 이미지의 일부를 조건부로 예측하여 시각적으로 세계를 강력하게 내부적으로 표현하는 방법을 학습합니다. 시각 자료와 함께 자세한 설명을 확인할 수 있습니다.

Meta's chief scientist has been vocal about his skepticism around the current path of research in AI. He believes (with strong evidence) that a new model type is required. This first version of this new model type learns a strong internal representation of the world visually by conditionally predicting pieces of images. An in depth explanation with visuals is available.

개선된 언어 모델 GAP / Improved Language Models with GAP (9 minute read)

이 연구는 태그가 지정되지 않은 무작위 텍스트에 대해 그라디언트 상승 사후 학습(GAP; Gradient Ascent Post-training)이라는 몇 단계의 기술을 사용하여 사전 학습된 언어 모델을 업데이트하면 다양한 언어 처리 작업을 일반화하고 해결하는 모델의 능력을 크게 향상시킬 수 있음을 보여줍니다. 즉, GAP를 사용하면 작업별 조정 없이도 작은 언어 모델도 훨씬 큰 언어 모델만큼 잘 수행할 수 있습니다.

This research shows that updating pre-trained language models with a few steps of a technique called Gradient Ascent Post-training (GAP) on random, untagged text can significantly improve the models’ ability to generalize and solve varied language processing tasks. In other words, by using GAP, smaller language models can perform as well as much larger ones, even without any task-specific adjustments.

그 외 소식 / Miscellaneous

Microsoft와 OpenAI, 어색한 파트너십 체결 / Microsoft And OpenAI Forge Awkward Partnership (6 minute read)

Microsoft와 OpenAI의 파트너십은 기술 업계에서 가장 핫한 팀 중 하나이지만, 그 이면에는 상당한 갈등과 혼란을 초래한 팀이기도 합니다.

Microsoft and OpenAI partnership isn’t just one of the hottest teams in tech, but also one that has produced a fair amount of conflict and confusion behind the scenes.

AI 스타트업을 위한 2K H100 GPU / 2K H100 GPUs for AI startups (2 minute read)

일부 유명 엔젤 투자자들은 스타트업이 모델을 훈련하는 데 사용할 수 있는 거대한 컴퓨팅 클러스터를 구축했습니다.

Some well known angel investors have set up a huge compute cluster for startups to use to train their models.

구글, 새로운 AI 규제안 촉구 / Google Bucks Calls For A New AI Regulator (8 minute read)

거대 기술 기업인 Google은 연방 정부에 이 문제를 전담하는 단일 규제 기관을 설립하는 대신 여러 기관에 인공 지능 도구에 대한 감독을 분할할 것을 촉구하고 있어 Microsoft 및 OpenAI와 같은 라이벌과 대조를 이룹니다.

Tech giant Google is urging the federal government to divvy up oversight of artificial intelligence tools across agencies rather than setting up a single regulator dedicated to the issue, striking a contrast with rivals like Microsoft and OpenAI.

더 읽어보기 / Quick Links

폴 매카트니, AI를 사용해 비틀즈 노래 녹음 / Paul McCartney Used AI To Make A New Beatles Song (1 minute read)

폴 매카트니는 AI를 사용하여 데모 녹음에서 존 레논의 보컬을 분리하여 새로운 비틀즈 노래를 만들었습니다.

Paul McCartney used AI to create a new Beatles song by isolating John Lennon's vocals from a demo recording.

Roop (GitHub Repo)

루프는 클릭 한 번으로 딥페이크 제작이 가능합니다.

Roop allows for the creation of deepfakes with one click.

ChatCSV (Product)

ChatCSV는 개인 AI 데이터 과학자입니다.

ChatCSV is your personal AI data scientist.

어셈블리, 초능력을 가진 마케팅 캘린더 / Assembly, A Marketing Calendar with Superpowers (Product Launch)

많은 마케팅 팀들이 콘텐츠 캘린더를 노션이나 시트와 같은 제품 안에서 만들어 사용합니다. 어셈블리는 마케터에게 필요한 기능을 바로 사용할 수 있을 뿐 아니라 몇 단계 더 나아간 제품입니다. 관리자 승인을 통합하고, AI 콘텐츠 생성을 포함하며, 모든 분석을 가져옵니다.

Many marketing teams hack together a content calendar, generally inside a product like Notion or Sheets. Assembly is exactly what marketers need right out of the box, plus takes it a few steps further. It integrates manager approvals, includes AI content generation, and pulls in all your analytics.