[TLDR] 오늘의 AI 뉴스, 2023-09-01: 메타, FACET 데이터세트 출시 💾, 바이두, 어니(Ernie) 챗봇 출시 🤖, 컴퓨터 비전 모델의 공정성 👓

9bow · 9월 2, 2023, 2:49오전

파이토치 한국 사용자 모임에서는 TLDR 뉴스레터의 승인을 받아 AI 소식을 DeepL로 번역하여 전합니다.

더 많은 AI 소식 및 정보를 공유하고 함께 성장하고 싶으신가요? 지금 파이토치 한국어 커뮤니티에 방문해주세요!

주요 뉴스 & 신규 출시 소식 / Headlines & Launches

Meta, FACET 데이터세트 출시 / Meta Releases FACET Dataset (2 minute read)

Meta는 사람을 포함한 사진 및 동영상 속 사물을 분류하고 감지하는 AI 모델의 '공정성'을 평가하기 위해 설계된 새로운 AI 벤치마크인 FACET을 출시했습니다.

Meta has released a new AI benchmark, FACET, designed to evaluate the “fairness” of AI models that classify and detect things in photos and videos, including people.

바이두, 어니 챗봇 출시 / Baidu Launches Ernie Chatbot (1 minute read)

중국의 거대 기술 기업인 바이두가 중국 정부의 승인을 받아 어니 챗봇을 공식 출시했습니다.

Chinese tech giant Baidu officially launched its Ernie chatbot after the government approved its application.

Google Cloud, TPU v5 및 H100 지원 가상 머신 출시 / Google launches TPU v5 and H100 enabled VMs (6 minute read)

구글은 엔비디아를 상대할 수 있는 하드웨어를 보유한 몇 안 되는 대기업 중 하나입니다. 과거에는 파이토치 지원이 다소 열악하여 사용하기 어려웠습니다. 이제 더 빠른 칩과 더 나은 소프트웨어 지원으로 이 검색 대기업의 매력적인 제품처럼 보입니다.

Google is one of the few large organizations with hardware that can take on Nvidia. It has been challenging to use in the past with somewhat poor PyTorch support. Now, with even faster chips, and better software support - it seems like a compelling offering from the search giant.

연구 & 혁신 관련 소식 / Research & Innovation

DUIC: 스마트 디코딩 기법으로 이미지 품질 개선 / Improving Image Quality with Smart Decoding Techniques (GitHub Repo)

이 프로젝트는 특히 익숙하지 않은 데이터셋에 대해 이미지 디코딩 방식을 조정하는 방법을 개발했습니다. 이 프로세스는 디코더의 특정 설정을 업데이트하여 많은 데이터를 사용하지 않고도 고품질 이미지를 보장합니다. 또한 이 시스템은 이미지 유형에 따라 즉시 적응하여 최적의 결과를 얻을 수 있습니다.

This project developed a method that adjusts how images are decoded, especially for unfamiliar datasets. This process updates certain settings in the decoder, ensuring high-quality images without using much data. The system also adapts on the fly, depending on the type of image, for optimal results.

Sparse Feedback: 피드백은 LLM에 어떤 영향을 미칠까요? / How Feedback Affects LLMs? (GitHub Repo)

인간의 값을 이해하고 일치시키기 위해 대규모 언어 모델을 학습시킬 때 피드백으로 점수(등급)를 사용할까요, 아니면 비교(순위)를 사용할까요? 이 연구는 선택이 매우 중요하다는 것을 발견했습니다. 예를 들어, 순위로 학습된 모델은 비교를 통해 평가할 때 선호되지만 항상 점수로 평가하는 것은 아닙니다. 이는 피드백을 제공하는 방식이 실제 상황에서 모델이 얼마나 잘 작동하는지에 큰 영향을 미친다는 것을 보여줍니다.

When training large language models to understand and match human values, do we use scores (rating) or comparisons (ranking) as feedback? This study found that the choice matters a lot. For instance, models trained with rankings are preferred when evaluated by comparison but not always by scoring. This shows that how we choose to give feedback has big implications for how well the model works in real-life situations.

엔지니어링 및 리소스 관련 소식 / Engineering & Resources

Meta AI, 컴퓨터 비전 모델의 공정성 평가하기 / Evaluating the fairness of computer vision models (6 minute read)

AI를 위한 오픈소스 기여자가 되기 위한 지속적인 노력의 일환으로, Meta는 커뮤니티가 그 위에 구축할 수 있는 보다 허용적인 라이선스 하에 놀라운 DINO v2 모델을 출시했습니다. 또한 다양한 컴퓨터 비전 모델의 편향을 파악하고 벤치마킹하는 데 도움이 되는 데이터 세트도 공개했습니다.

In its continued push to be an open source contributor for AI, Meta released their amazing DINO v2 model under a more permissive license that allows the community to build on top of it. In addition, they released a data set that helps determine and benchmark bias in various computer vision models.

MMVP: 동작과 외관을 분리하여 비디오 콘텐츠 예측하기 / Predict Video Content by Separating Motion and Appearance (18 minute read)

비디오 콘텐츠를 예측하는 것은 시스템이 물체의 움직임을 추측하고 일관성을 유지해야 하기 때문에 까다로운 작업입니다. 이 새로운 시스템인 MMVP는 움직임에만 초점을 맞춘 특수한 "모션 매트릭스"를 사용하여 움직임과 모양을 별도로 처리합니다. 이 방식은 기존 최고 시스템보다 정확도가 높을 뿐만 아니라 효율적이고 컴팩트합니다.

Predicting video content is tricky because the system must guess object movements and keep them looking consistent. This new system, MMVP, separately handles motion and appearance using special "motion matrices" that focus just on movement. This method is not only more accurate but also more efficient and compact than current top systems.

TexMod: 텍스처 변조로 이미지 생성 향상 / Enhancing Image Generation with Textural Modulation (14 minute read)

몇 가지 예시만으로 다양한 이미지를 생성하는 것은 어려운 일입니다. 텍스모드는 고유한 기술을 사용하여 다양한 이미지의 디테일을 혼합하고 생성된 이미지가 논리적인 레이아웃을 갖도록 합니다.

Producing diverse images from just a few examples is challenging. TexMod uses unique techniques to blend in details from various images and makes sure the created images have a logical layout.

그 외 소식 / Miscellaneous

OpenAI 기본 모델은 사이코패틱하지 않습니다 / OpenAI base models are not sycophantic (5 minute read)

역 스케일링 문제는 모델 크기가 커질수록 악화되는 문제입니다. 시코판시는 모델이 진실과 상관없이 인간과 일치하는 경향을 보이는 것을 말합니다. Anthropic은 최근에 모델이 클수록 더 많은 동조성을 보인다는 훌륭한 연구를 발표했습니다. 그러나 최근 실험에 따르면 OpenAI 모델은 이와 같은 문제를 겪지 않는 것으로 나타났습니다.

Inverse scaling problems are things that get worse as model size increases. Sycophancy is when a model tends to agree with a human regardless of the truth. Anthropic released great work recently that showed larger models are more sycophantic. However, recent experiments show that OpenAI models don’t suffer from this same problem.

대부분의 사람들은 정말 AI의 속도를 늦추고 싶어할까요? / Do Most People Really Want To Slow Down AI? (6 minute read)

미디어에서 볼 수 있는 것과는 달리, 대다수의 사람들이 AI 개발 속도를 늦추기를 원한다는 여론조사 및 설문조사에는 여러 가지 문제가 있습니다.

Despite what you may see in the media, there are a number of issues with the polls and surveys saying that a majority of people want to slow down AI development.

더 읽어보기 / Quick Links

Swifto (Product)

Swifto는 AI를 사용하여 사이트의 A/B 테스트를 자동화합니다.

Swifto uses AI to automate your site’s A/B testing.

고속 AI 드론, 세계 챔피언 레이서들을 제압하다 / High-Speed AI Drone Beats World Champion Racers (3 minute read)

취리히 대학교와 인텔의 연구팀이 1인칭 시점(FPV) 드론 레이싱에서 인간 챔피언을 이길 수 있는 자율 드론 시스템 '스위프트'를 개발했다고 발표했습니다.

A team of researchers from the University of Zürich and Intel announced the development of an autonomous drone system named Swift that can beat human champions in first-person view (FPV) drone racing.

앵귈라에 .AI 금광이 있다 / Anguilla Has An .AI Goldmine (1 minute read)

카리브해의 작은 영국령인 앵귈라는 '.ai' 도메인 이름 덕분에 올해 최대 3천만 달러의 수익을 올릴 수 있을 것으로 예상됩니다.

Anguilla, a tiny British island territory in the Caribbean, may bring in up to $30 million in revenue this year thanks to its ".ai" domain name.