[TLDR] 오늘의 AI 뉴스, 2023-09-06: 허깅페이스, 학습용 클러스터를 서비스 형태로 제공 💻, LLM의 수학 능력 향상 ➕, 제로 샷 모델 미세 조정 0️⃣

9bow · 9월 6, 2023, 3:09오후

파이토치 한국 사용자 모임에서는 TLDR 뉴스레터 의 승인을 받아 AI 소식을 DeepL로 번역 하여 전합니다.
더 많은 AI 소식 및 정보를 공유하고 함께 성장하고 싶으신가요? 지금 파이토치 한국어 커뮤니티에 방문해주세요!

주요 뉴스 & 신규 출시 소식 / Headlines & Launches

X(Twitter)의 개인정보 보호정책은 공개 데이터를 사용하여 AI 모델을 학습할 것임을 확인시켜줍니다 / X’s privacy policy confirms it will use public data to train AI models (3 minute read)

X*(Twitter)*는 사용자로부터 생체 인식, 직업 및 교육 데이터를 수집하기 위해 개인정보 처리방침을 업데이트했습니다. 특히, X는 이 데이터와 기타 공개 정보를 사용하여 AI 모델을 학습시킬 예정입니다. 알렉스 이바노프스는 엘론 머스크가 자신의 AI 벤처인 xAI에 X의 데이터를 활용할 수 있다고 제안합니다. 머스크는 공개 데이터만 사용할 것이라고 밝혔습니다.

X has updated its privacy policy to collect biometric, job, and education data from users. Notably, X will use this data and other public info to train its AI models. Alex Ivanovs suggests Elon Musk might leverage X's data for his AI venture, xAI. Musk clarifies only public data will be used.

허깅 페이스, 학습용 클러스터를 서비스 형태로 제공 / Hugging Face training cluster as a service (3 minute read)

허깅페이스는 내부적으로 복잡한 프로세스를 관리할 필요 없이 간편하게 모델을 학습할 수 있는 새로운 서비스를 출시했습니다.

Hugging Face launched a new service that allows you to simply train a model without managing the complicated process internally.

Kindo, AI 생산성 향상 도구를 위해 700만 달러 투자 / Kindo raises $7m for AI enhanced productivity tools (4 minute read)

라이엇 벤처스가 이끄는 Kindo는 복잡한 엔터프라이즈 AI 프로세스를 관리하기 위한 중앙 집중식 도구 세트를 만드는 것을 목표로 합니다.

Led by Riot Ventures, Kindo aims to create a central set of tools for managing complex enterprise AI processes.

연구 & 혁신 관련 소식 / Research & Innovation

BLoRA: 동일한 GPU에서 LoRA 일괄 처리 / Batch LoRAs on the same GPU (GitHub Repo)

LoRA는 특정 레이어에 연결하여 기본 모델의 동작을 변경하는 작은 매개변수 어댑터입니다. 기본 모델에 대한 일괄 생성을 통해 성능을 개선할 수 있습니다. 또한 모델의 LoRA 수정 버전을 일괄 생성하여 다양한 사용자에게 서로 다른 성능의 모델을 제공할 수 있습니다. lora qlora

LoRAs are small parameter adapters that change the behavior of the underlying model by plugging into certain layers. You can batch generation for the base model to improve performance. Additionally, you can batch LoRA-modified versions of the model and serve models with different performance to different users.

Topical-Chat: 실제 대화에서 더 나은 AI 봇 만들기 / Making AI Bots Better at Real-Life Conversations (GitHub Repo)

많은 AI 챗봇이 인간처럼 자연스럽고 폭넓은 대화를 나누는 데 어려움을 겪고 있기 때문에, 이번 프로젝트에서는 8가지 큰 주제에 대한 인간 대화 데이터 세트인 '토픽-챗'을 도입하여 정해진 역할 없이 사람과 더 비슷하게 대화하는 방법을 AI에 학습시켰습니다.

Many AI chatbots struggle with having natural wide-ranging chats like humans do, so this project introduced 'Topical-Chat', a new dataset from human conversations on 8 big topics, to teach AI how to talk more like us without fixed roles.

엔지니어링 및 리소스 관련 소식 / Engineering & Resources

SQLdepth: 자율주행차와 로봇이 더 선명하게 볼 수 있는 더 스마트한 방법 / A Smarter Way for Self-Driving Cars and Robots to See More Clearly (14 minute read)

연구자들은 자율 주행 자동차와 로봇이 주변 환경을 매우 상세하게 이해하는 데 도움이 되는 SQLdepth라는 새로운 도구를 만들었습니다.

The authors have created a new tool called SQLdepth that helps self-driving cars and robots understand their surroundings in great detail.

WiSE-FT: 제로 샷 모델의 강력한 미세 조정 / Robust fine-tuning of zero shot models (28 minute read)

제로 샷 모델은 광범위한 분포에서 강력한 성능을 발휘하는 AI 모델입니다. 그러나 좁은 작업으로 튜닝하면 해당 능력을 잃는 동시에 훈련된 작업을 훨씬 더 잘 수행하는 경향이 있습니다. 이 점은 좋지만 개선될 수 있습니다. 이 백서에서는 CLIP과 같은 모델을 튜닝하여 일반화 가능성을 유지하면서 특정 성능을 개선하는 방법을 제시합니다.

Zero shot models are AI models that have robust performance on a broad range of distributions. However, if you tune them on your narrow task, they tend to lose that ability while simultaneously getting much better at what they're trained for. This is good but could be improved. This paper presents a method to tune such models, like CLIP to maintain their generalizability while improving specific performance.

RFT: 대형 언어 모델에서 수학 능력 향상하기 / Improving Math Skills in Big Language Models (21 minute read)

이 연구는 LLM이 수학 문제를 얼마나 잘 풀 수 있는지, 그리고 다양한 요인이 그 능력에 어떤 영향을 미치는지 살펴봅니다. 연구진은 거부 샘플링 미세 조정(RFT; Rejection sampling Fine-Tuning)이라는 방법이 특히 여러 모델의 샘플을 사용할 때 이러한 모델이 수학을 더 잘하는 데 도움이 된다는 사실을 발견했습니다.

This research looks into how well LLMs can solve math problems and how different factors affect their abilities. They’ve found that a method called Rejection sampling Fine-Tuning (RFT) helps these models get better at math, especially when we use samples from several models.

그 외 소식 / Miscellaneous

AI의 윤리적 복잡성 풀기 / Unraveling The Ethical Complexities Of AI (18 minute read)

인공지능이 제기하는 핵심 윤리적 딜레마에 대한 탐구로, 인공지능이 우리의 공동 운명을 어떻게 형성할 수 있는지, 그리고 인공지능을 인간의 가치에 부합하는 길로 인도할 수 있는 힘을 조명합니다.

An exploration of the central ethical dilemmas posed by artificial intelligence, with the goal of illuminating how AI may shape our collective destiny and the power we have to guide it down a path aligned with human values.

금지할 것인가, 포용할 것인가? 인공지능으로 생성된 입학 에세이와 씨름하는 대학들 / Ban or Embrace? Colleges Wrestle With AI-Generated Admissions Essays (7 minute read)

대학들은 인공지능으로 작성된 입학 에세이의 등장으로 고민하고 있습니다. 대학 관계자들은 ChatGPT와 같은 챗봇이 대학 에세이를 작성할 수 있게 되면서 일반적이거나 표절된 콘텐츠가 나올 가능성을 우려하고 있습니다. 일부 학교에서는 AI 사용을 권장하지 않지만, 조지아 공대는 학생들에게 브레인스토밍과 개선에 책임감 있게 AI를 사용하되 직접적인 콘텐츠 제작에는 사용하지 말라고 조언합니다.

Colleges grapple with the rise of AI-generated admissions essays. With chatbots like ChatGPT able to produce college essays, officials fear a potential for generic or plagiarized content. While some schools discourage AI use, Georgia Tech advises students to use it responsibly for brainstorming and refining, but not for direct content creation.

50개 주 모두 의회에 AI로 생성된 CSAM 문제를 다룰 것을 촉구합니다 / All 50 states call on Congress to address AI-generated CSAM (2 minute read)

미국 50개 주의 법무장관이 의회에 아동 착취에 대한 AI의 역할을 조사하는 위원회를 구성할 것을 촉구했습니다. 이들은 AI가 아동 성적 학대 자료(CSAM)와 딥페이크 생성에 사용되는 것에 대해 우려하고 있습니다. 사우스캐롤라이나주의 앨런 윌슨 법무장관이 주도하는 이 움직임은 AI로 생성된 콘텐츠에 대해서도 CSAM 규제를 확대하는 것을 목표로 합니다.

US attorneys general from all 50 states urge Congress to create a commission investigating AI's role in child exploitation. They're concerned about AI being used to generate child sexual abuse material (CSAM) and deepfakes. Led by South Carolina's Attorney General Alan Wilson, the push aims to expand CSAM restrictions to cover AI-generated content.

더 읽어보기 / Quick Links

(광고) Guidde: AI로 11배 더 빠르게 매력적인 비디오 문서 만들기(스폰서) / Guidde: Create engaging video documentation 11x faster with AI (Sponsor)

Guidde는 가장 복잡한 작업도 몇 초 만에 설명할 수 있도록 도와주는 GPT 기반 도구입니다. 지루한 문서를 멋진 시각적 가이드로 바꿔보세요. 어디에나 삽입하세요.
브라우저 확장 프로그램은 100% 무료입니다 - 여기에서 사용해 보세요 →

Guidde is a GPT-powered tool that helps you explain the most complex tasks in seconds. Turn boring documentation into stunning visual guides. Embed them anywhere.
The browser extension is 100% free — try it here →

Refact Code LLM: 1.6B 매개변수 모델, HumanEval에서 32%에 도달 / 1.6B parameter model reaches 32% on HumanEval (8 minute read)

동급 최강의 초고속 코드 생성 모델. 하프 코드 하프 언어 데이터로 구성된 1.2T 토큰을 처음부터 학습합니다. humaneval

Best in class and lightning-fast code generation model. Trained from scratch on 1.2T tokens of half code half language data.

SAM.cpp / SAM.cpp (GitHub Repo)

GGML로 구동되는 순수 C++로 실행되는 메타에서 무엇이든 세분화하세요. ggml segment-anything sam

Segment anything from Meta running in pure C++ powered by GGML.

쿨라 / Kula (Product)

최고의 인재를 채용할 수 있도록 돕는 AI.

Your AI assistant to hire top talent.