[TLDR] 오늘의 AI 뉴스, 2023-06-13: 영국, 파운데이션 모델에 조기 접근🇬🇧, LLM의 편향성 해독 🚩, 개인 데이터 어시스턴트 🧑‍💻

9bow · 6월 14, 2023, 2:22오전

파이토치 한국 사용자 모임에서는 TLDR 뉴스레터의 승인을 받아 AI 소식을 DeepL로 번역하여 전합니다.

더 많은 AI 소식 및 정보를 공유하고 함께 성장하고 싶으신가요? 지금 파이토치 한국어 커뮤니티에 방문해주세요!

주요 뉴스 & 신규 출시 소식 / Headlines & Launches

워드프레스, 블로그 글을 작성해주는 AI 도구 출시 / WordPress Has A New AI Tool That Will Write Blog Posts For You (2 minute read)

워드프레스에서 제트팩 AI라는 새로운 AI 글쓰기 도구를 출시했습니다. 이 도구는 블로그 글, 소셜 미디어 글 및 기타 유형의 콘텐츠를 생성하는 데 사용할 수 있습니다. 젯팩 AI는 OpenAI에서 개발한 대규모 언어 모델인 GPT-3로 구동됩니다. 이 도구는 아직 베타 버전이지만 사람들이 웹사이트를 위한 콘텐츠를 더 쉽게 만들 수 있는 잠재력을 가지고 있습니다.

WordPress has released a new AI writing tool called Jetpack AI. The tool can be used to generate blog posts, social media posts, and other types of content. Jetpack AI is powered by GPT-3, a large language model developed by OpenAI. The tool is still in beta, but it has the potential to make it easier for people to create content for their websites.

AI 거인들, 영국에 안전 연구용 모델에 대한 조기 접근 제공 / AI Giants To Give UK Early Access To Models For Safety Research (4 minute read)

OpenAI, 구글 딥마인드, 앤트로픽은 평가 및 안전에 대한 연구를 지원하기 위해 영국에 자사의 AI 모델에 대한 "조기 또는 우선 액세스"를 제공하기로 약속했습니다.

OpenAI, Google DeepMind, and Anthropic have committed to provide “early or priority access” to their AI models to the UK to support research into evaluation and safety.

LlamaIndex, 850만 달러 시드 라운드 모금 / LlamaIndex raises $8.5M seed round (8 minute read)

LLM과 개인 데이터를 연결하는 것을 목표로 하는 혁신적인 플랫폼인 LlamaIndex가 Greylock이 주도한 라운드에서 850만 달러의 시드 자금을 확보했습니다. 이 회사의 솔루션은 개발자가 자신의 데이터에 대해 LLM의 추론 기능을 더 쉽게 활용할 수 있도록 하는 것을 목표로 합니다. 이 자금은 LlamaIndex의 오픈 소스 데이터 프레임워크를 확장하고 기업의 데이터 관련 문제를 대규모로 해결하는 데 사용될 예정입니다.

LlamaIndex, an innovative platform aiming to connect LLMs and private data, has secured $8.5 million in seed funding in a round led by Greylock. The company’s solution aims to make it easier for developers to leverage the reasoning capabilities of LLMs over their own data. The funding is set to expand LlamaIndex’s open-source data framework and tackle data-related challenges for enterprises at scale.

연구 & 혁신 관련 소식 / Research & Innovation

DataDM (GitHub Repo)

DataDM은 단 한 줄의 코드 없이 데이터를 로드, 정리, 변환, 시각화할 수 있는 대화형 데이터 인터페이스인 개인 데이터 도우미입니다. DataDM은 오픈소스이며 완전히 로컬에서 실행할 수 있어 보안이 필요한 데이터를 완전히 비공개로 유지할 수 있습니다.

DataDM is your private data assistant - a conversational interface for your data where you can load, clean, transform, and visualize without a single line of code. DataDM is open source and can be run entirely locally, keeping your data secrets fully private.

FinGPT: 금융 혁신을 위한 오픈소스 언어 모델 / FinGPT: An Open-Source Language Model for Financial Innovation (GitHub Repo)

이 논문에서는 금융 부문을 위해 설계된 오픈소스 인공지능 모델인 FinGPT를 소개합니다. 이 모델은 금융 연구 및 개발을 위한 접근 가능한 도구를 제공하는 것을 목표로 하며, 자동화된 데이터 관리와 같은 기능과 로보 어드바이징 및 알고리즘 트레이딩과 같은 애플리케이션의 잠재력을 제공합니다.

This paper introduces FinGPT, an open-source artificial intelligence model designed for the finance sector. It aims to provide an accessible tool for financial research and development, offering features like automated data management and the potential for applications such as robo-advising and algorithmic trading.

(광고) 기업이 개인 정보를 악용하지 못하게 하세요 / Don't let companies exploit your personal information (Sponsor)

데이터 브로커는 개인 정보를 수집하여 개인 정보를 희생시키면서 이익을 얻습니다. 인코그니는 데이터베이스를 자동으로 옵트아웃하여 데이터에 대한 통제권을 되찾고, 스팸을 줄이고, 사기 공격을 방지할 수 있도록 도와줍니다.️ 50% 절약: 지금 개인 데이터를 삭제하세요!

Data brokers collect your personal information and profit off it at the expense of your privacy. Incogni helps you take back control of your data, reduce spam, and prevent scam attacks by opting you out of their databases automatically. Save 50%: Delete your personal data now!

엔지니어링 및 리소스 관련 소식 / Engineering & Resources

파운데이션 모델도 사람처럼 데이터에 레이블을 붙일 수 있나요? / Can foundation models label data like humans? (18 minute read)

최근 언어 모델에 대한 과대 광고로 인해 "우리 모델이 ChatGPT보다 N% 더 선호된다"와 같은 문구가 등장했습니다. 이 문구는 보통 GPT-4가 선호되는 모델이라는 사실을 숨기고 있습니다. 그 이면에는 보정, 신뢰성에 대한 모든 종류의 질문과 GPT-4가 응답당 더 많은 토큰을 출력하는 모델을 선호한다는 명백한 사실이 숨겨져 있습니다. 이 블로그에서는 이러한 세부 사항과 Open LLM 리더보드에 대해 자세히 살펴봅니다.

The recent hype in language models has led to statements like "our model is preferred to ChatGPT N% of the time". This statement usually hides the fact that GPT4 was the model doing the preferring. Wrapped up in that is all sorts of questions about calibration, reliability, and the plain fact that GPT4 prefers models that output more tokens per response. This blog dives into these details and the Open LLM leaderboard.

대규모 언어 모델의 디코딩 편향 / Decoding Biases in Large Language Models (12 minute read)

AI 언어 모델이 입력의 미세한 변화에 어떻게 반응하는지 이해하는 것은 공정성과 유용성을 보장하는 데 필수적이지만 쉬운 일은 아닙니다. 약간 다른 두 입력의 고유한 특성을 반영하는 텍스트를 생성하여 이러한 미묘한 차이를 밝히고, AI의 응답을 더 이해하기 쉽고 관리하기 쉽도록 하기 위해 Contrastive Input Decoding(CID)이라는 새로운 방법이 도입되었습니다.

Understanding how AI language models react to slight changes in their inputs is vital for ensuring fairness and usefulness, but it's not an easy task. A new method called Contrastive Input Decoding (CID) has been introduced to shed light on these subtle differences by generating text that reflects the unique characteristics of two slightly different inputs, thus making the AI's responses more understandable and manageable.

RL4F: 협업 피드백을 통한 AI 성능 향상( / RL4F: Improving AI Performance with Collaborative Feedback (16 minute read)

RL4F는 인간이 피드백을 통해 개선하는 방식과 마찬가지로, 비평을 생성하는 소규모 AI를 사용하여 GPT-3와 같은 훨씬 더 큰 AI가 실수를 수정하도록 돕는 새로운 방법입니다. 연구자들은 계획 세우기, 요약하기, 알파벳순 나열하기와 같은 다양한 작업을 연구한 결과, 이 접근 방식이 대형 AI의 성능을 평균 5% 가량 향상시킨다는 사실을 발견했습니다.

RL4F is a new method that uses a smaller, critique-generating AI to help a much larger AI like GPT-3 fix its mistakes, much like how humans improve from feedback. By studying different tasks like planning, summarizing, and alphabetizing, researchers found that this approach improved the big AI's performance by about 5% on average.

그 외 소식 / Miscellaneous

근로자를 보호하기 위해 AI 혁신을 '제어'하려는 시도가 나쁜 생각인 이유 / Why Trying To “Shape” AI Innovation To Protect Workers Is A Bad Idea (8 minute read)

근로자를 보호하기 위해 AI 혁신을 제어하려는 시도는 나쁜 생각입니다. 대신 모든 사람이 자동화된 경제에서 번영할 수 있도록 보장하는 제도를 만드는 데 집중해야 합니다.

Trying to shape AI innovation to protect workers is a bad idea. Instead, we should focus on creating institutions that ensure that everyone can thrive in an automated economy.

암시적 코드 실행으로 더 스마트해진 Bard / Implicit code execution makes Bard smarter (5 minute read)

Bard에 두 가지 멋진 새 기능이 곧 출시됩니다. 첫 번째는 응답을 Google 스프레드시트로 내보내는 기능이고, 두 번째는 백그라운드에서 코드를 실행하여 출력의 신뢰성과 정확성을 향상시키는 기능입니다. Bard는 질문에 수학, 논리 또는 코드 추론이 필요한지 여부를 감지하고 백엔드에서 코드를 실행하여 답을 얻을 수 있습니다.

Two cool new features are launching in Bard soon. The first is the ability to export responses into Google sheets and the second enables code to be run in the background to improve the reliability and accuracy of outputs. Bard will be able to detect if a question requires math, logic, or code reasoning and execute code in the backend to get answers.

더 읽어보기 / Quick Links

Composer (Product Launch)

AI로 트레이딩 알고리즘을 구축하세요. 코드가 필요 없는 드래그 앤 드롭 편집기로 사용자 지정하고 전체 전략을 백테스트한 다음 실행하는 모든 과정을 하나의 플랫폼에서 할 수 있습니다. 코딩 기술이 필요하지 않습니다.

Build trading algorithms with AI. Customize with a no-code drag-and-drop editor, backtest the whole strategy, then execute - all in one platform. No coding skills required.

Salesforce, AI 전략 소개 / Salesforce Touts AI Strategy (2 minute read)

Salesforce는 매출 성장에 도움이 되는 새로운 기술을 활용하기 위해 자사 제품에 새로운 생성 인공 지능 기능을 추가하고 AI 스타트업에 대한 투자를 두 배로 늘리고 있습니다.

Salesforce is elevating new generative artificial intelligence features in its products and doubling its investment in AI startups as the company banks on the emerging technology to help resuscitate sales growth.

LocalAI (GitHub Repo)

자체 호스팅, 커뮤니티 중심, 로컬 OpenAI 호환 API.

Self-hosted, community-driven, local OpenAI-compatible API.

ChatALL (GitHub Repo)

모든 챗봇과 동시에 대화할 수 있는 도구.

A tool to converse with all chatbots simultaneously.