주제에 text-to-speech 태그가 달렸습니다

글	댓글	조회수	활동
Supertonic: ONNX 런타임 기반의 초경량 온디바이스 다국어 TTS 시스템 (feat. Supertone AI) 읽을거리&정보공유 text-to-speech , on-device , flow-matching , supertonic , onnx-runtime , multilingual-tts , supertone	0	979	5월 18, 2026
VoxCPM2: 2B 파라미터로 30개 언어를 지원하는 토크나이저 없는 고품질 AI 음성 합성 모델 읽을거리&정보공유 speech-synthesis , multilingual , text-to-speech , diffusion , openbmb , voice-cloning , voxcpm , voxcpm2	0	1815	4월 14, 2026
PersonaPlex: NVIDIA가 공개한, 텍스트 프롬프트와 음성 컨디셔닝으로 페르소나를 제어하는 실시간 양방향 음성 대화 모델 읽을거리&정보공유 text-to-speech , nvidia , speech-to-speech , moshi , personaplex , full-duplex , voice-ai	0	276	4월 9, 2026
VibeVoice: 60분 장시간 음성 인식(ASR)과 실시간 TTS를 통합한 Microsoft의 오픈소스 음성 AI 모델 패밀리 읽을거리&정보공유 speech-recognition , text-to-speech , microsoft , tts , asr , vibevoice	0	485	4월 3, 2026
Qwen3-TTS: 500만 시간의 학습 데이터, 12Hz 초저지연 토크나이저로 완성한 오픈소스 Omni-Audio 모델 읽을거리&정보공유 alibaba , text-to-speech , qwen , multilingual , multi-token-prediction , qwen3-tts , qwen-tts , omni-audio , korean-tts , flow-matching , dual-track-representation , residual-vector-quantization , voice-design , custom-voice , 25hz-tokenizer , 12hz-tokenizer	0	2594	1월 25, 2026
VoxCPM: 토크나이저 없이 작동하는, 0.5B 규모의 고품질 AI 음성 생성 및 복제를 위한 영어/중국어 TTS 모델 읽을거리&정보공유 tts , text-to-speech , voice-cloning , voxcpm , tokenizer-free	1	464	1월 11, 2026
Dia2: Nari Labs가 공개한, 사람처럼 대화하는 초저지연의 오픈소스 TTS 모델 (1B/2B) 읽을거리&정보공유 text-to-speech , voice-cloning , dia , nari-labs , dia-2 , input-streaming , non-verbal-cues	0	434	12월 4, 2025
NeuTTS Air: 3초 분량의 음성만으로 음성 복제가 가능한, On-Device TTS(Text-to-Speech) 모델 읽을거리&정보공유 text-to-speech , instant-voice-cloning , on-device , neutts , neutts-air , neuphonic	0	548	10월 11, 2025
Chatterbox: Resemble AI가 공개한 상용 품질의 오픈소스 TTS 모델 읽을거리&정보공유 text-to-speech , chatterbox , resemble-ai	0	959	5월 29, 2025
RealtimeVoiceChat: 실시간(~500ms) AI 음성 채팅 오픈소스 프로젝트 읽을거리&정보공유 text-to-speech , realtimevoicechat , koljab , voice-chat	0	792	5월 6, 2025
Audibit: 개발자를 위한 오픈소스 TTS 팟캐스트 플랫폼 읽을거리&정보공유 podcast , tts , text-to-speech , audibit	0	260	5월 3, 2025
Dia: 감정 표현 및 비언어적 요소까지 생성이 가능한 1.6B 규모의 오픈소스 TTS 모델 (feat. Nari Labs) 읽을거리&정보공유 tts , text-to-speech , dia , nari-labs , dia-16b	2	2253	4월 25, 2025
OpenAI, 텍스트를 음성으로 합성(TTS)하는 데모 사이트 OpenAI.fm 공개 읽을거리&정보공유 openai , text-to-speech , webapp , demo , openai-fm	0	957	4월 17, 2025
Fish Speech, 한국어를 비롯한 8개 언어를 지원하는 오픈소스 다국어 TTS 모델 읽을거리&정보공유 text-to-speech , multilingual , fish-speech	2	4633	1월 5, 2025
OuteTTS, 350M 규모의 영문 전용 TTS 모델 읽을거리&정보공유 text-to-speech , outetts , voice-cloning , oute-ai	0	347	11월 10, 2024
Amphion: 오픈소스 오디오, 음악 및 음성 생성 툴킷 🎤🗣️🛠️ 읽을거리&정보공유 opensource , text-to-speech , mit-license , text-to-audio , singing-voice-conversion , vocoder , vingvisio	0	317	10월 28, 2024
MARS5: 혁신적인 음성 운율을 지원하는 새로운 음성 모델 읽을거리&정보공유 text-to-speech , instant-voice-cloning , mars5 , camb-ai , prosody	1	428	6월 16, 2024
Sonic: 상태-공간 모델(SSM) 기반 실시간 대화 AI를 위한 저지연 음성 모델 읽을거리&정보공유 text-to-speech , state-space-models , sonic , low-latency	0	375	6월 4, 2024
eSpeak NG, 100개 이상의 언어와 방언을 지원하는 오픈소스 음성합성기 eSpeak의 개선 버전(fork) 읽을거리&정보공유 opensource , text-to-speech , espeak-ng , espeak	0	1542	5월 4, 2024
MetaVoice: 인간 수준의 음성 지원을 위한 오픈소스 TTS 모델 (w/ 1.2B 모델 공개, 상업적 이용 가능) 읽을거리&정보공유 opensource , text-to-speech , metavoice	0	1163	2월 8, 2024
OpenVoice: 짧은 오디오 샘플로 음성 복제가 가능한 TTS 읽을거리&정보공유 text-to-speech , openvoice , instant-voice-cloning , cross-lingual , myshell-ai , myshell-tts	0	2279	1월 10, 2024
[GN] 프로젝트 S.A.T.U.R.D.A.Y - 음성으로 동작하는 개인 AI 비서 J.A.R.V.I.S 만들기 읽을거리&정보공유 geeknews , saturday , jarvis , webrtc , speech-to-text , text-to-text , text-to-speech	0	714	7월 19, 2023

Supertonic: ONNX 런타임 기반의 초경량 온디바이스 다국어 TTS 시스템 (feat. Supertone AI)

읽을거리&정보공유

text-to-speech , on-device , flow-matching , supertonic , onnx-runtime , multilingual-tts , supertone

0

979

5월 18, 2026

VoxCPM2: 2B 파라미터로 30개 언어를 지원하는 토크나이저 없는 고품질 AI 음성 합성 모델

읽을거리&정보공유

speech-synthesis , multilingual , text-to-speech , diffusion , openbmb , voice-cloning , voxcpm , voxcpm2

0

1815

4월 14, 2026

PersonaPlex: NVIDIA가 공개한, 텍스트 프롬프트와 음성 컨디셔닝으로 페르소나를 제어하는 실시간 양방향 음성 대화 모델

읽을거리&정보공유

text-to-speech , nvidia , speech-to-speech , moshi , personaplex , full-duplex , voice-ai

0

276

4월 9, 2026

VibeVoice: 60분 장시간 음성 인식(ASR)과 실시간 TTS를 통합한 Microsoft의 오픈소스 음성 AI 모델 패밀리

읽을거리&정보공유

speech-recognition , text-to-speech , microsoft , tts , asr , vibevoice

0

485

4월 3, 2026

Qwen3-TTS: 500만 시간의 학습 데이터, 12Hz 초저지연 토크나이저로 완성한 오픈소스 Omni-Audio 모델

읽을거리&정보공유

alibaba , text-to-speech , qwen , multilingual , multi-token-prediction , qwen3-tts , qwen-tts , omni-audio , korean-tts , flow-matching , dual-track-representation , residual-vector-quantization , voice-design , custom-voice , 25hz-tokenizer , 12hz-tokenizer

0

2594

1월 25, 2026

VoxCPM: 토크나이저 없이 작동하는, 0.5B 규모의 고품질 AI 음성 생성 및 복제를 위한 영어/중국어 TTS 모델

읽을거리&정보공유

tts , text-to-speech , voice-cloning , voxcpm , tokenizer-free

1

464

1월 11, 2026

Dia2: Nari Labs가 공개한, 사람처럼 대화하는 초저지연의 오픈소스 TTS 모델 (1B/2B)

읽을거리&정보공유

text-to-speech , voice-cloning , dia , nari-labs , dia-2 , input-streaming , non-verbal-cues