주제에 speech-to-speech 태그가 달렸습니다

글	조회수	활동
PersonaPlex: NVIDIA가 공개한, 텍스트 프롬프트와 음성 컨디셔닝으로 페르소나를 제어하는 실시간 양방향 음성 대화 모델 읽을거리&정보공유 nvidia , text-to-speech , speech-to-speech , moshi , personaplex , full-duplex , voice-ai	160	4월 9, 2026
Step-Audio-R1: 오디오 분야에서의 추론 시 연산 시간 확장(Test-time Scaling) 기법 적용에 대한 연구 및 모델 읽을거리&정보공유 paper , speech-to-text , speech-to-speech , test-time-compute , test-time-scaling , rl-with-verifiable-rewards , step-audio , stepfun , mgrd , inverted-scaling , stepfun-audio , format-reward , self-cognition-error , textual-surrogate-reasoning , audio-llm	215	11월 28, 2025
SeamlessM4T: Meta AI에서 공개한, 번역을 위한 멀티모달에서의 파운데이션 모델 읽을거리&정보공유 meta , multimodal , mms , asr , meta-ai , speech-to-text , seamlessm4t , translation , sonar , speech-to-speech , stopes , fairseq2 , speechmatrix , seamlessalign , nllb	1665	8월 23, 2023

PersonaPlex: NVIDIA가 공개한, 텍스트 프롬프트와 음성 컨디셔닝으로 페르소나를 제어하는 실시간 양방향 음성 대화 모델

읽을거리&정보공유

nvidia , text-to-speech , speech-to-speech , moshi , personaplex , full-duplex , voice-ai

0

160

4월 9, 2026

Step-Audio-R1: 오디오 분야에서의 추론 시 연산 시간 확장(Test-time Scaling) 기법 적용에 대한 연구 및 모델

읽을거리&정보공유

paper , speech-to-text , speech-to-speech , test-time-compute , test-time-scaling , rl-with-verifiable-rewards , step-audio , stepfun , mgrd , inverted-scaling , stepfun-audio , format-reward , self-cognition-error , textual-surrogate-reasoning , audio-llm

0

215

11월 28, 2025

SeamlessM4T: Meta AI에서 공개한, 번역을 위한 멀티모달에서의 파운데이션 모델

읽을거리&정보공유

meta , multimodal , mms , asr , meta-ai , speech-to-text , seamlessm4t , translation , sonar , speech-to-speech , stopes , fairseq2 , speechmatrix , seamlessalign , nllb

0

1665

8월 23, 2023