Welcome to the Speech-to-Speech (S2S)
Model Evaluation! ๐
In this evaluation, you will assess the performance of different S2S models, such as
ChatGPT-4o, FunAudioLLM, SpeechGPT,
Mini-Omni, Cascade, and LLaMA-Omni.
๐๏ธ Speech: Partially followed the instruction on speed.
๐งพ Semantics: Accurately followed the instruction, with no semantic deviation or
missing
information.
๐๏ธ Speech: Partially followed the instruction on speed.
๐งพ Semantics: Accurately followed the instruction, with no semantic deviation or
missing
information.
๐๏ธ Speech: Did not follow the instruction on speed.
๐งพ Semantics: Partially followed the instruction, with minor semantic deviation and
missing information.
๐๏ธ Speech: Did not follow the instruction on speed.
๐งพ Semantics: Did not follow the instruction, with significant semantic deviation
and missing information.
After making your choice, you'll proceed to the next round. ๐
Click the button below to start the evaluation! ๐
๐ฏ Goal: Test how well these models handle speech tasks across different domains.
How It Works
Once you select a specific domain and task (e.g., Educational Tutoring and Rhythm
Control),
you will proceed to the evaluation stage. In each round, you will be presented with an audio input.
๐ฐ Example: