[New Model] Aero-1-Audio #658
Merged
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
π Introducing Aero-1-Audio β a compact yet mighty audio model.
π§ Built on Qwen-2.5-1.5B
β‘ Trained in <24h on just 16ΓH100
π§ Handles 15+ min audio seamlessly
π‘ Outperforms bigger models like Whisper, Qwen-2-Audio & commercial services from ElevenLabs/Scribe
Aero shows: smart data > massive scale.
Github Repo: https://github.com/EvolvingLMMs-Lab/Aero-1
Model Checkpoints: https://huggingface.co/lmms-lab/Aero-1-Audio-1.5B
Evaluation Results: https://github.com/EvolvingLMMs-Lab/lmms-eval/tree/dev/aero
Cookbook: https://www.lmms-lab.com/posts/lmms-lab-docs/aero_audio/
Evaluation Result
20250424_092927_results.json
20250421_203304_results.json
20250421_202840_results.json
20250421_170326_results.json
*Note: for some benchmarks, we use gpt-4o-2024-11-20 as judge model
Examples
We supports batch evaluation for faster inference. Notice that the result might be slightly difference for different batch size