Skip to content

Conversation

@kcz358
Copy link
Collaborator

@kcz358 kcz358 commented Nov 23, 2024

Before you open a pull-request, please check if a similar issue already exists or has been closed before.

When you open a pull-request, please be sure to include the following

  • A descriptive title: [xxx] XXXX
  • A detailed description

If you meet the lint warnings, you can use following scripts to reformat code.

pip install pre-commit
pre-commit install
pre-commit run --all-files

Thank you for your contributions!

@kcz358 kcz358 changed the title Add mix_evals audio2text [Feat] Add mix_evals audio2text Nov 23, 2024
@Luodian Luodian merged commit 1cc17b9 into feat-dev Nov 23, 2024
2 checks passed
@Luodian Luodian deleted the mixeval/audio branch November 23, 2024 14:35
kcz358 added a commit that referenced this pull request Nov 27, 2024
* Add mix_evals audio2text

* Fix task tags in datasets
kcz358 added a commit that referenced this pull request Nov 27, 2024
* [Feat] Add qwen2_audio model support and Automatic speech recognition task with LibriSpeech dataset (#289)

* "add qwen2_audio model, asr librispeech eval task"

* lint update for PR #289

---------

Co-authored-by: Pengyun <[email protected]>

* add clotho_aqa task

* Apply black formatting

* formatting

* excluding xl due to downloading issue.

* [Feat] add audiobench version of clothoaqa (#302)

* add clothoaqa task

* formatting

* minor fixes

* minor fixes

* Add AIR_bench task (#315)

* add air_bench

* minor changes

* add common_voice_15 and people_speech tasks (#316)

Co-authored-by: Pengyun <[email protected]>

* add indent to yaml

* Add openhermes task (#323)

* add openhermes task

* formatting

* [Refactor] Fixing doc to audio return type, qwen_audio revise (#329)

* Add downsample function for audio array

* Batch support for qwen2 and use apply chat template

* Return sr for common voice

* Doc to audio to return the whole dict

* add muchomusic and vocalsound task (#331)

* add alpaca audio task (#333)

* [feat] added gigaspeech config (#334)

* fix xl yaml

* Fixed config for gigaspeech_xl. gigaspeech_xl_test has intermittent problem.

* add alpaca audio task (#333)

* pre-committed utils.py

---------

Co-authored-by: Cong <[email protected]>

* add tedlium_long_form and tedlium_dev_test tasks (#345)

Co-authored-by: Pengyun <[email protected]>

* [Feat] add-wavcaps (#349)

* fix xl yaml

* Fixed config for gigaspeech_xl. gigaspeech_xl_test has intermittent problem.

* add alpaca audio task (#333)

* pre-committed utils.py

* add wavcaps

* add wavcaps

---------

Co-authored-by: Cong <[email protected]>

* Update dep and fix log samples for audio (#355)

* Update dep

* Fix saved audio OOM error

* Fix typing

* Fix librispeech dataset name

* Add add_generation_prompt as option for Qwen audio

* Add add system propmt as optional

* fix vocalsound (#362)

* Add using simple prompt for Qwen2 Audio to align (#360)

* Add retry for gpt api call and improve air_bench aggregation function (#376)

* add retry for api calls and change air_bench_foundation aggregation function

* make azure default api

* minor changes

* [Feat] Add mix_evals audio2text (#420)

* Add mix_evals audio2text

* Fix task tags in datasets

* Gemini Audio (#421)

* gemini audio

* better variable naming

* Revise prompt

* delete redundant tasks in gigaspeech

* Fix wavcaps bugs

* Add lmms-eval-0.3 docs

Update lmms-eval-0.3.md

fix errors in markdown and add hyperlinks

proofread markdown and fix errors

rewrite some parts to fix errors

rewrite some parts to fix errors

rewrite some parts to fix errors

try optimize the table format using html

try optimize the table format using html

try optimize the table 2 format

final proofread

final proofread

final proofread

add explanantion for AIF and ASR

standardize WER to WER(↓)

final proofread

final proofread

final proofread

final proofread

correct hyperlink errors

modify readme to support lmms-eval0.3.0 release

modify icon

fix typos

Co-Authored-By: KairuiHu <[email protected]>

---------

Co-authored-by: Pengyun Wang <[email protected]>
Co-authored-by: Pengyun <[email protected]>
Co-authored-by: pbcong <[email protected]>
Co-authored-by: Li Bo <[email protected]>
Co-authored-by: Cong <[email protected]>
Co-authored-by: Yingluo <[email protected]>
Co-authored-by: Totoluo <[email protected]>
Co-authored-by: Pu Fanyi <[email protected]>
Co-authored-by: KairuiHu <[email protected]>
ZhaoCinyu pushed a commit to ZhaoCinyu/lmms-eval that referenced this pull request Dec 9, 2024
* [Feat] Add qwen2_audio model support and Automatic speech recognition task with LibriSpeech dataset (EvolvingLMMs-Lab#289)

* "add qwen2_audio model, asr librispeech eval task"

* lint update for PR EvolvingLMMs-Lab#289

---------

Co-authored-by: Pengyun <[email protected]>

* add clotho_aqa task

* Apply black formatting

* formatting

* excluding xl due to downloading issue.

* [Feat] add audiobench version of clothoaqa (EvolvingLMMs-Lab#302)

* add clothoaqa task

* formatting

* minor fixes

* minor fixes

* Add AIR_bench task (EvolvingLMMs-Lab#315)

* add air_bench

* minor changes

* add common_voice_15 and people_speech tasks (EvolvingLMMs-Lab#316)

Co-authored-by: Pengyun <[email protected]>

* add indent to yaml

* Add openhermes task (EvolvingLMMs-Lab#323)

* add openhermes task

* formatting

* [Refactor] Fixing doc to audio return type, qwen_audio revise (EvolvingLMMs-Lab#329)

* Add downsample function for audio array

* Batch support for qwen2 and use apply chat template

* Return sr for common voice

* Doc to audio to return the whole dict

* add muchomusic and vocalsound task (EvolvingLMMs-Lab#331)

* add alpaca audio task (EvolvingLMMs-Lab#333)

* [feat] added gigaspeech config (EvolvingLMMs-Lab#334)

* fix xl yaml

* Fixed config for gigaspeech_xl. gigaspeech_xl_test has intermittent problem.

* add alpaca audio task (EvolvingLMMs-Lab#333)

* pre-committed utils.py

---------

Co-authored-by: Cong <[email protected]>

* add tedlium_long_form and tedlium_dev_test tasks (EvolvingLMMs-Lab#345)

Co-authored-by: Pengyun <[email protected]>

* [Feat] add-wavcaps (EvolvingLMMs-Lab#349)

* fix xl yaml

* Fixed config for gigaspeech_xl. gigaspeech_xl_test has intermittent problem.

* add alpaca audio task (EvolvingLMMs-Lab#333)

* pre-committed utils.py

* add wavcaps

* add wavcaps

---------

Co-authored-by: Cong <[email protected]>

* Update dep and fix log samples for audio (EvolvingLMMs-Lab#355)

* Update dep

* Fix saved audio OOM error

* Fix typing

* Fix librispeech dataset name

* Add add_generation_prompt as option for Qwen audio

* Add add system propmt as optional

* fix vocalsound (EvolvingLMMs-Lab#362)

* Add using simple prompt for Qwen2 Audio to align (EvolvingLMMs-Lab#360)

* Add retry for gpt api call and improve air_bench aggregation function (EvolvingLMMs-Lab#376)

* add retry for api calls and change air_bench_foundation aggregation function

* make azure default api

* minor changes

* [Feat] Add mix_evals audio2text (EvolvingLMMs-Lab#420)

* Add mix_evals audio2text

* Fix task tags in datasets

* Gemini Audio (EvolvingLMMs-Lab#421)

* gemini audio

* better variable naming

* Revise prompt

* delete redundant tasks in gigaspeech

* Fix wavcaps bugs

* Add lmms-eval-0.3 docs

Update lmms-eval-0.3.md

fix errors in markdown and add hyperlinks

proofread markdown and fix errors

rewrite some parts to fix errors

rewrite some parts to fix errors

rewrite some parts to fix errors

try optimize the table format using html

try optimize the table format using html

try optimize the table 2 format

final proofread

final proofread

final proofread

add explanantion for AIF and ASR

standardize WER to WER(↓)

final proofread

final proofread

final proofread

final proofread

correct hyperlink errors

modify readme to support lmms-eval0.3.0 release

modify icon

fix typos

Co-Authored-By: KairuiHu <[email protected]>

---------

Co-authored-by: Pengyun Wang <[email protected]>
Co-authored-by: Pengyun <[email protected]>
Co-authored-by: pbcong <[email protected]>
Co-authored-by: Li Bo <[email protected]>
Co-authored-by: Cong <[email protected]>
Co-authored-by: Yingluo <[email protected]>
Co-authored-by: Totoluo <[email protected]>
Co-authored-by: Pu Fanyi <[email protected]>
Co-authored-by: KairuiHu <[email protected]>
MichalCiesiolka pushed a commit to MichalCiesiolka/lmms-eval-llmzszl that referenced this pull request Apr 3, 2025
* [Feat] Add qwen2_audio model support and Automatic speech recognition task with LibriSpeech dataset (EvolvingLMMs-Lab#289)

* "add qwen2_audio model, asr librispeech eval task"

* lint update for PR EvolvingLMMs-Lab#289

---------

Co-authored-by: Pengyun <[email protected]>

* add clotho_aqa task

* Apply black formatting

* formatting

* excluding xl due to downloading issue.

* [Feat] add audiobench version of clothoaqa (EvolvingLMMs-Lab#302)

* add clothoaqa task

* formatting

* minor fixes

* minor fixes

* Add AIR_bench task (EvolvingLMMs-Lab#315)

* add air_bench

* minor changes

* add common_voice_15 and people_speech tasks (EvolvingLMMs-Lab#316)

Co-authored-by: Pengyun <[email protected]>

* add indent to yaml

* Add openhermes task (EvolvingLMMs-Lab#323)

* add openhermes task

* formatting

* [Refactor] Fixing doc to audio return type, qwen_audio revise (EvolvingLMMs-Lab#329)

* Add downsample function for audio array

* Batch support for qwen2 and use apply chat template

* Return sr for common voice

* Doc to audio to return the whole dict

* add muchomusic and vocalsound task (EvolvingLMMs-Lab#331)

* add alpaca audio task (EvolvingLMMs-Lab#333)

* [feat] added gigaspeech config (EvolvingLMMs-Lab#334)

* fix xl yaml

* Fixed config for gigaspeech_xl. gigaspeech_xl_test has intermittent problem.

* add alpaca audio task (EvolvingLMMs-Lab#333)

* pre-committed utils.py

---------

Co-authored-by: Cong <[email protected]>

* add tedlium_long_form and tedlium_dev_test tasks (EvolvingLMMs-Lab#345)

Co-authored-by: Pengyun <[email protected]>

* [Feat] add-wavcaps (EvolvingLMMs-Lab#349)

* fix xl yaml

* Fixed config for gigaspeech_xl. gigaspeech_xl_test has intermittent problem.

* add alpaca audio task (EvolvingLMMs-Lab#333)

* pre-committed utils.py

* add wavcaps

* add wavcaps

---------

Co-authored-by: Cong <[email protected]>

* Update dep and fix log samples for audio (EvolvingLMMs-Lab#355)

* Update dep

* Fix saved audio OOM error

* Fix typing

* Fix librispeech dataset name

* Add add_generation_prompt as option for Qwen audio

* Add add system propmt as optional

* fix vocalsound (EvolvingLMMs-Lab#362)

* Add using simple prompt for Qwen2 Audio to align (EvolvingLMMs-Lab#360)

* Add retry for gpt api call and improve air_bench aggregation function (EvolvingLMMs-Lab#376)

* add retry for api calls and change air_bench_foundation aggregation function

* make azure default api

* minor changes

* [Feat] Add mix_evals audio2text (EvolvingLMMs-Lab#420)

* Add mix_evals audio2text

* Fix task tags in datasets

* Gemini Audio (EvolvingLMMs-Lab#421)

* gemini audio

* better variable naming

* Revise prompt

* delete redundant tasks in gigaspeech

* Fix wavcaps bugs

* Add lmms-eval-0.3 docs

Update lmms-eval-0.3.md

fix errors in markdown and add hyperlinks

proofread markdown and fix errors

rewrite some parts to fix errors

rewrite some parts to fix errors

rewrite some parts to fix errors

try optimize the table format using html

try optimize the table format using html

try optimize the table 2 format

final proofread

final proofread

final proofread

add explanantion for AIF and ASR

standardize WER to WER(↓)

final proofread

final proofread

final proofread

final proofread

correct hyperlink errors

modify readme to support lmms-eval0.3.0 release

modify icon

fix typos

Co-Authored-By: KairuiHu <[email protected]>

---------

Co-authored-by: Pengyun Wang <[email protected]>
Co-authored-by: Pengyun <[email protected]>
Co-authored-by: pbcong <[email protected]>
Co-authored-by: Li Bo <[email protected]>
Co-authored-by: Cong <[email protected]>
Co-authored-by: Yingluo <[email protected]>
Co-authored-by: Totoluo <[email protected]>
Co-authored-by: Pu Fanyi <[email protected]>
Co-authored-by: KairuiHu <[email protected]>
dadwadw233 pushed a commit to dadwadw233/lmms-eval that referenced this pull request Apr 28, 2025
* [Feat] Add qwen2_audio model support and Automatic speech recognition task with LibriSpeech dataset (EvolvingLMMs-Lab#289)

* "add qwen2_audio model, asr librispeech eval task"

* lint update for PR EvolvingLMMs-Lab#289

---------

Co-authored-by: Pengyun <[email protected]>

* add clotho_aqa task

* Apply black formatting

* formatting

* excluding xl due to downloading issue.

* [Feat] add audiobench version of clothoaqa (EvolvingLMMs-Lab#302)

* add clothoaqa task

* formatting

* minor fixes

* minor fixes

* Add AIR_bench task (EvolvingLMMs-Lab#315)

* add air_bench

* minor changes

* add common_voice_15 and people_speech tasks (EvolvingLMMs-Lab#316)

Co-authored-by: Pengyun <[email protected]>

* add indent to yaml

* Add openhermes task (EvolvingLMMs-Lab#323)

* add openhermes task

* formatting

* [Refactor] Fixing doc to audio return type, qwen_audio revise (EvolvingLMMs-Lab#329)

* Add downsample function for audio array

* Batch support for qwen2 and use apply chat template

* Return sr for common voice

* Doc to audio to return the whole dict

* add muchomusic and vocalsound task (EvolvingLMMs-Lab#331)

* add alpaca audio task (EvolvingLMMs-Lab#333)

* [feat] added gigaspeech config (EvolvingLMMs-Lab#334)

* fix xl yaml

* Fixed config for gigaspeech_xl. gigaspeech_xl_test has intermittent problem.

* add alpaca audio task (EvolvingLMMs-Lab#333)

* pre-committed utils.py

---------

Co-authored-by: Cong <[email protected]>

* add tedlium_long_form and tedlium_dev_test tasks (EvolvingLMMs-Lab#345)

Co-authored-by: Pengyun <[email protected]>

* [Feat] add-wavcaps (EvolvingLMMs-Lab#349)

* fix xl yaml

* Fixed config for gigaspeech_xl. gigaspeech_xl_test has intermittent problem.

* add alpaca audio task (EvolvingLMMs-Lab#333)

* pre-committed utils.py

* add wavcaps

* add wavcaps

---------

Co-authored-by: Cong <[email protected]>

* Update dep and fix log samples for audio (EvolvingLMMs-Lab#355)

* Update dep

* Fix saved audio OOM error

* Fix typing

* Fix librispeech dataset name

* Add add_generation_prompt as option for Qwen audio

* Add add system propmt as optional

* fix vocalsound (EvolvingLMMs-Lab#362)

* Add using simple prompt for Qwen2 Audio to align (EvolvingLMMs-Lab#360)

* Add retry for gpt api call and improve air_bench aggregation function (EvolvingLMMs-Lab#376)

* add retry for api calls and change air_bench_foundation aggregation function

* make azure default api

* minor changes

* [Feat] Add mix_evals audio2text (EvolvingLMMs-Lab#420)

* Add mix_evals audio2text

* Fix task tags in datasets

* Gemini Audio (EvolvingLMMs-Lab#421)

* gemini audio

* better variable naming

* Revise prompt

* delete redundant tasks in gigaspeech

* Fix wavcaps bugs

* Add lmms-eval-0.3 docs

Update lmms-eval-0.3.md

fix errors in markdown and add hyperlinks

proofread markdown and fix errors

rewrite some parts to fix errors

rewrite some parts to fix errors

rewrite some parts to fix errors

try optimize the table format using html

try optimize the table format using html

try optimize the table 2 format

final proofread

final proofread

final proofread

add explanantion for AIF and ASR

standardize WER to WER(↓)

final proofread

final proofread

final proofread

final proofread

correct hyperlink errors

modify readme to support lmms-eval0.3.0 release

modify icon

fix typos

Co-Authored-By: KairuiHu <[email protected]>

---------

Co-authored-by: Pengyun Wang <[email protected]>
Co-authored-by: Pengyun <[email protected]>
Co-authored-by: pbcong <[email protected]>
Co-authored-by: Li Bo <[email protected]>
Co-authored-by: Cong <[email protected]>
Co-authored-by: Yingluo <[email protected]>
Co-authored-by: Totoluo <[email protected]>
Co-authored-by: Pu Fanyi <[email protected]>
Co-authored-by: KairuiHu <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants