Skip to content

Cannot Reproduce the Performance on VGGSound #52

@wjc2830

Description

@wjc2830

Hi there, thanks for contributing a good repo for the community.

When I tried to reproduce the results of CLAP score with caption_cot on VGGSound-test set (14k samples), I got results of: 33.97 (with GT Audio), 30.65 (with ThinkSound generated). It seems like something goes wrong on the caption_cot or my CLAP script. Could you please double check the AudioCoT released or provide the script you measured the CLAP?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions