Skip to content

Feature Request: more nuanced score for models #10

@paoloricciuti

Description

@paoloricciuti

Right now, the models are scored solely based on the amount of test that they pass...a more nuanced score that also involves how the tests are passing would be wonderful. This could involve:

  • Whether the model is using the MCP server or not
  • Whether the model is using the Test tool or not
  • The amount of step it took to complete
  • The number of tokens it took to complete
  • Possibly cost (?)

Other ideas?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions