Will you release the benchmark dataset samples, evaluation metrics and methods? #9

SilasTHU · 2024-04-02T06:23:18Z

Now we can only see the scores of these models, but I'm very interested in how you evaluate these agents.

SilasTHU changed the title ~~Will you release the benchmark dataset, evaluation metrics and methods?~~ Will you release the benchmark dataset samples, evaluation metrics and methods? Apr 2, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Will you release the benchmark dataset samples, evaluation metrics and methods? #9

Will you release the benchmark dataset samples, evaluation metrics and methods? #9

SilasTHU commented Apr 2, 2024

Will you release the benchmark dataset samples, evaluation metrics and methods? #9

Will you release the benchmark dataset samples, evaluation metrics and methods? #9

Comments

SilasTHU commented Apr 2, 2024