Skip to content

WebArena Shopping evaluator issues #366

@imenelydiaker

Description

@imenelydiaker

There is an issue with some WebArena shopping tasks:

  • On task 275: it's a search task where the agent is asked to search for "xbox". So the reference URL is __SHOPPING__/catalogsearch/result/?q=xbox. The agent (GenericAgent) gets to that URL correctly but is rewarded 0.
  • Same thing for task 274 and probably other tasks.

@xhluca do you have other task IDs with the same failure?

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions