feat: Document automation run configuration options#378
Conversation
Co-authored-by: openhands <openhands@all-hands.dev>
Co-authored-by: openhands <openhands@all-hands.dev>
enyst
left a comment
There was a problem hiding this comment.
I realize this is just documenting for the skill, I'm a bit confused though: why A/B testing, and maybe why is it part of this skill.
First, running the reviewer agent on GitHub with a different LLM profile seem to me like stretching the term "A/B testing".
(1) Idk, what the user (PR author, PR reviewer) wants, is for the reviewer agent to work. It's not really testing for user preferences or whatnot.
(2) It would be nice to take into account writing the LLM profile into the comment: Slack proposal
(3) For different LLMs, it happens that we can intuit what kind of problems they're likely to see and what they are less likely to see. This intuition, and the problems posed by bypassing it, is maybe not as strong in this case, as it is in regular conversations... but, still 🤔 . I'm not sure, I feel it would be more useful if we can control what LLM gets used. But I don't know how, I'll make an issue on that...
|
@OpenHands Read the entire skill, and clone yourself automation/ repo from same org to see how exactly it works:
Respond as a comment here. Be CONCISE. |
|
I'm on it! enyst can track my progress at all-hands.dev |
|
Read the full skill + A/B testing is currently plugin preset only: Each variant has: Service design: validates exactly one of One doc mismatch I found: runtime conversation tags are This comment was created by an AI agent (OpenHands) on behalf of the user. |
|
Posted the concise PR comment: #378 (comment) |
|
@malhotra5 is this maybe something to fix?
|
Why
The automation service recently added clearer support for configurable run options, including execution timeouts and sandbox cleanup behavior. The OpenHands automation skill should surface those options so agents can create and update automations with the current API.
Summary
model,timeout,keep_alive, andrepossupport.keep_alivesandbox cleanup behavior.Issue Number
N/A
How to Test
uv run --with pyyaml python scripts/sync_extensions.py --checkplugins/issue-duplicate-checker.git diff --checkVideo/Screenshots
N/A - documentation-only change.
Notes
This PR was created by an AI agent (OpenHands) on behalf of the user.
@malhotra5 can click here to continue refining the PR