-
Notifications
You must be signed in to change notification settings - Fork 652
FEAT: More Informative Attack Exceptions #1318
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
FEAT: More Informative Attack Exceptions #1318
Conversation
jbolor21
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
minor comments!
hannahwestra25
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
nice change! my only concern is that it's easy to forget to add the with_execution_context when adding new attacks that don't inherit from prompt_sending or multi_prompt_sending but i'm also unsure how frequently we will be adding those
This has something that has bugged me for a while, but I hadn't thought of a good solution for it. I think the solution here is decent, and it definitely improves usability. And I don't want perfect to get in the way of good...
Old way
The exceptions take some digging and often don't help.
e.g. here is a retry for a target. Which target? You'll never know...
Now, here is an error in objective_target, not that you would know that either....
New way
It still takes wrapping critical pieces in a handler, but I think it's already quite good. And will improve as we update identifiers to be better.
Here is a new error in objective_scorer:
Here is an error in a converter
Similar errors are available for both adversarial and objective chats. Here are new retry messages which now contain the component