Thread

D
Dieter_be5:50 PMOpen in Slack
@user as a prevention against sneaky tiny-font/invisible instructions in emails etc, would it make sense to render the html of an email and do OCR?
:halo_wave:1

9 replies
IK
Innokentii Konstantinov (archestra team)5:58 PMOpen in Slack
Hi @user, nice to see you!
D
Dieter_be5:59 PMOpen in Slack
👋 hey !
IK
Innokentii Konstantinov (archestra team)6:02 PMOpen in Slack
My 2 cents is that it's definitely an interesting idea.
I think this hidden instructions anyway will be passed to the LLM input and archestra should detect them when trying to evaluate an unathorized tool call, but extra layer of protection before sounds solid anyway.
MK
Matvey Kukuy (archestra team)6:04 PMOpen in Slack
white text on a white background is not the biggest issue, there may be just a plain text prompt injection in the middle of 60-pages doc 😞
MK
Matvey Kukuy (archestra team)6:04 PMOpen in Slack
But idea is nice!
J(
joey (archestra team)6:37 PMOpen in Slack
nice to see you Dieter 🙂
yeah multi-modal security is.. tricky, If you're bringing it up as a hackathon idea - I think it's a great/ambitious idea!
D
Dieter_be6:41 PMOpen in Slack
It's just where my brain went when I saw the "look how easily matvey can hack clawdbot" post
D
Dieter_be6:42 PMOpen in Slack
especially since OCR is a solved problem now with several models doing this well (I think ?)
J(
joey (archestra team)6:59 PMOpen in Slack