(((ل()(ل() 'yoav))))👾@yoavgo·Reply
@srush_nlp (also i do think it is important to distinguish "policies" where the actions are tokens, from policies where the actions are, say, tool calls or reasoning steps. so i do not think we should abstract over these things..)