-
Notifications
You must be signed in to change notification settings - Fork 8
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Callable policy type directly (like alg4dm) #42
Comments
Yeah, I often think about this e.g. JuliaPOMDP/POMDPs.jl#252 Changing the way people interact with policies would be a big change, so we would want to make sure that there is a good plan for transitioning. The other related issue is whether policies should return distributions of actions. If I was redesigning POMDPs.jl today, I think I would say (p::Policy)(s) = rand(action(p, s)) I have not thought very much about the utility of having the function call syntax as just syntactic sugar like you suggest. |
Happy to consider proposals |
I like the distribution idea, but haven't thought too much about the impact of that conceptual change. My original proposal was simply some syntactical sugar to mimic the I'll keep thinking about this. |
I would expect the opposite. |
@rejuvyesh Good to know. That would be much easier to transition to from our current semantics. The reason I thought It would also be nice for |
But I would be open to having |
I think in [PO]MDP literature, policy |
Has it been considered to allow for callable policies directly? The algorithmsbook uses this syntax extensively.
For POMDPPolicies, we'd only need to define:
then we can do things like
Thoughts?
The text was updated successfully, but these errors were encountered: