Agents and environment implementation #2081

Rakesh-123-cryp · 2024-03-14T16:26:02Z

Rakesh-123-cryp
Mar 14, 2024

We can integrate the agents and the environment using wrapper functions and class methods like:

This below is the wrapper class

def from_externel_agent(agent_class, name):

        class AgentWrapper(mesa.Agent):
            def __init__(self, unique_id: int, model: mesa.Model, agent_class) -> None:
                super(AgentWrapper,self).__init__(unique_id, model)
                self.fgn_class = agent_class
                self.__name__ = name
                self.functions = inspect.getmembers(agent_class,predicate=inspect.ismethod)
            
          def tf_functions(self, func_name=None, atrribute=None, *args, **kwargs):
                if func_name in self.functions:
                    self.functions[func_name](*args,**kwargs)
                else:
                    raise AttributeError(f"Function {func_name} not found in {self.__name__}")
        
          def step(self):
                return self.fgn_class.step()
        
          def advance(self) -> None:
               #Add extra default setting code using the functions on tf agents
               if "advance" in self.functions:
                    return self.fgn_class.advance()
            
                return super().advance()
        
          def remove(self) -> None:
                #Add extra default setting code using the functions on tf agents
                if "remove" in self.functions:
                    return self.fgn_class.advance()
            
                return super().remove()

This class can be used for any tensor flow or KerasRL agents used and can be also considered for OpenAI Baselines.

Where as for the Environment of the agent we can either let the user integrate the gym environment and project it on the web based visualiser and we can also provide the user with mesa's inbuilt functionality to create environments as such which can also relatively lower the memory usage and possibly speedup the visualisation. When projecting the OpenAI environment onto the visualiser we can also provide them with hyper-parameter tuner to observe the effect in the environment.
We can also create an experience replay buffer library that can hold the episodes of actions, observation, reward and the respective metrics at the end of each episode such as success rate, avg reward, cumulative reward, Short term and Long term Risks, etc.

EwoutH · 2024-03-28T16:50:33Z

EwoutH
Mar 28, 2024
Maintainer

Thanks for reaching out! I see some nice ideas, but I'm missing a bit of context here. Is this part of GSoC? What problem do you try to solve? Do you have specific questions or ideas you want to discuss?

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Agents and environment implementation #2081

{{title}}

Replies: 1 comment

{{title}}

Select a reply

Agents and environment implementation #2081

Rakesh-123-cryp Mar 14, 2024

Replies: 1 comment

EwoutH Mar 28, 2024 Maintainer

Rakesh-123-cryp
Mar 14, 2024

EwoutH
Mar 28, 2024
Maintainer