issue when legal actions mask is dependant on current player #39

AdamLang96 · 2023-09-24T04:30:08Z

I have a custom environment where the legal actions depend on the state of the board and the current player , and when I try to train my first agent the legal_actions mask isn't computed correctly for the agent, but it is for the opponent. Im guessing the issue comes from the code below (found in SelfPlayWrapper). Since the legal_actions depend on current_player_num and agent_player_num != current_player_num it can not calculate the correct mask for the agent. Please let me know if you have any ideas on how to fix this

  def continue_game(self):
            observation = None
            reward = None
            done = None
            while self.current_player_num != self.agent_player_num:
                action = self.current_agent.choose_action(self, choose_best_action = False, mask_invalid_actions = True)
                observation, reward, done, _ = super(SelfPlayEnv, self).step(action)
                logger.debug(f'Rewards: {reward}')
                logger.debug(f'Done: {done}')
                if done:
                    break

            return observation, reward, done, None

The text was updated successfully, but these errors were encountered:

laymelek · 2023-11-08T22:31:06Z

Did you found a solution to this? I have the same problem when running Test. On the other hand while running Train, my agent does not care about the legal_actions what so ever... it doesnt call it at all and just chooses a random action num

AdamLang96 · 2023-11-10T04:26:26Z

Did you found a solution to this? I have the same problem when running Test. On the other hand while running Train, my agent does not care about the legal_actions what so ever... it doesnt call it at all and just chooses a random action num

Yeah this is my exact issue. Haven't found a solution yet

sakapadia · 2025-01-16T06:33:01Z

anyone find a solution?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

issue when legal actions mask is dependant on current player #39

issue when legal actions mask is dependant on current player #39

AdamLang96 commented Sep 24, 2023

laymelek commented Nov 8, 2023

Uh oh!

AdamLang96 commented Nov 10, 2023

Uh oh!

sakapadia commented Jan 16, 2025

Uh oh!

issue when legal actions mask is dependant on current player #39

issue when legal actions mask is dependant on current player #39

Comments

AdamLang96 commented Sep 24, 2023

laymelek commented Nov 8, 2023

Uh oh!

AdamLang96 commented Nov 10, 2023

Uh oh!

sakapadia commented Jan 16, 2025

Uh oh!