Wrong info in Gym tutorial #239

njustesen · 2022-06-10T11:26:54Z

I don't believe this is true in https://njustesen.github.io/botbowl/gym.html:

"The action space is discrete, the action is an int in the range 0 <= action_idx < len(action_mask)."

mrbermell · 2022-06-10T11:36:34Z

Can you elaborate? From what I can tell it's correct.

njustesen · 2022-06-10T11:44:30Z

If I can choose between blocking player A or player B, the sentence in the tutorial says that my action integer can be 0 or 1 but since it is a spatial action it has to be higher than the number of number of non-spatial actions to get past if action_idx < len(self.env_conf.simple_action_types): in _compute_action(self, action_idx: Optional[int], flip: Optional[bool] = None) -> List[Optional[Action]]:.

Instead, the integer is in the range [0, len(action_space)] which is implicit.

njustesen · 2022-06-10T11:47:23Z

Unless len(action_mask)=len(action_space) but that's just confusing, right?

mrbermell · 2022-06-10T11:52:38Z

Thanks for the clarification, I see your point and agree. We should explain how the action mask works here. I'll see what I can do!

mrbermell · 2022-06-13T21:12:33Z

How about something along these lines?

Action space

In botbowl's core engine all actions have a type, and some of the types also require a position. Read more about actions in the scripted bot tutorials. The gym environment has unrolled the spatial dimension into a one dimensional action space (see picture below). By doing so it becomes easy to use state-of-the-art algorithms, but it's worth considering that compared to many of the standard reinforcement learning benchmarks we have orders of magnitude larger action space.

The action of the environment in an integer, let's say action_idx = 352. You call env.step(action_idx) to step the environment with your action. But not all actions are legal at all times, this is where the action mask comes in. The action_mask is a vector of booleans that represents the legal actions, to check if your action action_idx is legal simply check if action_mask[action_idx] is true.

njustesen · 2022-06-21T11:27:11Z

This is better!

If the scripted bot tutorials contain important info about the action space, I think it should be included here. What are the paragraphs you are thinking of?

njustesen added the documentation label Jun 10, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Wrong info in Gym tutorial #239

Wrong info in Gym tutorial #239

njustesen commented Jun 10, 2022

mrbermell commented Jun 10, 2022

Uh oh!

njustesen commented Jun 10, 2022 •

edited

Loading

Uh oh!

njustesen commented Jun 10, 2022

Uh oh!

mrbermell commented Jun 10, 2022

Uh oh!

mrbermell commented Jun 13, 2022

Uh oh!

njustesen commented Jun 21, 2022

Uh oh!

Wrong info in Gym tutorial #239

Wrong info in Gym tutorial #239

Comments

njustesen commented Jun 10, 2022

mrbermell commented Jun 10, 2022

Uh oh!

njustesen commented Jun 10, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

njustesen commented Jun 10, 2022

Uh oh!

mrbermell commented Jun 10, 2022

Uh oh!

mrbermell commented Jun 13, 2022

Action space

Uh oh!

njustesen commented Jun 21, 2022

Uh oh!

njustesen commented Jun 10, 2022 •

edited

Loading