Skip to content

Coop-mining environment bug #293

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
linfangu opened this issue Mar 9, 2025 · 0 comments
Open

Coop-mining environment bug #293

linfangu opened this issue Mar 9, 2025 · 0 comments

Comments

@linfangu
Copy link

linfangu commented Mar 9, 2025

I have been using the cooperative mining environment and training agents to perform the task. I encountered two potential issues related to multiple agents shooting at gold:

  1. Limited Shot Registration:

When more than two agents attempt to shoot at gold simultaneously, only the first two agents receive rewards. Although multiple agents can execute the shooting action at the same time, only two shots are registered and counted as a mining event (registering mining event in lua code).

  1. Gold Persistence and Excess Rewards:

When more than two agents shoot at the gold, the gold does not disappear immediately, even though two agents receive rewards. This results in agents seemingly receiving double or triple rewards for mining the same gold.

I have recorded a video with three agents, where I display each agent's actions and rewards at each time step. In the video, action index 7 corresponds to the shooting action. Reward for iron mining grants 0.9, and gold mining grants 5.9 (after applying a -0.1 penalty per shooting action).

Video Link: https://drive.google.com/file/d/1PgsrAodoNAD5wrQrW6d9IjnGyIXkWDm7/view?usp=sharing
The issues occur frequently in the last 30 seconds of the video.

I appreciate any insights on why this might happen, thank you!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant