Solving Minecraft Tasks via Model Learning

Getting Started

The code has been thoroughly tested on the Ubuntu operating system and is recommended for use with this project.
While the project has not been formally tested, it should function correctly when using WSL on Windows.

Dependencies

Make sure that Python 3.8 is installed and active (via virtual environment or conda environment).
Make sure java8 is installed and active.
Follow the installation of Polycraft (PAL).
Navigate to the pal directory and run the following command:

xvfb-run -s '-screen 0 1280x1024x24' ./gradlew --no-daemon --stacktrace runclient

This will run Polycraft independently in headless mode. Gradle will install any dependencies that the java runtime needs, and eventually a message will appear in the log output Minecraft finished loading, which signifies that Polycraft is ready to use. Exit out of the application.

Follow the installation of Numeric-SAM (N-SAM).
pip install all the requirements for this project:

python -m pip install -r requirements.txt

Possible issues

While installing pal, running the command ./setup_linux_headless.sh may cause an error that can be fixed by running the following command and then try again:

sed -i -e 's/\r$//' setup_linux_headless.sh

If you got an error while installing gym:

pip install setuptools==66

If you still have errors while installing the gym:

pip install wheel==0.38.4

Usage

How to launch your first agent:

You need to update all the locations in the config.py file
Now you can run the demo agent with the following command:

python demo.py

The demo agent do random actions in order to solve the environment.

How to use the environment:

To run only the Polycraft server you can use the following code:

env = BasicMinecraft(visually=True, keep_alive=True)
env.reset()
env.close()

To run as a custom RL agent use the following code:

env = BasicMinecraft(visually=True)
model = PPO("MlpPolicy", env, verbose=1)
model.learn(total_timesteps=1000)
env.close()

As used in playground.py

If you like to export the expert trajectories for planning or behavioural cloning:

python custom_agent.py

The agent makes a wooden pogo from a list of commands (my_script.txt).

If you like to train the agent to learn the last k actions of an environment use:

env = PolycraftGymEnvKLA(BasicMinecraft, k=1, expert_actions=11, visually=True)

You can update "my_script.txt" as you like, and set expert_actions to num of lines of the file

In order to start an planning agent:

enhsp = ENHSP()
plan = enhsp.create_plan()
model = FixedScriptAgent(env, script=plan)

You need to update the ENHSP location in the config.py file and change the java version accordingly

To see the model learning results run the following command in a shell:

tensorboard --logdir logs

The code of our Hybrid approch can be seen in:

agents/exploring_sam.py
agents/hybrid_ppo_model.py

How to reproduce results in the paper

Recreate the maps:

python constructor.py

Run the offline RL algorithms:

python playground_offline.py

Run the offline N-SAM algorithm:

python playground_nsam.py

Run the online algorithms:

python playground_online.py

Run the Hybrid algorithm:

python playground_online_hybrid.py

Note that the reproduction of the Wooden Sword task is in the equalvine file named with "sword" at the end

Citations

If you find our work interesting or the repo useful, please consider citing this paper:

@misc{benyamin2025integratingreinforcementlearningaction,
      title={Integrating Reinforcement Learning, Action Model Learning, and Numeric Planning for Tackling Complex Tasks}, 
      author={Yarin Benyamin and Argaman Mordoch and Shahaf S. Shperberg and Roni Stern},
      year={2025},
      eprint={2502.13006},
      archivePrefix={arXiv},
      primaryClass={cs.AI},
      url={https://arxiv.org/abs/2502.13006}, 
}

Name		Name	Last commit message	Last commit date
Latest commit History 130 Commits
agents		agents
analysis		analysis
envs		envs
planning		planning
temp_files		temp_files
utils		utils
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
config.py		config.py
constructor.py		constructor.py
constructor_sword.py		constructor_sword.py
custom_agent.py		custom_agent.py
demo.py		demo.py
kfolds.csv		kfolds.csv
playground.py		playground.py
playground_active_learning.py		playground_active_learning.py
playground_close.py		playground_close.py
playground_nsam.py		playground_nsam.py
playground_nsam_trans.py		playground_nsam_trans.py
playground_offline.py		playground_offline.py
playground_offline_sword.py		playground_offline_sword.py
playground_online.py		playground_online.py
playground_online_hybrid.py		playground_online_hybrid.py
playground_online_sword.py		playground_online_sword.py
playground_online_sword_hybrid.py		playground_online_sword_hybrid.py
playground_test.py		playground_test.py
polycraft_policy.py		polycraft_policy.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Solving Minecraft Tasks via Model Learning

Getting Started

Dependencies

Possible issues

Usage

How to launch your first agent:

How to use the environment:

How to reproduce results in the paper

Citations

About

Uh oh!

Languages

License

SPL-BGU/PolyPlan

Folders and files

Latest commit

History

Repository files navigation

Solving Minecraft Tasks via Model Learning

Getting Started

Dependencies

Possible issues

Usage

How to launch your first agent:

How to use the environment:

How to reproduce results in the paper

Citations

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Languages