Separating Value Functions Across Time-Scales

Read the full paper: https://arxiv.org/abs/1902.01883

@article{separatingvalues2019,
  title={Separating value functions across time-scales},
  author={Romoff, Joshua and Henderson, Peter and Touati, Ahmed and Olliver, Yann and Brunskill, Emma and Pineau, Joelle},
  journal={arXiv preprint arXiv:1902.01883},
  year={2019}
}

We based our code off of ikostrikov's pytorch-rl repo.

@misc{pytorchrl,
  author = {Kostrikov, Ilya},
  title = {PyTorch Implementations of Reinforcement Learning Algorithms},
  year = {2018},
  publisher = {GitHub},
  journal = {GitHub repository},
  howpublished = {\url{https://github.com/ikostrikov/pytorch-a2c-ppo-acktr}},
}

Installation

PyTorch

without cuda:

conda install pytorch=0.4.1 -c pytorch

with cuda:

conda install pytorch=0.4.1 cuda90 -c pytorch

(or cuda92, cuda80, cuda 75. depending on what you have installed)

Baselines for Atari preprocessing

git clone https://github.com/openai/baselines.git cd baselines pip install -e .

Other requirements

pip install -r requirements.txt

Replicating results

To replicate our atari experiments run

python main.py --run-index [0-720]

Visualization

To visualize performance (requires Visdom) first create a visdom server:

python -m visdom.server

Then run:

python visualize.py

License

This repo is CC-BY-NC licensed, as found in the LICENSE file.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
algo		algo
tabular		tabular
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
arguments.py		arguments.py
configurations.py		configurations.py
distributions.py		distributions.py
enjoy.py		enjoy.py
envs.py		envs.py
main.py		main.py
model.py		model.py
plot_estimators.py		plot_estimators.py
random_starts.py		random_starts.py
requirements.txt		requirements.txt
reward_frequencies_trained.py		reward_frequencies_trained.py
reward_frequency.py		reward_frequency.py
storage.py		storage.py
test_RL_difference.py		test_RL_difference.py
utils.py		utils.py
visualize.py		visualize.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Separating Value Functions Across Time-Scales

Installation

PyTorch

Baselines for Atari preprocessing

Other requirements

Replicating results

Visualization

License

About

Uh oh!

Releases

Packages

Languages

License

facebookresearch/td-delta

Folders and files

Latest commit

History

Repository files navigation

Separating Value Functions Across Time-Scales

Installation

PyTorch

Baselines for Atari preprocessing

Other requirements

Replicating results

Visualization

License

About

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages