Federated Dynamic Treatment Regime (FDTR)

Code for the paper Federated Offline Reinforcement Learning

Overview

Running simulations.py will

Generate a training dataset using random behavior policy
Train an FDTR policy
Train LDTR, LDTR (MV), and 3 different Q-learning policies (see the paper for details)
Evaluate the policies on K hospital sites

Results are saved as a CSV file and estimated parameters from Algorithm 1 are saved as a pickle file which contains a dictionary.

To begin the process simulations.py with the following options:

python simulations.py Hs_dim ${1} Ps_dim ${2} a_No ${3} H ${4} episodes_No ${5} K ${6}

where

There are three other files:

utils.py contains all functions
utils_sepsis.py contains aditional functions for the sepsis data analysis
sepsis_FDTR.py contains code to run the analysis using the MIMIV-IV data set which is publicly available

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
img		img
.DS_Store		.DS_Store
.gitattributes		.gitattributes
README.md		README.md
sepsis_FDTR.py		sepsis_FDTR.py
simulations.py		simulations.py
utils.py		utils.py
utils_sepsis.py		utils_sepsis.py