I am looking for someone to write an implementation of of an openai gym environment which is a wrapper around a specific http endpoint.
The environment simulates a single person game, where turns played is to be maximised. See attached payload.
Additionally, I would like the paired up learning agent (using openai baselines), to train a neural network for optimal gains.
This is a simple project. I expect both files combined to be less than 100 lines of code.
In order to be considered for this project, please name your favourite piece of furniture in your proposal.