If you have experience with
1. Q-Learning
2. Sarsa
3. POMDP
I will Provide you the environment code + Q-Learning Algo code as well and you just need to integrate together to make them work.
Please read the job description and bid.
Thanks
Hi! I'm a PhD candidate studying Reinforcement Learning algorithms and applications. Your project sounds like something that can be delivered as per your request. If you need any modifications in the implemented environment or algorithms, I can do that as well! Let me know if you're interested in getting it done soon!