Multi-agent deep reinforcement learning to enable dynamic TDD in a multi-cell environment