On enabling 5G Dynamic TDD by leveraging deep reinforcement learning and O-RAN: demo