Efficient logistics path optimization and scheduling using deep reinforcement learning and convolutional neural networks