Sobhan Mohammadpour
Searching for a global minima.
3441
Address: 2920 Ch de la Tour
Montréal, QC H3T 1N8
I’m a master’s student at Université de Montréal (MILA and CIRRELT), advised by Emma Frejinger and Pierre-Luc Bacon. I work on end-to-end or decision aware learning. I try to find the real problem we should be optimizing not the auxiliary loss that is convenient. Concretely I’ve worked on learning the features we need to do inverse reinforcement learning. At the moment I’m trying to define a gradient for the shortest path, and I’m working on finding a better smooth Bellman operator for reinforcement learning.
selected publications
- Arc travel time and path choice model estimation subsumedarXiv preprint arXiv:2210.14351 2022