Learning for Planning, not Reaction

I am currently researching in the general problem set of Continual sequential decision making & Long horizon planning, trying to create algorithms with tools from meta learning, probabilistic inference, and latent planning. We recieve advising from Professor Ying Nian Wu from UCLA Statistics & Data Science Department (we previously recieved advising from Professor Sicun Gao from UCSD's Computer Science & Engineering Department and dived into this problem from an search & optimization perspective).

Artistic representation of RL as inference
Artistic representation of planning as inference (credit to GenSpark for generation)
Optimization Codebase ❌ Planning Codebase (Closed repo for now, will open later)