Lightning talk at Frontiers of Online Reinforcement Learning Workshop, IMSI
Talk title: "An Oracle Taxonomy for LM Post Training".
Complete list of talks, milestones, submissions, and releases.
Talk title: "An Oracle Taxonomy for LM Post Training".
Thesis title: "Structure and Reliability in Interactive Decision Making".
Presented our ongoing work on LLM reasoning.
Presentation venue: Tangier, Morocco.
Conference location: Los Angeles, CA.
Completed PhD proposal defense milestone.