blog | Hieu N. Nguyen

These notes are my attempt to distill and present materials in an on-policy manner (see this and this). I also try to use different mediums to communicate my research.

Learning by teaching - RL
- Multi-Armed Bandits #reinforcement-learning
  26.05.01
Scaling the Giants: A Guide to Efficient Parallelism in LLM Inference #llm-inference
25.12.01
Tản mạn #daily
25.09.01
Just know stuffs #research
25.07.30
Thinking in Language Models - The mechanistic questions
- Scaling compute #reasoning,llm
  25.09.14
- Learning to search #reasoning #LLM
  25.09.14
Optimization in Deep Learning: From convexity to invexity
- Learning as optimization #math
  22.04.02
- Convergence to critical point #math
  22.04.02
- Convergence of GD under Convexity #math
  22.04.02
- Beyond convexity #math
  22.04.02