Lifan Sun's blog

Lifan Sun's blog, Welcome to my blog.

  • Blog
  • About
  • RSS
  • Search
  • NLP (16)
  • MLSys (14)
  • RecSys (1)
  • System (22)
  • C++ (2)
  • daily-life (1)
  • SE-Paper Reading (6)
  • LLVM (1)
  • Java (1)

Reading Notes: “FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness”

Mar 8, 2025

Reading Notes: “Efficient Memory Management for Large Language Model Serving with PagedAttention”

Mar 7, 2025

Reading Notes: “Alpa: Automating Inter- and Intra-Operator Parallelism for Distributed Deep Learning”

Feb 23, 2025

Reading Notes: “GPipe: Easy Scaling with Micro-Batch Pipeline”

Feb 22, 2025

Distributed Training Basics

Feb 15, 2025

Reading Note: Megatron-LM v1

Feb 15, 2025

Quantization for NN Inference

Feb 6, 2025

Reading Note: TVM

Feb 1, 2025

Reading Note: Triton

Feb 1, 2025

Moore’s Law, and the future of computing beyond Moore’s Law

Jan 27, 2025

Deep Learning Performance Background

Jan 26, 2025

Reading Notes: MI300X vs H100 vs H200 Benchmark Part 1: Training – CUDA Moat Still Alive

Jan 26, 2025

An Architecture Overview of ML Systems

Jan 21, 2025

PMPP Reading Notes

Jan 21, 2025


© Lifan Sun 2023 - 2025