Tutorial #1

Title: Recent Advances in End-to-End Learned Image and Video Coding

Presenter: Prof. Heming Sun and Prof. Wen-Hsiao Peng

Prof. Heming Su
Prof. Wen-Hsiao Peng

Part I: Overview of Learned Image/Video Coding (by Prof. Peng; 15 mins)

  • Introduction to end-to-end learned image and video coding
  • The rate-distortion performance of SOTA learned image/video codecs
  • Standardization activities on neural image/video coding in JPEG and MPEG

Part II: End-to-End Learned Image Coding (by Prof. Sun; 70 mins)

  • Elements of end-to-end learned image coding
  • Review of a few notable tool features (e.g. fast context models)
  • Network pruning and quantization for learned image codecs
  • Implicit Neural Representation (INR)-based image coding systems
  • Real-time implementation of learned image codecs

Coffee Break (20 mins)

Part III: End-to-End Learned Video Coding (by Prof. Peng; 60 mins)

  • End-to-end learned video coding frameworks: residual coding, conditional coding, and conditional residual coding
  • Review of some notable systems
  • The explicit, implicit, and hybrid temporal buffering strategies
  • The rate-distortion-complexity trade-offs from the perspectives of coding frameworks and buffering strategies
  • Network quantization for learned video codecs

Part IV: Practical Implementation (30 minutes)

  • Emerging learned coding techniques for 3D/4D Gaussian Splatting and multi-modal large language modals
  • Open issues and concluding remarks

Venue: Lotus I