BAMM: Human Motion Modeling & Architecture

January 2026
25 min read
Generative Models, Motion Models, Transformers, BAMM
Loading...
Use arrow keys or click to navigate slides. Press 'F' or Fullscreen icon for best experience.

What You'll Learn

  • The Challenge of Current Text-to-Motion Models
  • BAMM Architecture: Motion Tokenizer & Masked SA Transformer
  • Training with Hybrid Attention Masking
  • Inference using Cascaded Motion Decoding
  • Classifier-Free Guidance (CFG)

Key Concepts Covered

Strategy combining masking patterns for robust training.

Iterative decoding process for finer motion control.

Discretizing continuous motion data.

Resources

Slide Overview

  • Challenges & Architecture (Slides 1-10)
  • Training Methodologies (Slides 11-20)
  • Inference & Guidance (Slides 21-28)
  • Evaluation & Results (Slides 29-end)

Further Reading