LLaDA
Public

Official PyTorch implementation for "Large Language Diffusion Models"

Great to have a 8B-scale Bert-like masked language model and see its benchmark performance! Is the inference inefficient? · Issue #2 · ML-GSAI/LLaDA - GitHub.GG