About LoongForge

A training framework born from real-world, large-scale production workloads โ€” and shared back with the community

๐Ÿ‰ Our Story

LoongForge didn't start as an open-source project. It grew out of AIAK-Training โ€” Baidu Baige's commercial training acceleration offering delivered to enterprise customers (previously closed-source) โ€” after years of hardening under real production workloads

Before going open source, LoongForge was already powering large-scale models in production:

It now joins the Baige Loong open-source series โ€” named after the traditional Chinese loong boat (้พ™่ˆŸ), a symbol of coordinated power and forward momentum. Sister project: LoongFlow โ€” A Thinking & Learning Framework for Expert-Grade AI Agents

๐Ÿงญ On the Horizon

A glimpse of what is next

  • ๐Ÿ“ฆContinuous coverage of frontier foundation models
  • ๐Ÿค–Deeper investment in Embodied AI training capabilities
  • โšกOngoing training-performance optimization driven by real-world workloads
  • ๐Ÿ’ŽContinued enhancement and optimization of Kunlun XPU support

๐Ÿ“„ License & Citation

License

LoongForge is released under the Apache License 2.0. Some files are derived from third-party open-source projects โ€” please consult file headers for their specific notices

Citation

@software{LoongForge2026,
  title  = {LoongForge: A modular, scalable, high-performance training framework for LLMs, VLMs, diffusion, and embodied models},
  author = {{The LoongForge Authors}},
  year   = {2026},
  url    = {https://github.com/baidu-baige/LoongForge}
}

๐Ÿ™ Acknowledgments

LoongForge is built upon NVIDIA's Megatron-LM. We also referenced and drew inspiration from excellent open-source projects including Transformers, LLaMA-Factory, and Megatron-Bridge. We sincerely thank these communities for their outstanding contributions