@QingQ77: 《动手学深度学习》是很好的入门书，但更新速度已经有些跟不上这个领域的发展。Transformer 之后，CLIP、Diffusion、vLLM 等等内容越来越多，网上资料虽然丰富，却很零散，今天看 Attention，明天学 LoRA，后…

X AI KOLs Timeline 2026/05/09 06:04 工具

deep-learning open-source education pytorch transformers github technical-notes

摘要

该项目是一个系统化的深度学习笔记仓库，涵盖 PyTorch、Transformer、生成模型等内容，旨在解决学习资料碎片化问题，并提供代码实现与实践指南。

《动手学深度学习》是很好的入门书，但更新速度已经有些跟不上这个领域的发展。Transformer 之后，CLIP、Diffusion、vLLM 等等内容越来越多，网上资料虽然丰富，却很零散，今天看 Attention，明天学 LoRA，后天又去读扩散模型，最后留下的往往只是碎片，很难真正串成体系。本项目目前主要使用 Quarto Markdown 进行维护和发布，并构建为静态网站。Quarto Markdown 是一种基于 Markdown 的纯文本格式，适合版本控制和持续更新。内容主要包括： - PyTorch 核心与工程实践 - 注意力机制与 Transformer 系列模型 - 生成模型，如 GAN、VAE、Diffusion - 多模态模型，如 CLIP 等 - Hugging Face 生态与实际应用 - 从数据处理到训练、推理、部署的实践笔记 https://github.com/jshn9515/deep-learning-notes…

查看原文导出为 Word 导出为 PDF

查看缓存全文

缓存时间: 2026/05/10 02:21

jshn9515/deep-learning-notes

Source: https://github.com/jshn9515/deep-learning-notes

Deep Learning Notes

English | 简体中文

dnnl-title

For a long time, I struggled with how to learn deep learning effectively.

Dive into Deep Learning is an excellent introductory book, but its update pace has gradually fallen behind the speed of progress in this field. Since the rise of Transformers, topics like CLIP, Diffusion, and vLLM have become increasingly important. Although there is no shortage of online material, most of it is scattered. One day you study Attention, the next day LoRA, and the day after that diffusion models. In the end, what often remains are only fragments, and it is hard to build a truly coherent understanding.

So I decided to systematically organize what I have learned. From the fundamentals of PyTorch, to Attention and Transformers, and then to GANs, CLIP, Stable Diffusion, and SAM3, I try to explain the core ideas, mathematical derivations, code implementations, and common pitfalls of each topic as clearly as possible. This repository is the public version of those notes. If you are also learning deep learning on your own, I hope it can be helpful to you.

📌 About These Notes

This project is primarily maintained and published in Quarto Markdown, and built as a static website. Quarto Markdown is a plain-text format based on Markdown, which makes it well suited for version control and continuous updates.

The content mainly includes:

PyTorch fundamentals and engineering practice
Attention mechanisms and Transformer-based models
Generative models, such as GANs, VAEs, and diffusion models
Multimodal models, such as CLIP
The Hugging Face ecosystem and its practical use
Practical notes covering the full workflow from data processing to training, inference, and deployment

To make the material easier to use, I also periodically prepare corresponding Jupyter Notebook versions:

Monthly Releases: provide relatively stable packaged Notebook versions
GitHub Actions Artifacts: provide the latest build outputs

If you want a stable version, please check the Releases page. If you want the latest version, please check the Artifacts in GitHub Actions.

If you prefer generating Notebook files from the source yourself, you can also install Quarto locally and use the quarto convert command to convert .qmd files into Jupyter Notebooks. For example:

quarto convert path/to/file.qmd

🔧 Environment

All code in this repository has been tested in the following environment:

Python 3.14
PyTorch 2.11

See requirements.txt for the full list of dependencies.

Before running the related content, please first enter the dnnl directory and install the dnnl library according to the instructions in dnnl/README.md. This library contains some custom implementations and utility functions used throughout the notes, and many examples will not run properly without it.

This project uses Transformers v5. If you are following other repositories or tutorials based on v4, there may be significant API differences (such as tokenizers and quantization configurations). Please refer to the official migration guide for adjustments.

🤝 Contributions

If you find an explanation unclear, notice a problem in the code, or have topics you would like me to add, feel free to contribute through Issues or Pull Requests.

Possible contributions include, but are not limited to:

Pointing out errors or inaccuracies in the notes
Adding clearer explanations, derivations, or code comments
Suggesting improvements to structure, wording, or formatting
Recommending topics or practical cases for future coverage

Since this is a project I am building and refining while learning, there will inevitably be places where my understanding is incomplete or my explanations are not precise enough. I read all helpful feedback carefully and try to improve the notes whenever possible.

If you would like to make a larger change, it is recommended to open an Issue first with a brief description so that we can discuss it in advance.

🙏 Acknowledgements

While organizing these notes, I have benefited from many excellent resources. In particular, Dive into Deep Learning by Aston Zhang, Zachary C. Lipton, Mu Li, and Alexander J. Smola, as well as Professor Hung-yi Lee’s deep learning lecture series, have helped me greatly in understanding many core concepts in deep learning.

This project website is built with Quarto.

📄 License

The notes in this repository are licensed under CC BY-NC 4.0.
The dnnl library is licensed under MIT.

相似文章

@VincentLogic: 这视频简直是 AI 工程师的“必修课”清单！从最基础的 Transformer 架构，到 LoRA 微调、RAG、Agents，甚至最新的 MCP 协议，把这 10 篇塑造了当今 AI 行业的核心论文讲得明明白白。如果你也想深入理解大…

X AI KOLs Timeline

该文章推荐了一个视频，系统讲解了塑造当今AI行业的10篇核心论文，涵盖Transformer、LoRA、RAG、Agents及MCP协议，旨在帮助工程师理清技术脉络。

@wsl8297: 分享一本通俗好读的开源书《大模型基础》。从大语言模型入门到架构演化，再到 Prompt 工程、参数高效微调、模型编辑、检索增强生成（RAG）等关键技术，一本串起来。 GitHub：https://github.com/ZJU-LLMs/…

X AI KOLs Timeline

浙江大学团队开源了一本通俗易懂的大模型教材《大模型基础》，涵盖从架构演化到RAG等关键技术，并附带Agent-Kernel多智能体框架。

@sagacity: 吴恩达老师的讲 Claude Code，DeepLearning 的科普课程Claude Code: A Highly Agentic Coding Assistant https://note.mowen.cn/detail/ml0Rd…

X AI KOLs Timeline

吴恩达发布关于 Claude Code 的科普课程笔记，介绍了这个高自主性的 AI 编程助手，并分享了入门级别的实践技巧。

@QingQ77: 30 个可跑的 Jupyter notebook，把 LLM 智能体的记忆技术从短到长、从简单到生产级全部讲透。 https://github.com/NirDiamant/Agent_Memory_Techniques… 这个仓库把 L…

X AI KOLs Timeline

一个包含30个可运行Jupyter notebook的GitHub仓库，全面讲解LLM智能体记忆技术，从短期上下文到生产级模式，覆盖MemGPT、Zep、Graphiti等方法，并附有决策树和对比表。

@bozhou_ai: 自学 Vibe Coding 看这三个开源项目就够，不用买课很多 AI Coding 课程的素材都从这里来，原始版本反而更系统 1. Easy-Vibe DataWhale 出品的系统教程，5k stars。分三阶段：从 AI 编程小游…

X AI KOLs Timeline

本文推荐了三个高星开源GitHub项目，帮助开发者零成本系统地学习AI编程与Vibe Coding工作流，涵盖结构化教程、提示词技能库及全方位工具目录。

jshn9515/deep-learning-notes

Deep Learning Notes

📌 About These Notes

🔧 Environment

🤝 Contributions

🙏 Acknowledgements

📄 License

相似文章

@VincentLogic: 这视频简直是 AI 工程师的“必修课”清单！ 从最基础的 Transformer 架构，到 LoRA 微调、RAG、Agents，甚至最新的 MCP 协议，把这 10 篇塑造了当今 AI 行业的核心论文讲得明明白白。 如果你也想深入理解大…

@wsl8297: 分享一本通俗好读的开源书《大模型基础》。 从大语言模型入门到架构演化，再到 Prompt 工程、参数高效微调、模型编辑、检索增强生成（RAG）等关键技术，一本串起来。 GitHub：https://github.com/ZJU-LLMs/…

@sagacity: 吴恩达老师的讲 Claude Code，DeepLearning 的科普课程Claude Code: A Highly Agentic Coding Assistant https://note.mowen.cn/detail/ml0Rd…

@QingQ77: 30 个可跑的 Jupyter notebook，把 LLM 智能体的记忆技术从短到长、从简单到生产级全部讲透。 https://github.com/NirDiamant/Agent_Memory_Techniques… 这个仓库把 L…

@bozhou_ai: 自学 Vibe Coding 看这三个开源项目就够，不用买课 很多 AI Coding 课程的素材都从这里来，原始版本反而更系统 1. Easy-Vibe DataWhale 出品的系统教程，5k stars。分三阶段：从 AI 编程小游…

提交意见反馈

@VincentLogic: 这视频简直是 AI 工程师的“必修课”清单！从最基础的 Transformer 架构，到 LoRA 微调、RAG、Agents，甚至最新的 MCP 协议，把这 10 篇塑造了当今 AI 行业的核心论文讲得明明白白。如果你也想深入理解大…

@wsl8297: 分享一本通俗好读的开源书《大模型基础》。从大语言模型入门到架构演化，再到 Prompt 工程、参数高效微调、模型编辑、检索增强生成（RAG）等关键技术，一本串起来。 GitHub：https://github.com/ZJU-LLMs/…

@bozhou_ai: 自学 Vibe Coding 看这三个开源项目就够，不用买课很多 AI Coding 课程的素材都从这里来，原始版本反而更系统 1. Easy-Vibe DataWhale 出品的系统教程，5k stars。分三阶段：从 AI 编程小游…