Skip to content

bwAI123/ClassicAIPapers

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 

Repository files navigation

36篇AI经典论文阅读清单 — 链接整理

来源:微信公众号「张小珺访谈|36篇经典论文演义:探索 AI 发展的所有论文!」 整理时间:2026-06-10


计算机视觉

01. AlexNet — ImageNet Classification with Deep Convolutional Neural Networks (2012)

02. VGGNet — Very Deep Convolutional Networks for Large-Scale Image Recognition (2014)

03. GoogLeNet/Inception — Going Deeper with Convolutions (2014)

04. ResNet — Deep Residual Learning for Image Recognition (2015)

05. U-Net — Convolutional Networks for Biomedical Image Segmentation (2015)

06. Batch Normalization — Accelerating Deep Network Training by Reducing Internal Covariate Shift (2015)

07. Faster R-CNN — Towards Real-Time Object Detection with Region Proposal Networks (2015)

08. YOLO — You Only Look Once: Unified, Real-Time Object Detection (2016)

09. MobileNet — Efficient Convolutional Neural Networks for Mobile Vision Applications (2017)

10. EfficientNet — Rethinking Model Scaling for Convolutional Neural Networks (2019)


自然语言处理

11. Transformer — Attention Is All You Need (2017)

12. BERT — Pre-training of Deep Bidirectional Transformers for Language Understanding (2018)

13. GPT-1 — Improving Language Understanding by Generative Pre-Training (2018)

14. GPT-2 — Language Models are Unsupervised Multitask Learners (2019)

15. RoBERTa — A Robustly Optimized BERT Pretraining Approach (2019)

16. T5 — Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer (2019)

17. GPT-3 — Language Models are Few-Shot Learners (2020)


多模态与视觉语言

18. CLIP — Learning Transferable Visual Models From Natural Language Supervision (2021)

19. DALL-E — Zero-Shot Text-to-Image Generation (2021)

20. BLIP — Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation (2022)

21. Flamingo — A Visual Language Model for Few-Shot Learning (2022)

22. LLaVA — Visual Instruction Tuning (2023)

23. GPT-4V — The Dawn of LMMs: Preliminary Explorations with GPT-4V(ision) (2023)


生成模型与扩散

24. GAN — Generative Adversarial Networks (2014)

25. VAE — Auto-Encoding Variational Bayes (2013)

26. StyleGAN — A Style-Based Generator Architecture for Generative Adversarial Networks (2019)

27. DDPM — Denoising Diffusion Probabilistic Models (2020)

28. Stable Diffusion — High-Resolution Image Synthesis with Latent Diffusion Models (2022)

29. DiT — Scalable Diffusion Models with Transformers (2023)


视觉Transformer与自监督

30. ViT — An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale (2020)

31. MAE — Masked Autoencoders Are Scalable Vision Learners (2021)

32. SimCLR — A Simple Framework for Contrastive Learning of Visual Representations (2020)

33. MoCo — Momentum Contrast for Unsupervised Visual Representation Learning (2019)


强化学习

34. DQN — Playing Atari with Deep Reinforcement Learning (2013)

35. AlphaGo — Mastering the Game of Go with Deep Neural Networks and Tree Search (2016)

36. PPO — Proximal Policy Optimization Algorithms (2017)

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages