BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift
Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks
Flamingo: a Visual Language Model for Few-Shot Learning
Learning Transferable Visual Models From Natural Language Supervision
You Only Look Once: Unified, Real-Time Object Detection
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model
DELTAR: Depth Estimation from a Light-weight ToF Sensor and RGB Image
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale