-
NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis
Paper • 2003.08934 • Published • 2 -
Learning Transferable Visual Models From Natural Language Supervision
Paper • 2103.00020 • Published • 19 -
Emerging Properties in Self-Supervised Vision Transformers
Paper • 2104.14294 • Published • 4 -
Segment Anything
Paper • 2304.02643 • Published • 5
Collections
Discover the best community collections!
Collections including paper arxiv:2104.14294
-
Rich feature hierarchies for accurate object detection and semantic segmentation
Paper • 1311.2524 • Published • 1 -
DeepPose: Human Pose Estimation via Deep Neural Networks
Paper • 1312.4659 • Published • 1 -
Generative Adversarial Networks
Paper • 1406.2661 • Published • 5 -
scikit-image: Image processing in Python
Paper • 1407.6245 • Published • 1
-
FaceChain-SuDe: Building Derived Class to Inherit Category Attributes for One-shot Subject-Driven Generation
Paper • 2403.06775 • Published • 5 -
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
Paper • 2010.11929 • Published • 15 -
Data Incubation -- Synthesizing Missing Data for Handwriting Recognition
Paper • 2110.07040 • Published • 2 -
A Mixture of Expert Approach for Low-Cost Customization of Deep Neural Networks
Paper • 1811.00056 • Published • 2
-
MIO: A Foundation Model on Multimodal Tokens
Paper • 2409.17692 • Published • 53 -
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
Paper • 2010.11929 • Published • 15 -
Going deeper with Image Transformers
Paper • 2103.17239 • Published -
Training data-efficient image transformers & distillation through attention
Paper • 2012.12877 • Published • 2
-
Self-Supervised Vision Transformers Learn Visual Concepts in Histopathology
Paper • 2203.00585 • Published • 2 -
Emerging Properties in Self-Supervised Vision Transformers
Paper • 2104.14294 • Published • 4 -
DreamScene360: Unconstrained Text-to-3D Scene Generation with Panoramic Gaussian Splatting
Paper • 2404.06903 • Published • 21 -
Ferret-v2: An Improved Baseline for Referring and Grounding with Large Language Models
Paper • 2404.07973 • Published • 32
-
NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis
Paper • 2003.08934 • Published • 2 -
Learning Transferable Visual Models From Natural Language Supervision
Paper • 2103.00020 • Published • 19 -
Emerging Properties in Self-Supervised Vision Transformers
Paper • 2104.14294 • Published • 4 -
Segment Anything
Paper • 2304.02643 • Published • 5
-
MIO: A Foundation Model on Multimodal Tokens
Paper • 2409.17692 • Published • 53 -
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
Paper • 2010.11929 • Published • 15 -
Going deeper with Image Transformers
Paper • 2103.17239 • Published -
Training data-efficient image transformers & distillation through attention
Paper • 2012.12877 • Published • 2
-
Rich feature hierarchies for accurate object detection and semantic segmentation
Paper • 1311.2524 • Published • 1 -
DeepPose: Human Pose Estimation via Deep Neural Networks
Paper • 1312.4659 • Published • 1 -
Generative Adversarial Networks
Paper • 1406.2661 • Published • 5 -
scikit-image: Image processing in Python
Paper • 1407.6245 • Published • 1
-
Self-Supervised Vision Transformers Learn Visual Concepts in Histopathology
Paper • 2203.00585 • Published • 2 -
Emerging Properties in Self-Supervised Vision Transformers
Paper • 2104.14294 • Published • 4 -
DreamScene360: Unconstrained Text-to-3D Scene Generation with Panoramic Gaussian Splatting
Paper • 2404.06903 • Published • 21 -
Ferret-v2: An Improved Baseline for Referring and Grounding with Large Language Models
Paper • 2404.07973 • Published • 32
-
FaceChain-SuDe: Building Derived Class to Inherit Category Attributes for One-shot Subject-Driven Generation
Paper • 2403.06775 • Published • 5 -
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
Paper • 2010.11929 • Published • 15 -
Data Incubation -- Synthesizing Missing Data for Handwriting Recognition
Paper • 2110.07040 • Published • 2 -
A Mixture of Expert Approach for Low-Cost Customization of Deep Neural Networks
Paper • 1811.00056 • Published • 2