Image Captioning with nlbconnect/vit-gpt2-image-captioning
- https://huggingface.co/nlpconnect/vit-gpt2-image-captioning
- The Illustrated Image Captioning using transformers
- Image captioning is the process of generating caption i.e. description from input image. It requires both Natural language processing as well as computer vision to generate the caption.
- facebook/detr-resnet-50, Sample:
实例
1. 打开:https://huggingface.co/spaces/SRDdev/Image-Caption
2. 上传图片,拖放或者点击Upload any Image
3. 点击 Submit,稍等片刻,右侧的Captions就会出现图片的介绍