Skip to content
#

flickr30k

Here are 16 public repositories matching this topic...

Image captioning model using EfficientNetB0 as encoder and a custom Transformer decoder, trained on the Flickr30k dataset. Demonstrates full model architecture, preprocessing, and BLEU-based evaluation in TensorFlow. Built as an educational resource to explain Transformer architecture step-by-step.

  • Updated Jun 20, 2025
  • Jupyter Notebook

"Flickr30k_image_captioning" is a project or repository focused on image captioning using the Flickr30k dataset. The project aims to develop and showcase algorithms and models that generate descriptive captions for images.

  • Updated May 2, 2023
  • Jupyter Notebook

This project aims to provide real-time visual-to-audio conversion, empowering visually impaired users by describing images through generated captions and synthesized audio. The system employs a Transformer-based image captioning model and integrates with a browser extension for seamless functionality.

  • Updated Nov 28, 2024
  • Jupyter Notebook

Improve this page

Add a description, image, and links to the flickr30k topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the flickr30k topic, visit your repo's landing page and select "manage topics."

Learn more