A new technique for Image Captioning based on Hierarchical Clustering, K-means Clustering and Deep Learning

Image captioning is the task of generating a caption for an image. We explore new models and analyse their performances. One of the key challenges of image captioning is that the model should be able to determine the relationship between the subjects and describe the scene accurately. This has been accompalished using encoder-decoder models where we extract features from images using a CNN architecture and train a language model which generates a caption word by word. We seek to make this process more efficient and accurate using clustering techniques.

In most of the previous works the models were found to be excessively data-hungry and time consuming to train. We seek to solve this using clustering techniques. We followed two approaches:

Flow Diagram

Clustering on images

We perform hierarchical agglomerative clustering on features extracted from images.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
Flow diagram.png		Flow diagram.png
Image Captioning.ipynb		Image Captioning.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

A new technique for Image Captioning based on Hierarchical Clustering, K-means Clustering and Deep Learning

Flow Diagram

Clustering on images

About

Releases

Packages

Languages

mohan-aditya05/image-captioning

Folders and files

Latest commit

History

Repository files navigation

A new technique for Image Captioning based on Hierarchical Clustering, K-means Clustering and Deep Learning

Flow Diagram

Clustering on images

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages