OCR with Google's AI technology (Cloud Vision API)
-
Updated
Feb 22, 2023 - Python
OCR with Google's AI technology (Cloud Vision API)
caption generator using lavis and argostranslate
A lightweight and high-speed ComfyUI custom node for generating image captions using BLIP models. Optimized for both GPU and CPU environments to deliver fast and efficient caption generation.
This Python package allows a you to access the img2txt.io API a clean interface.
Printext is a lightweight, application that extracts text from images.
Turn image to audio story(Upload image and let AI tells a story about it ).
This project showcases a comprehensive solution for converting images to text using OCR (Optical Character Recognition) and fine-tuning the extracted content. Leveraging powerful tools like AWS, Docker, OpenAI, and CI/CD pipelines, this system ensures high accuracy and efficiency.
Автоматизированная система переименования и классификации изображений на основе их содержания с использованием глубоких нейронных сетей
Image To Text with Florence 2
Add a description, image, and links to the img2txt topic page so that developers can more easily learn about it.
To associate your repository with the img2txt topic, visit your repo's landing page and select "manage topics."