Category: ocr

How to Apply Vision Language Models to Long Documents

How to Apply Vision Language Models to Long Documents Learn how to apply powerful VLMs for long context document understanding tasks The post How to Apply Vision Language Models to Long Documents appeared first on Towards Data Science. Eivind Kjosbakken Go to original source

November 4, 2025
From Pixels to Plots

From Pixels to Plots How I built an AI-powered prototype to turn images into insights The post From Pixels to Plots appeared first on Towards Data Science. Jens Winkelmann Go to original source

July 1, 2025
The Invisible Bug That Broke My Automation: How OCR Changed The Game

The Invisible Bug That Broke My Automation: How OCR Changed The Game The evolution of AI in test automation: from locators to generative AI (Part 3) Continue reading on Towards Data Science » Abdelkader HASSINE Go to original source

December 18, 2024
How Did Open Food Facts Fix OCR-Extracted Ingredients Using Open-Source LLMs?

How Did Open Food Facts Fix OCR-Extracted Ingredients Using Open-Source LLMs? Delve into an end-to-end Machine Learning project to improve the quality of the Open Food Facts database Image generated with Flux1 Open Food Facts’ purpose is to create the largest open-source food database in the world. To this day, it has collected over 3 millions products…

November 30, 2024