Category: gemini

  • Detecting and Editing Visual Objects with Gemini

    Detecting and Editing Visual Objects with Gemini A practical guide to identifying, restoring, and transforming elements within your images The post Detecting and Editing Visual Objects with Gemini appeared first on Towards Data Science. Laurent Picard Go to original source

  • How to Leverage Slash Commands to Code Effectively

    How to Leverage Slash Commands to Code Effectively Learn how I utilize slash commands to be a more efficient engineer The post How to Leverage Slash Commands to Code Effectively appeared first on Towards Data Science. Eivind Kjosbakken Go to original source

  • How to Optimize Your AI Coding Agent Context

    How to Optimize Your AI Coding Agent Context Make your coding agents more efficient The post How to Optimize Your AI Coding Agent Context appeared first on Towards Data Science. Eivind Kjosbakken Go to original source

  • How to Scale Your LLM Usage

    How to Scale Your LLM Usage Learn how to increase LLM usage to achieve increased productivity The post How to Scale Your LLM Usage appeared first on Towards Data Science. Eivind Kjosbakken Go to original source

  • How to Use Gemini 3 Pro Efficiently

    How to Use Gemini 3 Pro Efficiently Learn the pros and cons of Gemini 3 Pro, from testing with both coding and console usage The post How to Use Gemini 3 Pro Efficiently appeared first on Towards Data Science. Eivind Kjosbakken Go to original source

  • Generating Consistent Imagery with Gemini

    Generating Consistent Imagery with Gemini A practical guide to building a prompt-based generation pipeline for your image library The post Generating Consistent Imagery with Gemini appeared first on Towards Data Science. Laurent Picard Go to original source

  • The Rise of Semantic Entity Resolution

    The Rise of Semantic Entity Resolution Semantic entity resolution uses language models to bring an increased level of automation to schema alignment, blocking (grouping records into smaller, efficient blocks for all-pairs comparison at quadratic, n² complexity), matching and even merging duplicate nodes and edges. In the past, entity resolution systems relied on statistical tricks such…

  • Unlocking Multimodal Video Transcription with Gemini

    Unlocking Multimodal Video Transcription with Gemini Explore how to transcribe videos with speaker identification in a single prompt The post Unlocking Multimodal Video Transcription with Gemini appeared first on Towards Data Science. Laurent Picard Go to original source

  • Google’s URL Context Grounding: Another Nail in RAG’s Coffin?

    Google’s URL Context Grounding: Another Nail in RAG’s Coffin? Google’s hot streak in AI-related releases continues unabated. Just a few days ago, it released a new tool for Gemini called URL context grounding.  URL context grounding can be used stand-alone or combined with Google search grounding to conduct deep dives into internet content. What is…

  • Google’s AlphaEvolve Is Evolving New Algorithms — And It Could Be a Game Changer

    Google’s AlphaEvolve Is Evolving New Algorithms — And It Could Be a Game Changer AlphaEvolve imagined as a genetic algorithm coupled to a large language model. Picture created by the author using various tools including Dall-E3 via ChatGPT. Large Language Models have undeniably revolutionized how many of us approach coding, but they’re often more like a super-powered…

  • Step-by-Step Guide to Build and Deploy an LLM-Powered Chat with Memory in Streamlit

    Step-by-Step Guide to Build and Deploy an LLM-Powered Chat with Memory in Streamlit In this post, I’ll show you step by step how to build and deploy a chat powered with LLM — Gemini — in Streamlit and monitor the API usage on Google Cloud Console. Streamlit is a Python framework that makes it super easy to turn your…

  • Building a Scalable and Accurate Audio Interview Transcription Pipeline with Google Gemini

    Building a Scalable and Accurate Audio Interview Transcription Pipeline with Google Gemini This article is co-authored by Ugo Pradère and David Haüet How hard can it be to transcribe an interview? You feed the audio to an AI model, wait a few minutes, and boom: perfect transcript, right? Well… not quite. When it comes to…

  • Building Autonomous Multi-Tool Agents with Gemini 2.0 and LangGraph

    Building Autonomous Multi-Tool Agents with Gemini 2.0 and LangGraph A practical tutorial with full code examples for building and running multi-tool agents Continue reading on Towards Data Science » Youness Mansar Go to original source

  • Google Gemini Is Entering the Advent of Code Challenge

    Google Gemini Is Entering the Advent of Code Challenge An open-source project to explore the capabilities and limitations of LLMs on coding challenges Image by author (created with Flux 1.1 Pro) What is this about? If 2024 taught us anything in the realm of Generative AI, then it is that coding is one of the most promising…