API Connect

API Connect

Join this online group to communicate across IBM product users and experts by sharing advice and best practices with peers and staying up to date regarding product enhancements.


#API Connect
#Applicationintegration
#APIConnect
Β View Only
Expand all | Collapse all

How to Handle Images & Graphs in PDF/Word AI Analysis?

  • 1.  How to Handle Images & Graphs in PDF/Word AI Analysis?

    Posted Tue August 26, 2025 09:25 AM

    πŸš€ I'm building a tool on www.getbot.ai where users can upload PDF/Word documents.
    Right now, I extract the text and send it to AI for analysis.

    But here's the challenge:
    πŸ“Œ Many documents contain images, graphs, or charts, and I want to handle them in a smarter way.

    Some approaches I'm considering:

    • πŸ“ OCR (Tesseract, PaddleOCR, AWS Textract, Azure Read, etc.) β†’ Extract text from images inside the docs.

    • πŸ‘€ Vision models (like GPT-4o, Gemini, Claude with vision, LLaVA, Donut, etc.) β†’ Interpret graphs/charts/images directly.

    • πŸ”— Hybrid workflow β†’ First OCR the image, then pass both raw text + AI-generated description of the visual content into the analysis pipeline.

    • πŸ—‚ Embedding strategies β†’ Store text + image captions as embeddings for semantic search and context retrieval.

    πŸ’‘ Questions for the community:
    πŸ‘‰ What's the most practical way to analyze images/graphs in documents so the AI can understand them well?
    πŸ‘‰ Any tools, libraries, or best practices you'd recommend for handling this at scale?
    πŸ‘‰ If an entire PDF is image-based and 30–40+ pages long, what's the best approach to extract and process the content efficiently?

    Thanks in advance



    ------------------------------
    Afgan Shahguliyev
    Co Founder
    FutureTech Nexus
    Richmond
    ------------------------------


  • 2.  RE: How to Handle Images & Graphs in PDF/Word AI Analysis?

    Posted Wed August 27, 2025 01:41 AM

    Please keep questions to be about IBM API Connect here, there are other communities and forums for more general content and AI, this is just for IBM API Connect.



    ------------------------------
    Chris Dudley
    ------------------------------