tl  tr
  Home | Tutorials | Articles | Videos | Products | Tools | Search
Interviews | Open Source | Tag Cloud | Follow Us | Bookmark | Contact   
 Generative AI > Large Language Models > Claude Vision - Analysing Images with Claude

Claude Vision - Analysing Images with Claude

Author: Venkata Sudhakar

Claude can read and reason about images just as naturally as text. You pass an image alongside your text prompt and Claude describes what it sees, extracts information, compares visuals, or answers questions about the content. For businesses this opens up powerful workflows that previously required expensive custom computer vision models: reading handwritten delivery receipts, extracting line items from invoice photos taken on a phone, checking whether a product photo meets listing quality standards, or identifying damage in a customer-submitted claim photo. All of these work with a simple API call - no model training, no labelling data.

Images are passed in the content array of a user message as a block with type "image". You can provide the image as a base64-encoded string (for images stored locally or in your database) or as a URL (for images hosted on a public web server). Claude supports JPEG, PNG, GIF, and WebP. You can include multiple images in a single message - useful for before/after comparisons, multi-page documents, or asking Claude to compare two product photos side by side.

The below example shows three business image tasks: extracting product details from a product listing image, reading a handwritten expense receipt, and checking whether a customer-submitted photo shows actual product damage.


It gives the following output,

=== Product Photo Analysis ===
{
  "category": "Laptop",
  "brand": "Dell",
  "colour": "Silver",
  "condition": "New",
  "listing_quality": {
    "acceptable": true,
    "lighting": "Good - evenly lit, no harsh shadows",
    "focus": "Sharp",
    "background": "Clean white background - ideal for marketplace listing"
  }
}

It gives the following output,

=== Receipt Extraction ===
{
  "vendor_name": "Taj Darbar Restaurant",
  "date": "2025-03-18",
  "total_amount": 2340.00,
  "currency": "INR",
  "items": [
    {"description": "Butter Chicken", "amount": 480},
    {"description": "Dal Makhani",     "amount": 320},
    {"description": "Garlic Naan x3", "amount": 180},
    {"description": "Soft Drinks x4", "amount": 280},
    {"description": "Service Charge", "amount": 126},
    {"description": "GST 18%",        "amount": 954}
  ],
  "payment_method": "Credit Card"
}

=== Damage Assessment ===
Damage is clearly visible: a deep crack runs across the top-right corner of
the laptop lid. The crack pattern and scuff marks on the packaging are
consistent with an impact during shipping rather than pre-existing damage.
RECOMMENDATION: APPROVE_CLAIM - damage is genuine and shipping-related.

Vision tasks where Claude adds immediate business value: reading handwritten forms and receipts (eliminates manual data entry), quality-checking product photos before marketplace listing (catches blurry or inappropriate images automatically), processing insurance claim photos (triage before a human adjuster sees them), extracting text from scanned documents and business cards, and comparing before/after photos for maintenance or renovation projects. For high-volume image processing, use the Batch API covered in Tutorial 286 to process thousands of images overnight at 50% cost reduction.


 
  


  
bl  br