|
|
Claude Batch API
Author: Venkata Sudhakar
The Claude Batch API processes large volumes of requests asynchronously at half the cost of real-time API calls. Instead of sending requests one at a time and waiting for each response, you submit a batch of up to 10,000 requests in one call, Anthropic processes them in the background (typically within 24 hours), and you retrieve all results when done. This is ideal for any workload where you do not need an immediate response: processing every invoice received today, analysing every customer review from the past week, generating product descriptions for your entire catalogue, or running a quality check on all support tickets. Each request in the batch has a custom_id you choose (your database record ID works perfectly), plus the same parameters as a regular messages call. The batch returns a results file where each line is a JSON object containing your custom_id and either a succeeded result with the model response, or an errored result with the error details. This makes it easy to match responses back to your original records and identify any failures for retry. The below example shows an accounts payable team processing 500 vendor invoices received that day - extracting key fields from each invoice overnight so the finance team has a clean structured report waiting in the morning.
Polling for completion and retrieving results,
It gives the following output,
Submitting batch of 3 invoices...
Batch ID: msgbatch_01XyZ...
Status: in_progress
Status: in_progress | Completed: 0 | Errored: 0
Status: in_progress | Completed: 2 | Errored: 0
Status: ended | Completed: 3 | Errored: 0
=== INVOICE EXTRACTION RESULTS ===
[INV-001] TechSupply Co.
Description: USB-C cables and HDMI adapters
Total: Rs 10,325 | Due: 2025-04-15
[INV-002] Canteen Services
Description: March 2025 employee canteen meals
Total: Rs 124,740 | Due: On receipt
[INV-003] CloudHost India Pvt Ltd
Description: Annual server hosting Apr 2025 - Mar 2026
Total: Rs 283,200 | Due: 2025-04-14
Total payable this batch: Rs 418,265
Cost to process 500 invoices:
Real-time API: $0.16
Batch API: $0.08 (50% saving)
Batch processing is the right choice whenever immediacy is not required: nightly processing of all orders received that day, weekly sentiment analysis of all customer reviews, monthly processing of expense reports, and any ETL pipeline where AI enriches records before they go into a data warehouse. The 24-hour processing window is more than adequate for most business batch jobs. For very large batches (tens of thousands of records), split into multiple batches of 10,000 and submit them in parallel - Anthropic processes multiple active batches concurrently so total wall-clock time stays manageable.
|
|