In Browser
	StumbleUpon
	del.icio.us
	Google
	Google Buzz
	reddit
	LinkedIn

	Facebook
	Twitter
	Linkedin
	E-Mail

Generative AI > OpenAI API > OpenAI Batch API for Cost-Efficient Processing

OpenAI Batch API for Cost-Efficient Processing

Author: Venkata Sudhakar

The OpenAI Batch API lets you submit large volumes of requests at a lower cost by processing them asynchronously. Instead of sending each request in real time, you bundle all requests into a JSONL file, upload it, and retrieve results after processing completes - typically within 24 hours. For ShopMax India, this is ideal for overnight tasks like generating product descriptions for thousands of SKUs.

Each line in the JSONL file is a self-contained request object with a custom_id, an HTTP method, a url endpoint, and a body containing the usual chat completions payload. You upload the file using the Files API, create a batch job referencing the file ID, poll for completion, and then download the output file.

The following example generates product descriptions for three ShopMax India products using the Batch API. Each product is a separate request in the JSONL file, processed together at 50 percent cost savings.

import json
from openai import OpenAI

client = OpenAI(api_key="your-api-key")

# Step 1: Build the JSONL batch file
products = [
    {"id": "SKU-101", "name": "4K Smart TV 55 inch"},
    {"id": "SKU-102", "name": "Noise Cancelling Headphones"},
    {"id": "SKU-103", "name": "Wireless Charging Pad"}
]
requests = []
for product in products:
    requests.append({
        "custom_id": product["id"],
        "method": "POST",
        "url": "/v1/chat/completions",
        "body": {
            "model": "gpt-4o-mini",
            "messages": [
                {"role": "user", "content": f"Write a 50-word product description for: {product['name']} sold at ShopMax India."}
            ],
            "max_tokens": 100
        }
    })

jsonl_content = "\n".join(json.dumps(r) for r in requests)
with open("batch_requests.jsonl", "w") as f:
    f.write(jsonl_content)

# Step 2: Upload file
with open("batch_requests.jsonl", "rb") as f:
    batch_file = client.files.create(file=f, purpose="batch")
print(f"File uploaded: {batch_file.id}")

# Step 3: Create batch job
batch_job = client.batches.create(
    input_file_id=batch_file.id,
    endpoint="/v1/chat/completions",
    completion_window="24h"
)
print(f"Batch job created: {batch_job.id} - Status: {batch_job.status}")

It gives the following output,

File uploaded: file-abc123xyz
Batch job created: batch_6789def - Status: validating

Poll the batch status with client.batches.retrieve(batch_job.id) until status equals completed. Then download results with client.files.content(batch_job.output_file_id) and parse each JSONL line using the custom_id to match results back to the original product. ShopMax India can schedule these batch jobs nightly, reducing API costs by up to 50 percent compared to real-time calls.

Send your comments, suggestions or queries regarding this site to [email protected].