In Browser
	StumbleUpon
	del.icio.us
	Google
	Google Buzz
	reddit
	LinkedIn

	Facebook
	Twitter
	Linkedin
	E-Mail

Generative AI > Vector Databases > Getting Started with Pinecone in Python

Getting Started with Pinecone in Python

Author: Venkata Sudhakar

Pinecone is a fully managed vector database that makes it easy to store, index, and query high-dimensional embeddings at scale. Unlike self-hosted solutions like ChromaDB or FAISS, Pinecone requires no infrastructure management - you create an index, upsert vectors, and query them through a simple API. It supports billions of vectors with millisecond query latency, making it suitable for production RAG systems at ShopMax India scale.

Pinecone organises vectors into indexes. Each index has a fixed dimension (matching your embedding model) and a distance metric (cosine for semantic search, euclidean for geometric similarity). Namespaces allow you to partition an index into logical groups - for example, separating product embeddings from policy document embeddings within the same index.

The below example creates a Pinecone index, upserts ShopMax India product embeddings, and performs semantic similarity search using the Pinecone Python client.

from pinecone import Pinecone, ServerlessSpec
from google.generativeai import embed_content
import google.generativeai as genai

genai.configure(api_key="YOUR_GEMINI_API_KEY")
pc = Pinecone(api_key="YOUR_PINECONE_API_KEY")

# Create index if it does not exist
index_name = "shopmax-products"
if index_name not in pc.list_indexes().names():
    pc.create_index(
        name=index_name,
        dimension=768,
        metric="cosine",
        spec=ServerlessSpec(cloud="aws", region="us-east-1")
    )

index = pc.Index(index_name)

# ShopMax product catalogue
products = [
    {"id": "p001", "text": "Samsung Galaxy S24 256GB smartphone Rs 74999 Mumbai"},
    {"id": "p002", "text": "Sony WH-1000XM5 noise cancelling headphones Rs 29999"},
    {"id": "p003", "text": "MacBook Air M3 13 inch laptop Rs 114900 Bangalore"},
    {"id": "p004", "text": "OnePlus 12 5G smartphone 256GB Rs 64999 Delhi"},
]

# Embed and upsert products
vectors = []
for p in products:
    emb = embed_content(model="models/embedding-001", content=p["text"])["embedding"]
    vectors.append({"id": p["id"], "values": emb, "metadata": {"text": p["text"]}})

index.upsert(vectors=vectors, namespace="products")
print(f"Upserted {len(vectors)} products")

# Query
query = "affordable phone under Rs 70000"
query_emb = embed_content(model="models/embedding-001", content=query)["embedding"]
results = index.query(vector=query_emb, top_k=2, namespace="products", include_metadata=True)

print(f"\nTop results for: {query}")
for match in results["matches"]:
    print(f"  [{match['score']:.3f}] {match['metadata']['text']}")

It gives the following output,

Upserted 4 products

Top results for: affordable phone under Rs 70000
  [0.891] OnePlus 12 5G smartphone 256GB Rs 64999 Delhi
  [0.847] Samsung Galaxy S24 256GB smartphone Rs 74999 Mumbai

Pinecone correctly ranked the OnePlus 12 highest as it is both a phone and under Rs 70000. The Samsung Galaxy S24 scored second despite being slightly above budget because it is semantically similar. In production, add metadata filters to enforce hard constraints like price range: index.query(vector=query_emb, filter={"price": {"lte": 70000}}). Use Pinecone for ShopMax product search when your catalogue grows beyond what ChromaDB can handle efficiently in memory.

Send your comments, suggestions or queries regarding this site to [email protected].