Free Tier Mechanics: 1GB RAM, 0.5 vCPU, and the 7-Day Inactivity Pause
Qdrant Cloud's free tier provides a single-node cluster with 1 GB of RAM and 0.5 vCPU, deployed in a single region. This is a meaningful resource allocation for a vector database — substantially more than a toy experiment allows, but not enough for a production workload with more than a few hundred thousand typical vectors. At 768 dimensions and float32 precision, 1 GB of RAM comfortably holds approximately 300,000–350,000 vectors when accounting for collection index overhead. That is sufficient for proof-of-concept work, tutorials, and small internal tools.
Like Pinecone's Starter tier, Qdrant Cloud's free cluster is paused after seven consecutive days of inactivity. A paused cluster does not respond to API requests until it is manually resumed through the Qdrant Cloud console. Your collection data, vectors, and payload fields are preserved — pausing is not deletion. The cluster typically resumes within 30-60 seconds of the resume action. Importantly, the definition of 'activity' in Qdrant Cloud's inactivity timer appears to include both read (search) and write (upsert/update) operations. A passive cluster that receives no API traffic will pause regardless of whether it was recently created.
The prevention strategy is the same as for Pinecone: a lightweight keep-alive cron job that issues a simple search request to your collection on a schedule shorter than seven days. A five-day interval provides comfortable margin. Unlike production databases where idle connections consume resources, a single Qdrant search query with k=1 against a small collection completes in milliseconds and consumes negligible compute. The free tier is appropriate for development environments, personal projects, and initial prototyping. Any application serving real users should be on a paid tier — not primarily because of the inactivity pause, but because of the absence of an SLA and the single-node, single-region architecture that makes the free tier unsuitable for reliability-sensitive workloads.