Published on22 August 2025You too can run the Vidore Benchmark with less than 32GB of GPU VRAMvidorebenchmarkcolpaliraggpu-poorpytorchmtebQuick, practical notes to run the Vidore benchmark smoothly on a single 32GB GPU: dtype, batch size, and common OOM fixes.Read more →
Published on15 August 2025The Most Beautiful RAG: Starring ColPali, Qdrant, Minio and Friendsragretrieval-augmented-generationvisionmultimodalcolpaliqdrantminiofastapipythongradiovector-searchnextjsfrontendbinary-quantizationAn end-to-end, page-level Vision RAG template with ColPali-style embeddings, optional Next.js frontend, Qdrant multivector retrieval (with optional binary quantization), and MinIO-backed storage — dockerized, API-first, and optionally UI-powered.Read more →
Published on12 August 2025ColQwen2.5 FastAPI Integrationfastapiembeddingsqdrantcoplpalicolqwenapi-developmentlittle-scriptsA little-script to create a FastAPI server for ColQwen2.5Read more →
Published on17 July 2025Audio RAG with ColQwen2.5-Omniragretrieval-augmented-generationaudiovideo-processingcolqwenopenaigradiolittle-scriptsmultimodalembeddingssemantic-searchpythonAn audio RAG system that processes video URLs and answers questions about their content using ColQwen2.5-Omni and OpenAI audioRead more →
Published on3 July 2025The Most Beautiful RAG: Starring Colnomic, Qdrant, Minio and Friendsragretrieval-augmented-generationqdrantvector-searchcolbertlate-interactionllmpythonlittle-scriptsembeddingssemantic-searchcolpalicolnomicIntroducing the first project in my little-scripts monorepo - A simple, yet beautiful RAG implementation using Colnomic, Qdrant and NomicRead more →