lift over default SaaS search on Indian auto-parts catalogs

Tuned catalog search as a service

You bring a catalog. We tune a domain-specific search model on it. You get a hosted /search endpoint. Integrate like Algolia — except your search actually understands Hindi, Hinglish, misspellings, and brand-as-generic queries.

Benchmark

Benchmark: 149-query graded evaluation on 26K auto-parts catalog
Default SaaSOur /searchOpenAI
Overall nDCG@100.230.450.47
Hindi / Hinglish0.140.470.54
Misspelled queries0.200.760.53
Symptom queries0.140.500.55
Zero-result queries44 / 1490 / 149

Default SaaS = raw keyword search. Algolia, Typesense, Elastic, Meilisearch Cloud all behave similarly out-of-the-box on Indian catalogs. Numbers are Meilisearch-default as category proxy.

Who this is for

Mid-market e-commerce and marketplace teams in India running catalogs where:

How it works

Stack: fine-tuned BGE-m3 multilingual embedder (open source, domain-adapted on HSN + ITI + NHTSA data) + Meilisearch BM25 with custom Indic tokenizer + 2,700-pair Hinglish bridge dictionary + class-routed cross-encoder reranker. Hybrid RRF fusion, tuned per query class.

All open-source base components. No proprietary embedding API dependency.

The bigger idea

The auto-parts endpoint is a reference implementation. The actual product I'm chasing: make catalog-search tuning agentic. You bring a catalog and 30 example queries; AI agents (Claude + Codex) walk the same 11 stages end-to-end; human in the loop only for taste decisions.

Playbook-as-code. I ran the auto-parts version manually so the playbook could exist. Read the 11-stage playbook.

Interested? DM me on LinkedIn

If you're running catalog search in India and your users are frustrated — I'll run a free benchmark on your data. Send a CSV sample + 20-30 example queries real customers type.

→ LinkedIn  |  → Code + playbook on GitHub