Try Demo
Paytm Build for India Hackathon
Paytm Soundbox

Your Soundbox just became your CFO.

14.4 million kirana owners trust the Soundbox for payments. Now it answers their real business questions, in Hinglish, hands-free.

Scroll to explore
The device

Already on
every counter.

The Soundbox already runs 10 to 14 hours a day on kirana counters. Merchants know it. They trust it. The only thing missing is intelligence.

0 Soundboxes
Active across India today
0
Of Paytm merchant loans go to Soundbox users
0
Average daily interaction with their Soundbox
The gap

The data exists.
Merchants can't reach it.

Paytm has 30 days of transaction data for every merchant. But accessing it mid-sale requires unlocking a phone, opening an app, and navigating. That friction kills the insight.

📱

The app is too far away

Merchants already double-press the Soundbox button to hear their daily total. They do not want to open an app mid-transaction. Hands-free is the only UX that works.

Disputes happen in real time

"Customer bola payment nahi hua" is the number one merchant pain point. Resolving it currently means finding a phone, opening transaction history, and scrolling. It should take one sentence.

📊

Business trends stay invisible

Merchants do not know their peak hours, slow days, or weekly trends unless they dig through the app. Decisions get made blind.

🏦

Loan eligibility is never surfaced proactively

Paytm already has the behavioral data to say "you qualify for 80,000 rupees." Merchants never hear it unless they go looking.

The answer

Soundbox CFO

A conversational AI layer added to the Soundbox the merchant already owns. No new hardware. No app download. No onboarding. Just ask.

🎤

Voice-first

Speak in Hinglish. The Soundbox listens, understands, and responds in under 3 seconds.

Web Speech API, hi-IN
🧠

Contextually intelligent

Reasons over 30 days of real transaction data. Not generic advice, not a lookup. Actual reasoning.

Paytm Inference API
🇮🇳

Hinglish by design

Responds the way merchants actually talk. Warm, direct, specific. Like a trusted munshi, not a chatbot.

llama-3.3-70b
Under the hood

Four steps.
Under 3 seconds.

Built on Paytm's own Inference API. P50 latency under 80ms on Groq LPUs.

1

Merchant speaks in Hinglish

"Aaj kitna hua?" or "Loan milega?" or "Customer bola payment nahi hua." Any natural query works.

Web Speech API, hi-IN locale
2

Speech is transcribed

hi-IN locale handles Hinglish code-switching naturally. Typed input and quick-ask buttons serve as fallbacks for noisy environments.

90%+ accuracy in quiet conditions
3

LLM reasons over merchant data

Pre-computed transaction summaries are injected into the system prompt. The model does not calculate; it formats exact figures into a Hinglish response. No hallucinations by design.

Paytm Inference API, llama-3.3-70b-versatile
4

Soundbox responds out loud

2 to 3 sentences. Specific numbers. Warm tone. Web Speech Synthesis reads the response aloud while the text appears on screen.

Web Speech Synthesis API
See it in action

Five questions.
One device answers them all.

Tap any query to see the kind of response the CFO gives.

🎤 Soundbox CFO Live
Tap a query on the left to see a response.
Stack

Built in one day.
Zero dependencies.

No backend. No build step. One HTML file. Open in browser, demo instantly.

Frontend
Single HTML file
No framework, no bundler, no npm. Open in browser and it works.
LLM
llama-3.3-70b-versatile
Paytm Inference API. Auto-fallback to Groq (same model, same LPU infra).
Voice In
Web Speech API
hi-IN locale, handles Hinglish code-switching. Typed fallback auto-shown.
Voice Out
Speech Synthesis API
Built-in browser TTS. No external dependency. hi-IN voice preferred.
Design
Paytm PODS
Paytm for Business color system. Roboto. #002E6E + #00B9F1.
Data
Mock JSON (inline)
30 days, realistic patterns. Pre-computed summaries fed to LLM to prevent hallucination.
Why it matters

The intelligence layer
Paytm already has the data for.

The Soundbox generates behavioral data every day. Soundbox CFO closes the loop by returning that intelligence to the merchant.

Every "Loan milega?" is a qualified lead

85% of Paytm's merchant loans go to Soundbox users. The device already generates the data for loan approval. Surfacing eligibility proactively through voice converts passive data into active revenue.

0

New hardware required

Runs on the Soundbox that is already deployed. The AI layer is software only.

~0.08

Rupees per merchant per day

At 10 queries per day, 150 tokens each, on Paytm Inference pricing. Well within subscription economics.

"The AI Soundbox will become a CFO, COO, and CMO for every merchant, giving them intelligence previously only available to large enterprises."

Vijay Shekhar Sharma, Founder and CEO, Paytm. GFF 2025.
Built by
N
Nimit Bhargava