Author: Callum Gracie, Founder, Gia AI
Voice commerce is no longer about asking Alexa to reorder paper towels. Voice commerce has evolved into a fully transactional channel, and most small businesses have no idea what that means for their bottom line.
I have spent the better part of a decade helping small businesses show up on Google. That work is about to change in ways most service business owners are not prepared for. At CES 2026 in January, SoundHound AI unveiled Amelia 7, an agentic voice commerce platform that lets people order food, pay for parking, and book tickets through their car dashboards, televisions, and smart devices. The system operates across 60,000 restaurant partners and more than 90 million parking spaces. It does not just take a command. It processes payment, confirms the transaction, and completes fulfilment without the user touching a screen.
That shift from voice search to voice transaction is the part most people are missing
Voice Commerce by the Numbers
The global voice commerce market now exceeds exceeds $151 billion. One in five consumers has already completed a purchase using a voice command. Germany forecasts that 30% of its e-commerce volume will flow through voice channels by 2030. Meanwhile, Apple, Google, and Amazon are all building payment authorisation directly into voice interaction flows. As a result, the gap between saying “I want that” and money leaving your account is collapsing to seconds.
These figures are not projections from a single analyst report. They represent converging bets from every major technology company on the planet. Amazon has embedded transactional capability into Alexa across more than 500 million devices globally. Google Assistant now supports purchase flows in over 30 countries. Apple’s integration of payment authorisation into Siri reflects the same strategic direction. All three companies understand that the future of commerce is conversational, and they are building infrastructure to match.
For fintech, voice commerce creates an entirely new payment channel. As voice commerce adoption accelerates, voice biometrics are becoming standard for transaction security. Your voice becomes both the command and the authentication. No card number. No PIN. Not even a phone unlock. Just intent, identity, and settlement in a single spoken sentence.
However, here is what matters for the businesses I work with every day. None of this works if the business on the other end is not structured to receive it.
Why Service Businesses Are Not Ready for Voice Commerce
I run an SEO agency with over 40 SME clients across trades, hospitality, and professional services. I also co-founded GiaAI, an AI platform built to automate lead generation for service businesses. So I sit at the intersection of how customers find businesses and how those businesses convert attention into revenue.
Right now, most of my clients optimise for typed search queries. They build content around keywords and structure their Google Business Profiles for map pack visibility. Blog posts target long-tail phrases. All of that still matters. But voice commerce changes the game because voice queries are fundamentally different from typed ones.
Consider the difference. When someone types “plumber Canberra emergency,” they get a list of ten results to compare. They scan reviews, compare pricing, and make a deliberate choice. When someone says “find me a plumber who can come right now and let me pay over the phone,” they get one answer. The AI agent picks for them. There is no list. There is no comparison. If your business is not structured in a way that lets an AI agent understand your services, pricing, availability, and payment options, you are invisible in that interaction.
Think about what that means for a typical trades business. The owner has spent years building a reputation, collecting Google reviews, and paying for local ads. All of that effort assumes a customer will see a search results page and make a decision. In a voice-first environment, the customer never sees that page. They never read those reviews. They trust the AI agent to choose for them based on structured data, availability signals, and payment compatibility.
This is what I call the shift from SEO to AEO, or Answer Engine Optimisation. The goal is no longer ranking on a page of results. Instead, the goal is being the single answer an AI agent selects and transacts with. Voice commerce makes that selection process invisible to the customer and automatic for the AI.
Voice Commerce Needs Structured Data, Not Better Copy
The practical requirements are more technical than creative. To begin with, service businesses need machine-readable schema markup that describes what they offer, where they operate, when they are available, and how payment works. Beyond that, API endpoints or structured booking systems must exist for an AI agent to interact with programmatically. On top of that, payment infrastructure needs to connect to platforms that support voice-initiated transactions.
Most small businesses have none of this. Their websites are brochures. Booking systems require human interaction. Payment processing assumes a customer will type in card details on a checkout page. In a voice commerce environment, that friction kills the transaction before it starts.
Here is a practical example. A swim school in suburban Sydney offers lessons for children aged six weeks and up. Parents searching by voice might say “book a baby swim lesson near me this Saturday.” For an AI agent to fulfil that request, it needs to read structured data confirming the business offers infant swimming, operates on Saturdays, serves the user’s suburb, and accepts online payment. If any of those data points are missing or buried in unstructured paragraph text on a website, the agent moves on to a competitor that provides them cleanly.
The same applies across every service category. Electricians, landscapers, personal trainers, and accountants all face the same challenge. Their expertise is real, but their digital infrastructure does not speak the language that AI agents understand.
SoundHound’s voice commerce architecture already handles multi-step transactions: identifying user intent, selecting a provider, confirming details, processing payment, and sending confirmation. Mastercard launched its Agent Suite for merchants in early 2026. Visa is piloting Intelligent Commerce with DBS Bank in Singapore. That infrastructure is being built right now. Consequently, businesses that plug into it early will capture demand that their competitors do not even know exists.
The Fintech Opportunity Sitting in the Middle Layer
For fintech companies, the voice commerce opportunity sits between the voice platform and the service business. Someone needs to build the connective tissue. That means booking APIs that speak to voice agents, payment rails optimised for voice-initiated micro-transactions, identity verification layers that work through voice biometrics, and discovery protocols that help AI agents match intent to provider.
Stripe, Square, and PayPal are all positioning for this. Google’s Universal Commerce Protocol, launched at NRF 2026 with Walmart and Shopify, is an early attempt at standardising how AI agents interact with merchants. But those protocols are designed for retail and e-commerce. Service businesses, the plumbers and swim schools and personal trainers, remain an afterthought.
This is a significant blind spot. Service businesses account for roughly 70% of GDP in most developed economies, yet they receive a fraction of the fintech infrastructure investment that retail and e-commerce attract. The reason is simple: service transactions are harder to standardise. A plumber’s quote depends on the job. A personal trainer’s availability changes daily. A swim school’s class structure varies by age group and location. Building payment and booking systems that handle that complexity through a voice interface is harder than processing a standard product purchase.
That gap is precisely where the next wave of vertical fintech will emerge. The platforms that make a local service business voice commerce ready without requiring a development team will own the distribution layer for an entirely new payment channel. In other words, whoever solves the B2B payment friction between voice platforms and service providers will control how millions of local transactions flow.
What Voice Commerce Means for Your Business Right Now
Voice search changed how customers find businesses. Voice commerce will change how they pay. This is not a trend on the horizon. Voice commerce is happening right now, and the window to prepare is shrinking fast.
If you run a service business, start with three steps. First, audit your website for structured data. Can a machine read your services, hours, service areas, and payment methods without interpreting paragraph text? Second, evaluate your booking system. Can it accept a reservation programmatically, or does it require a human to answer a phone or respond to an email? Third, review your payment infrastructure. Can a transaction complete without a customer manually entering card details on a screen?
These are not futuristic requirements. They are the minimum conditions for participation in a payment channel that already processes billions in transactions globally.
Businesses and fintech platforms that act now will not need to compete for attention later. An AI agent will simply choose them. Those that wait will find themselves invisible to the fastest-growing transaction channel in a generation.
The question is not whether voice commerce will reach your industry. The question is whether your business will be ready when it does.
Callum Gracie is the founder of Otto Media Group and co-founder of GiaAI. He works with SMEs across multiple industries and is building AI-powered tools for service businesses.
