"Safe Agentic AI for Banking: Gradient Labs' Black Box Solution"

The Banking Sector Strikes a Balance Between Opportunity and Risk in AI Adoption

The banking industry is at a crossroads, facing significant opportunities as well as substantial concerns. Research indicates that the market for AI agents in financial services could reach $6.54 billion by 2035. Meanwhile, search interest in “AI bank risks” has skyrocketed by an astonishing 33,125%. As financial institutions endeavor to integrate AI systems into customer operations, navigating the complexities of safe deployment has become a pressing challenge.

Neil Lathia – Co-founder & CTO at Gradient Labs

Neal Lathia, Co-Founder and CTO of Gradient Labs, emphasizes that successful AI adoption hinges not on evading regulations but on developing systems that exceed regulatory benchmarks. Bringing over a decade of experience in AI and a background at Monzo, Lathia discussed the essential principles of safe AI deployment with The Fintech Times. He also tackled the “black box” dilemma that keeps compliance officers alert and engaged.

Ensuring Transparency in AI Systems

The primary obstacle for banking executives contemplating the implementation of AI is the challenge of transparency. Traditional Large Language Models (LLMs) are often perceived as black boxes—information is input, and an answer is output, with little understanding of the reasoning behind the conclusion. Lathia asserts that Gradient Labs addresses this issue by creating agents that operate outside of a closed loop.

According to Lathia, the core of their AI agents is designed to maintain a strict set of decision traces that can be reviewed and understood. By binding non-deterministic LLMs to narrow, specific tasks, financial institutions gain visibility not just into actions taken by the AI, but also the logical framework it used to arrive at its decisions. This transparency is essential for meeting the demands of regulators and risk assessment committees.

Exceeding Human Performance Standards

Proving that an AI system is production-ready is a highly debated subject. Lathia insists that current human performance sets the standard that AI must surpass. To achieve this, Gradient Labs employs rigorous internal quality assurance processes that benchmark AI performance against that of human agents.

The goal is to provide a transformative customer experience through AI, a goal that positions current human capabilities as the baseline. For a banking AI to move into operations, it must demonstrate that it can meet or exceed human standards for accuracy and compliance. The challenge is not only speed but also the capacity to manage a wide variety of customer queries that banks encounter, far exceeding what typical e-commerce platforms face.

Addressing Compliance Concerns: The Tipping Off Risk

In the UK, alerting a customer about a suspicious activity report (SAR) or an ongoing investigation is deemed a criminal offense. This creates a complex challenge for AI agents, which often utilize extensive internal data. The risk of unintentionally revealing sensitive information poses a significant concern for compliance officers.

Lathia points out that it is nearly impossible to entirely shield an AI from information that might lead to a potential tip-off. The company has responded to this challenge by developing an independent, auditable control that verifies all agent communications before they reach customers, ensuring that no sensitive investigative details are inadvertently disclosed.

Evaluating Historical Data Without Bias

A prevalent concern among Chief Risk Officers is that relying on historical data might lead AI to adopt existing human biases or outdated practices. Gradient Labs counters this by using a specialized onboarding agent to extract relevant “knowledge snippets” from past interactions.

A critical aspect of this process is verification; facts must be substantiated through multiple conversations before being considered valid. This rigorous approach is complemented by a human oversight mechanism, where human operators review and adjust the information the AI uses, preventing the repetition of past mistakes while learning from earlier experiences.

Metrics for Boardroom Oversight

For executives, the effectiveness of AI deployments is gauged by their impact on risk appetite. Lathia identifies three key “control-plane metrics” that should be reported to the board: resolution rates, customer satisfaction indicators, and specialized metrics that capture complaint volumes. These metrics enable Chief Risk Officers to monitor the system’s health continuously, aligning AI functionality with regulatory requirements.

By focusing on these high-level outcomes, banks can transition from viewing AI as a potentially risky endeavor to recognizing it as a reliable, scalable resource.

Looking Ahead to 2035: Bridging Technology and Culture

As the financial sector looks toward the multi-billion dollar opportunities that lie ahead, the question persists: is the challenge primarily technological or cultural? Lathia asserts that these elements are intricately connected. He believes that effective technology flourishes under the guidance of regulations rather than subverting them.

As AI increasingly participates in financial decision-making, regulators will play a pivotal role in safeguarding customer experiences. For banks, the most effective strategy will not involve circumventing regulations but rather developing transparent, auditable, and nuanced systems that facilitate compliance.

Subscribe to Updates

Trending Now

Eliminating the Black Box: Gradient Labs’ Approach to Developing Safe, Agentic AI for the Banking Sector