FyscalTech | Choosing Foundational Models for AI Agents in Banking

→

Choosing Foundational Models for AI Agents in Banking

Learn how senior leaders can select the optimal foundational models for AI agents to drive productivity gains and escape the technical debt trap in 2026.

Written By

FT Scholar Desk

Unlock exclusive
FyscalTech Content & Insights

Subscribe now for best practices, research reports, and more.

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

The Transition from Chatbots to Autonomous Orchestrators

‍

The global banking sector has arrived at a structural crossroads. For the last three years, financial institutions have experimented with generative AI as a surface level feature, primarily for internal knowledge retrieval and basic customer service. However, as we enter 2026, the strategic focus has shifted from "generative" to "agentic." Senior decision makers are no longer content with models that simply provide information; they require systems that can independently interpret objectives, break them into subtasks, interact with legacy ledgers, and execute complex workflows with minimal human oversight.

The urgency of this shift is underscored by recent data from Accenture, which reveals that approximately 70 percent of banking IT budgets are currently consumed by the maintenance of technical debt. To break this cycle, leaders are looking toward the "10x bank" model, where one professional manages a team of AI agents to deliver exponential impact. The fundamental challenge lies in the first step of this transformation: selecting the right foundational model. In an era where the choice of architecture determines long term competitiveness, selecting the wrong model is not just a technical error but a significant strategic risk.

The Strategic Chasm: Large vs Small Language Models

A common misconception among C Suite executives is that the largest model is inherently the best for every application. While frontier Large Language Models (LLMs) such as GPT-5 or Claude 4.5 offer unmatched general purpose reasoning and creative capabilities, they are frequently an inefficient choice for specific banking tasks. These massive models carry high latency, prohibitive infrastructure costs, and a higher propensity for "hallucinations" in structured financial contexts.

Leading institutions are increasingly adopting a heterogeneous approach, utilizing Small Language Models (SLMs) for targeted, high frequency operations. Research indicates that SLMs, typically those under 14 billion parameters, can be 30 times cheaper to operate than their larger counterparts. For tasks such as parsing invoice data, triaging support tickets, or performing automated KYC checks, SLMs provide sub-second response times and can be trained on proprietary bank data to ensure higher precision.

The decision path for 2026 involves a "fit for purpose" assessment:

Frontier LLMs: Best for open ended reasoning, complex investment research, and high level strategic advisory where broad context is indispensable.
Specialised SLMs: Ideal for repetitive, predictable routines like command parsing, structured JSON output for tool calls, and real time fraud detection.

Evaluating the Capacity to Act: The Function Calling Imperative

The defining characteristic of an AI agent is its ability to interact with the external world through APIs and software tools. This means that when evaluating a foundational model, the most critical benchmark is not its prose but its "function calling" accuracy. A model that fails to correctly invoke a core banking API can trigger significant operational failures or compliance breaches.

Senior technical strategists are now looking toward the(https://gorilla.cs.berkeley.edu/leaderboard.html) (BFCL) as a primary metric for model selection. Current data suggests that while leading models are nearing human level performance in single turn tasks, a gap remains in "long horizon" reasoning where multiple sequential API calls are required. For example, an agent tasked with "rebalancing a corporate treasury portfolio" must check balances, calculate FX spreads, and initiate multiple transfers across different chains. Models that score high on the BFCL for relevance detection are essential for preventing "hallucinated" actions that could lead to financial loss.

Quantifying Business Impact and ROI

The shift to an agentic architecture is producing measurable gains that redefine the bottom line. According to research from McKinsey, banks that successfully redesign frontline domains end to end using AI agents can see revenues per relationship manager rise by up to 15 percent, while the cost to serve can fall by 20 to 40 percent.

Furthermore, organisations are achieving an average 2.3x return on agentic AI investments within 13 months. This ROI is driven by:

Operational Excellence: Reducing documentation time by up to 42 percent and automating manual prospecting.
Compliance Resilience: Transitioning from "after the fact" monitoring to real time, programmable policy enforcement embedded directly in the agent's reasoning loop.
Revenue Uplift: Using agents to provide hyper personalised advice at scale, a capability that previously was reserved only for high net worth segments.

Navigating Governance and Data Sovereignty

The final and most sensitive consideration in model selection is the regulatory environment. For global institutions, data sovereignty is a non-negotiable requirement. Models that operate exclusively in public clouds may not meet the rigorous standards of the EU AI Act or local data residency laws.

The move toward "On-Premises" or "Private Cloud" deployment of SLMs offers a path to resilience. By keeping the model within the bank's firewall, leaders ensure that sensitive client data never leaves the secured environment. This approach also allows for better auditability, as every reasoning trace generated by the agent can be stored and inspected to satisfy regulatory inquiries. Fyscal Technologies advocates for a "compliance by design" architecture, where model choice is balanced against the institution's risk appetite and jurisdictional requirements.

Strategic Implementation with Fyscal Technologies

Building a resilient, agentic workforce requires more than just an API key. It demands a vendor agnostic execution strategy that prevents the bank from becoming locked into a single model's roadmap. Fyscal Technologies serves as a strategic partner to help banks modernise their core systems and build the "ontology" necessary for agents to function effectively across departments.

Our engineering approach focuses on building a composable AI mesh, an orchestration layer that allows agents to reason and collaborate across different language models. Whether it is implementing secure multi party computation for wallet management or designing the data foundation for autonomous sales acceleration, we ensure that your digital transformation is built for agility and regulatory confidence.

Senior executives must transition from the "if" of AI to the "how" of agentic deployment. The organisations that move first to right-size their AI adoption will not only capture the productivity dividend but also secure their place as the primary financial interface for the next decade of digital commerce.

Conclusion: The Strategic Imperative for 2026

The era of monolithic, slow moving banking software is over. The rise of agentic AI represents a fundamental upgrade of the financial stack, one that rewards precision, speed, and autonomy. Decision makers who prioritise model fit for purpose over brand recognition will be best positioned to scale their AI capabilities safely and profitably. The window to gain strategic distance through these technologies is open now, but as the gap between market leaders and laggards widens, the cost of delay will only grow.

Ready to explore how Fyscal Technologies can help you achieve this

Book a Strategy Call →

Last Updated

February 1, 2026

Choosing Foundational Models for AI Agents in Banking

Unlock exclusive FyscalTech Content & Insights

Heading 1

Heading 2

Heading 3

Heading 3

Heading 4

Heading 5

Heading 6

The Transition from Chatbots to Autonomous Orchestrators

Heading 1

Heading 2

Heading 3

Heading 4

Heading 5

Heading 6

The Strategic Chasm: Large vs Small Language Models

Evaluating the Capacity to Act: The Function Calling Imperative

Quantifying Business Impact and ROI

Navigating Governance and Data Sovereignty

Strategic Implementation with Fyscal Technologies

Conclusion: The Strategic Imperative for 2026

Related articles

The 2026 Strategic Blueprint for Commercial Banking Modernisation

Pragmatic AI Strategies for Commercial Bank Growth 2026

Navigating the 2026 Payments Regulatory Horizon

EU Consumer Credit Directive II: Strategic Impact on Telcos

Beyond Compliance: Why AI-Ready Governance is the New Competitive Advantage

5 Major Payment Trends for 2026: The Executive Briefing

Google UCP vs OpenAI UCP: A Strategic Choice for Banks

Five Essential Uses of Agentic AI in Banking for 2026

Palm Payments: Dead or the Future of Invisible Commerce?

Onchain Payment Architecture: Moving Beyond Messaging Rails

Can Open Banking Replace Cards? The 2026 Strategic Outlook

Crypto and Stablecoin Cards: Bridging Digital and Fiat

Banking Industry Outlook 2026: Precision Over Scale

The Stablecoin Sandwich: Modernising Global Settlement

FT Digital Banking Platform: Accelerate YourFintech Transformation

Stablecoins, AI Agents, and the Return of HTTP 402

Which Neobank Wins on ARPU?

Agentic AI in Finance: Moving From Discovery to Autonomous Action

Bank of America’s Consumer Strategy: A Blueprint for Digital Scale

Building the Agentic RAG Tech Stack: An Enterprise Guide

Unified Platforms: The Engine for MENA’s Next Fintech Leap

GDPR Consent Checklist: A Compliance Framework for Fintechs

Agentic Commerce 2026: Infrastructure for the Machine Economy

2026 Payment Trends: The Shift from Adoption to Industrialisation

Why Modern Payment Infrastructure Needs Multiple Rails

Super App Strategy: Stablecoins and Zero-Fee Remittances

Request-to-Pay (R2P): The Feature That Kills the Direct Debit

Generative AI in Compliance: Moving from Detection to Prediction

AI and Fraud in Fintech: Why Deepfakes and Synthetic IDs Are Winning

Stablecoins and MiCA Regulation: What You Need to Know in 2026

Open Banking to Open Finance: Regional Convergence on Data-Sharing in 2026

From Chatbots to Agentic AI: Next-Gen Banking Assistants in 2026

Stablecoin-Backed Cards: Bridge Between Blockchain and Payments

Agentic AI in Banking: What Banks Need to Know

AI Adoption: Banks vs. Fintechs - Who Is Winning the Race?

Payment Tokenization Guide: Security, Types, and Compliance

Bank Charters for Fintechs: Strategic Imperative or Regulatory Trap?

18 Fintech Moments That Defined 2025

Why Crypto-Native Banking Is Entering the Regulated Mainstream

TPG’s Takeover Offer: Why Europe’s Payments Giants Are Choosing Independence Over Consolidation

SoFi’s Stablecoin Launch: Why Regulated Digital Money Is Entering the Banking Core

Visa’s Stablecoins Advisory Practice: Why Stablecoins Are Moving From Experiment to Payment Infrastructure

Airwallex’s $8 Billion Valuation: Why Global Payments Infrastructure Is Back at the Center

Fintech Trends 2026: Infrastructure Priorities for Banks

Payments in Europe Are Reaching a Maturity Moment

Agentic Commerce and the Payments Ecosystem

Why Commercial Banking Is Entering Its Most Urgent Phase

AI in Banking 2026: From Experimentation to Execution

The AI-Powered KYC Revolution: What Banks and Fintechs Must Do Next

How Banks Can Modernise Merchant Services & Compete With Stripe, Adyen, and Shopify

Why Tokenization Is Finally Becoming Real: Strategies for Financial Institutions

Why Stablecoins Already Won Global Money Movement

How Biometric Auth and AI Fraud Detection Are Taking Over Digital Wallets

How the World's Biggest Digital Bank Redefines Profitable Hypergrowth

The Practical Guide to Integrating Stablecoins Into Your Payment Stack

Banks Are Losing Merchants to PayTechs: Here's How to Fight Back

Behind Every Online Card Payment: Who’s Really Involved?

Unlock exclusive
FyscalTech Content & Insights