Blog

Choosing Foundational Models for AI Agents in Banking

Learn how senior leaders can select the optimal foundational models for AI agents to drive productivity gains and escape the technical debt trap in 2026.

Written By
FT Scholar Desk

Unlock exclusive
FyscalTech Content & Insights

Subscribe now for best practices, research reports, and more.

Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.

Heading 1

Heading 2

Heading 3

Heading 3

Heading 4

Heading 5
Heading 6

Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam, quis nostrud exercitation ullamco laboris nisi ut aliquip ex ea commodo consequat. Duis aute irure dolor in reprehenderit in voluptate velit esse cillum dolore eu fugiat nulla pariatur.

Block quote

Ordered list

  1. Item 1
  2. Item 2
  3. Item 3

Unordered list

  • Item A
  • Item B
  • Item C

Text link

Bold text

Emphasis

Superscript

Subscript

The Transition from Chatbots to Autonomous Orchestrators

The global banking sector has arrived at a structural crossroads. For the last three years, financial institutions have experimented with generative AI as a surface level feature, primarily for internal knowledge retrieval and basic customer service. However, as we enter 2026, the strategic focus has shifted from "generative" to "agentic." Senior decision makers are no longer content with models that simply provide information; they require systems that can independently interpret objectives, break them into subtasks, interact with legacy ledgers, and execute complex workflows with minimal human oversight.

The urgency of this shift is underscored by recent data from Accenture, which reveals that approximately 70 percent of banking IT budgets are currently consumed by the maintenance of technical debt. To break this cycle, leaders are looking toward the "10x bank" model, where one professional manages a team of AI agents to deliver exponential impact. The fundamental challenge lies in the first step of this transformation: selecting the right foundational model. In an era where the choice of architecture determines long term competitiveness, selecting the wrong model is not just a technical error but a significant strategic risk.

Heading 1

Heading 2

Heading 3

Heading 4

Heading 5
Type image caption here (optional)
Heading 6

Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam, quis nostrud exercitation ullamco laboris nisi ut aliquip ex ea commodo consequat. Duis aute irure dolor in reprehenderit in voluptate velit esse cillum dolore eu fugiat nulla pariatur.

Block quote

Ordered list

  1. Item 1
  2. Item 2
  3. Item 3

Unordered list

  • Item A
  • Item B
  • Item C

Text link

Bold text

Emphasis

Superscript

Subscript

The Strategic Chasm: Large vs Small Language Models

A common misconception among C Suite executives is that the largest model is inherently the best for every application. While frontier Large Language Models (LLMs) such as GPT-5 or Claude 4.5 offer unmatched general purpose reasoning and creative capabilities, they are frequently an inefficient choice for specific banking tasks. These massive models carry high latency, prohibitive infrastructure costs, and a higher propensity for "hallucinations" in structured financial contexts.

Leading institutions are increasingly adopting a heterogeneous approach, utilizing Small Language Models (SLMs) for targeted, high frequency operations. Research indicates that SLMs, typically those under 14 billion parameters, can be 30 times cheaper to operate than their larger counterparts. For tasks such as parsing invoice data, triaging support tickets, or performing automated KYC checks, SLMs provide sub-second response times and can be trained on proprietary bank data to ensure higher precision.

The decision path for 2026 involves a "fit for purpose" assessment:

  • Frontier LLMs: Best for open ended reasoning, complex investment research, and high level strategic advisory where broad context is indispensable.
  • Specialised SLMs: Ideal for repetitive, predictable routines like command parsing, structured JSON output for tool calls, and real time fraud detection.

Evaluating the Capacity to Act: The Function Calling Imperative

The defining characteristic of an AI agent is its ability to interact with the external world through APIs and software tools. This means that when evaluating a foundational model, the most critical benchmark is not its prose but its "function calling" accuracy. A model that fails to correctly invoke a core banking API can trigger significant operational failures or compliance breaches.

Senior technical strategists are now looking toward the(https://gorilla.cs.berkeley.edu/leaderboard.html) (BFCL) as a primary metric for model selection. Current data suggests that while leading models are nearing human level performance in single turn tasks, a gap remains in "long horizon" reasoning where multiple sequential API calls are required. For example, an agent tasked with "rebalancing a corporate treasury portfolio" must check balances, calculate FX spreads, and initiate multiple transfers across different chains. Models that score high on the BFCL for relevance detection are essential for preventing "hallucinated" actions that could lead to financial loss.

Quantifying Business Impact and ROI

The shift to an agentic architecture is producing measurable gains that redefine the bottom line. According to research from McKinsey, banks that successfully redesign frontline domains end to end using AI agents can see revenues per relationship manager rise by up to 15 percent, while the cost to serve can fall by 20 to 40 percent.

Furthermore, organisations are achieving an average 2.3x return on agentic AI investments within 13 months. This ROI is driven by:

  • Operational Excellence: Reducing documentation time by up to 42 percent and automating manual prospecting.
  • Compliance Resilience: Transitioning from "after the fact" monitoring to real time, programmable policy enforcement embedded directly in the agent's reasoning loop.
  • Revenue Uplift: Using agents to provide hyper personalised advice at scale, a capability that previously was reserved only for high net worth segments.

Navigating Governance and Data Sovereignty

The final and most sensitive consideration in model selection is the regulatory environment. For global institutions, data sovereignty is a non-negotiable requirement. Models that operate exclusively in public clouds may not meet the rigorous standards of the EU AI Act or local data residency laws.

The move toward "On-Premises" or "Private Cloud" deployment of SLMs offers a path to resilience. By keeping the model within the bank's firewall, leaders ensure that sensitive client data never leaves the secured environment. This approach also allows for better auditability, as every reasoning trace generated by the agent can be stored and inspected to satisfy regulatory inquiries. Fyscal Technologies advocates for a "compliance by design" architecture, where model choice is balanced against the institution's risk appetite and jurisdictional requirements.

Strategic Implementation with Fyscal Technologies

Building a resilient, agentic workforce requires more than just an API key. It demands a vendor agnostic execution strategy that prevents the bank from becoming locked into a single model's roadmap. Fyscal Technologies serves as a strategic partner to help banks modernise their core systems and build the "ontology" necessary for agents to function effectively across departments.

Our engineering approach focuses on building a composable AI mesh, an orchestration layer that allows agents to reason and collaborate across different language models. Whether it is implementing secure multi party computation for wallet management or designing the data foundation for autonomous sales acceleration, we ensure that your digital transformation is built for agility and regulatory confidence.

Senior executives must transition from the "if" of AI to the "how" of agentic deployment. The organisations that move first to right-size their AI adoption will not only capture the productivity dividend but also secure their place as the primary financial interface for the next decade of digital commerce.

Conclusion: The Strategic Imperative for 2026

The era of monolithic, slow moving banking software is over. The rise of agentic AI represents a fundamental upgrade of the financial stack, one that rewards precision, speed, and autonomy. Decision makers who prioritise model fit for purpose over brand recognition will be best positioned to scale their AI capabilities safely and profitably. The window to gain strategic distance through these technologies is open now, but as the gap between market leaders and laggards widens, the cost of delay will only grow.

Ready to explore how Fyscal Technologies can help you achieve this

Book a Strategy Call →

Last Updated
February 1, 2026
CATEGORY
INSIGHTS

Get started for free

Try Webflow for as long as you like with our free Starter plan. Purchase a paid Site plan to publish, host, and unlock additional features.

Book a Strategy Call →
TRANSFORMING THE DESIGN PROCESS AT