Choosing Foundational Models for AI Agents in Banking
Learn how senior leaders can select the optimal foundational models for AI agents to drive productivity gains and escape the technical debt trap in 2026.

Learn how senior leaders can select the optimal foundational models for AI agents to drive productivity gains and escape the technical debt trap in 2026.

Subscribe now for best practices, research reports, and more.
The global banking sector has arrived at a structural crossroads. For the last three years, financial institutions have experimented with generative AI as a surface level feature, primarily for internal knowledge retrieval and basic customer service. However, as we enter 2026, the strategic focus has shifted from "generative" to "agentic." Senior decision makers are no longer content with models that simply provide information; they require systems that can independently interpret objectives, break them into subtasks, interact with legacy ledgers, and execute complex workflows with minimal human oversight.
The urgency of this shift is underscored by recent data from Accenture, which reveals that approximately 70 percent of banking IT budgets are currently consumed by the maintenance of technical debt. To break this cycle, leaders are looking toward the "10x bank" model, where one professional manages a team of AI agents to deliver exponential impact. The fundamental challenge lies in the first step of this transformation: selecting the right foundational model. In an era where the choice of architecture determines long term competitiveness, selecting the wrong model is not just a technical error but a significant strategic risk.
A common misconception among C Suite executives is that the largest model is inherently the best for every application. While frontier Large Language Models (LLMs) such as GPT-5 or Claude 4.5 offer unmatched general purpose reasoning and creative capabilities, they are frequently an inefficient choice for specific banking tasks. These massive models carry high latency, prohibitive infrastructure costs, and a higher propensity for "hallucinations" in structured financial contexts.
Leading institutions are increasingly adopting a heterogeneous approach, utilizing Small Language Models (SLMs) for targeted, high frequency operations. Research indicates that SLMs, typically those under 14 billion parameters, can be 30 times cheaper to operate than their larger counterparts. For tasks such as parsing invoice data, triaging support tickets, or performing automated KYC checks, SLMs provide sub-second response times and can be trained on proprietary bank data to ensure higher precision.
The decision path for 2026 involves a "fit for purpose" assessment:
The defining characteristic of an AI agent is its ability to interact with the external world through APIs and software tools. This means that when evaluating a foundational model, the most critical benchmark is not its prose but its "function calling" accuracy. A model that fails to correctly invoke a core banking API can trigger significant operational failures or compliance breaches.
Senior technical strategists are now looking toward the(https://gorilla.cs.berkeley.edu/leaderboard.html) (BFCL) as a primary metric for model selection. Current data suggests that while leading models are nearing human level performance in single turn tasks, a gap remains in "long horizon" reasoning where multiple sequential API calls are required. For example, an agent tasked with "rebalancing a corporate treasury portfolio" must check balances, calculate FX spreads, and initiate multiple transfers across different chains. Models that score high on the BFCL for relevance detection are essential for preventing "hallucinated" actions that could lead to financial loss.
The shift to an agentic architecture is producing measurable gains that redefine the bottom line. According to research from McKinsey, banks that successfully redesign frontline domains end to end using AI agents can see revenues per relationship manager rise by up to 15 percent, while the cost to serve can fall by 20 to 40 percent.
Furthermore, organisations are achieving an average 2.3x return on agentic AI investments within 13 months. This ROI is driven by:
The final and most sensitive consideration in model selection is the regulatory environment. For global institutions, data sovereignty is a non-negotiable requirement. Models that operate exclusively in public clouds may not meet the rigorous standards of the EU AI Act or local data residency laws.
The move toward "On-Premises" or "Private Cloud" deployment of SLMs offers a path to resilience. By keeping the model within the bank's firewall, leaders ensure that sensitive client data never leaves the secured environment. This approach also allows for better auditability, as every reasoning trace generated by the agent can be stored and inspected to satisfy regulatory inquiries. Fyscal Technologies advocates for a "compliance by design" architecture, where model choice is balanced against the institution's risk appetite and jurisdictional requirements.
Building a resilient, agentic workforce requires more than just an API key. It demands a vendor agnostic execution strategy that prevents the bank from becoming locked into a single model's roadmap. Fyscal Technologies serves as a strategic partner to help banks modernise their core systems and build the "ontology" necessary for agents to function effectively across departments.
Our engineering approach focuses on building a composable AI mesh, an orchestration layer that allows agents to reason and collaborate across different language models. Whether it is implementing secure multi party computation for wallet management or designing the data foundation for autonomous sales acceleration, we ensure that your digital transformation is built for agility and regulatory confidence.
Senior executives must transition from the "if" of AI to the "how" of agentic deployment. The organisations that move first to right-size their AI adoption will not only capture the productivity dividend but also secure their place as the primary financial interface for the next decade of digital commerce.
The era of monolithic, slow moving banking software is over. The rise of agentic AI represents a fundamental upgrade of the financial stack, one that rewards precision, speed, and autonomy. Decision makers who prioritise model fit for purpose over brand recognition will be best positioned to scale their AI capabilities safely and profitably. The window to gain strategic distance through these technologies is open now, but as the gap between market leaders and laggards widens, the cost of delay will only grow.
Ready to explore how Fyscal Technologies can help you achieve this