Central banks leaned on single models and badly misread the inflation surge
Ensemble monetary policy blends many models to create safer, more robust rate rules
Teaching and adopting ensemble monetary policy can rebuild trust through cautious, transparent decisions
Beyond the Hype: Causal AI in Education Needs a Spurious Regression Check
Picture
Member for
1 year
Real name
David O'Neill
Bio
David O’Neill is a Professor of Finance and Data Analytics at the Gordon School of Business, SIAI. A Swiss-based researcher, his work explores the intersection of quantitative finance, AI, and educational innovation, particularly in designing executive-level curricula for AI-driven investment strategy. In addition to teaching, he manages the operational and financial oversight of SIAI’s education programs in Europe, contributing to the institute’s broader initiatives in hedge fund research and emerging market financial systems.
Published
Modified
AI in education is pattern matching, not true thinking
The danger is confusing correlation with real causal insight
Schools must demand causal evidence before using AI in high-stakes decisions
In 2025, a national survey in the United Kingdom found that 88% of university students were using generative AI tools like ChatGPT for their assessments. However, a global UNESCO survey of over 450 schools and universities found that fewer than 10 percent provided any formal guidance on the use of AI in teaching and learning. This shows that causal AI in education is growing from the ground up. At the same time, institutions debate the nature of these systems. Many public figures suggest each new model brings us closer to 'human-level' thinking. In reality, classrooms are using basic tools. Students depend on powerful pattern machines that don't truly grasp cause and effect. Education policy must recognize this before effectively using causal AI in education, emphasizing policies that promote understanding of causal relationships rather than superficial pattern-matching.
Causal AI in Education: Why Pattern Matching Is Not Intelligence
Much of the current debate assumes AI is steadily advancing towards general intelligence. When a system writes an essay or solves a math problem, some people claim the benchmarks for intelligence have shifted. This perspective overlooks what today's systems actually accomplish. They are massive engines for statistical pattern matching. They learn from trillions of words and images, identifying which tokens typically connect to others. If we define intelligence as the ability to create causal models, operate across various domains, and learn from experience, the situation looks different. These systems do not resemble self-learning agents, such as children or animals. In econometrics, this shallow connection between variables is known as spurious regression. Two lines may move in tandem, the model shows a high score, yet nothing indicates that one causes the other. That is what most generative models offer in classrooms today. They make predictions and imitate actions without understanding why. Industry professionals observe the same trend: systems excel at identifying correlations in large datasets, but humans must determine whether those signals reflect actual causal forces.
Recent evidence from technical evaluations supports this. The 2025 AI Index report indicates that while cutting-edge models now match or surpass humans on many benchmark tests, they still struggle with complex reasoning tasks like planning and logic puzzles, even when exact solutions exist. In other words, they perform well when the problem resembles their training data but fail when faced with fresh causal structures. Studies on large reasoning models from major labs tell a similar story. Performance declines as tasks become more complex, and models often resort to shortcuts rather than thinking harder. Additional work shows that top chatbots achieve high accuracy on factual questions, yet still confuse belief with fact and treat surface patterns as explanations. These are clear signs of spurious regression, not signs of minds awakening. This distinction is crucial for causal AI in education.
Spurious Regression, AGI Hype, and Causal AI in Education
Spurious regression isn't just a technical term; it reflects a common human tendency. When two trends move together over time, people want to create a narrative linking them. Economists learn to approach this impulse with caution. For instance, two countries might see rising exports and internet usage simultaneously. That doesn’t mean that browsing the web causes trade surpluses. Similarly, when a model mimics human style, passes exams, and engages in fluent conversations, people often interpret it as a sign of a thinking agent. However, what’s actually happening is more like a significant auto-complete function. The system selects tokens that fit the statistical pattern. It doesn't account for what would happen if a curriculum changed, an experiment were conducted, or a rule were broken. It lacks an innate sense of intervention. This is why causal AI in education is significant; it emphasizes what causes real-world changes rather than just what predicts the next word.
The public already senses something is amiss. Surveys indicate that just over half of adults in the United States feel more anxious than excited about AI in everyday life, with only a slight percentage feeling mostly excited. At the same time, adoption in education is rapidly advancing. A 2024 global survey found that 86% of higher education students used AI tools in their studies, with most using them weekly or more. By 2025, a survey in the United Kingdom found that 88 percent of students used generative AI tools for assessments, and more than nine in ten used AI for some academic work. Yet the same surveys showed that about half of students did not feel "AI-ready" and lacked confidence in their ability to use these tools effectively. Meanwhile, UNESCO found that fewer than one in ten institutions had formal AI policies in place. This context highlights the challenges for causal AI in education: heavy usage, minimal guidance, and significant concern.
Figure 1: Most students use AI, but very few feel prepared or know their university’s rules, exposing a readiness gap that causal AI in education must address.
Other sectors are beginning to correct similar issues by adopting causal methods. Healthcare serves as a helpful example. For years, clinical teams relied on machine learning models that ranked patients by risk. These systems were effective at identifying patterns, such as the connection between specific lab results and future complications. Still, they could not show what would happen if a treatment changed. New developments in causal AI for drug trials and care pathways aim to address this gap. They create models that ask 'what if ' questions and employ rigorous experimental or quasi-experimental designs to distinguish genuine drivers from misleading correlations. The goal isn’t to make models seem more human but to ensure their recommendations are safer and more reliable when circumstances change. Educational policy should incorporate these insights by supporting research that develops causal models capable of predicting the impact of interventions, thus ensuring AI tools in education are both effective and trustworthy in dynamic classroom environments.
Policy Priorities for Causal AI in Education
The priority is to teach the next generation what these systems can and cannot do. Currently, students see attractive interfaces and smooth language and assume there’s a mind behind the screen. Yet research tells a different story. Large language models can solve Olympiad-level math problems and produce plausible step-by-step reasoning. However, independent evaluations reveal they still struggle with many logic tasks, lose track of constraints, and cannot reliably plan over multiple steps. When tasks become more challenging, models often guess quickly instead of thinking more deeply. In classroom settings, this means AI will excel at drafting, summarizing, and spotting patterns but will fall short when asked to justify a causal claim. Educators can turn this into a valuable teaching moment. Causal AI in education should emphasize critical use of current tools by encouraging students to question model answers for hidden assumptions, missing mechanisms, and spurious regression.
The second priority is institutional. AI literacy shouldn't rely solely on individual teachers. Parents and students expect schools to take action. Recent surveys in the United States show that nearly 9 in 10 parents believe knowledge of AI is crucial for their children’s education and career success, and about 2/3 think schools should explicitly teach students how to use AI. Yet UNESCO's survey indicates that fewer than ten percent of institutions provide formal guidance. Many universities rely on ad hoc policies crafted by small committees under time pressure. This mismatch invites both overreaction and negligence. One extreme leads to blanket bans that push AI use underground and hinder honest discussion. The other leads to blind trust in "AI tutors" taking charge of feedback and assessment. A policy agenda focused on causal AI in education should ask a straightforward question: under what conditions does a specific AI tool noticeably help students learn, and how do those effects vary for different groups?
Answering that question requires better evidence. The tools of causal AI in education are practical, straightforward methods that many institutions already use in other areas. Randomized trials can compare sections using an AI writing assistant to those that do not, tracking not only grades but also the transfer of skills to new tasks. Quasi-experimental designs can study the impact of AI-enabled homework support in districts that introduce new tools in phased implementations. Learning analytics teams can shift from dashboards that merely describe past behavior to models that simulate the consequences of changes in grading rules or feedback styles. None of this necessitates systems that think like humans; it requires clarity about causes and careful control groups. The potential benefits are considerable. Rather than debating abstractly whether AI is "good" or "bad" for learning, institutions can begin to identify where causal AI in education truly adds value and where it merely automates spurious regression.
Figure 2: In just one year, UK students moved from majority to near-universal AI use, especially for assessments, showing why causal AI in education policy cannot wait.
The final priority is to establish regulatory clarity. Governments are rapidly addressing AI in general, but education often remains on the sidelines of those discussions. Regulations that treat all advanced models as steps toward general intelligence can overlook the real risks. In education, the most urgent dangers arise from pattern-matching systems that appear authoritative while concealing their limitations. Recent technical studies show that even leading chatbots released after mid-2024 still struggle to distinguish fact from belief and can confidently present incorrect explanations in sensitive areas such as law and medicine. Other research highlights a significant drop in accuracy when reasoning models tackle more complex challenges, even though they perform well on simpler tasks. These are not minor issues on the path to human-like thought; they signal the risks of relying too heavily on correlation. Regulations shaped by causal AI in education should mandate that any high-stakes use, such as grading or placement, undergo experimental testing for robustness to change, rather than relying solely on benchmark scores based on past data.
Education systems face a critical choice. One path tells a story where every new AI breakthrough prompts us to redefine intelligence and brings us closer to machines that think like us. The other path presents a more realistic view. In this perspective, today’s systems are fast, cost-effective engines for statistical fitting that can benefit classrooms when used responsibly. They can also cause serious issues if we confuse spurious regression with genuine understanding. The statistics presented at the beginning should draw attention. When nearly 90% of students depend on AI for their work, and fewer than 10% of institutions provide solid guidance, the problem isn't that machines are becoming too intelligent. Instead, the issue is that policy isn't smart enough about machines. The task ahead is straightforward. Foster a culture of causal AI in education that encourages students to question patterns, empowers teachers to conduct simple experiments, and insists on evidence before high-stakes deployment. If we succeed, the goalposts do not need to shift at all.
The views expressed in this article are those of the author(s) and do not necessarily reflect the official position of the Swiss Institute of Artificial Intelligence (SIAI) or its affiliates.
References
College Board. (2025). U.S. high school students’ use of generative artificial intelligence. Research Brief. DIA Global Forum. (2024). Correlation vs. causation: How causal AI is helping determine key connections in healthcare and clinical trials. Digital Education Council. (2024). Key results from DEC Global AI Student Survey 2024. Higher Education Policy Institute. (2025). Student generative AI survey 2025. Pew Research Center. (2023). What the data says about Americans’ views of artificial intelligence. Qymatix. (2025). Correlation and causality in artificial intelligence: What does this mean for wholesalers? Stanford Institute for Human-Centered Artificial Intelligence. (2025). AI Index Report 2025. UNESCO. (2023). Guidance for generative AI in education and research: Executive summary.
Picture
Member for
1 year
Real name
David O'Neill
Bio
David O’Neill is a Professor of Finance and Data Analytics at the Gordon School of Business, SIAI. A Swiss-based researcher, his work explores the intersection of quantitative finance, AI, and educational innovation, particularly in designing executive-level curricula for AI-driven investment strategy. In addition to teaching, he manages the operational and financial oversight of SIAI’s education programs in Europe, contributing to the institute’s broader initiatives in hedge fund research and emerging market financial systems.
The US funds fabs but lacks skilled workers
Asian firms plug gaps with temporary foreign technicians
Real fix: serious US semiconductor workforce training
The fight for factory jobs in the United States has become a struggle
Governing at Machine Speed: How to Prevent AI Bank Runs from Becoming the Next Crisis
Picture
Member for
1 year 1 month
Real name
Keith Lee
Bio
Keith Lee is a Professor of AI and Data Science at the Gordon School of Business, part of the Swiss Institute of Artificial Intelligence (SIAI), where he leads research and teaching on AI-driven finance and data science. He is also a Senior Research Fellow with the GIAI Council, advising on the institute’s global research and financial strategy, including initiatives in Asia and the Middle East.
Published
Modified
AI is accelerating bank-run risk into “AI bank runs”
Supervisors lag far behind banks in AI tools and skills
We need real-time oversight, automated crisis tools, and targeted training now
In March 2023, a mid-sized American bank experienced a day when depositors attempted to withdraw $42 billion. Most of this rush came from simple phone taps, not from long lines outside the bank. Although this scramble did not yet use fully autonomous systems, it demonstrated how quickly fear can spread when money moves as fast as a push notification. Now, imagine an added layer. Corporate treasurers use AI tools to track every rumor. Households rely on chatbots that adjust savings in real time. Trading engines react to each other's moves in milliseconds. The outcome is not a traditional bank run. Instead, it’s the potential for AI-driven bank runs, in which algorithms trigger, amplify, and execute waves of withdrawals and funding cuts long before human supervisors can act.
Why AI Bank Runs Change the Crisis Playbook
Systemic banking crises have always involved everyone rushing for the same exit at the same time. In the past, this coordination relied on rumors, news broadcasts, and human analysis of financial reports spread over hours or days. Today, AI systems can scan markets, social media, and private data streams in seconds. They update risk models and send orders almost immediately. The rush that caused Silicon Valley Bank’s collapse was one example. Customers attempted to withdraw tens of billions of dollars in a single day, highlighting the power of digital channels and online coordination. Future AI bank runs will shorten that timeline even more. Automated systems will learn to respond not only to general news but also to each other’s activities in markets and payment flows. Tools that smooth out minor fluctuations during calm periods can trigger sudden, synchronized movements during stressful periods.
The groundwork for AI bank runs is already established. In developing economies, the percentage of adults making or receiving digital payments grew from 35 percent in 2014 to 57 percent in 2021. In high-income economies, this percentage is nearly universal. Recent Global Findex updates show that 42 percent of adults made a digital payment to a merchant in 2024, up from 35 percent in 2021. At the same time, 86 percent of adults worldwide now own a mobile phone, and 68 percent have a smartphone. On the supply side, surveys conducted by European and national authorities indicate that a clear majority of banks currently use AI for tasks like credit scoring, fraud detection, and trading. This means that the technical ability for AI bank runs is in place today. Highly digital customers, near-instant payments, and widespread AI use among firms are all established. What is lacking is a supervisory framework that can monitor and influence these dynamics before they develop into full-blown crises.
What Emerging Markets Teach Us about AI Bank Runs
The weakest link in this situation is not the sophistication of private sector tools but the readiness of supervisory agencies. A recent survey of 27 financial sector authorities in emerging markets and developing economies shows that most anticipate AI will have a net positive effect. Yet their own use of AI for supervision remains limited. Many authorities are still in pilot mode for essential tasks such as data collection, anomaly detection, and off-site monitoring. Only about a quarter have formal internal policies on AI use. In parts of Africa, the percentage is even lower. The usual barriers are present. Data is often fragmented or incomplete. IT systems may be unreliable. Staff with the right technical skills are scarce. Concerns about data privacy, sovereignty, and vendor risk add further complications. Supervisors face the threat of AI bank runs with inadequate tools, incomplete data, and insufficient personnel.
In contrast, financial institutions in these same regions are advancing more quickly. The survey reveals that where AI is implemented, banks and firms focus on customer service chatbots, fraud detection, and anti-money laundering checks. African institutions are more inclined to use AI for credit scoring and underwriting in markets with limited data. These applications may seem low risk compared to high-frequency trading. However, they can create new pathways for a sudden loss of trust. A failure in an AI fraud detection system can freeze thousands of accounts simultaneously. A flawed credit model can halt lending in entire sectors overnight. The Financial Stability Board has noted that many authorities, including those in advanced economies, struggle to monitor AI-related vulnerabilities using current data-collection methods. Therefore, emerging markets serve as an early warning system. They illustrate how quickly a gap can widen between supervised firms and supervisory agencies. They also show how that gap can evolve into a serious vulnerability if not treated as an essential issue.
Figure 1: AI in emerging-market banks is concentrated in customer service, fraud/AML, and analytics, showing how fast private adoption is racing ahead of supervisory capacity.
Designing Concrete Defenses against AI Bank Runs
If AI bank runs pose a structural risk, then mere "AI-aware supervision" is insufficient. Authorities need a solid framework for prevention and response. The first building block is straightforward but often overlooked. Supervisors need real-time insight into where AI is used within large firms. Some authorities have begun asking banks to disclose whether they use AI in areas such as risk management, credit, and treasury. The results are inconsistent and hard to compare. A more serious method would treat high-impact AI systems like vital infrastructure. Banks would maintain a live register of models that influence liquidity management, deposit pricing, collateral valuation, and payment routing. This register should include key vendors and data sources. Supervisors could then see where similar models or vendors are used across institutions and where synchronized actions might occur under stress. This step is essential for any coherent response to AI bank runs.
The second building block is shared early warning systems that reflect, at least partially, the speed and complexity of private algorithms. Some central banks already conduct interactive stress tests in which firms adapt their strategies as conditions change. Groups of authorities could build on this idea and use methods such as federated learning to jointly train models on local data without sharing raw data. These supervisory models would monitor not just traditional indicators but also the behavior of high-impact AI systems across banks, payment providers, and large non-bank platforms. Signals from these models, combined with insights about vendor concentration, would enable authorities to detect when AI bank runs are developing. They wouldn’t simply observe deposit outflows after the fact.
The third building block is crisis infrastructure that can act quickly when early warnings appear. Authorities already use tools such as market circuit breakers and standing liquidity facilities. For AI bank runs, these tools must be redesigned with automated triggers. This could mean pre-authorized liquidity lines that activate when particular patterns of outflows are detected across multiple institutions. It might involve temporary restrictions on certain types of algorithmic orders or instant transfer products during extreme stress. None of this eliminates the need for human judgment. It simply acknowledges that by the time a crisis committee gathers, AI-driven actions may have already changed balance sheets. Without pre-set responses linked to well-defined metrics, supervisors will remain bystanders in events they should oversee.
An Education Agenda for an Era of AI Bank Runs
These defenses will be ineffective if they rely on a small group of technical experts. The survey of emerging markets highlights that internal skill shortages are a significant obstacle to the use of AI in supervision. This issue is also prevalent in many advanced economies, where central bank staff and frontline supervisors often lack practical experience with modern machine learning tools. At the same time, most adults now carry smartphones, send digital payments, and interact with AI-powered systems in their daily lives. The human bottleneck lies within the authorities, not beyond them. Bridging that gap is therefore as much an educational challenge as it is a regulatory one. It requires a new collaboration between financial authorities, universities, and online training providers.
Figure 2: Supervisors plan to use AI first for data and risk analysis, while more complex tasks stay in the medium-to-long-term bucket, revealing a strategic but still cautious adoption path.
That collaboration should start with the reality of AI bank runs, not with vague courses on "innovation." Supervisors, policymakers, and bank risk teams need access to practical programs. These should blend basic coding and data literacy with a solid understanding of liquidity risk, market dynamics, and consumer behavior in digital settings. Scenario labs where participants simulate AI bank runs can be more effective than traditional lectures for building intuition. In these exercises, chatbots, robo-advisers, treasurers, and central bank tools all interact on the same screen. Micro-credentials for board members, regulators, and journalists can spread that knowledge beyond just a small group of experts. Online education platforms can make these resources available to authorities in low-income countries that can’t afford large in-house training programs. The goal isn’t to turn every supervisor into a data scientist. It is to ensure that enough people in the system understand what an AI bank looks like in practice. Only then can the institutional response be informed, swift, and coordinated.
The window to act is still open. The same global surveys that indicate rapid adoption of AI in banks also show that supervisors’ use of AI remains at an earlier stage. The gap is vast in emerging markets. Today’s digital bank runs, like the surge that caused a mid-sized lender to collapse in 2023, still unfold over hours instead of seconds. However, that margin is decreasing as AI transitions from test projects to essential systems in both finance and everyday life. Authorities can still alter their path. Investments in data infrastructure, shared supervisory models, machine-speed crisis tools, and a serious education agenda can prevent AI bank runs from becoming the major crises of the next decade. If they hesitate, the next systemic event may not begin with a rumor in a line. It may start with a silent cascade of algorithm-driven decisions that no one in charge can detect or understand.
The views expressed in this article are those of the author(s) and do not necessarily reflect the official position of the Swiss Institute of Artificial Intelligence (SIAI) or its affiliates.
References
Bank of England. 2024. Artificial intelligence in UK financial services 2024. London: Bank of England. Boeddu, Gian, Erik Feyen, Sergio Jose de Mesquita Gomes, Serafin Martinez Jaramillo, Arpita Sarkar, Srishti Sinha, Yasemin Palta, and Alexandra Gutiérrez Traverso. 2025. “AI for financial sector supervision: New evidence from emerging market and developing economies.” VoxEU, 18 November. Danielsson, Jon. 2025. “How financial authorities best respond to AI challenges.” VoxEU, 25 November. European Central Bank. 2024. “The rise of artificial intelligence: benefits and risks for financial stability.” Financial Stability Review, May. Federal Deposit Insurance Corporation. 2023. “Recent bank failures and the federal regulatory response.” Speech by Chairman Martin J. Gruenberg, 27 March. Financial Stability Board. 2025. Monitoring adoption of artificial intelligence and related technologies. Basel: FSB. Financial Times. 2024. “Banks’ use of AI could be included in stress tests, says Bank of England deputy governor.” 6 November. Visa Economic Empowerment Institute. 2025. “World Bank Global Findex 2025 insight.” 9 October. World Bank. 2022. The Global Findex Database 2021: Financial inclusion, digital payments, and resilience in the age of COVID-19. Washington, DC: World Bank. World Bank. 2025a. “Mobile-phone technology powers saving surge in developing economies.” Press release, 16 July. World Bank. 2025b. Artificial Intelligence for Financial Sector Supervision: An Emerging Market and Developing Economies Perspective. Washington, DC: World Bank.
Picture
Member for
1 year 1 month
Real name
Keith Lee
Bio
Keith Lee is a Professor of AI and Data Science at the Gordon School of Business, part of the Swiss Institute of Artificial Intelligence (SIAI), where he leads research and teaching on AI-driven finance and data science. He is also a Senior Research Fellow with the GIAI Council, advising on the institute’s global research and financial strategy, including initiatives in Asia and the Middle East.
Osaka second capital now anchors a fragile coalition
Capital shift could widen or rebalance regional education
Classrooms can use this debate to teach coalition politics
Japan’s new government was formed with two seats short.
Big firms helped drive the 2021–2022 inflation surge
Granular data links market power and inflation
Policy and teaching must reflect this granular reality
In late 2022, inflation in the euro area reached a peak of 10.6%.
Japan–China tensions are now a social problem; deterrence alone cannot steady the region
Dedicate a sliver of rising defense spend to history education, exchanges, and crisis literacy
Pair hard power with hard learning so classrooms absorb shocks instead of amplifying them