Sector-Specific AI: Why Finance Isn’t ChatGPT—and Why That Matters
Picture
Member for
11 months 1 week
Real name
Ethan McGowan
Bio
Ethan McGowan is a Professor of Financial Technology and Legal Analytics at the Gordon School of Business, SIAI. Originally from the United Kingdom, he works at the frontier of AI applications in financial regulation and institutional strategy, advising on governance and legal frameworks for next-generation investment vehicles. McGowan plays a key role in SIAI’s expansion into global finance hubs, including oversight of the institute’s initiatives in the Middle East and its emerging hedge fund operations.
Published
Modified
AI works best when built for each sector’s data and goals
Finance needs domain-grounded models and risk-based metrics, not generic chatbots
Teach, buy, and regulate using sector-specific measures
The most crucial figure in today’s AI discussion isn’t a high parameter count. It is a divide. In 2024, only 13.5% of EU firms reported using AI at all. Yet nearly half of information and communications firms had adopted it, while most other sectors were below one in six. This pattern is striking and revealing: adoption occurs where data and tasks match the tools, and stalls where they do not. This divide challenges the idea that "AI equals ChatGPT." Sector-specific AI relies on domain data, error structure, incentives, and key metrics in the field. Photographs and writing benefit from identifying patterns in clean, stable datasets. In contrast, financial time series are challenging due to noise, regime shifts, and the higher costs of making mistakes. Suppose we continue to think of AI as a single entity. In that case, we risk teaching students incorrectly, purchasing the wrong systems, and mismanaging risk. The solution isn’t just more hype; it’s about adopting sector-specific AI tailored to the needs of each sector.
Sector-specific AI is a practice, not a product
Sector-specific AI isn’t just a chatbot dressed up; it’s a group of probabilistic systems tailored to specific contexts. In some areas, these systems already outperform previous top standards. Take weather forecasting. In peer-reviewed studies, DeepMind’s GraphCast beat the leading European physics model on 90% of 1,380 verification targets for forecasts ranging from three to ten days, delivering results in under a minute. This achievement stemmed from training on decades of structured reanalysis data, which had well-understood physical constraints and stable measurement processes. In short, the domain provided the model with valuable signals. This lesson applies broadly: when data are plentiful, consistent, and connected to governing dynamics, the benefits of learning are significant and rapid. That’s the essence of sector-specific AI.
The adoption rates illustrate the same story. Across the European Union, information and communications firms led AI use at 48.7% in 2024, followed by professional, scientific, and technical services at 30.5%. Adoption rates were lower in other sectors, highlighting differences in how well tasks match available data. UK business surveys present a similar picture: about 15% of firms reported using AI in late 2024, with adoption increasing alongside firm size and digital readiness. Global executive surveys show another layer: many organizations claim to be "using AI" somewhere, but the actual use is primarily in IT and sales/marketing, not in the complex, data-scarce areas of each sector. Sector-specific AI doesn’t spread like a single product; it expands where data, incentives, and measurement are aligned.
Why financial time series breaks the hype
Finance is where the idea that “AI = ChatGPT” causes the most damage. Equity and foreign exchange returns exhibit heavy tails, clustered volatility, and frequent regime changes—factors that render simple pattern-finding unreliable. The signal-to-noise ratio is low, labels change because of market perceptions (prices move because others believe they will), and feedback loops can reverse correlations. Recent studies clarify this issue: careful analyses find that language-model features generally do not beat strong, purpose-built time-series baselines. In many cases, simpler attention models or traditional structures perform just as well or better at a much lower cost. This isn’t a critique of AI; it’s a reminder that sector-specific AI for markets must begin with the data-generating process, not the latest general-purpose model.
What works in weather or image recognition doesn’t translate directly to forecasting returns. Weather models benefit from physics-consistent data with dense, continuous coverage. Corporate earnings, yields, and prices fluctuate due to policy changes, accounting adjustments, liquidity pressures, and narratives that affect risk pricing. Advanced models in finance can assist by nowcasting macroeconomic series from mixed data, classifying the tone of news, and identifying structural breaks, but the objectives differ. A trading desk cares less about root mean square error on tomorrow’s price and more about drawdowns, turnover, and tail risk during stress periods. This requires sector-specific AI that combines causal structure, reliable features, and stringent controls for backtesting and live deployment. If the benchmark is whether it surpasses a well-tuned seasonal naive or a hedged factor model after costs, many flashy systems fall short. That isn’t a failure of AI; it reflects the nature of finance.
Figure 1: AI user adoption (ChatGPT as proxy) has been exponentially faster than that of earlier digital platforms, underscoring that general-purpose tools spread faster than sector-specific ones but also face sharper saturation and adaptation limits.
Sector-specific AI needs local metrics, not generic demos
The quickest benefits from sector-specific AI come when evaluations align with the job to be done. In meteorology, the goal is timely, location-specific forecasts on variables that impact operations; benchmarks reward physical consistency and accuracy across different times. That’s why the GraphCast work—and hybrid systems that combine it with ensemble methods—are significant to grid planners and disaster response teams, not just to machine learning experts. The technique, data, and impact metric all align. In health imaging, sensitivity and specificity, depending on the condition and scanner type, matter more than polished fluency. In manufacturing, defects per million and scrap rates determine success, not flashy presentations. Sectors that establish these local metrics see real compounding benefits from AI.
Figure 2: The structure of AI research reveals distinct architectures—computer vision, NLP, robotics, and time-series analysis—each producing different error behaviors and performance ceilings. Policy must recognize these domain boundaries to design realistic AI adoption strategies.
Financial markets require an even stricter approach. A credible sector-specific AI roadmap starts with governance: controlled data pipelines, baseline econometrics, and pre-registered backtesting protocols that penalize overfitting. It then focuses evaluations on finance-specific outcomes: probability of failure under stress, realized Sharpe ratio after fees and slip, turnover-adjusted alpha, and worst-case liquidity scenarios. This viewpoint explains the gap between firms stating "we use AI somewhere" and those saying "we trust an AI signal in real time." Business surveys show widespread experimentation—78% of firms reported AI use in at least one function in 2025—but not widespread transformation. In finance, trust will increase when developers acknowledge the noise and start with the right benchmarks. Sector-specific AI works when it optimizes for the metrics that the sector already uses.
What educators, administrators, and policymakers should do next
Educators should teach sector-specific AI as a skill grounded in domain data. This begins with projects that use real sector datasets and metrics. A finance capstone should connect students with anonymized tick or macro datasets, run competitions against seasonal naïve, ARIMA-GARCH, and transformer models, and assess based on out-of-sample risk and costs, rather than classroom-friendly accuracy. A public-sector project should simulate casework prioritization or fraud detection, focusing on fairness, false-positive costs, and auditability from the start. An energy systems project should optimize fleet dispatch based on weather forecasts and price volatility. The core message is clear: the sector defines the loss function, and the model adapts accordingly. OECD analysis indicates that sectoral AI dynamics differ significantly; curricula should reflect that reality, rather than compress it into a single "AI course."
Administrators should also invest in sector metrics. In universities and research hospitals, purchases should require direct validation based on outcomes important to the unit—such as critical sensitivity and workload effects in radiology —rather than broad natural language processing benchmarks. In business schools and engineering programs, computing resources should connect to reproducible methods and solid baselines, rather than just parameter counts. In finance labs, live-trading environments should be separated from model development and subject to strict change controls. For many institutions, the first step is straightforward: improve the data management. UK and EU business surveys show that AI adoption increases with digital maturity; companies with good data management and security gain more tangible benefits from AI than those that skip the basics. Sector-specific AI relies on clean data.
Policymakers should create regulations based on sector needs, not a one-size-fits-all approach. Generic rules regarding "AI systems" overlook the actual risk landscape. Financial markets require model-risk management, record-keeping, and post-trade auditing. Health care needs documentation of training groups, site-to-site testing, and commitments to real-world performance. Critical infrastructure needs stress tests for rare events. International organizations are starting to recognize these differences as they assess sectoral AI use in manufacturing, finance, and government; the most effective frameworks relate controls to impacts and error costs rather than vague capability labels. In short, regulate the interface between model errors and human welfare where it occurs.
The counterargument is familiar: if models improve continuously, won’t ChatGPT-like systems soon handle everything? Progress is real, and overlaps will occur. However, the immediate path to impact relies on aligning with sector needs. Even in areas with significant breakthroughs, improvements stem from tailoring architecture and data to the task. Weather AI’s success wasn’t about chatbots; it was about domain knowledge. Finance will eventually reach its own breakthroughs, but they will involve effective risk management and improved hedging, rather than a general model extracting alpha from random noise. Since incentives vary by sector, the adoption curve will also differ. The right approach is not to wait for a single, universal solution; it’s to build according to the unique needs of each domain now.
The 13.5% overall adoption compared to nearly half in information and communications was never just a number. It illustrates how sector-specific AI spreads. When data are organized, outcomes are clear, and metrics are relevant, AI evolves quickly. In contrast, when data are chaotic, outcomes are uneven, and metrics are flawed, it stalls or even fails. Finance serves as a warning: time series are complex, rich in feedback, and resistant to pattern-finding that lacks a foundation in risk economics. The takeaway isn’t to lower expectations. It’s to teach, invest, build, and regulate as if AI encompasses many things—because it does. Suppose educators prepare builders who start with the data-generating process. In that case, administrators invest based on sector metrics, and policymakers regulate the impact of errors on people and capital. This approach will achieve the desired benefits alongside the necessary safeguards. This is the route out of the "AI = ChatGPT" trap and toward effective sector-specific AI based on the needs of each field.
The views expressed in this article are those of the author(s) and do not necessarily reflect the official position of the Swiss Institute of Artificial Intelligence (SIAI) or its affiliates.
References
DeepMind (2023). GraphCast: AI model for faster and more accurate global weather forecasting. Retrieved November 14, 2023, from DeepMind blog; see also Lam, R., et al. “Learning skillful medium-range global weather forecasting,” Science, 382(6673), eadi2336. Eurostat (2025). Use of artificial intelligence in enterprises—Statistics Explained (data extracted January 2025). European Commission. McKinsey & Company (2024). The state of AI in early 2024: Gen AI adoption. McKinsey Global Survey. McKinsey & Company (2025). The State of AI: Global survey. (March 12, 2025). OECD (2025). “How do different sectors engage with AI?” OECD Blog, February 13, 2025. OECD (2025). Governing with Artificial Intelligence: The State of Play and Way Forward in Core Government Functions (Media advisory and report). September 18, 2025. Office for National Statistics—UK (2024). Business insights and impact on the UK economy, October 3 2024 (PDF). Office for National Statistics—UK (2025). Management practices and the adoption of technology and artificial intelligence in UK firms, 2023, March 24, 2025. Tan, M., Merrill, M. A., Gupta, V., Althoff, T., & Hartvigsen, T. (2024). “Are Language Models Actually Useful for Time Series Forecasting?” NeurIPS 2024 (conference paper and arXiv). Xu, D. J., & Kim, D. J. (2025). “Modeling Stylized Facts in FX Markets with FINGAN-BiLSTM.” Entropy, 27(6), 635 (discussion of volatility clustering). Yan, Z., et al. (2025). “Evaluation of precipitation forecasting based on GraphCast.” Scientific Reports (Nature).
Picture
Member for
11 months 1 week
Real name
Ethan McGowan
Bio
Ethan McGowan is a Professor of Financial Technology and Legal Analytics at the Gordon School of Business, SIAI. Originally from the United Kingdom, he works at the frontier of AI applications in financial regulation and institutional strategy, advising on governance and legal frameworks for next-generation investment vehicles. McGowan plays a key role in SIAI’s expansion into global finance hubs, including oversight of the institute’s initiatives in the Middle East and its emerging hedge fund operations.
Adjusted deflators reveal hidden gains in US manufacturing productivity
The bottleneck is skills, not machines
Scale capital-intensive training, stackable credentials, and adult apprenticeships
Services trade is increasingly borderless and digital
Tariffs miss; data rules, licensing, and standards decide access
Train for exportable skills and build trust-based regimes to unlock growth
Domain-Specific AI Is the Safer Bet for Classrooms and Markets
Picture
Member for
11 months 1 week
Real name
Ethan McGowan
Bio
Ethan McGowan is a Professor of Financial Technology and Legal Analytics at the Gordon School of Business, SIAI. Originally from the United Kingdom, he works at the frontier of AI applications in financial regulation and institutional strategy, advising on governance and legal frameworks for next-generation investment vehicles. McGowan plays a key role in SIAI’s expansion into global finance hubs, including oversight of the institute’s initiatives in the Middle East and its emerging hedge fund operations.
Published
Modified
General AI predicts probabilities, not context-specific safety
Domain-specific AI fits the task and lowers risk in classrooms and markets
Use ISO 42001, NIST RMF, and the EU AI Act, and test on domain benchmarks
Reported AI incidents reached 233 in 2024, marking a 56% increase from the previous year. This sharp rise highlights real-world failures as generative systems become part of everyday tools. The public is aware of this shift. By September 2025, half of U.S. adults expressed more concern than excitement about AI in their daily lives, with only 10% feeling more excited than worried. This sentiment reveals a trend that cannot be ignored: general-purpose models focus on predicting the next token instead of meeting safety standards that differ by context. They operate on probability, not purpose. As usage shifts from chat interfaces to classrooms and financial settings, relying on probability as a measure of safety is inadequate. The solution lies in domain-specific AI, which consists of systems designed specifically for a narrow task, including safety measures suited to the task's stakes.
The case for domain-specific AI
The main issue with the push for greater generality is that it misaligns model learning with safety requirements. Models learn from training data trends, while safety is situational. What is acceptable in an open forum may be harmful in a school feedback tool or a trading assistant. As regulators outline risk tiers, this misalignment becomes both a compliance and ethical concern. The European Union’s AI Act, which took effect on August 1, 2024, categorizes AI used for educational decisions as high-risk, imposing stricter obligations and documentation requirements. General-purpose models also fall under a specific category with unique responsibilities. In short, the law reflects reality: risk depends on usage, not hype. Domain-specific AI acknowledges this reality. It narrows inputs and outputs, aligns evaluations with workflow, and establishes error budgets that correspond to the potential harm involved.
General benchmarks support the same idea. When researchers adapted the traditional MMLU exam into MMLU-Pro, top models experienced a 16-33% drop in accuracy. Performance on today’s leaderboards tends to falter when faced with real-life scenarios. This serves as a warning against unscoped deployment. Meanwhile, government labs now publish pre-deployment safety evaluations because jailbreak vulnerabilities still occur. The UK’s AI Safety Institute has outlined methods for measuring attack success and correctness; its May 2024 update bluntly addresses models’ dangerous capabilities and the need for contextual testing. If failure modes vary by context, safety must also be context-dependent. Domain-specific AI enables this possibility.
Figure 1: Domain-specific tuning reduces error rates in responses involving under-represented groups, showing safer alignment than general fine-tuning.
Evidence from incidents, benchmarks, and data drift
The curve for incidents is steep. The Stanford AI Index 2025 reported 233 incidents in 2024, marking the highest number on record. This figure is a system-wide measure rather than a reaction to media coverage. Additionally, the proportion of “restricted” tokens in common web data rose from about 5-7% in 2019 to 20-33% by 2023, altering both model inputs and behavior. As training data becomes more varied, the likelihood of general models misfiring in edge cases increases. Safety cannot be a once-and-done action; it must be a continuous practice tied to specific domains, using ongoing tests that reflect real tasks.
Safety evaluations confirm this trend. New assessment frameworks like HELM-Safety and AIR-Bench 2024 reveal that measured safety is heavily influenced by the specific harms tested and how prompts are structured. The UK AISI’s method also rates attack success rates and underscores that risk is contingent on deployment context rather than solely on model capability. The conclusion is clear: a general score does not guarantee safety for a specific classroom workflow, exam proctor, or FX-desk summary bot. Domain-specific AI allows us to select relevant benchmarks, limit the input space, and establish refusal criteria that reflect the stakes involved.
Data presents another limitation. As access to high-quality text decreases, developers increasingly rely on synthetic data and smaller, curated datasets. This raises the risk of overfitting and makes distribution drift more likely. Therefore, targeted curation gains importance. A model for history essay feedback should include exemplars of rubrics, standard errors, and grade-level writing—not millions of random web pages. A model for news reading in FX requires calendars, policy documents, and specific press releases, rather than a general assortment of internet text. In both cases, domain-specific AI addresses the data issue by defining what “good coverage” means.
What domain-specific AI changes in education and finance
In education, the risks are personal and honest. AI tools that determine student access or evaluate performance fall into high-risk categories in the EU. This requires rigorous documentation, monitoring, and human oversight. It also calls for choosing the right design for the task. A model that accepts inputs aligned with rubrics, generates structured feedback, and cites authoritative sources is easier to audit and remains within an error budget compared to a general chat model that can veer off-topic. NIST’s AI Risk Management Framework and its 2024 Generative AI profile provide practical guidance: govern, map, measure, manage—applied to specific use cases. Schools can utilize these tools to define what “good” means for various needs, such as formative feedback, plagiarism checks, or support accommodations, and to determine when AI should not be applied.
In finance, the most effective approaches are focused and testable. The best outcomes currently emerge from news-to-signal pipelines rather than general conversation agents. A study by the ECB this summer found that using a large model to analyze two pages of commentary in PMI releases significantly enhanced GDP forecasts. Academic and industry research shows similar improvements when models extract structured signals from curated news instead of trying to act like investors. One EMNLP 2024 study showed that fine-tuning LLMs on newsflow yielded return signals that surpassed conventional sentiment scores in out-of-sample tests. A 2024 EUR/USD study combined LLM-derived text features with market data, reducing MAE by 10.7% and RMSE by 9.6% compared to the best existing baseline. This demonstrates domain-specific AI in action: focused inputs, clear targets, and straightforward validation.
The governance layer must align with this technical focus. ISO/IEC 42001:2023, the first AI management system standard, provides organizations with a way to integrate safety into daily operations: defining roles, establishing controls, implementing monitoring, and improving processes. Combining this with the EU AI Act’s risk tiers and NIST’s RMF creates a coherent protocol for schools, ministries, and finance firms. Start small. Measure what truly matters. Prove it. Domain-specific AI is not a retreat from innovation; it is how innovation thrives amid real-world challenges.
Answering the pushback—and governing the shift
Critics may argue that “narrow” models will lag behind general models that continue to scale. However, the issue isn’t about measuring intelligence; it’s about being fit for purpose. When researchers modify tests or change prompts, general models struggle. MMLU-Pro’s 16-33 point decline illustrates that today’s apparent mastery could collapse under shifting distributions. General models remain prone to jailbreak vulnerabilities, so safety teams continue to publish defenses because clever attacks still work. The UK AI Safety Institute’s methodologies and the follow-up reports from labs and security firms emphasize one need: we must evaluate safety against the specific hazards of each task. Domain-specific AI achieves this inherently.
Figure 2: User evaluations show that domain-specific dialogue systems achieve higher persuasiveness and competence with no increase in discomfort.
Cost is another concern. Building smaller, focused systems may be redundant. In reality, effective stacks combine both types. Use a general model for drafting or retrieval, then run outputs through a policy engine that checks for rubric compliance, data lineage, and refusal rules. ISO 42001 and NIST RMF guide teams in this process by detailing what data was utilized, what tests were conducted, and how failures are addressed. The EU AI Act rewards such design with clearer paths to compliance, particularly for high-risk educational applications and for general models used in regulated environments. The lesson from recent evaluations is evident: governance must reside where the work occurs. This is more cost-effective than dealing with incident responses, reputational damage, and rework after audits.
A final objection is that specialization could limit creativity. Evidence from the finance sector suggests otherwise. The most reliable improvements currently come from models that analyze specific news to generate precise signals, validated against clear benchmarks. The ECB example demonstrated how small amounts of focused text improved projections, while the EUR/USD study outperformed baselines using task-specific features. Industry research indicates that fine-tuned newsflows perform better than generic sentiment analysis. None of these systems “thinks like a fund manager.” They excel at one function, making them easier to evaluate when they falter. In education, the parallel is clear: assist teachers in providing rubric-based feedback, highlight patterns of misunderstanding, and reserve critical judgment for humans. This approach keeps the helpful tool and mitigates harm.
The evidence is compelling. Incidents rose to 233 in 2024, public trust is fragile, and stricter benchmarks reveal delicate performance. The solution isn't abstract “alignment” with universal values implemented in larger models. The remedy is to connect capability with context. Domain-specific AI narrows inputs and outputs, utilizes curated data, and demonstrates effectiveness through relevant tests. It establishes concrete governance through ISO 42001 and NIST RMF. It aligns with the EU AI Act’s risk-driven framework. Its efficacy is already demonstrated in lesson feedback and news-based macro signals. The call to action is straightforward. Schools and ministries should set domain-specific error budgets, adopt risk frameworks, and choose systems that prove their reliability in domain tests before they interact with students. Financial firms should define assistants to focus on information extraction and scoring, not decision-making, and hold them to measurable performance standards. We can continue to chase broad applications and hope for safety, or we can develop domain-specific AI that meets the standards our classrooms and markets require.
The views expressed in this article are those of the author(s) and do not necessarily reflect the official position of the Swiss Institute of Artificial Intelligence (SIAI) or its affiliates.
References
Adalovelace Institute. (2024). Under the radar? London: Ada Lovelace Institute. Carriero, A., Clark, T., & Pettenuzzo, D. (2024). Macroeconomic Forecasting with Large Language Models (slides). Boston College. Ding, H., Zhao, X., Jiang, Z., Abdullah, S. N., & Dewi, D. A. (2024). EUR/USD Exchange Rate Forecasting Based on Information Fusion with LLMs and Deep Learning. arXiv:2408.13214. European Commission. (2024). EU AI Act enters into force. Brussels. Guo, T., & Hauptmann, E. (2024). Fine-Tuning Large Language Models for Stock Return Prediction Using Newsflow. In EMNLP Industry Track. ACL Anthology. ISO. (2023). ISO/IEC 42001:2023—Artificial intelligence management system. Geneva: International Organization for Standardization. NIST. (2023). Artificial Intelligence Risk Management Framework (AI RMF 1.0). Gaithersburg, MD. NIST. (2024). AI RMF: Generative AI Profile (NIST AI 600-1). Gaithersburg, MD. RAND Corporation. (2024). Risk-Based AI Regulation: A Primer on the EU AI Act. Santa Monica, CA. Reuters. (2025, June 26). ECB economists improve GDP forecasting with ChatGPT. Stanford CRFM. (2024). HELM-Safety. Stanford University. Stanford HAI. (2025). AI Index Report 2025—Responsible AI. Stanford University. UK AI Safety Institute (AISI). (2024). Approach to evaluations & Advanced AI evaluations—May update. London: DSIT & AISI. Wang, Y., Ma, X., Zhang, G., et al. (2024). MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark. NeurIPS Datasets & Benchmarks (Spotlight).
Picture
Member for
11 months 1 week
Real name
Ethan McGowan
Bio
Ethan McGowan is a Professor of Financial Technology and Legal Analytics at the Gordon School of Business, SIAI. Originally from the United Kingdom, he works at the frontier of AI applications in financial regulation and institutional strategy, advising on governance and legal frameworks for next-generation investment vehicles. McGowan plays a key role in SIAI’s expansion into global finance hubs, including oversight of the institute’s initiatives in the Middle East and its emerging hedge fund operations.
Takaichi’s rise is a pragmatic bet, not a protest vote
Shrinking cohorts demand an education reset—protect funding, cut teacher overload, use AI with guardrails
Judge success by delivery: fair consolidation and measurable gains in teacher time
Firms under competition cut jobs, switch lines, or move tasks offshore
Schools need a skills policy for competition: fast, stackable, portable training
Fund future-shaping sectors and track outcomes so workers can switch quickly
Physical and transition climate risks are orthogonal inside firms
Physical risk is geographic and weather-driven; transition risk is policy- and profit-driven
Teach and regulate with two dashboards, not one, so budgets and actions match each risk
David O’Neill is a Professor of Finance and Data Analytics at the Gordon School of Business, SIAI. A Swiss-based researcher, his work explores the intersection of quantitative finance, AI, and educational innovation, particularly in designing executive-level curricula for AI-driven investment strategy. In addition to teaching, he manages the operational and financial oversight of SIAI’s education programs in Europe, contributing to the institute’s broader initiatives in hedge fund research and emerging market financial systems.
Published
Modified
AI is making labor borderless as online services surge
Opportunity expands, but standards, audits, and broadband are crucial
Schools must teach task-first skills, platform literacy, and safeguards
The fastest-growing part of global trade is undergoing a profound transformation. In 2023, digitally deliverable services—work sent and received through networks—reached about $4.5 trillion. For the first time, developing economies surpassed $1 trillion in exports. This accounts for more than half of all services trade. Borders still exist, but for more tasks, they function like dimmer switches rather than walls. This is the world of borderless AI labor, a transformative force where millions of people annotate data, moderate content, build software, translate, advise, and teach from anywhere. The rise of large language models and affordable coordination tools has accelerated this shift. Employers can now swiftly create distributed teams, manage them with algorithmic systems, and pay them across time zones. The key question is not whether jobs will shift, but who sets the rules for that shift and how schools and governments prepare people to succeed in it.
The borderless AI labor market is here
The numbers clearly illustrate this trend. In a 2025 survey of employers, 40-41% of firms indicated they would reduce staff where AI automates tasks. At the same time, they are reorienting their business models around AI and hiring for new skills. This change does not spell disaster. By 2030, the best estimate is a net gain of 78 million jobs globally, with 170 million created and 92 million displaced. To put it another way: tasks are moving, talent strategies are changing, and opportunities are opening—often across borders. The borderless AI labor market is not just a shift, but a gateway to unprecedented growth and opportunity.
Figure 1: Global digital services hit $4.5T in 2023; developing economies passed $1T, confirming the rise of borderless AI labor.
Hiring practices are already reflecting this shift. Data from platforms show that remote and cross-border hiring increased through 2024. Over 80% of roles on one global platform were classified as remote, and cross-border hiring rose by about 42% year-over-year. In the United States alone, 64 million people—or 38% of the workforce—freelanced in 2023, many serving clients they never met in person. These trends, coupled with the surge in digitally delivered services, confirm that borderless AI labor has scale and momentum.
Winners and risks in borderless AI labor
The benefits are significant. For the 2 billion people in informal work—over 60% of workers in much of the Global South—AI-enabled matching, translation, and reputation tools can help clarify skills for employers and create opportunities for higher-value tasks. This is the inclusive promise of borderless AI labor, where visibility and trust are essential in global services. When communication, payments, and rating systems are tied to the worker, location becomes less critical, and performance issues become more significant.
However, the risks are real and immediate. The global supply chain for AI, which relies on data labelers and content moderators in cities like Nairobi, Manila, Accra, Bogotá, and others, is a key part of this issue. Over the past two years, research and litigation have highlighted issues like psychological harm, low pay, and unclear practices within this ecosystem. New projects by policy groups are beginning to show how moderation systems can fail in non-English contexts and low-resource languages. The solution is not to withdraw from global work but to set enforceable standards for wages, safety, and mental health support wherever moderation and annotation take place. While automation can filter some content, it cannot replace the need for decent labor conditions.
A subtler risk comes from algorithmic management itself. This refers to the use of AI tools to assign tasks, monitor productivity, and evaluate workers at scale. Recent surveys in advanced economies reveal widespread use of these systems and varied governance. Many companies have policies, but many workers still express concerns about transparency and fairness. In cross-border teams, power imbalances can worsen if ratings and algorithm-based decision-making are unaccountable. The solution is not to ban these tools; it is to make them auditable and explainable, providing clear recourse for workers when mistakes happen.
There is also a broader change. As tasks move online, the demand for office space is shifting unevenly. In the U.S., office vacancies reached around 20% by late 2024, the highest level in decades, with distress in the sector nearing $52 billion. This does not mean cities will die; rather, it indicates a shift away from low-performing spaces toward digital infrastructure and talent. Employers will continue to focus on time zones, skills, and cost—rather than postal codes. Borderless AI labor will keep driving this change.
Rules for a fair marketplace in borderless AI labor
First, establish portable labor standards for digital work. Governments that buy AI services should require contractors—and their subcontracted platforms—to meet basic conditions, including living-wage floors based on purchasing power, mandated rest periods, and access to mental health support for high-risk jobs like content moderation. International cooperation can help align these standards with existing due diligence laws. A credible standard would connect buyers in cities like New York or Berlin to the outcomes for workers in places like Nairobi or Cebu.
Second, include algorithmic management in policy discussions. Regulators should require companies that use automated assignment, monitoring, or evaluation to disclose such practices, document their effects, and provide appeals processes. Evidence shows that companies already report governance measures, but their implementation often outpaces oversight. Clear rules, alongside audits by independent organizations, can reduce bias, limit intrusive monitoring, and protect the integrity of cross-border teams. Where national laws do not apply, public and institutional buyers can enforce contract obligations.
Third, invest in connectivity. An estimated 3 billion people still lack internet access. A rigorous estimate from 2023 places the global broadband investment needed for near-universal access at about $418 billion, mostly in emerging markets. This investment is essential infrastructure for borderless AI labor. Prioritize open-access fiber, shared 4G/5G in rural areas, and community networks. Public funds should promote open competition and fair-use rules so smaller firms and schools can benefit.
Fourth, speed up cross-border payments and credentials. Workers need quick and low-cost methods to get paid, demonstrate their skills, and maintain verified work histories. Trade organizations are already tracking a surge in digital services. The next step is mutual recognition of micro-credentials and compatible identity standards. Reliable certificates and verifiable portfolios allow employers to assess workers based on evidence, not stereotypes—an essential principle in borderless AI labor.
Preparing schools for borderless AI labor
Education systems must shift from a geography-focused model to a task-focused model. Start with language-aware and AI-aware curricula. Students should learn how to use model-assisted tools for translation, summarization, coding, and data cleaning. They also need to judge outputs, cite sources, and respect privacy. Employer surveys indicate substantial investment in AI skills and ongoing job redesign. Schools should reflect this reality with projects that involve actual work for external clients and peers in different time zones, building the trust that borderless AI labor values.
Figure 2: Freelancing is mainstream: 64M Americans—38% of the workforce—freelanced in 2023.
Next, teach platform literacy and how to promote portfolios. Many graduates will have careers that combine organizational roles with platform work. They must understand service-level agreements, dispute processes, rating systems, and the basics of tax compliance across borders. Capstone projects should culminate in public portfolios featuring versioned code, reproducible analyses, and evidence of collaboration across cultures and time zones. Credentialing should be modular and stackable, so a learner in Accra can combine a local certificate with a global micro-credential recognized by the same employer.
Finally, prioritize well-being and ethics. The data work behind AI can be demanding. Students who choose annotation, moderation, or risk analysis need training to handle exposure to harmful content and find pathways to safer, higher-value roles. Programs should normalize mental health support, rotate students out of traumatic tasks, and teach how to create safer processes—such as automated filtering, triage, and escalation—to reduce harm. The goal is not to deter students but to empower them and provide safeguards in a market that will continue to grow.
The world has already crossed a significant threshold. When digitally deliverable services surpass $4.5 trillion and developing economies exceed $1 trillion in exports, we aren’t discussing a future trend; we’re managing a current reality. Borderless AI labor is not just a concept; it involves a daily flow of tasks, talent, and trust—organized by algorithms, traded in global markets, and facilitated over networks. We can let this system develop by default, with weak protections, inadequate training, and increasing disparities. Or we can take action: establish portable standards for digital work, regulate algorithmic management, invest in connectivity, and teach for a task-driven environment. If we choose the latter, the potential is genuine. Millions who have been invisible to global employers can gain recognition, verifiable reputations, and access to safer, better-paying jobs. This is a policy decision—one that schools, ministries, and buyers can act on now, while the new market is still taking shape.
The views expressed in this article are those of the author(s) and do not necessarily reflect the official position of the Swiss Institute of Artificial Intelligence (SIAI) or its affiliates.
References
Amnesty International. (2025, April 3). Kenya: Meta can be sued in Kenya for role in Ethiopia conflict. Brookings Institution. (2025, October 7). Reimagining the future of data and AI labor in the Global South. Center for Democracy & Technology (CDT). (2024, January 30). Investigating content moderation systems in the Global South. Deel. (2025, April 13). Global hiring trends show cross-border growth and 82% remote roles. MSCI Real Assets. (2025, July 15). The office-market recovery is here, just not everywhere. Moody’s CRE. (2025, January 3). Q4 2024 preliminary trend announcement: Office vacancy hits 20.4%. OEC D. (2024, Mar.). Using AI in the workplace. OECD. (2025, February 6). Algorithmic management in the workplace. Oughton, E., Amaglobeli, D., & Moszoro, M. (2023). What would it cost to connect the unconnected? arXiv. United Nations Conference on Trade and Development (UNCTAD) via WAM. (2024, December 7). Developing economies surpass $1 trillion in digitally deliverable services; global total $4.5 trillion. Upwork. (2023, December 12). Freelance Forward 2023. World Economic Forum. (2025, January 7). Future of Jobs Report 2025 (press release and digest). World Economic Forum. (2025, May 13). How AI is reshaping the future of informal work in the Global South. World Trade Organization. (2025). Digitally Delivered Services Trade Dataset.
Picture
Member for
11 months 1 week
Real name
David O'Neill
Bio
David O’Neill is a Professor of Finance and Data Analytics at the Gordon School of Business, SIAI. A Swiss-based researcher, his work explores the intersection of quantitative finance, AI, and educational innovation, particularly in designing executive-level curricula for AI-driven investment strategy. In addition to teaching, he manages the operational and financial oversight of SIAI’s education programs in Europe, contributing to the institute’s broader initiatives in hedge fund research and emerging market financial systems.