Migration Sentiment, Digital Technology and AI

Rationale

Public attitudes towards migration shape electoral outcomes, policy design, integration trajectories and social cohesion. Yet, conventional instruments used to measure such attitudes, particularly surveys, are temporally coarse, spatially aggregated and slow to update. They are poorly suited to capture rapid opinion shifts during crises such as pandemics, refugee inflows or political campaigns.

At the same time, digital platforms generate vast volumes of user-produced text that encode real-time expressions of political opinion. Advances in natural language processing and machine learning make it possible to extract structured measures of migration-related sentiment from these data at unprecedented temporal and spatial resolution. However, digital trace data are not neutral reflections of public opinion. They are shaped by demographic selection, platform affordances, algorithmic amplification and linguistic bias. In parallel, artificial intelligence is increasingly embedded within migration governance systems including risk scoring, border technologies and automated decision systems. This creates a dual analytical challenge: (i) how to measure migration sentiment rigorously using AI while correcting for bias and (ii) how to understand the feedback effects between digital technologies and public attitudes.

My research addresses this dual challenge through an integrated programme combining computational social science, geographic data science and AI system development in collaboration with international partners including the International Organization for Migration (IOM-UN Migration).

Overall research programme

My work on migration sentiment and digital technology has developed across four interlinked strands:

  1. Measuring migration sentiment using social media based on migration theory
  2. Structural and network dynamics of anti-immigration discourse
  3. Policy translation and ethical AI guidance with IOM-UN Migration
  4. Multilingual large language model (LLM) classification at planetary scale

Together, these strands move from feasibility demonstration to scalable AI systems capable of analysing billions of multilingual posts.

1. Measuring and understanding migration sentiment through social media data

Early work established the feasibility of measuring migration sentiment using large-scale social media data. In Rowe et al. (2021), I analysed over 30 million tweets across five countries to examine immigration sentiment during the early stages of the COVID-19 pandemic. The study demonstrated that migration-related discourse responds to epidemiological shocks and political developments and that sentiment content varies across national contexts. This provided empirical evidence that digital trace data can detect rapid attitudinal shifts that conventional surveys cannot capture in real time.

Building on social psychology theory, we (Freire-Vidal, Graells-Garrido and Rowe, 2021) developed a theory-grounded framework to classify attitudes towards immigration using Twitter data. Drawing on Intergroup Contact Theory and Integrated Threat Theory, we operationalised empathy and threat as measurable stance categories. The framework combined seed-based labelling, machine learning classification, network interactions and psycholinguistic profiling to identify stance groups and characterise their emotional and semantic structure. This work moved beyond simple sentiment polarity to capture theoretically meaningful attitude dimensions.

I subsequently extended this research spatially with my PhD student, Matt Mason (preprint). We analysed nearly a decade of data to demonstrate persistent sub-national variation in migration sentiment across Great Britain. This research showed that attitudes towards migration are embedded within local socio-economic and political contexts. It established that digital sentiment measures can be geographically disaggregated to reveal enduring spatial inequalities. We also demonstrated that Twitter-based migration sentiment produces long-term trends comparable to survey-based data.

2. Understanding online anti-immigration networks

Beyond aggregate sentiment trends, with my PhD student Andrea Nasuto, I examined the structural diffusion of anti-immigration discourse (Nasuto and Rowe, 2024). We analysed over 220,000 UK tweets to investigate how anti-immigration narratives spread online. We demonstrated that although anti-immigration users constitute a numerical minority, they form dense and cohesive network structures that enable disproportionate amplification of hostile narratives. This finding challenges the assumption that volume alone drives visibility and highlights the importance of network topology in shaping discourse. This work contributed to understanding how digital infrastructures interact with political polarisation and how small but organised communities can exert outsized influence in online migration debates.

3. Policy translation and ethical AI with IOM-UN Migration

In collaboration with IOM-UN Migration, I translated computational methods into practitioner-oriented guidance. This work contributed to the IOM handbook Harnessing Data Innovation for Migration Policy: A Handbook for Practitioners (Rowe et al., 2021). The handbook provides operational guidance on using social media and digital trace data for migration policy while emphasising transparency, bias awareness and ethical safeguards. This collaboration bridged methodological innovation and institutional practice, embedding digital sentiment analysis within real-world migration governance discussions. It positioned digital data not as a replacement for official statistics but as a complementary monitoring tool when used with appropriate safeguards.

4. Multilingual large language models (LLMs) and scalable AI systems

Recent work extends my agenda to multilingual, AI-native classification at planetary scale. With colleagues at Harvard University and Andrea Nasuto (Nasuto, Iacus, Rowe and Jain, 2025), we developed a lightweight open-source LLaMA 3.2-3B fine-tuned framework to classify immigration-related discourse across 13 languages. Unlike prior work relying on BERT-style encoders or translation pipelines, we jointly modelled topic detection and nuanced stance classification (pro, anti, neutral). We demonstrate three key findings: (1) fine-tuning in one or two languages enables reliable cross-lingual transfer for identifying whether content is about immigration; (2) capturing nuanced ideological stance benefits from multilingual exposure during fine-tuning; and (3) incorporating even small volumes of low-resource language data mitigates English-centric pretraining bias and improves classification accuracy.

By using 4-bit quantisation and LoRA fine-tuning, we achieved 26-168x faster inference and orders-of-magnitude cost reductions relative to proprietary LLMs. This makes it feasible to analyse billions of tweets in near-real time using open, reproducible infrastructure. The framework provides a scalable alternative to commercial API-dependent systems and advances theory on cross-lingual representation in LLMs.

Addressing important gaps

Across these strands, my work has sought to address three persistent gaps in the literature:

  1. Representativeness and bias. The idea has been to move beyond descriptive sentiment analysis to theory-informed classification and explicit consideration of platform and sampling bias.
  2. Spatial and network structure. Our core intent has been to integrate geographic and network perspectives to understand local inequalities and amplification dynamics.
  3. Scalable multilingual AI. We sought to demonstrate that open-source LLMs can generalise migration discourse classification across languages without reliance on costly translation or proprietary systems.

Contributions

We have aimed to make substantive and innovative contributions to the literature by establishing a coherent framework for studying migration sentiment in the digital age. We have provided evidence that digital trace data can provide high-frequency, geographically disaggregated measures of migration attitudes. We showed that theoretical constructs such as empathy and threat can be operationalised computationally. We evidenced that network structure shapes the amplification of anti-immigration narratives. We demonstrated that open-source multilingual LLMs can classify migration discourse at global scale in a cost-efficient and reproducible manner. Collectively, our research has contributed to moving work on migration sentiment analysis from exploratory computational experiments to bias-aware, theoretically grounded and institutionally relevant AI systems capable of informing policy debates in real time.

References

Freire-Vidal, Y., Graells-Garrido, E., & Rowe, F. (2021). A framework to understand attitudes towards immigration through Twitter. Applied Sciences, 11(20), 9689. https://doi.org/10.3390/app11209689

Rowe, F., Mahony, M., Sievers, N., Rango, M., & Graells-Garrido, E. (2021). Sentiment towards migration during COVID-19: What Twitter data can tell us. OSF Preprints. https://doi.org/10.31219/osf.io/sf7u4

Nasuto, A., & Rowe, F. (2024). Understanding anti-immigration sentiment spreading on Twitter. PLOS ONE. https://doi.org/10.1371/journal.pone.0307917

Nasuto, A., Iacus, S., Rowe, F., & Jain, S. (2025). Learning the topic, not the language: How LLMs classify online immigration discourse across languages. arXiv (Cornell University). https://doi.org/10.48550/arXiv.2508.06435

Rowe, F., Mahony, M., Graells-Garrido, E., Rango, M., & Sievers, N. (2021). Using Twitter to track immigration sentiment during early stages of the COVID-19 pandemic. Data & Policy, 3, E36. https://doi.org/10.1017/dap.2021.38

Rowe, F., et al. (2021). Harnessing Data Innovation for Migration Policy: A Handbook for Practitioners. International Organization for Migration. https://publications.iom.int/books/harnessing-data-innovation-migration-policy-handbook-practitioners