Tackling Contextual Bias in AI Models for Online Safety

About the Project

Background :

Artificial Intelligence (AI) systems are increasingly used to detect and moderate online harms, such as cyberbullying, misinformation, and disinformation. However, many current models suffer from contextual bias, they misinterpret content when removed from its social, cultural, or multimodal context. For example, AI models may:

Misclassify sarcasm or satire as harmful misinformation.
Fail to recognise subtle, repeated bullying when viewing a single message in isolation.
Cannot understand how different elements, like text and imagery, interact to convey meaning.

These limitations not only reduce accuracy but also raise ethical concerns about fairness, inclusivity, and explainability. Developing context-aware AI systems is therefore critical for safeguarding digital platforms, ensuring trustworthy moderation, and protecting vulnerable users.

Research Aims

This PhD project aims to investigate and mitigate contextual bias in AI models used for detecting online harms. The research will focus on:

Defining and operationalising contextual bias in the domain of online harms (linguistic, multimodal, cultural, and platform-specific).
Developing context-aware AI methods that incorporate social, emotional, and interactional cues beyond surface-level content.
Evaluating fairness and explainability in online harm detection systems, ensuring robustness across platforms and populations.
Applying the methods to real-world challenges, with a primary focus on cyberbullying detection and misinformation/disinformation identification.

In addition, unlike approaches focused solely on technical performance, this work prioritises safety, inclusion, and accountability by designing AI systems that explain their decisions in ways people can understand, reduce unfair impacts on diverse communities, and support responsible use in real-world settings.

Tackling Contextual Bias in AI Models for Online Safety

Kingston University

55-59 Penrhyn Rd, Kingston upon Thames KT1 2EE, UK

Tackling Contextual Bias in AI Models for Online Safety

About the Project

Unlock this job opportunity

View more options below

View full job details

Guiding and Evaluating Neural Theorem Proving

Formalizing and Testing Function Boolean Conjectures with LLMs

Verifiable Robot Planning: Safeguarding Foundation Models via Formal Methods Feedback

Agentic AI-Based Behaviour and Intent Prediction within Aviation Operations

Surgical AI Copilot: Multimodal LLM Agent in Minimally Invasive Surgery

Outsmarting Evolving Pathogens with AI

A Utilisation of AI Techniques to Manage Energy Consumption in Wireless Sensor Networks

Embodied AI for Wellbeing: Exploring the Opportunities for Social Robots

Robots Dancing to Salsa

Towards incorporating causality in explainable artificial intelligence

Carbon-Conscious Resource Scheduling for AI Workloads in the Cloud–Edge Continuum

Non-Invasive and Semantic-based IoT stream processing framework with Agentic AI for Next-Generation Trustworthy Wearable and Ambient Systems

AI for Proton-Acoustics for Real-Time 3D Dosimetry of Proton Therapy

Computer Science & Mathematics - Example topics include person identification using gait-based approaches, AI for clinical assessment, and applications in areas such as ophthalmology and mobility analysis.

AI for Sustainable and Inclusive Societies (Ref: 2026/LUL/IDT)

Differential Privacy (DP) mechanisms for Large Language Models (LLMs)

Computational Pathology using Artificial Intelligence

Context-aware workload optimisation in cloud ecosystems: a framework for sustainable and low-impact AI processing

Intelligent Avatars for Collaborative and Rehabilitative Extended Realities

Large AI models for Privacy and Compliance