
Prf Harith Alani
Director Kmi
Biography
Professional biography
Professor of Web Science and leader of the Social Data Science group at the Knowledge Media institute, The Open University. His work is mainly focused on applying data science methods and social media analytics to better model, understand and track various social phenomena on the web. Currently, Prof Alani is a Principal Investigator on multiple multimillion international projects, including HERoS; to study the dynamics of COVID19 related misinformation and fact-checks, Co-Inform; to analyse individuals' interaction with misinformation on social media, CIMPLE; to investigate knowledge-based explanations of AI misinformation detection techniques, and NoBias; a European Training Network on Bias in AI. He was also Coordinator of the €2M COMRADES and the €2M DecarboNet international R&D projects. Prof Alani published over 160 scientific papers, and the General Chair for the 2021 International Semantic Web Conference.
See homepage for further detail.
Research interests
social data science, computational social science, social media analysis, online behaviour analysis, web science, semantic web, online misinformation.
Projects
Countering Creative Information Manipulation with Explainable AI
“Explainability is of significant importance in the move towards trusted, responsible and ethical AI, yet remains in infancy. Most relevant efforts focus on the increased transparency of AI model design and training data, and on statistics-based interpretations of resulting decisions. The understandability of such explanations and their suitability to particular users and application domains received very little attention so far. Hence there is a need for an interdisciplinary and drastic evolution in XAI methods, to design more understandable, reconfigurable and personalisable explanations. Knowledge Graphs offer significant potential to better structure the core of AI models, and to use semantic representations when producing explanations for their decisions. By capturing the context and application domain in a granular manner, such graphs offer a much needed semantic layer that is currently missing from typical brute-force machine learning approaches. Human factors are key determinants of the success of relevant AI models. In some contexts, such as misinformation detection, existing XAI technical explainability methods do not suffice as the complexity of the domain and the variety of relevant social and psychological factors can heavily influence users’ trust in derived explanations. Past research has shown that presenting users with true / false credibility decisions is inadequate and ineffective, particularly when a black-box algorithm is used. To this end, CIMPLE aims to experiment with innovative social and knowledge-driven AI explanations, and to use computational creativity techniques to generate powerful, engaging, and easily and quicky understandable explanations of rather complex AI decisions and behaviour. These explanations will be tested in the domain of detection and tracking of manipulated information, taking into account social, psychological and technical explainability needs and requirements.”
Health Emergency Response in Interconnected Systems
The Corona-virus outbreak is continuing to spread. By beginning of February, the number of infected people surpasses 42,000 infections, and the death toll continues to rise. As authorities and responders are struggling to contain the spread, news about mass quarantine camps or shortages of personal protective equipment threaten the health systems globally, fueled by rumors and mis-information. The disruptions of (medical) supply chains, the lack of capacity to treat patients and the spread of rumours fuel an atmosphere of uncertainty and mistrust, hampering an effective response. While traditional models of disease outbreaks largely focus on infection rates, new methods are needed to integrate behaviour from the bottom up, and integrated in macro-level models to coordinate the response world-wide.
Climate Misinformation Surveillance with Multidimensional GIS
This research project aims to tackle the pressing issue of misinformation, especially concerning climate change, by leveraging the powers of GIS. Despite numerous efforts to combat misinformation, its prevalence continues to rise, influencing public perceptions and derailing climate policy debates and decisions. This proposal identifies and addresses several key challenges: the absence of effective tools for geospatial mapping and analyses of climate misinformation alongside climate data, the limited understanding of how climate misinformation spreads geographically, and the lack of effective tools to visualise the interplay of geospatial, temporal, and topical patterns of climate misinformation. To address these challenges, we propose the development of a Multidimensional Geographic Information System (GIS). This GIS will integrate climate data (temperature, precipitation, wind, carbon emission), climate misinformation data from fact-checkers and traditional and social media platforms, analyse its correlations with geographical and climate information, and enhance our predictive capabilities to counter the spread of misinformation effectively and proactively. The next three Conference Of the Parties (COP) will be used as use cases to train and evaluate our research and outcomes. This project, ClimateSense, aims to maximise the capabilities of GIS, enabling it to significantly influence societal understanding and policymaking in the face of climate change and misinformation.
COMRADES
The aim of this proposal is, firstly and fore-mostly, to create an open-source, community-driven resilience platform, designed by the communities, for the communities, to enable them to self-organise during crisis, to co-create and share knowledge and trustworthy advise, and to identify assess, and validate humanitarian needs at the community (macro) and citizen (micro) levels, to target their relief and recovery efforts more effectively and rapidly. Secondly, the project will foster social innovation during crises by developing and integrating automated methods for advanced processing and linking of crowdsourced information, and for safeguarding communities during critical scenarios from inaccurate, distrusted, and overhyped information, and for arming humanitarian communities with enriched, high quality, and actionable information. More specifically, the objectives are to: 1. Extract the socio-technical requirements for community resilience platforms by reviewing the role of technology during previous crises scenarios, and by engaging communities in participatory design and requirements gathering initiatives 2. Produce novel automated methods for identifying and semantically representing the geographical, temporal, and topic clusters of citizen emergency events, and the community network broadcasting these events. 3. Design and develop programs and algorithms for measuring the trustworthiness, informativeness, and veracity of information in multiple languages, gathered from distributed social data sources and communities during crisis 4. Integrate and release project output with Ushahidi;7 a popular open-source platform for crisis situations. The platform will extensible, and grounded on open data, open source, and open hardware. 5. Train and evaluate project tools and algorithms with real communities and in live events, as well as with over 10 million multilingual historical social media messages collected for nearly 30 crises including floods, earthquakes, terrorist attacks, hurricanes, and wildfires. The proposal aims to target the objectives above with an interdisciplinary consortium of computer scientists, social scientists, and humanitarian research organisations, through user driven designs and pilots, and involving several existing communities.
Publications
Book
The Semantic Web – ISWC 2021 (2021)
Proceedings of the 5th Annual ACM Web Science Conference, 2013 Paris, France, WebSci ‘13 (2013)
Book Chapter
Artificial Intelligence and Online Extremism: Challenges and Opportunities (2021)
Supporting Policy-Makers with Social Media Analysis Tools to Get Aware of Citizens’ Opinions (2014)
Features for killer apps from a semantic web perspective (2008)
Geographical Terminology Servers - Closing the Semantic Divide (2003)
Digital Artefact
Journal Article
New Frontiers in Fighting Misinformation (2025)
Exploring the impact of automated correction of misinformation in social media (2024)
Semantic Web technologies and bias in artificial intelligence: A systematic literature review (2023)
Mediating learning with learning analytics technology: guidelines for practice (2022)
Bias in data-driven artificial intelligence systems - An introductory survey (2020)
Relevancy Identification Across Languages and Crisis Types (2019)
Radicalisation Influence in Social Media (2019)
Pro-Environmental Campaigns via Social Media: Analysing Awareness and Behaviour Patterns (2017)
Sentiment Lexicon Adaptation with Context and Semantics for the Social Web (2017)
Contextual semantics for sentiment analysis of Twitter (2016)
Detecting Important Life Events on Twitter Using Frequent Semantic and Syntactic Subgraphs (2016)
Semantic Topic Compass – Classification Based on Unsupervised Feature Ambiguity Gradation (2016)
Modelling and analysis of user behaviour in online communities (2013)
Community analysis through semantic rules and role composition derivation (2013)
Review of the state of the art: discovering and associating semantics to tags in folksonomies (2012)
Engaging politicians with citizens on social networking sites: the WeGov Toolbox (2012)
Social support for ontological mediation and data integration (2009)
The CKC challenge: exploring tools for collaborative knowledge construction (2008)
Building a pragmatic Semantic Web (2008)
Ontologies as facilitators for repurposing web documents (2007)
Identifying communities of practice through ontology network analysis (2003)
Automatic ontology-based knowledge extraction from web documents (2003)
Augmenting thesaurus relationships: Possibilities for retrieval (2001)
Voronoi-based region approximation for geographical information retrieval with gazetteers (2001)
Other
Technical Report: The CKC Challenge: Exploring Tools for Collaborative Knowledge Construction (2007)
Presentation / Conference
CimpleKG: A Continuously Updated Knowledge Graph on Misinformation, Factors and Fact-Checks (2025)
Enhancing Hate Speech Annotations with Background Semantics (2024)
Towards AI-mediated Meme Generation for Misinformation Correction Explanation (2024)
Knowledge-Grounded Target Group Language Recognition in Hate Speech (2023)
MisinfoMe: A Tool for Longitudinal Assessment of Twitter Accounts’ Sharing of Misinformation (2023)
On the Readability of Misinformation in Comparison to the Truth (2023)
Estimating Ground Truth in a Low-labelled Data Regime: A Study of Racism Detection in Spanish (2022)
Supporting Online Toxicity Detection with Knowledge Graphs (2022)
Chatbots to Support Children in Coping with Online Threats: Socio-technical Requirements (2021)
On the use of Jargon and Word Embeddings to Explore Subculture within the Reddit’s Manosphere (2020)
Co-Spread of Misinformation and Fact-Checking Content during the Covid-19 Pandemic (2020)
Towards a Cross-article Narrative Comparison of News (2020)
News Source Credibility in the Eyes of Different Assessors (2019)
MisinfoMe: Who’s Interacting with Misinformation? (2019)
SenZi: A Sentiment Analysis Lexicon for the Latinised Arabic (Arabizi) (2019)
Exploring Misogyny across the Manosphere in Reddit (2019)
Chasing the Chatbots: Directions for Interaction and Design Research (2019)
Understanding the Role of Human Values in the Spread of Misinformation (2019)
Contextual Semantics for Radicalisation Detection on Twitter (2018)
Cross-Lingual Classification of Crisis Data (2018)
What’s going on in my city? Recommender systems and electronic participatory budgeting (2018)
Designing Chatbots for Crises: A Case Study Contrasting Potential and Reality (2018)
Classifying Crises-Information Relevancy with Semantics (2018)
Understanding the Roots of Radicalisation on Twitter (2018)
Online Misinformation: Challenges and Future Directions (2018)
Semantic Wide and Deep Learning for Detecting Crisis-Information Categories on Social Media (2017)
Statistical Semantic Classification of Crisis Information (2017)
An analysis of UK Policing Engagement via Social Media (2017)
Prospecting Socially-Aware Concepts and Artefacts for Designing for Community Resilience (2017)
A Semantic Graph-Based Approach for Radicalisation Detection on Social Media (2017)
On Semantics and Deep Learning for Event Detection in Crisis Situations (2017)
DoRES — A Three-tier Ontology for Modelling Crises in the Digital Age (2017)
On the Role of Semantics for Detecting pro-ISIS Stances on Social Media (2016)
Strategies and Tools to Raise Energy Awareness Collectively (2016)
Talking Climate Change via Social Media: Communication, Engagement and Behaviour (2016)
Climate Change Engagement: Results of a Multi-Task Game with a Purpose (2016)
Identifying Important Life Events from Twitter Using Semantic and Syntactic Patterns (2016)
A Linked Open Data Approach for Sentiment Lexicon Adaptation (2016)
SentiCircles: A Platform for Contextual and Conceptual Sentiment Analysis (2016)
Identifying Prominent Life Events on Twitter (2015)
Predicting Answering Behaviour in Online Question Answering Communities (2015)
Modelling Question Selection Behaviour in Online Communities (2015)
Analysing engagement towards the 2014 Earth Hour Campaign in Twitter (2015)
Automatic Identification of Personal Life Events in Twitter (2015)
Detecting child grooming behaviour patterns on social media (2014)
Policing engagement via social media (2014)
The topics they are a-changing - characterising topics with time-stamped semantic graphs (2014)
Stretching the life of Twitter classifiers with time-stamped semantic graphs (2014)
Automatic stopword generation using contextual semantics for sentiment analysis of Twitter (2014)
Semantic patterns for sentiment analysis of Twitter (2014)
OUSocial2: a platform for gathering students’ feedback from social media (2014)
Personal life event detection from social media (2014)
Using social media to inform policy making: to whom are we listening? (2014)
Exploring user behavior and needs in Q & A communities (2014)
Motivating online engagement and debates on energy consumption (2014)
Mining and comparing engagement dynamics across multiple social media platforms (2014)
User profile modelling in online communities (2014)
Adapting sentiment lexicons using contextual semantics for sentiment analysis of Twitter (2014)
Energy consumption awareness in the workplace: technical artefacts and practices (2014)
On stopwords, filtering and data sparsity for sentiment analysis of Twitter (2014)
SentiCircles for contextual and conceptual semantic sentiment analysis of Twitter (2014)
Evaluation datasets for Twitter sentiment analysis: a survey and a new dataset, the STS-Gold (2013)
OU Social: reaching students in social media (2013)
Measuring the topical specificity of online communities (2013)
What catches your attention? An empirical study of attention patterns in community forums (2012)
Automatic identification of best answers in online enquiry communities (2012)
Semantic sentiment analysis of twitter (2012)
What makes communities tick? Community health analysis using role compositions (2012)
Ignorance isn't bliss: an empirical analysis of attention patterns in online communities (2012)
Behaviour analysis across different types of Enterprise Online Communities (2012)
Alleviating data sparsity for Twitter sentiment analysis (2012)
Semantic smoothing for Twitter sentiment analysis (2011)
Anticipating discussion activity on community forums (2011)
Automatically extracting polarity-bearing topics for cross-domain sentiment classification (2011)
The Effect of User Features on Churn in Social Networks (2011)
Predicting discussions on the social semantic web (2011)
Modelling and analysis of user behaviour in online communities (2011)
Social dynamics in conferences: analyses of data from the Live Social Semantics application (2010)
Exploring English lexicon knowledge for Chinese sentiment analysis (2010)
Global integration of public sector information (2010)
Preliminary results in tag disambiguation using DBpedia (2009)
Collaborative support for community data sharing (2008)
Semantic modelling of user interests based on cross-folksonomy analysis (2008)
A community based approach for managing ontology alignments (2008)
Demo: A community based approach for managing ontology alignments (2008)
Enriching ontological user profiles with tagging history for multi-domain recommendations (2008)
Correlating user profiles from multiple folksonomies (2008)
A community based approach to managing ontology alignments (2008)
Mining for Social Serendipity (2008)
Advanced knowledge system for coatings and the gas turbine MRO industry (2008)
Survey of tools for collaborative knowledge construction and sharing (2007)
Searching ontologies based on content: Experiments in the biomedical domain (2007)
Folksonomies, the semantic web, and movie recommendation (2007)
Unlocking the potential of public sector information with Semantic Web technology (2007)
Searching biomedical ontologies based on content (2007)
The application of advanced knowledge technologies for emergency response (2007)
Ranking ontologies with AKTiveRank (2006)
Ontologies change and queries break: Towards a solution (2006)
Content-based ontology ranking (2006)
Winnowing ontologies based on application use (2006)
Ontology construction from online ontologies (2006)
Metrics for ranking ontologies (2006)
Searching and ranking ontologies on the Semantic Web (2005)
Ontology Winnowing: A Case Study on the AKT Reference Ontology (2005)
Towards a killer app for the Semantic Web (2005)
Ontology ranking based on the analysis of concept structures (2005)
Common features of killer apps: A comparison with Protégé (2005)
Monitoring research collaborations using semantic web technologies (2005)
Trust Strategies for the Semantic Web (2004)
On the emergent Semantic Web and overlooked issues (2004)
The Semantic Web as a Semantic Soup (2004)
Using Protege for automatic ontology instantiation (2004)
Data driven ontology evaluation (2004)
CS AKTive Space: Building a Semantic Web Application (2004)
Web based knowledge extraction and consolidation for automatic ontology instantiation (2003)
TGVizTab: An ontology visualisation extension for Protégé (2003)
Automatic extraction of knowledge from web documents (2003)
Generating adaptive hypertext content from the semantic web (2003)
ONTOCOPI: Methods and tools for identifying communities of practice (2002)
Exploiting synergy between ontologies and recommender systems (2002)
Design issues for agent-based resource locator systems (2002)
Initiating organizational memories using ontology network analysis (2002)
Managing reference: ensuring referential integrity of ontologies for the semantic web (2002)
Geographical information retrieval with ontologies of place (2001)
Ontology-driven geographical information retrieval (2000)
Associative and spatial relationships in thesaurus-based retrieval (2000)
Thesaural and spatial knowledge in cultural heritage information retrieval systems (2000)