Biography

Professional biography

Dr. Angelo Salatino is a Research Fellow at the Scholarly Knowledge team (SKM3), at the Knowledge Media Institute (KMi) of the Open University. He obtained a Ph.D., studying methods for the early detection of research trends. In particular, his project aimed at identifying the emergence of new research topics at their embryonic stage (i.e., before being recognised by the research community).

Currently, he is mainly working on: i) new technologies for classifying scientific papers according to their relevant research topics, and ii) how the research output of academia fosters innovation in the industry. 

Research interests

His research interests are in the areas of Semantic Web, Network Science and Knowledge Discovery technologies, with focus on the structure and evolution of science: Science of Science

Projects

TRACK-ing Stars and Unicorns: The trajectory of funded and unfunded people and teams

Scientific progress and innovation depend on a complex interplay between researchers, funding, teams, time and ideas. Funding agencies, including UKRI, balance a choice between investing in people, as the cornerstone of a robust UK research workforce, with investing in research ideas underpinning potentially commercial innovations. For example, whereas previous research has worked to identify relevant markers for high-performing research individuals (stars) [citeme], it is more difficult to identify indicators of market success early in the innovation pipeline. This increases the risk associated with framing an entire funding system around early investment in potential research ideas, when the vast majority of these will fail to materialise [citeme]. Furthermore, early investment in ideas as sole focus, risks underinvesting in a workforce that is necessary to underpin the development of novel ideas in the long term. This project labels this paradox as a choice between funding research “stars” (individual research careers), versus research “unicorns” (ideas or concepts). What is needed is a funding mechanism that invests in people and the sustainability of a research workforce, but also provides an environment for novel, and potentially commercial ideas, can thrive. This project leverages Natural Language Processing (NLP), and Knowledge Graphs extracting concepts in the titles and abstracts. And it matches them with bibliometric data, monitoring of research success on individuals who were both successful and unsuccessful to attract UKRI research funding. It will result in the development of an AI-driven database capable of track-ing the evolution of ideas and careers, over time. This database, known as the TRACK-database (Trajectories of Research Applications and Career Knowledge), will be the world’s first longitudinal funding database linked to pseudonymised individuals and their academic output capable of mapping UKRI-funded and unfunded grant applications to individual researchers, research concepts, and innovation over time. Creation of this database represents a step change in research policy evaluation by introducing a scalable, AI-powered framework that will enable large-scale, AI-driven analyses. These include analysing how careers and ideas develop over time, as well as causally linking these longitudinal outcomes with different funding types (fellowship or teams) at the early stage of a research career (<5 years first publication). Further investigations include whether certain disciplines benefit more from specific funding models and assess the UK's position in the global research landscape by comparing its research trends with international competitors. Engagement with UKRI, as a major beneficiary of this project’s outcomes, will ensure that research insights from the TRACK-database are utilised towards the development and implementation of actionable insights used to facilitate a UK funding environment that contributes to both a sustainable UK workforce, as well as fosters the development of novel ideas and concepts. Such policies will be used to develop a more resilient and productive research ecosystem, thereby enhancing UK’s international competitiveness.

Analysis of AI publication trends

The researcher will join a team dedicated to developing a custom analysis of AI bibliometric trends. Specifically, Angelo will consult with the AI Index team on designing a classification scheme for AI publications to be featured in the 2025 AI Index report. Once the scheme is established, there is also interest in publishing the analysis as a standalone research paper. While the AI Index team will conduct the analysis, Angelo’s role will be consultative, providing strategic guidance on project design.

Publications

Book Chapter

Knowledge Graph Construction for Health, Lifestyle and Fitness Applications (2025)

Detection, Analysis, and Prediction of Research Topics with Scientific Knowledge Graphs (2021)

Ontology Extraction and Usage in the Scholarly Knowledge Domain (2020)

Journal Article

Large language models for scholarly ontology generation: An extensive analysis in the engineering field (2026)

A Survey on Knowledge Organization Systems of Research Fields: Resources and Challenges (2025)

The Epistemology of Fine-Grained News Classification (2025)

Exploring Large Language Models for Scientific Question Answering via Natural Language to SPARQL Translation (2025)

A Comparative Study of Task Adaptation Techniques of Large Language Models for Identifying Sustainable Development Goals (2025)

Optimizing Tourism Accommodation Offers by Integrating Language Models and Knowledge Graph Technologies (2024)

A comparative analysis of knowledge injection strategies for large language models in the scholarly domain (2024)

Exploring Environmental, Social, and Governance (ESG) Discourse in News: An AI-Powered Investigation Through Knowledge Graph Analysis (2024)

Artificial intelligence for literature reviews: opportunities and challenges (2024)

Data-Driven Methodology for Knowledge Graph Generation Within the Tourism Domain (2023)

Integrating Conversational Agents and Knowledge Graphs Within the Scholarly Domain (2023)

AIDA: a Knowledge Graph about Research Dynamics in Academia and Industry (2022)

The AIDA Dashboard: a Web Application for Assessing and Comparing Scientific Conferences (2022)

R-classify: Extracting research papers’ relevant concepts from a controlled vocabulary (2022)

Link Prediction of Weighted Triples for Knowledge Graph Completion Within the Scholarly Domain (2021)

Trans4E: Link Prediction on Scholarly Knowledge Graphs (2021)

CSO Classifier 3.0: a scalable unsupervised method for classifying documents in terms of research topics (2021)

New Trends in Scientific Knowledge Graphs and Research Impact Assessment (2021)

The Computer Science Ontology: A Comprehensive Automatically-Generated Taxonomy of Research Areas (2020)

How are topics born? Understanding the research dynamics preceding the emergence of new areas (2017)

Presentation / Conference

Assessing Large Language Models for SPARQL Query Generation in Scientific Question Answering (2025)

Capturing the Viewpoint Dynamics in the News Domain (2024)

HUMAD 2024: International Workshop on Human-Centered Modeling and Adaptation for Digital Transformation (2024)

Identifying Semantic Relationships Between Research Topics Using Large Language Models in a Zero-Shot Learning Setting (2024)

Classifying Scientific Topic Relationships with SciBERT (2024)

Leveraging Language Models for Generating Ontologies of Research Topics (2024)

Large Language Models for Scientific Question Answering: An Extensive Analysis of the SciQA Benchmark (2024)

Investigating Environmental, Social, and Governance (ESG) Discussions in News: A Knowledge Graph Analysis Empowered by AI (2024)

AIDA-Bot 2.0: Enhancing Conversational Agents with Knowledge Graphs for Analysing the Research Landscape (2023)

Leveraging Knowledge Graphs with Large Language Models for Classification Tasks in the Tourism Domain (2023)

Enhancing Scholarly Understanding: A Comparison of Knowledge Injection Strategies in Large Language Models (2023)

3rd International Workshop on Scientific Knowledge Representation, Discovery, and Assessment (Sci-K 2023) (2023)

Sci-K 2022 - International Workshop on Scientific Knowledge: Representation, Discovery, and Assessment (2022)

Leveraging Knowledge Graph Technologies to Assess Journals and Conferences at Springer Nature (2022)

Enriching Data Lakes with Knowledge Graphs (2022)

Assessing Scientific Conferences through Knowledge Graphs (2021)

AIDA-Bot: A Conversational Agent to ExploreScholarly Knowledge Graphs (2021)

The AIDA Dashboard: Analysing Conferences with Semantic Technologies (2020)

ResearchFlow: Understanding the Knowledge Flow between Academia and Industry (2020)

Databases and Information Systems in the AI Era: Contributions from ADBIS, TPDL and EDA 2020 Workshops and Doctoral Consortium (2020)

Integrating Knowledge Graphs for Analysing Academia and Industry Dynamics (2020)

Improving Editorial Workflow and Metadata Quality at Springer Nature (2019)

Smart Topics Miner 2: Improving Proceedings Retrievability at Springer Nature (2019)

Integrating Knowledge Graphs for Comparing the Scientific Output of Academia and Industry (2019)

The CSO Classifier: Ontology-Driven Detection of Research Topics in Scholarly Articles (2019)

Classifying Research Papers with the Computer Science Ontology (2018)

The Computer Science Ontology: A Large-Scale Taxonomy of Research Areas (2018)

AUGUR: Forecasting the Emergence of New Research Topics (2018)

2100 AI: Reflections on the mechanisation of scientific discovery (2017)

Smart Book Recommender: A Semantic Recommendation Engine for Editorial Products (2017)

Supporting Springer Nature Editors by means of Semantic Technologies (2017)

Detection of Embryonic Research Topics by Analysing Semantic Topic Networks (2016)

Ontology Forecasting in Scientific Literature: Semantic Concepts Prediction based on Innovation-Adoption Priors (2016)

Ontology Forecasting in Scientific Literature: Semantic Concepts Prediction based on Innovation-Adoption Priors (2016)

Smart Topic Miner: Supporting Springer Nature Editors with Semantic Web Technologies (2016)

Automatic Classification of Springer Nature Proceedings with Smart Topic Miner (2016)

Early Detection and Forecasting of Research Trends (2015)

Presentation / Conference Contribution

Benchmarking Large Language Models for Sustainable Development Goals Classification: Evaluating In-Context Learning and Fine-Tuning Strategies (2025)

Thesis

Early Detection of Research Trends (2019)