
Prof Advaith Siddharthan
Professor Of Computer Science And Society
Biography
Professional biography
I read Physics at the University of Delhi, and Computer Science at the University of Cambridge before gained my PhD in Computational Linguistics at the University of Cambridge (2003). After Postdoctoral research at Columbia University and the University of Cambridge, I took up my first faculty position at the University of Aberdeen in 2009 before joining the Open University’s Knowledge Media Institute in 2017 as a Reader. I have been Professor of Computer Science and society since 2022. I have over 80 peer-reviewed publications and have been PI on grants from UKRI, EPSRC, NERC and ESRC and Co-I on grants from EPSRC, ESRC, H2020, National Geographic and NERC. I currently work on several research projects developing technologies for biodiversity citizen science, including to make these relevant and accessible to primary and secondary schools, while continuing to dabble in disparate topics in Computational Linguistics.
Research interests
My research intersects Citizen Science, Artificial Intelligence, Data Science, and Sustainability Education. I develop socially responsible AI technologies that bridge the divide between professional scientists and lay public, facilitate meaningful public engagement with science and foster attitudinal and behavioural change, particularly around biodiversity issues. I am the academic lead for four citizen science projects at the OU:
My current research investigates science learning within such citizen science projects, especially how citizens can learn alongside artificial intelligence from data. More details of my projects can be found at the Citizen Science and Artificial Intelligence group pages or on my Personal Webpages.
Impact and engagement
Current projects integrate citizen science learning around pollinators into school curricula in the UK and Italy, collecting and analysing data while encouraging schools and students to create habitats and act as pollinator advocates in society. This research is referenced in the UKRI public engagement strategy document and was showcased during Bees’ Needs Week 2020, a public engagement event coordinated by DEFRA.
I previously led the development of novel technologies aimed at public engagement with nature conservation schemes, co-created in partnership with leading UK charities, the Royal Society for Protection of Birds, Bumblebee Conservation Trust and Royal Horticultural Society. Two projects, Blogging Birds (redkite.abdn.ac.uk) and BeeWatch (beewatch.abdn.ac.uk) were among the 8 selected to feature in the RCUK impact summary report for its Digital Economy theme “Celebrating Success in the Digital Economy". Blogging Birds demonstrated the ability to generate complex automated data-driven texts, received an EPSRC prize (Telling Tales of Engagement Competition), and resulted in a publication in the prestigious Communications of the ACM (Siddharthan et al., 2019). BeeWatch was an online citizen science initiative that generated valuable bumblebee records across the UK by integrating artificial intelligence and data science into citizen science. Following publication of our results on bumblebee feeding patterns in Nature Scientific Reports in 2020, we are working with the RHS to use citizen science data to improve pollinator-friendly planting lists.
External collaborations
Key research collaborators on our citizen science projects include University of Aberdeen, Swedish University of Agricultural Sciences, University of Edinburgh and Imperial College London. I work with a much larger set of partners, within and outside of academia, including on European projects such as https://cos4cloud-eosc.eu/.
Publications
Journal Article
Blogging Birds: Telling informative stories about the lives of birds from telemetric data (2019)
Extractive and Abstractive Sentence Labelling of Sentiment-bearing Topics (2019)
SaferDrive: an NLG-based Behaviour Change Support System for Drivers (2018)
Recognizing cited facts and principles in legal judgements (2017)
Presentation / Conference
CSS: Contrastive Semantic Similarities for Uncertainty Quantification of LLMs (2024)
iSpot & AI: Integrating FASTCAT-Cloud and PI@ntNET-API in the Cos4Cloud framework (2023)
Empirical Optimal Risk to Quantify Model Trustworthiness for Failure Detection (2023)
Consensus building in on-line citizen science (2022)
Confidence-Aware Calibration and Scoring Functions for Curriculum Learning (2022)
Gender equality work in a distance learning institution (2022)
Summarising Historical Text in Modern Languages (2021)
Incorporating Constraints into Matrix Factorization for Clothes Package Recommendation (2018)
Generating Summaries of Sets of Consumer Products: Learning from Experiments (2018)
Understanding how to Explain Package Recommendations in the Clothes Domain (2018)
Matrix Factorization for Package Recommendations (2017)
Should Learning Material's Selection be Adapted to Learning Style and Personality? (2017)
Automatically Labelling Sentiment-Bearing Topics with Descriptive Sentence Labels (2017)
Bumblebee friendly planting recommendations with citizen science data (2017)
Summarising News Stories for Children (2016)
Summarising the points made in online political debates (2016)
Exploring the impact of extroversion on the selection of learning materials (2016)
Scrutable Feature Sets for Stance Classification (2016)
Lexico-syntactic Text Simplification And Compression With Typed Dependencies (2014)
Investigation into Human Preference between Common and Unambiguous Lexical Substitutions (2011)
Complex lexico-syntactic reformulation of sentences using typed dependency representations (2010)
Reformulating discourse connectives for non-expert readers (2010)
Corpora for the conceptualisation and zoning of scientific papers (2010)
Whose idea was this, and why does it matter? Attributing scientific work to citations (2007)
An annotation scheme for citation function (2006)
Automatic Classification of Citation Function (2006)
Syntactic Simplification for Improving Content Selection in Multi-Document Summarization (2004)