JavaScript is Disabled
Your browser's JavaScript functionality is disabled. It has to be enabled to use this function of ConfTool.
Here you can find information on how to enable JavaScript
If you have any problems, please contact the organisers at FF2025@bl.uk.

Conference Agenda

Overview and details of the sessions of this conference. Please select a date or location to show only sessions at that day or location. Please select a single session for detailed view (with abstracts and downloads if available).

Please note that all times are shown in the time zone of the conference. The current conference time is: 29th Mar 2026, 03:38:50am BST

Only Sessions at Date / Time

Agenda Overview

Session

LP03: Long papers

Time:

Friday, 05/Dec/2025:

9:00am - 10:00am

Session Chair: Neil Fitzgerald

Location: Pigott Theatre (Auditorium)

Knowledge Centre, capacity 255

Presentations

9:00am - 9:30am

Developing Archival AI chatbots: risks, benefits, and future directions

Finola Finn¹, Caio Mello¹, Yves Maurer²

¹University of Luxembourg, Luxembourg; ²National Library of Luxembourg, Luxembourg

AI chatbots are increasingly used for navigating and analysing the contents of major archives and collections. Applying Retrieval Augmented Generation to existing large language models, these tools draw on indexes of the relevant collections to answer, in natural language, users’ questions. This presentation brings together three professionals from history, digital humanities, the GLAM sector, and computer science to discuss the risks, benefits, and future directions of these tools. We explore how AI chatbots could be used to optimise the accessibility of collections in ways that maintain, or even enhance, research integrity.

The discussion draws on hands-on experiences with developing and using archival AI chatbots. Caio Mello reflects on the challenges and questions faced by his team in developing Barista, a chatbot designed as part of the ‘Impresso – Media Monitoring of the Past II’ project to transform prompts into search queries on the Impresso Web App. Yves Maurer brings insights from the design and launch of chat.eluxemburgensia.lu, a pioneering AI chatbot released in 2023 to help users search heritage documents digitized by the National Library of Luxembourg (BNL). Finola Finn complements these inputs with methodological and epistemological considerations, examining the implications of using chatbots for historical research and public engagement. Together, the speakers offer suggestions for how providers could carefully design, frame, and describe the intended use of archival chatbots. Throughout, the discussion centers around two key themes:

Thoughtful UX design and communication
Given the current influx of very similar tools available for public use (from ChatGPT to customer service bots), a key issue in the development of archival AI chatbots is expectation management. Each chatbot works in a different way and was built for a different purpose, but these distinctions and their downstream effects are often not immediately clear to users. So, how can we best inform users of chatbots’ capabilities and limitations? Have users developed certain knowledge and habits through engagement with other chatbots that might need curbing or reorienting? What skills and information do we need to make available for users to be able to use chatbots effectively and responsibly?

Maintaining rigorous research practices
In addition to being highly powerful finding aids, allowing users to locate and access relevant documents more easily, some archival AI chatbots are also presented as being capable of providing useful, automated answers to questions about the past. What are the epistemological differences between these two uses of archival chatbots (i.e. navigation vs analysis), and why is it important to demarcate them? What are the implications and possibilities for integrating these tools into the research practices of humanities scholars and their efforts to interrogate archives and collections?

9:30am - 10:00am

Old system, new tricks; Using AI to Improve AV Metadata in a Legacy System

Agnes Toftgård, Emma Rende

National Library of Sweden, Sweden

What can you do when you find a problem that could be solved with the help of AI, but are currently saddled with an inflexible legacy system? How do you handle problems that need solving now without investing too much time and effort in a solution that will be made redundant within a few years by ever evolving technology? These questions are the starting point of our presentation - a case study from the National library of Sweden (KB) involving automation of metadata enrichment for audio-visual (AV) materials in a legacy system, assisting staff in anticipation of future restructuring.

Currently, AV materials are catalogued using metadata purchased from a Swedish news agency, including program schedules, air times, and subject matter. However, for many programs - particularly live or news broadcasts - this metadata is incomplete or too generic to be useful. In these cases, human staff manually supplement archive searchability by listening to news broadcasts and writing their own summaries. This manual work is time-consuming and can require experienced staff to spend up to a full workday each week on this task.

To address these issues, we have developed an automated solution that makes use of speech-to-text and large language models. Selected TV and radio programs are transcribed using KB-Whisper [1], a transcription model trained in-house. The transcribed text is then summarized using a language model (currently Llama 3.1-8B-Instruct [2]). These summaries are automatically added to the institution’s national AV catalogue, SMDB, in a dedicated field clearly labeled as AI-generated. Catalogue staff manage only the selection of programs to be processed - everything else happens without manual input. This means that catalogue staff can redirect their efforts from basic summarization toward tasks that require human judgment and domain expertise. Furthermore, the searchable AI-generated summaries enhance discovery and access for end users, improving the overall usability of the system without the need for significant development in the legacy platform.

The process has been highly collaborative. Data scientists, developers, and cataloging experts have worked closely together throughout. In addition to solving a technical problem, the project has fostered important conversations about the role of AI in cultural heritage institutions. Topics have included quality standards for AI-generated text, transparency toward users, and internal expectations for automated processes.

Although our solution is not a long-term replacement for a modern infrastructure, it threads the needle of using the tools we have now while anticipating changing circumstances in the future. The AI-enhanced cataloguing pipeline provides practical value now, easing workloads, improving metadata quality, and building organizational experience with AI technologies.

In conclusion, the project shows how AI can be used to improve public sector operations even within the limitations of outdated systems. The focus on incremental improvement, cross-functional collaboration, and transparency has helped turn a short-term constraint into an opportunity for innovation. As institutions across the cultural heritage sector face similar challenges - aging systems, rising content volumes, and increasing user expectations - this type of applied AI work offers a concrete and transferable model. It demonstrates that meaningful change is possible without waiting for perfect conditions, and that small steps can lead to significant long-term impact.

Background:

Since 2019, KB has operated KB Lab, a data lab focused on training AI models for the Swedish language and supporting researchers with access to structured collection data.

The KBx initiative was launched by KB to begin implementing AI within the organization through small-scale solutions and an experimental approach. The core team is small, and additional expertise - such as subject matter experts, system developers, and others - is brought in as needed, depending on the nature of the ongoing work.

By developing tangible products, even on a small scale, we hope to make the advantages and possibilities of AI more accessible and easier to understand for our staff. Through this approach, we aim to foster a sense of ambassadorship for AI, making it more approachable.

References:

[1] Leonora Vesterbacka, Faton Rekathati, Robin Kurtz, Justyna Sikora, Agnes Toftgård. Swedish Whispers; Leveraging a Massive Speech Corpus for Swedish Speech Recognition. https://arxiv.org/abs/2505.17538 (2025).

[2] Grattafiori, Aaron, et al. The llama 3 herd of models. https://arxiv.org/abs/2407.21783 (2024).

Fantastic Futures 2025

AI4LAM and the British Library

3 - 5 December 2025
London, UK

Conference Agenda