Olivier Lartillot

Researcher - RITMO Centre for Interdisciplinary Studies in Rhythm, Time and Motion (IMV)

Norwegian version of this page

Email olivier.lartillot@imv.uio.no

Mobile phone +47 966 81 210?

Username

Visiting address 澳门葡京手机版app下载sveien 3A 0373 Oslo

Postal address P.O. Box 1133 Blindern 0318 Oslo

Press photo Download business card

Short bio:?Olivier Lartillot is a researcher working in the fields of computational music and sound analysis and artificial intelligence. He is a co-leader of the Work Package "AI for cultural heritage" at?MishMash Centre for AI and Creativity. He obtained?a funding from the Research Council of Norway under the FRIPRO-IKTPLUSS program, for a project called MIRAGE?- A Comprehensive AI-Based System for Advanced Music Analysis?(2020-2023).

He designed MIRtoolbox, a recognised tool for music feature extraction from audio. He also works on symbolic music analysis, notably on sequential pattern mining. In the context of his 5-year Academy of Finland research fellowship, he conceived the MiningSuite, an analytical framework that combines audio and symbolic research.

Olivier is a partner in the HIGH-M project (Human Interaction assessment and Generative segmentation in Health and Music). Olivier previously worked for?the SoundTracer project, an innovation project in collaboration with the National Library of Norway.?He also?collaborates?on the TIME project.

Olivier has given courses in Music Information Retrieval, conference tutorials and has taught in various summer schools. He has collaborated on various projects around the topics of artificial intelligence, signal processing, cognitive science, neuroscience, music analysis, cross-cultural studies and music therapy. He has written 80 articles, with more than 3000 citations. He is a member of the?Editorial Board of the?Transactions of the International Society for Music Information Retrieval,?is an expert evaluator?for the European Commission's Horizon 2020 program and has participated to the Program Committees of various conferences.

As part of the SoundTracer project, he worked in particular on the automated transcription of Norwegian fiddle music. He discussed this topic in the international conference on music perception and cognition (ICMPC 2018):

Background

Researcher?at the Department for Architecture, Design and Media Technology, Aalborg University, Denmark (2014-2016)
Academy of Finland research fellow (2009-2014) and previously post-doc (2004-2009)?at the?Finnish Centre of Excellence in Interdisciplinary Music Research, University of Jyv?skyl?, Finland
Scientific collaborator at the Swiss Center for Affective Sciences, University of Geneva, Switzerland (2012-2013)
PhD at Ircam - Centre George Pompidou, UPMC Paris University (2000-2004)
ATIAM Master?at Ircam - Centre George Pompidou, UPMC Paris University (1999-2000)
Sup��lec engineering school, France (1996-1999)
Art degree in Musicology, Sorbonne University, Paris, France (1996-1999)

Video Visualization of a String Quartet Performance of a Bach Fugue: Design and Subjective Evaluation Image may contain: Chordophone, String instrument, Violin family, Musical ensemble, Musical instrument.

Music & Science,?Special Collection on MusicLab Copenhagen: A research concert with the Danish String Quartet

We design and test a visualization strategy aimed at explicating to a large audience with diverse backgrounds��especially novices��the multifaceted beauty of the final Contrapunctus in J.S. Bach's The Art of Fugue, performed by the Danish String Quartet.?At the surface level of the musical structure, the rich fluctuation of pitch shaped by each musician was depicted in the form of undulating pitch curves. At a deeper structural level, the repetition of pitch curves, distinctive of fugues, was highlighted through vertical alignment��inspired by a technique called paradigmatic analysis, originating from anthropology and music semiology.

Musicological and Technological Perspectives on Computational Analysis of Electroacoustic Music

Chapter from the Sonic Design?book

I examine electroacoustic music analysis, covering musicological investigations and desires and technological challenges and potentials. Starting from Pierre Schaeffer��s?Trait�� des objects musicaux, I present?an overview of core analytical principles underpinning more recent musicological approaches. Based on a state of the art in computational analysis of electroacoustic music, I sketch the principles of what could be a?Toolbox des objets sonores.

A Dataset of Norwegian Hardanger Fiddle Recordings with Precise Annotation of Note and Beat Onsets

Transactions of the International Society for Music Information Retrieval

We present a dataset of several hours of recordings of Hardanger fiddle music, with note annotations of onsets, offsets and pitches, provided by the performers themselves. A subset has also been annotated with beat onset positions by the performer as well as three expert musicians. The complexity of the music genre��polyphonic, highly ornamented and with a very irregular pulsation, among other aspects��motivated the design of a new annotation software adapted to these particular needs.?

Segmentation, Transcription, Analysis and Visualisation of the Norwegian Folk Music Archive

Presented at the?9th International Conference on Digital Libraries for Musicology,?part of the annual conference of the International Association of Music Libraries (IAML)

We present an ongoing project dedicated to the transmutation of a collection of field recordings of Norwegian folk music established in the 1960s into an easily accessible online catalogue augmented with advanced music technology and computer musicology tools. We focus in particular on a major highlight of this collection: Hardanger fiddle music.

MusicLab 8: Synaesthesia

We explored the synaesthesia of guitarist and artistic researcher Bj?rn Charles Dreyer in a multimodal and interactive concert?based on new technologies specifically developed for this collaboration. You can watch the concert, followed by a panel discussion and a technical demonstration:

MIRAGE Symposium #1: Computational Musicology

The 1st MIRAGE Symposium, which took?place on 8-9 June, 2021, is available on replay.

New method for computational analysis of sound chosen as JASA highlight

Article about new method for attack detection chosen as a highlight by The Journal of the Acoustical Society of America (JASA).

Computational Musicological Analysis of Notated Music: A Brief Overview

A chapter from the newly released book "Notated Music in the Digital Sphere:?Possibilities and Limitations", Nota Bene 15

Artificial intelligence can help you understand music better

Algorithms and technology have so far helped listeners to more of the same music. Now, UiO researchers are working on new technology that can get people interested in a greater musical variety.

Christodoulou, Anna-Maria & Lartillot, Olivier (2025). A Multimodal Dataset of Greek Folk Music. In Luca, Elsa De (Eds.), DLfM '25: Proceedings of the 12th International Conference on Digital Libraries for Musicology. Association for Computing Machinery (ACM). ISSN 9798400720833. p. 19�C27. doi: https:/dl.acm.org/doi/10.1145/3748336.3748339. Full text in Research Archive Show summary
This paper presents a multimodal dataset of Greek folk dance music, focusing on syrtos and balos. Developed to support research in computational musicology, the dataset improves access to Greek musical heritage through manually transcribed MIDI scores, aligned lyrics, and rich metadata, all curated by expert musicologists. Through pattern analysis and feature extraction, we examine both shared melodic structures and unique characteristics of each dance, with some examples reflecting traces of oral transmission. While metadata accompanies the collection to support organization and context, our primary emphasis is on the musical and lyrical content. This work contributes to digital ethnomusicology by showing how multimodal datasets of folk music can inform both analytical research and cultural heritage preservation.
Christodoulou, Anna-Maria; Glette, Kyrre; Lartillot, Olivier & Jensenius, Alexander Refsum (2025). MusiQAl: A Dataset for Music Question�CAnswering through Audio�CVideo Fusion. Transactions of the International Society for Music Information Retrieval. 8(1), p. 265�C282. doi: 10.5334/tismir.222. Full text in Research Archive Show summary
Music question�Canswering (MQA) is a machine learning task where a computational system analyzes and answers questions about music?related data. Traditional methods prioritize audio, overlooking visual and embodied aspects crucial to music performance understanding. We introduce MusiQAl, a multimodal dataset of 310 music performance videos and 11,793 human?annotated question�Canswer pairs, spanning diverse musical traditions and styles. Grounded in musicology and music psychology, MusiQAl emphasizes multimodal reasoning, causal inference, and cross?cultural understanding of performer�Cmusic interaction. We benchmark AVST and LAVISH architectures on MusiQAI, revealing strengths and limitations, underscoring the importance of integrating multimodal learning and domain expertise to advance MQA and music information retrieval.
Lartillot, Olivier; Swarbrick, Dana; Upham, Finn & Cancino-Chac��n, Carlos Eduardo (2025). Video Visualization of a String Quartet Performance of a Bach Fugue: Design and Subjective Evaluation. Music & Science. 8. doi: 10.1177/20592043251352299. Full text in Research Archive Show summary
Visualizing music��through music notation, analytical representations, or music videos��might potentially boost the appreciation of music in all its richness. The purpose of this study was to design and test a visualization strategy aimed at explicating to a large audience with diverse backgrounds��especially novices��the multifaceted beauty of the final Contrapunctus in J.S. Bach's The Art of Fugue, performed by the Danish String Quartet. At the surface level of the musical structure, the rich fluctuation of pitch shaped by each musician was depicted in the form of undulating pitch curves. At a deeper structural level, the repetition of pitch curves, distinctive of fugues, was highlighted through vertical alignment��inspired by a technique called paradigmatic analysis, originating from anthropology and music semiology. The visualization was initially prototyped in the form of a real-time technology as part of the MusicLab Copenhagen research concert. The concert audience focused on the performance itself, and did not pay much attention to, nor appreciate, the visualization. To evaluate more thoroughly the potential of the visualization, participants with varied musical expertise and taste were invited to listen to a recorded performance of the piece and watch the visualization on their own computer. A large majority reported that they felt they understood the visualization, around half of them felt that it enhanced their musical understanding, and a small group felt that it helped them to better appreciate the music.
H?ffding, Simon; Bergstr?m, Rebecca Josefine Five; Bishop, Laura; Bravo, Pedro Pablo Lucas; Burnim, Kayla & Cancino-Chac��n, Carlos Eduardo [Show all 28 contributors for this article] (2025). Introducing the MusicLab Copenhagen Dataset. Music & Science. 8. doi: 10.1177/20592043241303288. Full text in Research Archive Show summary
MusicLab Copenhagen was a unique research concert featuring the world-renowned Danish String Quartet in a naturalistic setting. The audience was split between one group physically located in the hall, another group listening to a radio broadcast, and a third group watching a live stream. Qualitative and quantitative data were captured from both musicians and audiences, resulting in a comprehensive dataset that can be used to address many research questions. This document introduces the dataset, explains its structure, and reflects on the related data collection, storing, publishing, and archiving processes.
Christodoulou, Anna-Maria; Dutta, Sagar; Lartillot, Olivier Serge Gabriel; Glette, Kyrre & Jensenius, Alexander Refsum (2024). Exploring Convolutional Neural Network Models for Multimodal Classification of Expressive Piano Performance, Proceedings of the Sound and Music Computing Conference 2024. SMC Network. ISSN 9789893520758. Full text in Research Archive Show summary
This paper addresses improving performance analysis by automating the recognition of expressive performance styles. We propose a multimodal fusion approach integrating audio, video, and motion data. We demonstrate the effectiveness of our approach by utilizing convolutional neural network (CNN) models. Training is done on a classical piano dataset of 211 excerpts containing audio, video, MIDI, and motion capture data. The results highlight the robustness of the CNN models; they achieve high accuracy even when trained on a limited dataset. Our study contributes to advancing the field of performance analysis by applying deep learning techniques to multimodal data.
Danielsen, Anne; Br?vig, Ragnhild; B?hler, Kjetil Klette; C?mara, Guilherme Schmidt; Haugen, Mari Romarheim & Jacobsen, Eirik [Show all 13 contributors for this article] (2024). There��s More to Timing than Time: Investigating Musical Microrhythm Across Disciplines and Cultures. Music Perception. ISSN 0730-7829. 41(3), p. 176�C198. doi: 10.1525/mp.2024.41.3.176. Full text in Research Archive Show summary
The TIME project: Timing and Sound in Musical Microrhythm (2017�C2022) studied microrhythm; that is, how dynamic envelope, timbre, and center frequency, as well as the microtiming of a variety of sounds, affect their perceived rhythmic properties. The project involved theoretical work regarding the basic aspects of microrhythm; experimental studies of microrhythm perception, exploring both stimulus features and the participants�� enculturated expertise; observational studies of how musicians produce particular microrhythms; and ethnographic studies of musicians�� descriptions of microrhythm. Collectively, we show that: (a) altering the microstructure of a sound (��what�� the sound is) changes its perceived temporal location (��when�� it occurs), (b) there are systematic effects of core acoustic factors (duration, attack) on microrhythmic perception, (c) microrhythmic features in longer and more complex sounds can give rise to different perceptions of the same sound, and (d) musicians are highly aware of microrhythms and have developed vocabularies for describing them. In addition, our results shed light on conflicting results regarding the effect of microtiming on the ��grooviness�� of a rhythm. Our use of multiple, interdisciplinary methodologies enabled us to uncover the complexity of microrhythm perception and production in both laboratory and real-world musical contexts.
Lartillot, Olivier (2024). Musicological and Technological Perspectives on Computational Analysis of Electroacoustic Music. In Jensenius, Alexander Refsum (Eds.), Sonic Design: Explorations Between Art and Science. Springer Nature. ISSN 9783031578922. p. 271�C297. doi: 10.1007/978-3-031-57892-2_15. Full text in Research Archive Show summary
Analysing electroacoustic music remains challenging, leaving this artistic treasure somewhat out of reach of mainstream musicology and many music lovers. This chapter examines electroacoustic music analysis, covering musicological investigations and desires and technological challenges and potentials. The aim is to develop new technologies to overcome the current limitations. The compositional and musicological foundations of electroacoustic music analysis are based on Pierre Schaeffer��s Trait�� des objects musicaux. The chapter presents an overview of core analytical principles underpinning more recent musicological approaches, including R. Murray Schafer��s soundscape analysis, Denis Smalley��s spectro-morphology, and Lasse Thoresen��s graphical formalisation. Then the state of the art in computational analysis of electroacoustic music is compiled and organised along broad themes, from detecting sound objects to estimating dynamics, facture and grain, mass, motions, space, timbre and rhythm. Finally, I sketch the principles of what could be a Toolbox des objets sonores.
Christodoulou, Anna-Maria; Lartillot, Olivier & Jensenius, Alexander Refsum (2024). Multimodal music datasets? Challenges and future goals in music processing. International Journal of Multimedia Information Retrieval. ISSN 2192-6611. 13(3). doi: 10.1007/s13735-024-00344-6. Full text in Research Archive Show summary
The term ��multimodal music dataset�� is often used to describe music-related datasets that represent music as a multimedia art form and multimodal experience. However, the term ��multimodality�� is often used differently in disciplines such as musicology, music psychology, and music technology. This paper proposes a definition of multimodality that works across different music disciplines. Many challenges are related to constructing, evaluating, and using multimodal music datasets. We provide a task-based categorization of multimodal datasets and suggest guidelines for their development. Diverse data pre-processing methods are illuminated, highlighting their contributions to transparent and reproducible music analysis. Additionally, evaluation metrics, methods, and benchmarks tailored for multimodal music processing tasks are scrutinized, empowering researchers to make informed decisions and facilitating cross-study comparisons.
Ros-F��bregas, Emilio; Page, Kevin; Saccomano, Mark; Lartillot, Olivier & Mazurenko, Anastasiia (2024). PERSPECTIVES ON DIGITAL MUSIC EDITION, HERITAGE PRESERVATION, ANNOTATION AND ANALYSIS: A COST ACTION MEETING IN BARCELONA. Anuario Musical. ISSN 0211-3538. doi: 10.3989/anuariomusical.2024.79.560. Full text in Research Archive
Thedens, Hans-Hinrich & Lartillot, Olivier (2023). AudioSegmentor: Et verkt?y for formidling av arkivopptak p? nettet. Studia Musicologica Norvegica. ISSN 0332-5024. 49(1), p. 92�C101. doi: 10.18261/smn.49.1.7. Full text in Research Archive Show summary
Norske folkemusikkarkiver st?r foran utfordringer vedr?rende ? gj?re sine samlinger tilgjengelige p? nett n?r feltopptak faller i det fri etter 50 ?r. Denne artikkelen beskriver et fors?k p? ? tilrettelegge brukskopier i form av lydfiler slik at brukerne kan lytte til enkeltopptak i en nettpresentasjon av et arkivs innhold. Lydklassifiseringsteknologi er i stand til ? finne og markere starttidspunkt p? enkeltmelodier og spare arkivpersonalet for mange timers manuelt arbeid. Mirage-prosjektet p? UiOs RITMO-senter har utviklet et grensesnitt for et slikt verkt?y for Nasjonalbibliotekets folkemusikkarkiv og dets nettkatalog WebbFIOL. L?sningen vil kunne tas i bruk av alle som st?r overfor liknende utfordringer.
Bishop, Laura; H?ffding, Simon; Lartillot, Olivier Serge Gabriel & Laeng, Bruno (2023). Mental Effort and Expressive Interaction in Expert and Student String Quartet Performance. Music & Science. 6. doi: 10.1177/20592043231208000. Full text in Research Archive
Lartillot, Olivier; Johansson, Mats Sigvard; Elowsson, Anders; Monstad, Lars L?berg & Cyvin, Mattias Stor?s (2023). A Dataset of Norwegian Hardanger Fiddle Recordings with Precise Annotation of Note and Beat Onsets. Transactions of the International Society for Music Information Retrieval. 6(1), p. 186�C202. doi: 10.5334/TISMIR.139. Full text in Research Archive
Maidhof, Clemens; M��ller, Viktor; Lartillot, Olivier; Agres, Kat; Bloska, Jodie & Asano, Rie [Show all 8 contributors for this article] (2023). Intra- and inter-brain coupling and activity dynamics during improvisational music therapy with a person with dementia: an explorative EEG-hyperscanning single case study. Frontiers in Psychology. 14. doi: 10.3389/fpsyg.2023.1155732. Full text in Research Archive
Szorkovszky, Alexander; Veenstra, Frank; Lartillot, Olivier Serge Gabriel; Jensenius, Alexander Refsum & Glette, Kyrre (2023). Embodied Tempo Tracking with a Virtual Quadruped, Proceedings of the Sound and Music Computing Conference 2023. SMC Network . ISSN 9789152773727. doi: 10.5281/zenodo.10060970. Full text in Research Archive
Lartillot, Olivier; Elovsson, Anders; Johansson, Mats Sigvard; Thedens, Hans-Hinrich & Monstad, Lars Alfred L?berg (2022). Segmentation, Transcription, Analysis and Visualisation of the Norwegian Folk Music Archive. In Pugin, Laurent (Eds.), DLfM '22: 9th International Conference on Digital Libraries for Musicology. Association for Computing Machinery (ACM). ISSN 9781450396684. p. 1�C9. doi: 10.1145/3543882.3543883. Full text in Research Archive Show summary
We present an ongoing project dedicated to the transmutation of a collection of field recordings of Norwegian folk music established in the 1960s into an easily accessible online catalogue augmented with advanced music technology and computer musicology tools. We focus in particular on a major highlight of this collection: Hardanger fiddle music. The studied corpus was available as a series of 600 tape recordings, each tape containing up to 2 hours of recordings, associated with metadata indicating approximate positions of pieces of music. We first need to retrieve the individual recording associated with each tune, through the combination of an automated pre-segmentation based on sound classification and audio analysis, and a subsequent manual verification and fine-tuning of the temporal positions, using a home-made user interface. Note detection is carried out by a deep learning method. To adapt the model to Hardanger fiddle music, musicians were asked to record themselves and annotate all played note, using a dedicated interface. Data augmentation techniques have been designed to accelerate the process, in particular using alignment of varied performances of same tunes. The transcription also requires the reconstruction of the metrical structure, which is particularly challenging in this style of music. We have also collected ground-truth data, and are conceiving a computational model. The next step consists in carrying out detailed music analysis of the transcriptions, in order to reveal in particular intertextuality within the corpus. A last direction of research is aimed at designing tools to visualise each tune and the whole catalogue, both for musicologists and general public.
Juslin, Patrik N.; Sakka, Laura S.; Barradas, Gon?alo T. & Lartillot, Olivier (2022). Emotions, mechanisms, and individual differences in music listening: A stratified random sampling approach. Music Perception. ISSN 0730-7829. 40(1), p. 55�C86. doi: 10.1525/mp.2022.40.1.55. Full text in Research Archive
Lartillot, Olivier; Nymoen, Kristian; C?mara, Guilherme Schmidt & Danielsen, Anne (2021). Computational localization of attack regions through a direct observation of the audio waveform. Journal of the Acoustical Society of America. ISSN 0001-4966. 149(1), p. 723�C736. doi: 10.1121/10.0003374. Full text in Research Archive Show summary
This article addresses the computational estimation of attack regions in audio recordings. Previous attempts to do so were based on the reduction of the audio waveform into an envelope curve, which decreases its temporal resolution. The proposed approach detects the attack region directly from the audio waveform. The attack region is modeled as a line starting from a low-amplitude point and intersecting one of the local maxima according to two principles: (1) maximizing the slope, while favoring, at the same time, a higher peak if the slope remains only slightly lower and (2) dismissing initial attack regions of relatively low amplitude. The attack start position is fine-tuned by intersecting the attack slope with the audio waveform. The proposed method precisely pinpoints the attack region in cases where it is unambiguously observable from the waveform itself. In such cases, previous methods selected a broader attack region due to the loss of temporal resolution. When attack regions are less evident, the proposed method��s estimation remains within the range of results provided by other methods. Applied to the prediction of judgments of P-center localization [Danielsen, Nymoen, Anderson, C^amara, Langer?d, Thompson, and London, J. Exp. Psychol. Hum. Percept. Perform. 45, 402�C418 (2019)], the proposed method shows a significant increase in precision, at the expense of recall.
Lartillot, Olivier (2021). Computational Musicological Analysis of Notated Music: a Brief Overview. Nota Bene. ISSN 1891-4829. 15, p. 142�C161. Full text in Research Archive Show summary
I present a short overview of computational methods for musicological analysis of notated music. We first need to clarify the various levels of computational representations of music: on one side, notated music, on the other, audio recordings, and in the middle, a note-level representa- tion of music performance where higher-level musical descriptions are absent. The article provides a synthetic and partial panorama of the different types of music analysis that have been systematised and auto- mated using computers. While pioneering works were mainly focused on statistical descriptions of the surface of music, other dimensions of music analysis such as harmony, metre and structure have been taken into consideration since. I conclude by sketching my personal vision of the future of computational music analysis.
Elovsson, Anders & Lartillot, Olivier (2021). A Hardanger Fiddle Dataset with Performances Spanning Emotional Expressions and Annotations Aligned using Image Registration, Proceedings of the 22nd International Society for Music Information Retrieval Conference, Online, Nov 7-12, 2021. International Society for Music Information Retrieval. ISSN 9781732729902. p. 174�C181. Full text in Research Archive Show summary
This paper presents a Hardanger fiddle dataset ��HF1�� with polyphonic performances spanning five different emotional expressions: normal, angry, sad, happy, and tender. The performances thus cover the four quadrants of the activity/valence-space. The onsets and offsets, together with an associated pitch, were human-annotated for each note in each performance by the fiddle players themselves. First, they annotated the normal version. These annotations were then transferred to the expressive performances using music alignment and finally human-verified. Two separate music alignment methods based on image registration were developed for this purpose; a B-spline implementation that produces a continuous temporal transformation curve and a Demons algorithm that produces displacement matrices for time and pitch that also account for local timing variations across the pitch range. Both methods start from an ��Onsetgram�� of onset salience across pitch and time and perform the alignment task accurately. Various settings of the Demons algorithm were further evaluated in an ablation study. The final dataset is around 43 minutes long and consists of 19 734 notes of Hardanger fiddle music, recorded in stereo. The dataset and source code are available online. The dataset will be used in MIR research for tasks involving polyphonic transcription, score alignment, beat tracking, downbeat tracking, tempo estimation, and classification of emotional expressions.
Weisser, St��phanie; Lartillot, Olivier & Sechehaye, H��l��ne (2021). Investiguer la gr��sillance. Pour une approche ethno-acoustique du timbre musical. Cahiers d'ethnomusicologie. ISSN 2235-7688. 34, p. 37�C58. Full text in Research Archive Show summary
��
Lartillot, Olivier & Bruford, Fred (2020). Bistate reduction and comparison of drum patterns, Proceedings of the 21st International Society for Music Information Retrieval (ISMIR) Conference. McGill-Queen's University Press. ISSN 9780981353708. p. 318�C324. Full text in Research Archive Show summary
This paper develops the hypothesis that symbolic drum patterns can be represented in a reduced form as a sim- ple oscillation between two states, a Low state (commonly associated with kick drum events) and a High state (often associated with either snare drum or high hat). Both an onset time and an accent time is associated to each state. The systematic inference of the reduced form is formal- ized. This enables the specification of a rhythmic struc- tural similarity measure on drum patterns, where reduced patterns are compared through alignment. The two-state representation allows a low computational cost alignment, once the complex topological formalization is fully taken into account. A comparison with the Hamming distance, as well as similarity ratings collected from listeners on a drum loop dataset, indicates that the bistate reduction enables to convey subtle aspects that goes beyond surface-level com- parison of rhythmic textures.
Bruford, Fred & Lartillot, Olivier (2020). Multidimensional similarity modelling of complex drum loops using the GrooveToolbox, Proceedings of the 21st International Society for Music Information Retrieval (ISMIR) Conference. McGill-Queen's University Press. ISSN 9780981353708. p. 263�C270. Full text in Research Archive Show summary
The GrooveToolbox is a new Python toolbox implementing various algorithms, new and pre-existing, for the analysis and comparison of symbolic drum loops, including rhythm features, similarity metrics and microtiming features. As part of the GrooveToolbox we introduce two new metrics of rhythm similarity and four features for describing the significant properties of microtiming deviations in drum loops. Based on a two-part perceptual evaluation, we show these four new microtiming features can each correlate to similarity perception, and be used with rhythm similarity metrics to improve personalized similarity models for drum loops. A new measure of structural rhythmic similarity is also shown to correlate more strongly to similarity perception of drum loops than the more com- monly used Hamming distance. These results point to the potential application of the GrooveToolbox and its new features in drum loop analysis for intelligent music production tools. The GrooveToolbox may be found at: https://github.com/fredbru/GrooveToolbox
C?mara, Guilherme Schmidt; Nymoen, Kristian; Lartillot, Olivier & Danielsen, Anne (2020). Effects of instructed timing on electric guitar and bass sound in groove performance. Journal of the Acoustical Society of America. ISSN 0001-4966. 147(2), p. 1028�C1041. doi: 10.1121/10.0000724. Full text in Research Archive Show summary
This paper reports on two experiments that investigated the expressive means through which musicians well versed in groove-based music signal the intended timing of a rhythmic event. Data were collected from 21 expert electric guitarists and 21 bassists, who were instructed to perform a simple rhythmic pattern in three different timing styles��laidback,�� on-the-beat,�� and ��pushed��in tandem with a metronome. As expected, onset and peak timing locations corresponded to the instructed timing styles for both instruments. Regarding sound, results for guitarists revealed systematic differences across participants in the duration and brightness [spectral centroid (SC)] of the guitar strokes played using these different timing styles. In general, laid-back strokes were played with a longer duration and a lower SC relative to on-the-beat and pushed strokes. Results for the bassists indicated systematic differences in intensity (sound-pressure level): pushed strokes were played with higher intensity than on-the-beat and laid-back strokes. These results lend further credence to the hypothesis that both temporal and sound-related features are important indications of the intended timing of a rhythmic event, and together these features offer deeper insight into the ways in which musicians communicate at the microrhythmic level in groove-based music.
C?mara, Guilherme Schmidt; Nymoen, Kristian; Lartillot, Olivier & Danielsen, Anne (2020). Timing Is Everything... Or Is It? Effects of Instructed Timing Style, Reference and Pattern on Drum Kit Sound in Groove-Based Performance. Music Perception. ISSN 0730-7829. 38(1), p. 1�C26. doi: 10.1525/mp.2020.38.1.1. Full text in Research Archive
Lartillot, Olivier; Cancino-Chac��n, Carlos & Brazier, Charles (2020). Real-Time Visualisation Of Fugue Played By A String Quartet. In Spagnol, Simone & Valle, Andrea (Ed.), Proceedings of the 17th Sound and Music Computing Conference. Axea sas/SMC Network. ISSN 9788894541502. p. 115�C122. Full text in Research Archive Show summary
We present a new system for real-time visualisation of music performance, focused for the moment on a fugue played by a string quartet. The basic principle is to offer a visual guide to better understand music using strategies that should be as engaging, accessible and effective as possible. The pitch curves related to the separate voices are drawn on a space whose temporal axis is normalised with respect to metrical positions, and aligned vertically with respect to their thematic and motivic classification. Aspects related to tonality are represented as well. We describe the underlying technologies we have developed and the technical setting. In particular, the rhythmical and structural representation of the piece relies on real-time polyphonic audio-to-score alignment using online dynamic time warping. The visualisation will be presented at a concert of the Danish String Quartet, performing the last piece of The Art of Fugue by Johann Sebastian Bach.
Diaz, Ximena Alarc��n; Bojorquez, Lucia Nikolaia L��pez; Lartillot, Olivier & Flamtermesky, Helga (2019). From collecting an archive to artistic practice in the intimal project lessons learned from listening to a colombian migrant women��s oral history archive. Acervo. ISSN 0102-700X. 32(3), p. 48�C63. Full text in Research Archive
Lartillot, Olivier & Grandjean, Didier (2019). Tempo and Metrical Analysis by Tracking Multiple Metrical Levels Using Autocorrelation. Applied Sciences. 9(23). doi: 10.3390/app9235121. Full text in Research Archive Show summary
We present a method for tempo estimation from audio recordings based on signal processing and peak tracking, and not depending on training on ground-truth data. First, an accentuation curve, emphasizing the temporal location and accentuation of notes, is based on a detection of bursts of energy localized in time and frequency. This enables the detection of notes in dense polyphonic texture, while ignoring spectral fluctuation produced by vibrato and tremolo. Periodicities in the accentuation curve are detected using an improved version of autocorrelation function. Hierarchical metrical structures, composed of a large set of periodicities in pairwise harmonic relationships, are tracked over time. In this way, the metrical structure can be tracked even if the rhythmical emphasis switches from one metrical level to another. This approach, compared to all the other participants to the Music Information Retrieval Evaluation eXchange (MIREX) Audio Tempo Extraction competition from 2006 to 2018, is the third best one among those that can track tempo variations. While the two best methods are based on machine learning, our method suggests a way to track tempo founded on signal processing and heuristics-based peak tracking. Moreover, the approach offers for the first time a detailed representation of the dynamic evolution of the metrical structure. The method is integrated into MIRtoolbox, a Matlab toolbox freely available.
Diaz, Ximena Alarc��n; Boj��rquez, Lucia Nikolaia Lopez; Lartillot, Olivier & Flamtermesky, Helga (2019). From collecting an archive to artistic practice in the INTIMAL project: lessons learned from listening to a Colombian migrant women��s oral history archive. Acervo. ISSN 0102-700X. 32(3), p. 48�C63. Full text in Research Archive Show summary
This paper describes a multidisciplinary encounter with oral testimony archives and their incorporation in the artistic research project Intimal. It explores ways in which to creatively listen to stories which might be emotionally challenging, and to create and sustain empathy with these disembodied voices, which contain a shared history as fissures of sixty years of violence.
Lartillot, Olivier (2019). Miningsuite: A Comprehensive Matlab Framework for Signal, Audio and Music Analysis, Articulating Audio and Symbolic Approaches. In Barbancho, Isabel; Tard��n, Lorenzo J.; Peinado, Alberto & Barbancho, Ana M. (Ed.), SMC 2019 Proceedings of the 16th Sound & Music Computing Conference. Society for Sound and Music Computing. ISSN 9788409085187. p. 489�C489. Full text in Research Archive
Lartillot, Olivier & Grandjean, Didier (2019). Tempo and Metrical Analysis by Tracking Multiple Metrical Levels Using Autocorrelation. In Barbancho, Isabel; Tard��n, Lorenzo J.; Peinado, Alberto & Barbancho, Ana M. (Ed.), SMC 2019 Proceedings of the 16th Sound & Music Computing Conference. Society for Sound and Music Computing. ISSN 9788409085187. p. 174�C181. doi: 10.3390/app9235121. Full text in Research Archive

View all works in NVA

Lartillot, Olivier (2025). Computational music analysis. Full text in Research Archive
Wosch, Thomas; Vobig, Bastian & Lartillot, Olivier (2025). Human Interaction assessment and Generative segmentation in Health & Music. doi: https:/www.youtube.com/watch?v=I4jaZIzX0wg. Full text in Research Archive Show summary
Improvisation in music therapy has been shown to be an effective technique for engaging clients in emotionally rooted (inter)action to treat affective disorders such as major depression (Aalbers et al., 2017; Erkkil? et al., 2011). During improvisation, however, a variety of musical information is exchanged, resulting in a highly complex musical and interpersonal situation. While traditional models of music therapy analysis emphasise aural analysis and assessment of single sessions (Bruscia, 1987), more recent and elaborated methods, such as microanalysis, focus on the detailed development of improvisation sessions (Wosch, 2021; Wosch & Erkkil?, 2016), which comes at the cost of a more time-consuming application process. Digital processing, as in music information retrieval and machine learning, seems promising to accelerate the analysis process, but requires considerable preliminary work in data preprocessing and formalisation of the high-level concepts used in music therapy to develop a suitable dataset for model training. Moreover, additional benefits of digital processing comprehend a more detailed and precise analysis of musical data.
Sudo, Marina; Ziegler, Michelle; Akkermann, Miriam & Lartillot, Olivier (2025). Towards Collaborative Analysis: Kaija Saariaho��s Io (1986�C87). Full text in Research Archive
Sudo, Marina & Lartillot, Olivier (2025). Contemporary Music Analysis and Auditory Memory: The Use of Computational Tools as an Aid for Listening. doi: https:/fabricadesites.fcsh.unl.pt/ncmm/ncmm-2025-program/. Full text in Research Archive Show summary
Music analysis involves categorising and interpreting sonic elements to uncover the structure and meaning of a work. In contemporary music studies, analysts often face methodological challenges in this process, especially when dealing with works that contain high degrees of complexity and ambiguity in terms of timbre, texture and temporal structure. This paper proposes a methodological model for analysing spatiotemporal complexities commonly observed in contemporary repertoires, utilising computational tools to enhance auditory memory and expand interpretative possibilities. Auditory memory plays a pivotal role in aural analysis, an approach that serves as a valuable alternative or complement to traditional score-based analysis. Rooted in Pierre Schaeffer��s typomorphology of objets sonores and the work of other analysts in electroacoustic music studies, the general principles of aural analysis can be outlined in a three-step process: 1) attentive listening to the acoustic properties of sounds, 2) describing and categorising their sonic variations, and 3) assessing their functions within a large-scale formal structure. Computational sound visualisation tools are frequently employed in this process to assist in transcribing and retaining musical events that are either absent from the score or difficult to interpret aurally due to textural complexities and/or timbral elusiveness. Despite their increasing use, however, the full potential of these tools remains largely unexplored in contemporary music studies. By digitally decomposing the transformation processes of ambiguous musical flows and supporting the organisation and structuring of auditory memory, computational analysis of audio data and various visualisation methods can deepen our understanding of both local sonic morphology and large-scale formal trajectory. In line with these considerations, the paper investigates how specialised computer interfaces can facilitate music analytical processes. Two research questions guide this investigation: 1) How can we analyse a stream of sonic textures; and 2) How can we outline the formal structure of a work that embraces extremes of sonic energy and polyrhythmic intricacy? To explore these questions, we have developed muScope, a new computer program that enables users to browse within high-resolution sonograms in tandem with a range of graphical representations capturing audio, timbral, rhythmic and structural descriptions. The analysis of spectral ��fluctuations�� allows for the identification of rapid pulsations at the middle ground between rhythm and timbre. Self-similarity matrix representations can serve as a tool for outlining the structural division of the audio data based on various sonic attributes. We integrate these visual representations into an analytical workflow designed to support the construction of a composition��s formal structure. Our methods are demonstrated through an analysis of excerpts from Kaija Saariaho��s Io for large ensemble and electronics (1986�C87) and Rapha?l Cendo��s Corps for piano and ensemble (2015). This integrated analytical approach offers new insights into the interplay between musical perception, memory and analytical interpretation using digital tools.
Christodoulou, Anna-Maria; Glette, Kyrre; Lartillot, Olivier & Jensenius, Alexander Refsum (2025). MusiQAl: Music Question Answering through Audio-Video fusion. doi: https:/ismir2025.ismir.net/program-detailed-schedule. Full text in Research Archive
Lartillot, Olivier (2025). Computational Music Analysis Applied to Music Therapy Improvisation. doi: https:/ifas.thws.de/fileadmin/user_upload/250917_HIGH-M_Symposium_Programme_updated.pdf. Full text in Research Archive
Lartillot, Olivier (2025). Computational Music Analysis: Toolbox and application to music psychology & therapy. Full text in Research Archive
Christodoulou, Anna-Maria & Lartillot, Olivier (2025). A Multimodal Dataset of Greek Folk Music. doi: https:/dlfm.web.ox.ac.uk/2025-programme. Full text in Research Archive
Lartillot, Olivier (2024). Successes and challenges of computational approaches for audio and music analysis and for predicting music-evoked emotion. Full text in Research Archive Show summary
Background Decades of research in computational sound and music analysis has led to a large range of analysis tools offering rich and diverse description of music, although a large part of the subtlety of music remains out of reach. These descriptors are used to establish computational models predicting perceived or induced emotion directly from music. Although the models can predict a significant amount of variability of emotions experimentally measured (Panda et al., 2023), further progress seems hard to achieve, probably due to the subtlety of music and of the mechanisms underlying the evocation of emotion from music. Aims An extensive but synthetic panorama of computational research in sound and music analysis as well as emotion prediction from music is presented. Core challenges are highlighted and prospective ways forward are suggested. Main contribution For each separate music dimension (dynamics, timbre, rhythm, tonality and mode, motifs, phrasing, structure and form), a synthetic panorama of the state of the art is evoked, highlighting strengths and challenges as well as indicating how particular sound and music features have been found to correlate with rated emotions. The various strategies for modelling emotional reactions to audio and musical features are presented and discussed. One common general analytical approach carries out a broad and approximate analysis of the audio recording based on simple mathematical models, describing individual audio or musical characteristics numerically. It is suggested that such loose approach might tend to drift away from commonly understood musical processes and to generate artefacts. This vindicates a more traditional musicological approach based on a focus on the score or approximations of it �C through automated transcription if necessary �C and a reconstruction of the types of traditional representations commonly studied in musicology. I also argue for the need to closely reflect the way humans listen to and understand music, inspired by a cognitive perspective. Guided by these insights, I sketch the idea of a complex system made of interdependent modules, founded on sequential pattern inference and activation scores not based on statistical sampling. I also suggest perspectives for the improvement of computational prediction of emotions evoked by music. Discussion and conclusion Further improvements of computational music analysis methods, as well as emotion prediction, seem to call for a change of modelling paradigm. References R. Panda, R. Malheiro, R. Paiva, "Audio Features for Music Emotion Recognition: A Survey", IEEE Transactions on Affective Computing, 14-1, 68-88, 2023.
Lartillot, Olivier (2024). Introduction to the MiningSuite toolbox. Full text in Research Archive
Lartillot, Olivier (2024). KI-verkt?y for h?ndtering, transkribering og analyse av musikkarkiver. Full text in Research Archive Show summary
Jeg presenterer en rekke verkt?y utviklet i 澳门葡京手机版app下载 med Nasjonalbiblioteket. AudioSegmentor deler automatisk b?ndopptak i individuelle musikkstykker. Dette verkt?yet forenklet digitaliseringen av Norsk folkemusikksamling. Vi bruker avanserte dyp l?ringsmetoder for ? skape et banebrytende automatisk musikktranskriberingssystem, MusScribe, f?rst finjustert for Hardingfele, og n? gjort tilgjengelig for musikkarkivprofesjonelle for et bredt spekter av musikk. Jeg diskuterer ogs? v?re p?g?ende fremskritt innen den automatiserte musikologiske analysen av folkemusikkstykker og omfattende samlinger.
Ziegler, Michelle; Sudo, Marina; Akkermann, Miriam & Lartillot, Olivier (2024). Towards Collaborative Analysis: Kaija Saariaho��s IO. Full text in Research Archive
Thedens, Hans-Hinrich & Lartillot, Olivier (2024). The Norwegian Catalogue of Folk Music Online. Full text in Research Archive
Monstad, Lars L?berg & Lartillot, Olivier (2024). muScribe: a new transcription service for music professionals. Full text in Research Archive
Johansson, Mats Sigvard & Lartillot, Olivier (2024). Automated transcription of Hardanger fiddle music: Tracking the beats. Full text in Research Archive
Monstad, Lars L?berg & Lartillot, Olivier (2024). Automated transcription of Hardanger fiddle music: Detecting the notes. Full text in Research Archive
Lartillot, Olivier (2024). MIRAGE Closing Seminar: Digitisation and computer-aided music analysis of folk music. Full text in Research Archive Show summary
One aim of the MIRAGE project is to conceive new technologies allowing to better access, understand and appreciate music, with a particular focus on Norwegian folk music. This seminar presents what has been achieved during the four years of the project, leading in particular to the digital version of the Norwegian Catalogue of Folk Music. We are also conceiving tools to automatically transcribe audio recordings of folk music. More advanced musicological applications are discussed as well. To conclude, we introduce the new spinoff project, called muScribe, aimed at the development of transcription services, for a broad range of music, besides folk music, in a first stage tailored to professional organisations such as archives, publishers and producers.
Lartillot, Olivier (2024). Overview of the MIRAGE project. Full text in Research Archive
Lartillot, Olivier (2024). Harmonizing Tradition with Technology: Enhancing Norwegian Folk Music through Computational Innovation. Full text in Research Archive Show summary
My work involves developing computational tools to safeguard and elevate the cultural significance of music repertoires, with a focus on a cooperative project with the National Library of Norway related to their collection of Norwegian folk music. Our first phase centered on transforming unstructured audio tapes into a systematic dataset of melodies while ensuring its access and longevity through efficient data management and linking with other catalogues. Our core activity involves transcribing audio recordings into scores, comparing the traditional manual method with our modern attempts towards automation. Providing detailed performance notation, the close alignment between scores and audio recordings will help improve comprehension and overall accessibility, as well as a more advanced structuring of the collection. Challenges arose when incorporating this music into the International Inventory of Musical Sources (RISM) database due to the incompatible 'incipit' concept, unfitting genres like Hardanger fiddle folk music. We suggest innovative generalisations for this concept. Moreover, we're creating techniques to digitally dissect the musical corpus, aiming to extract key features of each tune. This initiative not only serves as an alternative to incipits but also provides novel metadata formats, increasing the usability and connectivity within its content and with other databases.
Christodoulou, Anna-Maria; Dutta, Sagar; Lartillot, Olivier; Glette, Kyrre & Jensenius, Alexander Refsum (2024). Exploring Convolutional Neural Network Models for Multimodal Classification of Expressive Piano Performance. Full text in Research Archive
Lartillot, Olivier (2023). Towards a Comprehensive Modelling Framework for Computational Music Transcription/Analysis. Full text in Research Archive Show summary
Computational music analysis, still in its infancy, lacking overarching reliable tools, can be seen at the same time as a promising approach to fulfill core epistemo- logical needs. Analysis in the audio domain, although approaching music in its entirety, is doomed to superficiality if it does not fully embrace the underlying symbolic system, requiring a complete automated transcription and scaffolding of metrical, modal/harmonic, voicing and formal structures on top of the layers of elementary events (such as notes). Automated transcription enables to get over the polarity between sound and music notation, providing an interfacing semiotic system that combines the advantages of both domains, and surpassing the limitation of traditional approaches based on graphic representations. Deep learning and signal processing approaches for the discretisation of the continuous signal are compared and discussed. The multi-dimensional music transcription and analysis framework (where both tasks are actually deeply intertwined) requires to take into account the far-reaching interdependencies between dimensions, for instance between motivic and metrical analysis. We propose an attempt to build such a comprehensive framework, founded on general musical and cognitive principles and an attempt to build music analysis capabilities through a combina- tion of simple and general operators. The validity of the analyses is addressed in close discussion with music experts. The potential capability to produce valid analyses for a very large corpus of music would make such a complex system a potentially relevant blueprint for a cognitive modelling of music understanding. We try to address a large diversity of music cultures and their specific challenges: among others, maqam modes (with Mondher Ayari), Norwegian Hardanger fiddle rhythm (with Mats Johansson and Hans-Hinrich Thedens), djembe drumming from Mali (with Rainer Polak) or electroacoustic music (Towards a Toolbox des objets musicaux, with Rolf Inge God?y). We aim at making the framework fully transparent, collaborative and open.
Bishop, Laura; H?ffding, Simon; Lartillot, Olivier Serge Gabriel & Laeng, Bruno (2023). Mental effort and expressive interaction in expert and student string quartet performance. Full text in Research Archive
Lartillot, Olivier (2023). Computational audio and musical features extraction: from MIRtoolbox to the MiningSuite. Full text in Research Archive
Christodoulou, Anna-Maria; Lartillot, Olivier & Anagnostopoulou, Christina (2023). Computational Analysis of Greek Folk Music of the Aegean. Full text in Research Archive
Lartillot, Olivier (2023). Dynamic Visualisation of Fugue Analysis, Demonstrated in a Live Concert by the Danish String Quartet. Full text in Research Archive
Lartillot, Olivier (2023). Towards a comprehensive model for computational music transcription and analysis: a necessary dialog between machine learning and rule-based design? Full text in Research Archive
Lartillot, Olivier & Monstad, Lars L?berg (2023). MIRAGE - A Comprehensive AI-Based System for Advanced Music Analysis. Full text in Research Archive
Bishop, Laura; H?ffding, Simon; Laeng, Bruno & Lartillot, Olivier (2023). Mental effort and expressive interaction in expert and student string quartet performance. Full text in Research Archive
Lartillot, Olivier; Swarbrick, Dana; Upham, Finn & Cancino-Chac��n, Carlos Eduardo (2023). Video visualization of a string quartet performance of a Bach Fugue: Design and subjective evaluation. Full text in Research Archive
Lartillot, Olivier (2023). MIRAGE Symposium #2: Music, emotions, analysis, therapy ... and computer. Full text in Research Archive Show summary
The 2nd MIRAGE Symposium covers a broad range of topics related to the MIRAGE project, mainly related to music and emotion, music cognition in general, music analysis and music therapy. Featuring two keynotes by Patrik Juslin and Didier Grandjean.
Maidhof, Clemens; Agres, Kat; Fachner, J?rg & Lartillot, Olivier (2023). Intra- and inter-brain coupling during music therapy. Full text in Research Archive
Wosch, Thomas; Vobig, Bastian; Lartillot, Olivier & Christodoulou, Anna-Maria (2023). HIGH-M (Human Interaction assessment and Generative segmentation in Health and Music). Full text in Research Archive
Lartillot, Olivier (2023). Music Therapy Toolbox, and prospects. Full text in Research Archive
Lartillot, Olivier; Thedens, Hans-Hinrich; Mjelva, Olav Lukseng?rd; Elovsson, Anders; Monstad, Lars L?berg & Johansson, Mats Sigvard [Show all 8 contributors for this article] (2023). Norwegian Folk Music & Computational Analysis. Full text in Research Archive Show summary
As a pr��lude for Norway's Constitution Day, this special event celebrated the Norwegian folk music tradition, showcasing our new online archive and demonstrating the richness of Hardanger fiddle music, with live performance. One aim of the project is to conceive new technologies allowing to better access, understand and appreciate Norwegian folk music. In this event, we introduced a new online version of the Norwegian Folk Music Archive and discuss underlying theoretical and technical challenges. A live concert/workshop, with the participation of Olav Lukseng?rd Mjelva, offered a lively introduction to Hardanger fiddle music and its elaborate rhythm. The interests and challenges of automated transcription and analysis were discussed, with the public release of our new software Annotemus. The symposium was organised in the context of the MIRAGE project (RITMO, in collaboration with the National Library of Norway's Digital Humanities Laboratory).
Lartillot, Olivier & Monstad, Lars L?berg (2023). Computational music analysis: Significance, challenges, and our proposed approach. Full text in Research Archive Show summary
Music is something that we mostly all appreciate, yet it remains a hidden and enigmatic concept for many of us. Music notation, in the form of music scores, facilitates practicing and enhances the understanding of the richness of musical works. However, acquiring musical scores for any music performance is a tedious and demanding task (called music transcription) that demands considerable proficiency. Hence the interest of computational automation. But music is not just notes, it is also melody, rhythm, themes, timbre, and very subtle aspects such as form. While many of us may not be consciously familiar with these concepts, they still have a subconscious influence on our aesthetic experience. Interestingly, it often happens that the more we consciously understand the underlying language of music, the more we tend to appreciate and enjoy it. Therefore, there is value in creating computational tools that can automate and enhance these types of analyses. The presenters' past work resulted in the creation of Matlab's MIRtoolbox, which measures a broad range of musical characteristics directly from audio through signal processing techniques. Currently, the MIRAGE project prioritises music transcription (with a particular focus on Norwegian folk music), blending neural-network-based deep learning with conventional rule-based models. Through this project, they highlight the importance of acknowledging the interconnectedness between all musical elements. Additionally, they have crafted animated visualisations to make analyses more accessible to the general public and are aiming to make music transcription technology available to the public, with support from UiO Growth House.
Monstad, Lars L?berg & Lartillot, Olivier (2023). Automatic Transcription Of Multi-Instrumental Songs: Integrating Demixing, Harmonic Dilated Convolution, And Joint Beat Tracking. Full text in Research Archive Show summary
In the rapidly expanding field of music information retrieval (MIR), automatic transcription remains one of the most sought-after capabilities, especially for songs that employ multiple instruments. Musscribe emerges as a state-of-the-art transcription tool that addresses this challenge by integrating three distinct methodologies: demixing, harmonic dilated convolution, and joint beat tracking. Demixing is employed to isolate individual instruments within a song by separating overlapping audio sources, thus ensuring each instrument is transcribed distinctly. Beat tracking is then run as a parallel process to extract the joint beat and downbeat estimations. These processes results in an output midi file, which is then quantized using information derived from the beat tracking. As such, this method paves the way for more accurate and sophisticated analyses, bridging the gap between human and machine understanding of music. Together, these methodologies allow us to produce transcriptions that are not only accurate but also highly representative of the original compositions. Preliminary tests and evaluations showcase the potential in transcribing complex musical pieces with high fidelity, outperforming many contemporary tools in the market. This innovative approach not only has implications for music transcription but also for broader applications in audio analysis, remixing, and digital music production. The model has been instrumental in accelerating the composition process for several Norwegian television shows. Moreover, its efficacy can be observed in the Netflix series "A Storm for Christmas." Renowned composer Peter Baden harnessed this tool to enhance his workflow, proving the demand for innovative tools like this in the professional music industry.
Lartillot, Olivier; God?y, Rolf Inge & Christodoulou, Anna-Maria (2022). Computational detection and characterisation of sonic shapes: Towards a Toolbox des objets sonores. Full text in Research Archive Show summary
Computational detection and analysis of sound objects is of high importance both for musicology and sound design. Yet Music Information Retrieval technologies have so far been mostly focusing on transcription of music into notes in a classical sense whereas we are interested in detecting sound objects and their feature categories, as was suggested by Pierre Schaeffer��s typology and morphology of sound objects in 1966, reflecting basic sound-producing action types. We propose a signal-processing based approach for segmentation, based on a tracking of the salient characteristics over time, and dually Gestalt-based segmentation decisions based on changes. Tracking of pitched sound relies on partial tracking, whereas the analysis of noisy sound requires tracking of larger frequency bands possibly varying over time. The resulting sound objects are then described based on Schaeffer��s taxonomy and morphology, expressed first in the form of numerical descriptors, each related to one type of taxonomy (percussive/sustained/iterative, stable/moving pitch vs unclear pitch) or morphology (such as grain). This multidimensional feature representation is further divided into discrete categories related to the different classes of sounds. The typological and morphological categorisation is driven by the theoretical and experimental framework of the morphodynamical theory. We first experiment on isolated sounds from the Solf��ge des objets sonores��which features a large variety of sound sources��before considering more complex configurations featuring a succession of sound objects without silence or with simultaneous sound objects. Analytical results are visualised in the form of graphical representations, aimed both for musicology and music pedagogy purposes. This will be applied to the graphical descriptions of and browsing within large music catalogues. The application of the analytical descriptions to music creation is also investigated.
Lartillot, Olivier (2022). The MIRAGE project: Unlocking new computational abilities in computational music analysis. Full text in Research Archive
Lartillot, Olivier (2022). Computational music analysis: Application to music & emotion. Full text in Research Archive
Lartillot, Olivier; Elovsson, Anders; Johansson, Mats Sigvard; Thedens, Hans-Hinrich & Monstad, Lars Alfred L?berg (2022). Segmentation, Transcription, Analysis and Visualisation of the Norwegian Folk Music Archive. Full text in Research Archive Show summary
We present an ongoing project dedicated to the transmutation of a collection of field recordings of Norwegian folk music established in the 1960s into an easily accessible online catalogue augmented with advanced music technology and computer musicology tools. We focus in particular on a major highlight of this collection: Hardanger fiddle music. The studied corpus was available as a series of 600 tape recordings, each tape containing up to 2 hours of recordings, associated with metadata indicating approximate positions of pieces of music. We first need to retrieve the individual recording associated with each tune, through the combination of an automated pre-segmentation based on sound classification and audio analysis, and a subsequent manual verification and fine-tuning of the temporal positions, using a home-made user interface. Note detection is carried out by a deep learning method. To adapt the model to Hardanger fiddle music, musicians were asked to record themselves and annotate all played note, using a dedicated interface. Data augmentation techniques have been designed to accelerate the process, in particular using alignment of varied performances of same tunes. The transcription also requires the reconstruction of the metrical structure, which is particularly challenging in this style of music. We have also collected ground-truth data, and are conceiving a computational model. The next step consists in carrying out detailed music analysis of the transcriptions, in order to reveal in particular intertextuality within the corpus. A last direction of research is aimed at designing tools to visualise each tune and the whole catalogue, both for musicologists and general public.
Danielsen, Anne; C?mara, Guilherme Schmidt; Lartillot, Olivier; Leske, Sabine Liliana & Spiech, Connor (2022). Musical rhythm. Behavioural, computational and neurophysiological perspectives. Full text in Research Archive
Lartillot, Olivier & Lillesl?tten, Mari (2021). Olivier Lartillot utvikler verkt?y for ? forst? musikk bedre. [Internet]. Det humanistiske fakultet UiO YouTube account. Full text in Research Archive Show summary
Kunstig intelligens kan hjelpe deg ? forst? musikk bedre. UiO-forsker Olivier Lartillot jobber for at ny teknologi kan ?pne folks ?rer for ny musikk.
Lartillot, Olivier & Lillesl?tten, Mari (2021). Artificial intelligence can help you understand music better. [Internet]. RITMO News. Full text in Research Archive Show summary
Algorithms and technology have so far helped listeners to more of the same music. Now, UiO researchers are working on new technology that can get people interested in a greater musical variety.
Elovsson, Anders & Lartillot, Olivier (2021). A Hardanger Fiddle Dataset with Performances Spanning Emotional Expressions and Annotations Aligned using Image Registration. Full text in Research Archive Show summary
This paper presents a Hardanger fiddle dataset ��HF1�� with polyphonic performances spanning five different emotional expressions: normal, angry, sad, happy, and tender. The performances thus cover the four quadrants of the activity/valence-space. The onsets and offsets, together with an associated pitch, were human-annotated for each note in each performance by the fiddle players themselves. First, they annotated the normal version. These annotations were then transferred to the expressive performances using music alignment and finally human-verified. Two separate music alignment methods based on image registration were developed for this purpose; a B-spline implementation that produces a continuous temporal transformation curve and a Demons algorithm that produces displacement matrices for time and pitch that also account for local timing variations across the pitch range. Both methods start from an ��Onsetgram�� of onset salience across pitch and time and perform the alignment task accurately. Various settings of the Demons algorithm were further evaluated in an ablation study. The final dataset is around 43 minutes long and consists of 19 734 notes of Hardanger fiddle music, recorded in stereo. The dataset and source code are available online. The dataset will be used in MIR research for tasks involving polyphonic transcription, score alignment, beat tracking, downbeat tracking, tempo estimation, and classification of emotional expressions.
Lartillot, Olivier & Weisser, St��phanie (2021). Roughness, Crackliness, Buzzingness, ...: Characterizations of Sonic Unsteadiness and Application to the Analysis of Traditional Music from Ethiopia, Kenya, Morocco and India. Full text in Research Archive
Tidemann, Aleksander & Lartillot, Olivier (2021). Interactive tools for exploring performance patterns in hardanger fiddle music. Full text in Research Archive
Lartillot, Olivier; Guldbrandsen, Erling Eliseus & Cancino-Chac��n, Carlos Eduardo (2021). Dynamics analysis, and application to a comparative study of Bruckner performances. Full text in Research Archive
Lartillot, Olivier (2021). Presentation of MIRAGE project. Full text in Research Archive
Lartillot, Olivier & Johansson, Mats Sigvard (2021). Tracking beats in Hardanger fiddle tunes. Full text in Research Archive
Lartillot, Olivier; Elovsson, Anders & Mjelva, Olav Lukseng?rd (2021). A new software for computer-assisted annotation of music recordings, with a focus on transcription. Full text in Research Archive
God?y, Rolf Inge & Lartillot, Olivier (2021). Acoustic substrates of musique concr��te features: Towards a Toolbox de l'objet musical? Full text in Research Archive
Tidemann, Aleksander; Lartillot, Olivier & Johansson, Mats Sigvard (2021). Towards New Analysis And Visualization Software For Studying Performance Patterns in Hardanger Fiddle Music. Full text in Research Archive Show summary
Analyzing musical performances is a challenging and emergent field of computational music research, aiming to reveal performance patterns and link them to musical contexts. There exists a modest amount of computational research on Hardanger fiddle performances. The MIRAGE research project is currently contributing to this scientific body, developing advanced MIR frameworks that build on recent musicological research. This paper presents the development and evaluation of two Max/MSP/Jitter software applications for music analysis and data visualization that integrate contemporary research perspectives on the complex rhythmical structuring of springar performances, investigating how we can design user-friendly computational tools that explore performance patterns in Hardanger fiddle music, in collaboration with MIRAGE. Based on a small questionnaire and a few operational tests, the study shows an interest in more effective software tools capable of revealing complex interrelations between musical dimensions in Hardanger fiddle performances. Additionally, the study highlights design considerations for tools aiming to increase the availability of computational music research in the field of musicology, such as cross-compatibility and integrated features that actively facilitate nuanced interpretation processes.
Dalgard, Joachim; Lartillot, Olivier; Vuoskoski, Jonna Katariina & Guldbrandsen, Erling Eliseus (2021). Absorption - Somewhere between the heart and the brain. Full text in Research Archive
Lartillot, Olivier & Toiviainen, Petri (2020). Read about the Matlab MIRtoolbox. [Journal]. Young Acousticians Network (YAN) Newsletter. Full text in Research Archive Show summary
MIRtoolbox is a Matlab toolbox dedicated to the analysis of music and sound from audio recordings and to the extraction of musical features such as tonality, rhythm, or structures. It has also been used for non- musical applications, such as in Non Destructive Testing, and with non-audio signals. In this issue of the newsletter, the YAN discusses the MIRtoolbox with Olivier Lartillot (RITMO Centre for Interdisciplinary Studies in Rhythm, Time and Motion, University of Oslo, Norway) and Petri Toiviainen (University of Jyv?skyl?, Finland) You can also check out the MIRtoolbox website at: shorturl.at/oA038
Lartillot, Olivier & Toiviainen, Petri (2020). Read about the Matlab MIRtoolbox. [Internet]. Young Acousticians Network Newsletter. Full text in Research Archive Show summary
MIRtoolbox is a Matlab toolbox dedicated to the analysis of music and sound from audio recordings and to the extraction of musical features such as tonality, rhythm, or structures. It has also been used for non- musical applications, such as in Non Destructive Testing, and with non-audio signals. In this issue of the newsletter, the YAN discusses the MIRtoolbox with Olivier Lartillot (RITMO Centre for Interdisciplinary Studies in Rhythm, Time and Motion, University of Oslo, Norway) and Petri Toiviainen (University of Jyv?skyl?, Finland) You can also check out the MIRtoolbox website at: shorturl.at/oA038
Bruford, Fred & Lartillot, Olivier (2020). Multidimensional similarity modelling of complex drum loops using the GrooveToolbox. Full text in Research Archive Show summary
The GrooveToolbox is a new Python toolbox implementing various algorithms, new and pre-existing, for the analysis and comparison of symbolic drum loops, including rhythm features, similarity metrics and microtiming features. As part of the GrooveToolbox we introduce two new metrics of rhythm similarity and four features for describing the significant properties of microtiming deviations in drum loops. Based on a two-part perceptual evaluation, we show these four new microtiming features can each correlate to similarity perception, and be used with rhythm similarity metrics to improve personalized similarity models for drum loops. A new measure of structural rhythmic similarity is also shown to correlate more strongly to similarity perception of drum loops than the more com- monly used Hamming distance. These results point to the potential application of the GrooveToolbox and its new features in drum loop analysis for intelligent music production tools. The GrooveToolbox may be found at: https://github.com/fredbru/GrooveToolbox
Lartillot, Olivier & Bruford, Fred (2020). Bistate reduction and comparison of drum patterns. Full text in Research Archive Show summary
This paper develops the hypothesis that symbolic drum patterns can be represented in a reduced form as a sim- ple oscillation between two states, a Low state (commonly associated with kick drum events) and a High state (often associated with either snare drum or high hat). Both an onset time and an accent time is associated to each state. The systematic inference of the reduced form is formal- ized. This enables the specification of a rhythmic struc- tural similarity measure on drum patterns, where reduced patterns are compared through alignment. The two-state representation allows a low computational cost alignment, once the complex topological formalization is fully taken into account. A comparison with the Hamming distance, as well as similarity ratings collected from listeners on a drum loop dataset, indicates that the bistate reduction enables to convey subtle aspects that goes beyond surface-level com- parison of rhythmic textures.
Lartillot, Olivier; Cancino-Chac��n, Carlos & Brazier, Charles (2020). Real-Time Visualisation Of Fugue Played By A String Quartet. Full text in Research Archive Show summary
We present a new system for real-time visualisation of music performance, focused for the moment on a fugue played by a string quartet. The basic principle is to offer a visual guide to better understand music using strategies that should be as engaging, accessible and effective as possible. The pitch curves related to the separate voices are drawn on a space whose temporal axis is normalised with respect to metrical positions, and aligned vertically with respect to their thematic and motivic classification. Aspects related to tonality are represented as well. We describe the underlying technologies we have developed and the technical setting. In particular, the rhythmical and structural representation of the piece relies on real-time polyphonic audio-to-score alignment using online dynamic time warping. The visualisation will be presented at a concert of the Danish String Quartet, performing the last piece of The Art of Fugue by Johann Sebastian Bach.
Lartillot, Olivier (2019). The MIRAGE project. Full text in Research Archive
Haugen, Mari Romarheim; Johansson, Mats Sigvard & Lartillot, Olivier (2019). Investigating rhythm production and perception in traditional scandinavian dance music in non-isochronous meter: A case study of norwegian telespringar. Full text in Research Archive
Lartillot, Olivier (2019). Computational analysis of tempo and metre: from signal processing to cognitive musicology. Full text in Research Archive Show summary
Computational models for the analysis of tempo, metre and the tracking of beats have made significant progress during the last decades. I first present a synthetic overview of the state of the art. Up to recently, classical approaches were based on signal processing, with the integration of heuristics based on assumptions related to music perception and cognition. The standard approach is to first detect percussive events through the establishment of an accentuation curve, followed by periodicity detection, and the construction and tracking of meter. Because rhythmic emphasis can develop on various metrical levels across time, it is necessary to track the metrical structure on multiple levels. I show the benefit of such detailed analysis with the use of a model I have developed, and which obtained one of the highest grades in the MIREX tempo estimation competition. New approaches based on deep learning have achieved impressive progress and have largely surpassed signal-processing-based approached (including mine) in the recent yearly editions of MIREX. One limitation of these approaches, at least in their current stages, is that they appear as black boxes able to imitate a particular behaviour for which they were trained on particular examples. As such, they hardly offer insight on the cognitive mechanisms underlying the perception of metre. I will discuss the limitations of signal processing approaches and highlight the complexity of the musical structure. Pulsation in music is not always expressed through a periodic repetition of percussive events, but may emerge from a subtle propagation of motivic or harmonic structures. I present an approach under development that models the different components of music analysis and combine them altogether, extending further Lerdahl and Jackendoff��s vision. Motivic repetition, which plays a core role, is also one of the dimensions that is the most difficult to model and automate.
S?rb?, Solveig Isis; Bentham, John; Watson, Pia; Lartillot, Olivier; Sanchez, Victor Evaristo Gonzalez & Gonz��lez, Mar��a Isabel (2019). Aether Trouble. Full text in Research Archive Show summary
Music video for "Aether Trouble" by PYSJ. With: Pia Watson V��ctor Gonz��lez (RITMO*) Torgeir Koppang (PYSJ) Solveig S?rb? (PYSJ) Cinematography: John Bentham Camera Operators: Morten Malerstuen (camera and additional cinematography) Kristoffer Haugen Lighting: Nicholas Blakstad Andresen Morten Malerstuen Breath data collection (using FLOW* by SweetZpot): V��ctor Gonz��lez, RITMO* Visualization of breath data and audio: MIRAGE* and MIRtoolbox by Olivier Lartillot, RITMO* Editing: Mar��a Isabel Gonz��lez Story / Concept: Solveig S?rb? Olivier Lartillot Pia Watson Directed by: John Bentham Solveig S?rb? Consultant: Nicholas Blakstad Andresen Produced by: Solveig S?rb? Extras: The dogs Vips and Willy Special thanks: Sagar Sen Na��n Mendoza Fonseca Turid Svens?y SweetZpot RITMO* Workaway.info Filter Musikk Thanks to F21 for letting us use their studio MUSIC: Find the audio track on your plattform of choice: https://fanlink.to/aether Find the audio track on your plattform of choice: https://fanlink.to/aether Aether Trouble by PYSJ Written by: Solveig S?rb? Performed by PYSJ Solveig S?rb? Torgeir Koppang Andreas R?dland Haga Stig Frogner Mixing: Stig Frogner Mastering: Bj?rn Engelmann / The Cutting Room Production: Solveig S?rb? * RITMO Centre for Interdisciplinary Studies in Rhythm, Time and Motion, University of Oslo. This work was partially supported by the Research Council of Norway through its Centres of Excellence scheme, project number 262762. * Learn more about the visualization technologies on the MIRAGE project website: http://bit.ly/MirageProject * Data was collected using FLOW? sensors from SweetZpot. For more information about this technology see: https://www.sweetzpot.com/flow
Lartillot, Olivier (2019). A comprehensive framework for computational music analysis. Full text in Research Archive Show summary
During this presentation, Dr. Olivier Lartillot will give an overview of MIRtoolbox - a Matlab application enabling the extraction of a large range of audio and musical descriptions from recordings. MIRtoolbox is designed to be easy to use both for teaching at any level and for advanced research in musicology, signal analysis and music cognition. One initial aim of MIRtoolbox was to study the relationship between musical features and emotions evoked by music. MIRtoolbox focuses on signal-processing-based approaches that offer limited understanding of music. Lartillot is also developing computational methods for the analysis of notated music, starting from motivic analysis and aiming at building a comprehensive framework where audio and score are combined together.
Lartillot, Olivier & Grandjean, Didier (2019). Tempo and Metrical Analysis by Tracking Multiple Metrical Levels Using Autocorrelation. Full text in Research Archive
Lartillot, Olivier (2019). Miningsuite: A Comprehensive Matlab Framework for Signal, Audio and Music Analysis, Articulating Audio and Symbolic Approaches. Full text in Research Archive
C?mara, Guilherme Schmidt; Nymoen, Kristian; Lartillot, Olivier & Danielsen, Anne (2019). Timing is Everything... Or is it? Part I: Effects of Instructed Timing and Reference on Guitar and Bass Sound in Groove Performance. Full text in Research Archive
Lartillot, Olivier (2018). Computational sound/music/gesture analysis and application to gesture-based query in music catalogue. Full text in Research Archive Show summary
In the first part of this talk, I will give a short and broad overview of the MiningSuite, a Matlab toolbox that combines sound, music and gesture analysis. I will give a quick tour of the various types of sound and music analyses that can be carried out using the toolbox, covering a large range of musical dimensions such as timbre, rhythm, harmony or structure. The MiningSuite, integrating previous toolboxes such as MIRtoolbox and MIDI toolbox, can be used both for the analysis of audio recordings and of ��symbolic�� representations such as MIDI files. I will also present the current integration of motion capture and gesture analysis (from the MoCap Toolbox), as well as other sensor data such as breathing. The benefit of articulating these different types of analyses into a single framework will be demonstrated. In the second part, I will present a project aimed at automatically extracting melodic gestures from a catalogue of folk music recordings from the National Library of Norway. While melodic lines can be easily extracted from a cappella songs, the task is more challenging for other types of music such as Hardanger fiddle music. In such cases, we need to automatically transcribe the recordings and track melodic voices throughout the counterpoint of each composition. I will also present an iPhone app that enables to draw a gesture in the air with the phone and to find pieces of music from the catalogue that is characterised by a similar musical gesture.
Lartillot, Olivier & Sudo, Marina (2025). AcousMuScope: Users' Guide. Universitetet i Oslo. doi: https:/www.uio.no/ritmo/english/projects/mirage/software/AcousMuScope/index.html. Full text in Research Archive Show summary
AcousMuScope is a new software for music analysis of audio recordings, focusing on the graphical interface to browse into the analyses.
Joachimiak, Grzegorz; Ahrendt, Rebekah & Lartillot, Olivier (2024). Endangered Musical Sources: Strategies for Safeguarding, Digitization, and International Collaboration. Report of Working Group 2 SOURCES, Wroc?aw, 22�C24 May 2024. Zenodo. Full text in Research Archive
Christodoulou, Anna-Maria; Anagnostopoulou, Christina & Lartillot, Olivier (2022). Computational Analysis of Greek folk music of the Aegean islands. National and Kapodistrian University of Athens. Full text in Research Archive

View all works in NVA

Published Aug. 9, 2018 11:06 AM - Last modified Nov. 20, 2025 11:02 AM

Olivier Lartillot

Background

Other links

Publications

Projects

Olivier Lartillot

Background

Other links

Publications

Projects

Completed projects