Pitch Spelling Jazz Lead Sheets, Solo Transcriptions, Classical Piano and Monophonic Scores

· Source: cs.CL updates on arXiv.org · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Data Science & Analytics · Depth: Expert, quick

Summary

An algorithm for pitch spelling and key estimation is presented, designed to convert MIDI-like input (semitones, bar boundaries) into appropriate note names, a global Key Signature, and local scales for each bar. This process involves two optimization stages: a "modal" stage proposes a probable scale per bar using a shortest-path search to minimize accidentals, followed by a "tonal" stage that estimates the overall Key Signature and note names for optimal musical notation. Evaluations were conducted across diverse datasets, including jazz lead sheets from the Real Book, jazz solo/bass line transcriptions, traditional tunes, and classical piano/monophonic scores. Originally developed for music transcription, particularly for building digital jazz solo collections for analysis, teaching, and cultural preservation, the method also defines new distances between jazz scales, relevant for musicological studies.

Key takeaway

For music transcribers and developers of music notation software, this algorithm offers a robust approach to generating accurate and musically intuitive scores. By employing a two-stage optimization process, it effectively handles diverse musical styles from jazz to classical. You should consider integrating this method to enhance the quality of digital music collections, particularly for complex jazz improvisations, and explore its defined jazz scale distances for advanced musicological analysis.

Key insights

A two-stage optimization algorithm jointly estimates pitch spelling and key signature from MIDI-like input for musical notation.

Principles

Method

The "modal" stage proposes a probable scale per bar via shortest-path search, minimizing accidentals. The "tonal" stage then estimates the global Key Signature and note names for optimal notation across the entire piece.

In practice

Topics

Best for: AI Scientist, Research Scientist, Software Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by cs.CL updates on arXiv.org.