MelodySim: Measuring Melody-aware Music Similarity for Plagiarism Detection

TitleMelodySim: Measuring Melody-aware Music Similarity for Plagiarism Detection
Publication TypeJournal Article
Year of Publication2025
AuthorsLu T., Geist C-M, Melechovsky J., Roy A., Herremans D.
JournalarXiv:2505.20979
Abstract

We propose MelodySim, a melody-aware music similarity model and dataset for plagiarism detection. First, we introduce a novel method to construct a dataset with focus on melodic similarity. By augmenting Slakh2100; an existing MIDI dataset, we generate variations of each piece while preserving the melody through modifications such as note splitting, arpeggiation, minor track dropout (excluding bass), and re-instrumentation. A user study confirms that positive pairs indeed contain similar melodies, with other musical tracks significantly changed. Second, we develop a segment-wise melodic-similarity detection model that uses a MERT encoder and applies a triplet neural network to capture melodic similarity. The resultant decision matrix highlights where plagiarism might occur. Our model achieves high accuracy on the MelodySim test set.

URLhttps://arxiv.org/abs/2505.20979