Meseguer-Brocal et al., 2020 - Google Patents

Creating DALI, a Large Dataset of Synchronized Audio, Lyrics, and Notes.

Meseguer-Brocal et al., 2020

Document ID: 15781288772109196169
Author: Meseguer-Brocal G; Cohen-Hadria A; Peeters G
Publication year: 2020
Publication venue: Trans. Int. Soc. Music. Inf. Retr.

External Links

Cited by

Snippet

The DALI dataset is a large dataset of time-aligned symbolic vocal melody notations (notes) and lyrics at four levels of granularity. DALI contains 5358 songs in its first version and 7756 for the second one. In this article, we present the dataset, explain the developed tools to …

Continue reading at pdfs.semanticscholar.org (PDF) (other versions)

230000001360 synchronised 0 title description 12

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/3074—Audio data retrieval
- G06F17/30743—Audio data retrieval using features automatically derived from the audio content, e.g. descriptors, fingerprints, signatures, MEP-cepstral coefficients, musical score, tempo
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30017—Multimedia data retrieval; Retrieval of more than one type of audiovisual media
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10H—ELECTROPHONIC MUSICAL INSTRUMENTS
- G10H1/00—Details of electrophonic musical instruments
- G10H1/36—Accompaniment arrangements
- G10H1/38—Chord
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10H—ELECTROPHONIC MUSICAL INSTRUMENTS
- G10H2210/00—Aspects or methods of musical processing having intrinsic musical character, i.e. involving musical theory or musical parameters or relying on musical knowledge, as applied in electrophonic musical tools or instruments
- G10H2210/031—Musical analysis, i.e. isolation, extraction or identification of musical elements or musical parameters from a raw acoustic signal or from an encoded audio signal
- G10H2210/061—Musical analysis, i.e. isolation, extraction or identification of musical elements or musical parameters from a raw acoustic signal or from an encoded audio signal for extraction of musical phrases, isolation of musically relevant segments, e.g. musical thumbnail generation, or for temporal structure analysis of a musical piece, e.g. determination of the movement sequence of a musical work
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10H—ELECTROPHONIC MUSICAL INSTRUMENTS
- G10H1/00—Details of electrophonic musical instruments
- G10H1/0008—Associated control or indicating means
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G—PHYSICS
- G11—INFORMATION STORAGE
- G11B—INFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
- G11B27/00—Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
- G11B27/10—Indexing; Addressing; Timing or synchronising; Measuring tape travel
- G11B27/19—Indexing; Addressing; Timing or synchronising; Measuring tape travel by using information detectable on the record carrier
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification

Similar Documents

Publication	Publication Date	Title
Meseguer-Brocal et al.	2019	Dali: A large dataset of synchronized audio, lyrics and notes, automatically created using teacher-student machine learning paradigm
Civit et al.	2022	A systematic review of artificial intelligence-based music generation: Scope, applications, and future trends
Meseguer-Brocal et al.	2020	Creating DALI, a Large Dataset of Synchronized Audio, Lyrics, and Notes.
Murthy et al.	2018	Content-based music information retrieval (cb-mir) and its applications toward the music industry: A review
Lu et al.	2021	SpecTNT: A time-frequency transformer for music audio
Humphrey et al.	2018	An introduction to signal processing for singing-voice analysis: High notes in the effort to automate the understanding of vocals in music
Weiß et al.	2021	Schubert Winterreise dataset: A multimodal scenario for music analysis
Pardo et al.	2004	Name that tune: A pilot study in finding a melody from a sung query
Muller et al.	2012	A robust fitness measure for capturing repetitions in music recordings with applications to audio thumbnailing
Román et al.	2018	An End-to-end Framework for Audio-to-Score Music Transcription on Monophonic Excerpts.
Zhang et al.	2023	ATEPP: A dataset of automatically transcribed expressive piano performance
Baggi et al.	2013	Music navigation with symbols and layers: Toward content browsing with IEEE 1599 XML encoding
Sargent et al.	2016	Estimating the structural segmentation of popular music pieces under regularity constraints
Ewert et al.	2012	Towards cross-version harmonic analysis of music
Wilmering et al.	2012	High-level semantic metadata for the control of multitrack adaptive digital audio effects
Bittner et al.	2017	Pitch contours as a mid-level representation for music informatics
Zhang et al.	2023	Symbolic music representations for classification tasks: A systematic evaluation
Hung et al.	2022	A large TV dataset for speech and music activity detection
Ma et al.	2024	Foundation models for music: A survey
Lerch	2021	Audio content analysis
Edwards et al.	2023	PiJAMA: Piano Jazz with Automatic MIDI Annotations
Giraud et al.	2016	Computational analysis of musical form
Le et al.	2024	Natural Language Processing Methods for Symbolic Music Generation and Information Retrieval: a Survey
Van Balen	2016	Audio description and corpus analysis of popular music
Kroher et al.	2017	Discovery of repeated melodic phrases in folk singing recordings