Meseguer-Brocal et al., 2020 - Google Patents
Creating DALI, a Large Dataset of Synchronized Audio, Lyrics, and Notes.Meseguer-Brocal et al., 2020
View PDF- Document ID
- 15781288772109196169
- Author
- Meseguer-Brocal G
- Cohen-Hadria A
- Peeters G
- Publication year
- Publication venue
- Trans. Int. Soc. Music. Inf. Retr.
External Links
Snippet
The DALI dataset is a large dataset of time-aligned symbolic vocal melody notations (notes) and lyrics at four levels of granularity. DALI contains 5358 songs in its first version and 7756 for the second one. In this article, we present the dataset, explain the developed tools to …
- 230000001360 synchronised 0 title description 12
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/3074—Audio data retrieval
- G06F17/30743—Audio data retrieval using features automatically derived from the audio content, e.g. descriptors, fingerprints, signatures, MEP-cepstral coefficients, musical score, tempo
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30017—Multimedia data retrieval; Retrieval of more than one type of audiovisual media
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10H—ELECTROPHONIC MUSICAL INSTRUMENTS
- G10H1/00—Details of electrophonic musical instruments
- G10H1/36—Accompaniment arrangements
- G10H1/38—Chord
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10H—ELECTROPHONIC MUSICAL INSTRUMENTS
- G10H2210/00—Aspects or methods of musical processing having intrinsic musical character, i.e. involving musical theory or musical parameters or relying on musical knowledge, as applied in electrophonic musical tools or instruments
- G10H2210/031—Musical analysis, i.e. isolation, extraction or identification of musical elements or musical parameters from a raw acoustic signal or from an encoded audio signal
- G10H2210/061—Musical analysis, i.e. isolation, extraction or identification of musical elements or musical parameters from a raw acoustic signal or from an encoded audio signal for extraction of musical phrases, isolation of musically relevant segments, e.g. musical thumbnail generation, or for temporal structure analysis of a musical piece, e.g. determination of the movement sequence of a musical work
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10H—ELECTROPHONIC MUSICAL INSTRUMENTS
- G10H1/00—Details of electrophonic musical instruments
- G10H1/0008—Associated control or indicating means
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
-
- G—PHYSICS
- G11—INFORMATION STORAGE
- G11B—INFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
- G11B27/00—Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
- G11B27/10—Indexing; Addressing; Timing or synchronising; Measuring tape travel
- G11B27/19—Indexing; Addressing; Timing or synchronising; Measuring tape travel by using information detectable on the record carrier
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Meseguer-Brocal et al. | Dali: A large dataset of synchronized audio, lyrics and notes, automatically created using teacher-student machine learning paradigm | |
Civit et al. | A systematic review of artificial intelligence-based music generation: Scope, applications, and future trends | |
Meseguer-Brocal et al. | Creating DALI, a Large Dataset of Synchronized Audio, Lyrics, and Notes. | |
Murthy et al. | Content-based music information retrieval (cb-mir) and its applications toward the music industry: A review | |
Lu et al. | SpecTNT: A time-frequency transformer for music audio | |
Humphrey et al. | An introduction to signal processing for singing-voice analysis: High notes in the effort to automate the understanding of vocals in music | |
Weiß et al. | Schubert Winterreise dataset: A multimodal scenario for music analysis | |
Pardo et al. | Name that tune: A pilot study in finding a melody from a sung query | |
Muller et al. | A robust fitness measure for capturing repetitions in music recordings with applications to audio thumbnailing | |
Román et al. | An End-to-end Framework for Audio-to-Score Music Transcription on Monophonic Excerpts. | |
Zhang et al. | ATEPP: A dataset of automatically transcribed expressive piano performance | |
Baggi et al. | Music navigation with symbols and layers: Toward content browsing with IEEE 1599 XML encoding | |
Sargent et al. | Estimating the structural segmentation of popular music pieces under regularity constraints | |
Ewert et al. | Towards cross-version harmonic analysis of music | |
Wilmering et al. | High-level semantic metadata for the control of multitrack adaptive digital audio effects | |
Bittner et al. | Pitch contours as a mid-level representation for music informatics | |
Zhang et al. | Symbolic music representations for classification tasks: A systematic evaluation | |
Hung et al. | A large TV dataset for speech and music activity detection | |
Ma et al. | Foundation models for music: A survey | |
Lerch | Audio content analysis | |
Edwards et al. | PiJAMA: Piano Jazz with Automatic MIDI Annotations | |
Giraud et al. | Computational analysis of musical form | |
Le et al. | Natural Language Processing Methods for Symbolic Music Generation and Information Retrieval: a Survey | |
Van Balen | Audio description and corpus analysis of popular music | |
Kroher et al. | Discovery of repeated melodic phrases in folk singing recordings |