EP2159717A3 - Hybrid audio-visual categorization system and method - Google Patents
Hybrid audio-visual categorization system and method Download PDFInfo
- Publication number
- EP2159717A3 EP2159717A3 EP09175042A EP09175042A EP2159717A3 EP 2159717 A3 EP2159717 A3 EP 2159717A3 EP 09175042 A EP09175042 A EP 09175042A EP 09175042 A EP09175042 A EP 09175042A EP 2159717 A3 EP2159717 A3 EP 2159717A3
- Authority
- EP
- European Patent Office
- Prior art keywords
- tags
- categorization system
- hybrid audio
- visual categorization
- meta
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Withdrawn
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/50—Information retrieval; Database structures therefor; File system structures therefor of still image data
- G06F16/58—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/60—Information retrieval; Database structures therefor; File system structures therefor of audio data
- G06F16/63—Querying
- G06F16/632—Query formulation
- G06F16/634—Query by example, e.g. query by humming
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/60—Information retrieval; Database structures therefor; File system structures therefor of audio data
- G06F16/68—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/60—Information retrieval; Database structures therefor; File system structures therefor of audio data
- G06F16/68—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
- G06F16/683—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/70—Information retrieval; Database structures therefor; File system structures therefor of video data
- G06F16/78—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/70—Information retrieval; Database structures therefor; File system structures therefor of video data
- G06F16/78—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
- G06F16/783—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
- G06F16/7834—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content using audio features
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10H—ELECTROPHONIC MUSICAL INSTRUMENTS; INSTRUMENTS IN WHICH THE TONES ARE GENERATED BY ELECTROMECHANICAL MEANS OR ELECTRONIC GENERATORS, OR IN WHICH THE TONES ARE SYNTHESISED FROM A DATA STORE
- G10H2240/00—Data organisation or data communication aspects, specifically adapted for electrophonic musical tools or instruments
- G10H2240/075—Musical metadata derived from musical analysis or for use in electrophonic musical instruments
- G10H2240/081—Genre classification, i.e. descriptive metadata for classification or selection of musical pieces according to style
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Library & Information Science (AREA)
- General Engineering & Computer Science (AREA)
- Databases & Information Systems (AREA)
- Data Mining & Analysis (AREA)
- General Physics & Mathematics (AREA)
- Multimedia (AREA)
- Mathematical Physics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Indexing, Searching, Synchronizing, And The Amount Of Synchronization Travel Of Record Carriers (AREA)
- Electrically Operated Instructional Devices (AREA)
Abstract
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP06300310A EP1840764A1 (en) | 2006-03-30 | 2006-03-30 | Hybrid audio-visual categorization system and method |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP06300310.7 Division | 2006-03-30 |
Publications (2)
Publication Number | Publication Date |
---|---|
EP2159717A2 EP2159717A2 (en) | 2010-03-03 |
EP2159717A3 true EP2159717A3 (en) | 2010-03-17 |
Family
ID=36782325
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP09175042A Withdrawn EP2159717A3 (en) | 2006-03-30 | 2006-03-30 | Hybrid audio-visual categorization system and method |
EP06300310A Ceased EP1840764A1 (en) | 2006-03-30 | 2006-03-30 | Hybrid audio-visual categorization system and method |
Family Applications After (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP06300310A Ceased EP1840764A1 (en) | 2006-03-30 | 2006-03-30 | Hybrid audio-visual categorization system and method |
Country Status (3)
Country | Link |
---|---|
US (2) | US8392414B2 (en) |
EP (2) | EP2159717A3 (en) |
JP (1) | JP5340554B2 (en) |
Families Citing this family (45)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP2159717A3 (en) | 2006-03-30 | 2010-03-17 | Sony France S.A. | Hybrid audio-visual categorization system and method |
JP4799333B2 (en) * | 2006-09-14 | 2011-10-26 | シャープ株式会社 | Music classification method, music classification apparatus, and computer program |
US8656391B2 (en) * | 2007-06-22 | 2014-02-18 | International Business Machines Corporation | System and method for initiating the execution of a process |
US8180757B2 (en) * | 2007-12-28 | 2012-05-15 | International Business Machines Corporation | System and method for leveraging tag context |
US8682819B2 (en) * | 2008-06-19 | 2014-03-25 | Microsoft Corporation | Machine-based learning for automatically categorizing data on per-user basis |
CN101599271B (en) * | 2009-07-07 | 2011-09-14 | 华中科技大学 | Recognition method of digital music emotion |
US8190647B1 (en) * | 2009-09-15 | 2012-05-29 | Symantec Corporation | Decision tree induction that is sensitive to attribute computational complexity |
US8533134B1 (en) * | 2009-11-17 | 2013-09-10 | Google Inc. | Graph-based fusion for video classification |
US8452778B1 (en) | 2009-11-19 | 2013-05-28 | Google Inc. | Training of adapted classifiers for video categorization |
WO2011089276A1 (en) * | 2010-01-19 | 2011-07-28 | Vicomtech-Visual Interaction And Communication Technologies Center | Method and system for analysing multimedia files |
US8583647B2 (en) * | 2010-01-29 | 2013-11-12 | Panasonic Corporation | Data processing device for automatically classifying a plurality of images into predetermined categories |
CN102782678B (en) | 2010-02-01 | 2016-03-16 | 谷歌公司 | What associate for item combines embedding |
CN102193946A (en) * | 2010-03-18 | 2011-09-21 | 株式会社理光 | Method and system for adding tags into media file |
US8904472B2 (en) * | 2010-11-12 | 2014-12-02 | Riaz Ahmed SHAIKH | Validation of consistency and completeness of access control policy sets |
US9087297B1 (en) | 2010-12-17 | 2015-07-21 | Google Inc. | Accurate video concept recognition via classifier combination |
JP5633424B2 (en) * | 2011-02-23 | 2014-12-03 | 富士ゼロックス株式会社 | Program and information processing system |
US8856051B1 (en) | 2011-04-08 | 2014-10-07 | Google Inc. | Augmenting metadata of digital objects |
US10031932B2 (en) * | 2011-11-25 | 2018-07-24 | International Business Machines Corporation | Extending tags for information resources |
US9582767B2 (en) * | 2012-05-16 | 2017-02-28 | Excalibur Ip, Llc | Media recommendation using internet media stream modeling |
US9761277B2 (en) | 2012-11-01 | 2017-09-12 | Sony Corporation | Playback state control by position change detection |
US10643027B2 (en) * | 2013-03-12 | 2020-05-05 | Microsoft Technology Licensing, Llc | Customizing a common taxonomy with views and applying it to behavioral targeting |
US10623480B2 (en) | 2013-03-14 | 2020-04-14 | Aperture Investments, Llc | Music categorization using rhythm, texture and pitch |
US10242097B2 (en) * | 2013-03-14 | 2019-03-26 | Aperture Investments, Llc | Music selection and organization using rhythm, texture and pitch |
US10061476B2 (en) | 2013-03-14 | 2018-08-28 | Aperture Investments, Llc | Systems and methods for identifying, searching, organizing, selecting and distributing content based on mood |
US11271993B2 (en) | 2013-03-14 | 2022-03-08 | Aperture Investments, Llc | Streaming music categorization using rhythm, texture and pitch |
US10225328B2 (en) | 2013-03-14 | 2019-03-05 | Aperture Investments, Llc | Music selection and organization using audio fingerprints |
US20220147562A1 (en) | 2014-03-27 | 2022-05-12 | Aperture Investments, Llc | Music streaming, playlist creation and streaming architecture |
US20150294233A1 (en) * | 2014-04-10 | 2015-10-15 | Derek W. Aultman | Systems and methods for automatic metadata tagging and cataloging of optimal actionable intelligence |
US10372791B2 (en) * | 2014-10-08 | 2019-08-06 | Staples, Inc. | Content customization |
US20160162464A1 (en) * | 2014-12-09 | 2016-06-09 | Idibon, Inc. | Techniques for combining human and machine learning in natural language processing |
US10381022B1 (en) | 2015-12-23 | 2019-08-13 | Google Llc | Audio classifier |
US11354510B2 (en) | 2016-12-01 | 2022-06-07 | Spotify Ab | System and method for semantic analysis of song lyrics in a media content environment |
US10360260B2 (en) * | 2016-12-01 | 2019-07-23 | Spotify Ab | System and method for semantic analysis of song lyrics in a media content environment |
CN107357515A (en) * | 2017-06-29 | 2017-11-17 | 深圳天珑无线科技有限公司 | The method and its system that multiple utility program picture is presented simultaneously |
CN110998726B (en) * | 2017-06-29 | 2021-09-17 | 杜比国际公司 | Method, system, and computer-readable medium for adapting external content to a video stream |
CN108281138B (en) * | 2017-12-18 | 2020-03-31 | 百度在线网络技术(北京)有限公司 | Age discrimination model training and intelligent voice interaction method, equipment and storage medium |
KR102660124B1 (en) * | 2018-03-08 | 2024-04-23 | 한국전자통신연구원 | Method for generating data for learning emotion in video, method for determining emotion in video, and apparatus using the methods |
CN108762852A (en) * | 2018-06-10 | 2018-11-06 | 北京酷我科技有限公司 | A kind of implementation method of interception Audio Controls and lyrics control linkage effect |
CN109859770A (en) * | 2019-01-04 | 2019-06-07 | 平安科技(深圳)有限公司 | Music separation method, device and computer readable storage medium |
US11270077B2 (en) * | 2019-05-13 | 2022-03-08 | International Business Machines Corporation | Routing text classifications within a cross-domain conversational service |
CN114402389A (en) * | 2019-09-27 | 2022-04-26 | 雅马哈株式会社 | Sound analysis method, sound analysis device, and program |
CN113743117B (en) * | 2020-05-29 | 2024-04-09 | 华为技术有限公司 | Method and device for entity labeling |
CN113288452B (en) * | 2021-04-23 | 2022-10-04 | 北京大学 | Operation quality detection method and device |
CN113220737B (en) * | 2021-05-28 | 2023-06-20 | 平安科技(深圳)有限公司 | Data recommendation method and device, electronic equipment and storage medium |
CN113434731B (en) * | 2021-06-30 | 2024-01-19 | 平安科技(深圳)有限公司 | Music video genre classification method, device, computer equipment and storage medium |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20020140843A1 (en) * | 2001-04-03 | 2002-10-03 | Tretter Daniel R. | Camera meta-data for content categorization |
US20040168118A1 (en) * | 2003-02-24 | 2004-08-26 | Wong Curtis G. | Interactive media frame display |
WO2004081814A1 (en) * | 2003-03-14 | 2004-09-23 | Eastman Kodak Company | Method for the automatic identification of entities in a digital image |
US20050262051A1 (en) * | 2004-05-13 | 2005-11-24 | International Business Machines Corporation | Method and system for propagating annotations using pattern matching |
Family Cites Families (15)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH02292670A (en) * | 1989-05-02 | 1990-12-04 | Nippon Telegr & Teleph Corp <Ntt> | Additional information generation processing method |
JPH06215089A (en) * | 1993-01-12 | 1994-08-05 | Matsushita Electric Ind Co Ltd | Color image file managing device |
US6360234B2 (en) * | 1997-08-14 | 2002-03-19 | Virage, Inc. | Video cataloger system with synchronized encoders |
US6593936B1 (en) * | 1999-02-01 | 2003-07-15 | At&T Corp. | Synthetic audiovisual description scheme, method and system for MPEG-7 |
US6847980B1 (en) | 1999-07-03 | 2005-01-25 | Ana B. Benitez | Fundamental entity-relationship models for the generic audio visual data signal description |
GB9918611D0 (en) * | 1999-08-07 | 1999-10-13 | Sibelius Software Ltd | Music database searching |
US7548565B2 (en) * | 2000-07-24 | 2009-06-16 | Vmark, Inc. | Method and apparatus for fast metadata generation, delivery and access for live broadcast program |
US20020183984A1 (en) * | 2001-06-05 | 2002-12-05 | Yining Deng | Modular intelligent multimedia analysis system |
US8214741B2 (en) * | 2002-03-19 | 2012-07-03 | Sharp Laboratories Of America, Inc. | Synchronization of video and data |
US7548847B2 (en) * | 2002-05-10 | 2009-06-16 | Microsoft Corporation | System for automatically annotating training data for a natural language understanding system |
US7555165B2 (en) * | 2003-11-13 | 2009-06-30 | Eastman Kodak Company | Method for semantic scene classification using camera metadata and content-based cues |
US7925669B2 (en) * | 2004-10-13 | 2011-04-12 | Sony Corporation | Method and apparatus for audio/video attribute and relationship storage and retrieval for efficient composition |
US7831913B2 (en) * | 2005-07-29 | 2010-11-09 | Microsoft Corporation | Selection-based item tagging |
US20070078714A1 (en) * | 2005-09-30 | 2007-04-05 | Yahoo! Inc. | Automatically matching advertisements to media files |
EP2159717A3 (en) | 2006-03-30 | 2010-03-17 | Sony France S.A. | Hybrid audio-visual categorization system and method |
-
2006
- 2006-03-30 EP EP09175042A patent/EP2159717A3/en not_active Withdrawn
- 2006-03-30 EP EP06300310A patent/EP1840764A1/en not_active Ceased
-
2007
- 2007-03-23 US US11/690,553 patent/US8392414B2/en not_active Expired - Fee Related
- 2007-03-30 JP JP2007092399A patent/JP5340554B2/en not_active Expired - Fee Related
-
2009
- 2009-11-20 US US12/591,506 patent/US8321414B2/en not_active Expired - Fee Related
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20020140843A1 (en) * | 2001-04-03 | 2002-10-03 | Tretter Daniel R. | Camera meta-data for content categorization |
US20040168118A1 (en) * | 2003-02-24 | 2004-08-26 | Wong Curtis G. | Interactive media frame display |
WO2004081814A1 (en) * | 2003-03-14 | 2004-09-23 | Eastman Kodak Company | Method for the automatic identification of entities in a digital image |
US20050262051A1 (en) * | 2004-05-13 | 2005-11-24 | International Business Machines Corporation | Method and system for propagating annotations using pattern matching |
Non-Patent Citations (2)
Title |
---|
MARC DAVIS ET AL: "From Context to Content: Levaraging Context to Infer Media Metadata", PROCEEDINGS OF THE 12TH ANNUAL ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, NEW YORK, NY, USA, 10 October 2004 (2004-10-10), ACM, pages 1 - 8, XP002374239, ISBN: 1-58113-893-8 * |
SARVAS R ET AL.: "Metadata creation system for mobile images", PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON MOBILE SYSTEMS, APPLICATIONS, AND SERVICES MOBISYS'04, BOSTON, USA, 6 June 2004 (2004-06-06), ACM, pages 36 - 48, XP002393963, ISBN: 1-58113-793-1 * |
Also Published As
Publication number | Publication date |
---|---|
US20080040362A1 (en) | 2008-02-14 |
JP2007317168A (en) | 2007-12-06 |
US8392414B2 (en) | 2013-03-05 |
EP1840764A1 (en) | 2007-10-03 |
EP2159717A2 (en) | 2010-03-03 |
US20100125539A1 (en) | 2010-05-20 |
US8321414B2 (en) | 2012-11-27 |
JP5340554B2 (en) | 2013-11-13 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP2159717A3 (en) | Hybrid audio-visual categorization system and method | |
EP1953690A3 (en) | Method and system for business process management | |
WO2010078972A3 (en) | Method and arrangement for handling non-textual information | |
WO2009158135A3 (en) | Statistical approach to large-scale image annotation | |
EP1818840A3 (en) | Method and apparatus for merging data objects | |
EP3010219A3 (en) | Method and apparatus for managing images using a voice tag | |
EP1768063B8 (en) | Image checking device, image checking method, and image checking program | |
EP1914657A3 (en) | Authentication system, authentication-service-providing device, authentication-service-providing method, and program | |
WO2008045144A3 (en) | Gesture recognition method and apparatus | |
EP2261823A3 (en) | Fast and efficient nonlinear classifier generated from a trained linear classifier | |
EP3091535A3 (en) | Multi-modal input on an electronic device | |
EP2450808A3 (en) | Semantic visual search engine | |
EP1968040A3 (en) | Methods and systems for surround-specific display modeling | |
EP2846226A3 (en) | Method and system for providing haptic effects based on information complementary to multimedia content | |
EP2079234A3 (en) | Video searching apparatus, editing apparatus, video searching method, and program | |
MX2012011748A (en) | System and method for providing customer support on a user interface. | |
WO2008002356A3 (en) | System and method for using image data in connection with configuring a universal controlling device | |
WO2006099626A3 (en) | System and method for providing interactive feature selection for training a document classification system | |
EP2793218A3 (en) | Image processing device and image processing method | |
EP2521051A3 (en) | Handheld electronic device and method for recording multimedia clip | |
EP1985981A3 (en) | Information processing apparatus and method | |
EP1965316A3 (en) | Storage of multiple, related time-series data streams | |
WO2009036356A3 (en) | Dual cross-media relevance model for image annotation | |
EP1953664A3 (en) | Apparatus for detecting intrusion code and method using the same | |
EP2513858A4 (en) | Context information utilizing systems, apparatus and methods |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
PUAL | Search report despatched |
Free format text: ORIGINAL CODE: 0009013 |
|
17P | Request for examination filed |
Effective date: 20091104 |
|
AC | Divisional application: reference to earlier application |
Ref document number: 1840764 Country of ref document: EP Kind code of ref document: P |
|
AK | Designated contracting states |
Kind code of ref document: A2 Designated state(s): DE FR GB |
|
AK | Designated contracting states |
Kind code of ref document: A3 Designated state(s): DE FR GB |
|
17Q | First examination report despatched |
Effective date: 20101008 |
|
RAP1 | Party data changed (applicant data changed or rights of an application transferred) |
Owner name: SONY EUROPE LIMITED |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: THE APPLICATION IS DEEMED TO BE WITHDRAWN |
|
18D | Application deemed to be withdrawn |
Effective date: 20130319 |