[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

audeering/opensmile-python

Repository files navigation

openSMILE Python

Test status code coverage opensmile's documentation opensmile's supported Python versions opensmile's audEERING license

Python interface for extracting openSMILE features.

$ pip install opensmile

Note

Only 64-bit Python is supported.

Feature sets

Currently, three standard sets are supported. ComParE 2016 is the largest with more than 6k features. The smaller sets GeMAPS and eGeMAPS come in variants v01a, v01b and v02 (only eGeMAPS). We suggest to use the latest version unless backward compatibility with the original papers is desired.

Each feature set can be extracted on two levels:

  • Low-level descriptors (LDD)
  • Functionals

For ComParE 2016 a third level is available:

  • LLD deltas

Note

Pre v2.0.0 some LLDs of the GeMAPS family were incorrectly output as deltas. This was corrected with v2.0.0 and these features are now correctly returned as LLDs. Note that with v2.0.0 deltas are no longer available for the GeMAPS family.

The following table lists the number of features for each set and level.

With v2.0.0

Name #features
ComParE_2016 65 / 65 / 6373
GeMAPSv01a 18 / - / 62
GeMAPSv01b 18 / - / 62
eGeMAPSv01a 23 / - / 88
eGeMAPSv01b 23 / - / 88
eGeMAPSv02 25 / - / 88

Note

Additional feature sets have been added by the community. For a full list please see the documentation of opensmile.FeatureSet.

Pre v2.0.0

Name #features
ComParE_2016 65 / 65 / 6373
GeMAPSv01a 5 / 13 / 62
GeMAPSv01b 5 / 13 / 62
eGeMAPSv01a 10 / 13 / 88
eGeMAPSv01b 10 / 13 / 88

Code example

Code example, that extracts ComParE 2016 functionals from an audio file:

import opensmile

smile = opensmile.Smile(
    feature_set=opensmile.FeatureSet.ComParE_2016,
    feature_level=opensmile.FeatureLevel.Functionals,
)
y = smile.process_file('audio.wav')

License

openSMILE follows a dual-licensing model. Since the main goal of the project is a widespread use of the software to facilitate research in the field of machine learning from audio-visual signals, the source code and binaries are freely available for private, research, and educational use under an open-source license (see LICENSE). It is not allowed to use the open-source version of openSMILE for any sort of commercial product. Fundamental research in companies, for example, is permitted, but if a product is the result of the research, we require you to buy a commercial development license. Contact us at info@audeering.com (or visit us at https://www.audeering.com) for more information.

Original authors: Florian Eyben, Felix Weninger, Martin Wöllmer, Björn Schuller

Copyright © 2008-2013, Institute for Human-Machine Communication, Technische Universität München, Germany

Copyright © 2013-2015, audEERING UG (haftungsbeschränkt)

Copyright © 2016-2020, audEERING GmbH

Citing

Please cite openSMILE in your publications by citing the following paper:

Florian Eyben, Martin Wöllmer, Björn Schuller: "openSMILE - The Munich Versatile and Fast Open-Source Audio Feature Extractor", Proc. ACM Multimedia (MM), ACM, Florence, Italy, ISBN 978-1-60558-933-6, pp. 1459-1462, 25.-29.10.2010.