August 13th, 2024

A C/C++ library for audio and music analysis

audioFlux is a deep learning library for audio analysis, featuring new pitch algorithms in version 0.1.8. It supports Python 3.6+, with modules for transformations, features, and music information retrieval.

Read original articleLink Icon
CuriositySkepticismInterest
A C/C++ library for audio and music analysis

audioFlux is a deep learning library designed for audio and music analysis, feature extraction, and various audio-related tasks. It supports multiple time-frequency analysis methods and is suitable for applications such as classification, separation, music information retrieval (MIR), and automatic speech recognition (ASR). The library emphasizes performance through C implementations and hardware acceleration for FFT, making it effective for large-scale data processing. The latest version, v0.1.8, introduces several pitch algorithms, including YIN, CEP, PEF, NCF, HPS, LHS, STFT, and FFP, along with new algorithms for pitch shifting and time stretching. audioFlux is compatible with Python 3.6 and above and can be installed via PyPI or Anaconda. The main modules include Transform, which offers various time-frequency representation algorithms; Feature, which provides spectral features and cepstrum coefficients; and MIR, which focuses on pitch and onset detection. Comprehensive documentation is available, and contributions are encouraged. The project is licensed under the MIT License.

- audioFlux is a deep learning library for audio analysis and feature extraction.

- Version 0.1.8 includes new pitch algorithms and features for audio manipulation.

- The library can be installed via PyPI or Anaconda for Python 3.6+.

- Main modules cover time-frequency transformations, spectral features, and MIR tasks.

- Contributions to the project are welcome, and it is licensed under the MIT License.

AI: What people are saying
The comments on the audioFlux article reflect various inquiries and suggestions regarding the library's features and comparisons with other tools.
  • Users are interested in comparisons with existing C++ music information retrieval libraries.
  • There are questions about the necessity of GPU-accelerated functions for deep learning applications.
  • Some commenters clarify the programming languages used, noting that it is C and Python, not C++.
  • There is curiosity about the library's capabilities, such as audio fingerprinting and feature parity with other libraries like librosa.
  • One user humorously inquires about a version in Rust, highlighting interest in language compatibility.
Link Icon 11 comments
By @jcelerier - 6 months
It would be nice to have a comparison with any of the many C++ MIR (music information retrieval) libraries in the wild:

- https://essentia.upf.edu/

- https://github.com/marsyas/marsyas

- https://github.com/ircam-ismm/pipo

- https://github.com/flucoma/flucoma-core/tree/main/include/al...

By @dsego - 6 months
It's also for Python, I just discovered it a few days ago. This is the website https://audioflux.top/
By @bravura - 6 months
If this is supposed to be used for deep-learning, shouldn't all the transforms be GPU-accelerated torch functions?
By @nesarkvechnep - 6 months
What's this C/C++ language?
By @dekken_ - 6 months
It's C and Python, not C++
By @BrannonKing - 6 months
So are they going for feature parity with librosa? I think that would be great.
By @gosub100 - 6 months
Can this be used for audio fingerprinting?
By @zombot - 6 months
How can a Python library support iOS?
By @morning-coffee - 6 months
Do you have one in safe Rust? See, we've only just met, and I don't know how you handle your ptr/len arguments in C just yet. ;)