Neural FCA

This package provides neural blind source separation methods, namely neural full-rank spatial covariance analysis (neural FCA) [Bando2021] and neural fast FCA (neural FastFCA) [Bando2023].

Tasks

AviTask(encoder, decoder, scm_estimator, ...)

PyTorch Lightning implementation of the AVI task of neural FCA [Bando2021].

JdAviTask(encoder, decoder, n_fft, ...)

Neural FastFCA task [Bando2023] implemented in PyTorch Lightning.

Encoders

DilcnvEncoder(n_fft, n_mic, n_src, dim_latent)

Encoder module using dilated depthwise separable convolutions for multichannel spectrograms.

UNetEncoder(n_fft, n_mic, n_src, dim_latent)

Encoder module using a 1D U-Net architecture for multichannel spectrograms.

JdUNetEncoder(n_fft, n_mic, n_src, ...[, ...])

U-Net-based encoder for multichannel audio representation learning.

Decoders

ResDecoder(n_fft, dim_latent[, io_ch, n_layers])

Decoder module that transforms latent representations into magnitude spectrogram estimates.

ResLinearDecoder(n_fft, dim_latent[, io_ch, ...])

Decoder module that reconstructs a magnitude spectrogram from a latent representation.

Lightning Callbacks

PsdVisualizerCallback()

Callback to visualize intermediate tensors during model validation.

XtVisualizerCallback()

Callback to visualize the time-frequency energy (Xt) of the model output during validation.

References

[Bando2021] (1,2)

Yoshiaki Bando, Kouhei Sekiguchi, Yoshiki Masuyama, Aditya Arie Nugraha, Mathieu Fontaine, and Kazuyoshi Yoshii, “Neural full-rank spatial covariance analysis for blind source separation,” IEEE Signal Processing Letters, vol. 28, pp. 1670-1674, 2021. [PDF]

[Bando2023] (1,2)

Yoshiaki Bando, Yoshiki Masuyama, Aditya Arie Nugraha, and Kazuyoshi Yoshii, “Neural fast full-rank spatial covariance analysis for blind source separation,” in Proc. 31st European Signal Processing Conference (EUSIPCO), pp. 51-55, 2023. [PDF]