Publications
nnAudio: An on-the-fly GPU Audio to Spectrogram Conversion Toolbox Using 1D Convolution Neural Networks. IEEE Access. nnAudio.pdf (10.2 MB)
.
2020. Revisiting the Onsets and Frames Model with Additive Attention. Proceedings of the International Joint Conference on Neural Networks (IJCNN). 2104.06607.pdf (1.52 MB)
.
2021. nnAudio: A PyTorch Audio Processing Tool Using 1D Convolution neural networks. ISMIR - Late Breaking Demo. nnAudio.pdf (399.08 KB)
.
2019. The Effect of Spectrogram Reconstructions on Automatic Music Transcription:An Alternative Approach to Improve Transcription Accuracy. Proceedings of the International Conference on Pattern Recognition (ICPR2020). 2010.09969.pdf (3.46 MB)
.
2021. Latent space representation for multi-target speaker detection and identification with a sparse dataset using Triplet neural networks. IEEE Automatic Speech Recognition and Understanding Workshop (ASRU 2019). 1910.01463.pdf (934.76 KB)
.
2019. DiffRoll: Diffusion-based Generative Music Transcription with Unsupervised Pretraining Capability. ICASSP. diffroll.pdf (2.2 MB)
.
2023. The impact of Audio input representations on neural network based music transcription. Proceedings of the International Joint Conference on Neural Networks (IJCNN). 2001.09989.pdf (1.87 MB)
.
2020. Blacklisted speaker identification using triplet neural networks. MCE2018 competition. SUTD_description.pdf (133.08 KB)
.
2018. Jointist: Joint Learning for Multi-instrument Transcription and Its Applications. 2206.10805.pdf (427.51 KB)
.
2022. Regression-based music emotion prediction using triplet neural networks. Proceedings of the International Joint Conference on Neural Networks (IJCNN). 2001.09988.pdf (777.31 KB)
.
2020. .
2021.