Publications
Accent Conversion in Text-To-Speech Using Multi-Level VAE and Adversarial Training. Proc. of IEEE Tencon, Singapore.
.
2024. Accented Text-to-Speech Synthesis with a Conditional Variational Autoencoder. Proc. of IEEE Tencon, Singapore.
.
2024. Acoustic prediction of flowrate: varying liquid jet stream onto a free surface. IEEE International Conference on Signal Processing and Communications (SPCOM). preprint flow.pdf (1.01 MB)
.
2020. .
2021. Are we there yet? A brief survey of Music Emotion Prediction Datasets, Models and Outstanding Challenges arXiv:2406.08809. 2406.08809v1.pdf (156.19 KB)
.
2024. Asthmatic versus healthy child classification based on cough and vocalised /a:/ sounds. The Journal of the Acoustical Society of America (JASA). 148, EL253
.
2020. AttendAffectNet – Emotion Prediction of Movie Viewers Using Multimodal Fusion with Self-attention. Sensors. Special issue on Intelligent Sensors: Sensor Based Multi-Modal Emotion Recognition. sensors-21-08356.pdf (1.03 MB)
.
2021. AttendAffectNet: Self-Attention based Networks for Predicting Affective Responses from Movies. Proceedings of the International Conference on Pattern Recognition (ICPR2020). 2010.11188.pdf (7.07 MB)
.
2021. BandControlNet: Parallel Transformers-based Steerable Popular Music Generation with Fine-Grained Spatiotemporal Features. arXiv:2407.10462. 2407.10462v1.pdf (2.3 MB)
.
2024. Blacklisted speaker identification using triplet neural networks. MCE2018 competition. SUTD_description.pdf (133.08 KB)
.
2018. Classification and generation of composer-specific music using global feature models and variable neighborhood search. Computer Music Journal. 39(3):91. papercmj-dh_preprint.pdf (637.63 KB)
.
2015. Coarse-to-Fine Text-to-Music Latent Diffusion. Audio Imagination: NeurIPS 2024 Workshop.
.
2024. Compose ≡ compute. 4OR. 13:335–336.
.
2015. Compose=Compute - Computer Generation And Classification Of Music Through Operations Research Methods. PhD Thesis, University of Antwerp. :250.
.
2014. Composer Classification Models for Music-Theory Building. Computational Music Analysis. Chapter_HerremansEtAl_preprint.pdf (475.26 KB)
.
2015. Composing counterpoint musical scores with variable neighborhood search. Annual Conference of the Belgian Operation Research Society (ORBEL26). orbel26abs_vnsforcp.pdf (116.85 KB)
.
2012. Composing Fifth Species Counterpoint Music With A Variable Neighborhood Search Algorithm. Expert Systems with Applications. 40 paper_preprint_cp5.pdf (405.75 KB)
.
2013. .
2012. Composing first species counterpoint musical scores with a variable neighbourhood search algorithm. Journal of Mathematics and the Arts. 6:169-189.
.
2012. Computationally Efficient Physics Approximating Neural Networks for Highly Nonlinear Maps. 2022 International Conference on Research in Adaptive and Convergent Systems.
.
2022. Conditional Drums Generation using Compound Word Representations. EvoMUSART (EVO*) - Lecture Notes in Computer Science. 2202.04464.pdf (525.36 KB)
.
2022. Constructing Time-Series Momentum Portfolios with Deep Multi-Task Learning. Expert Systems with Applications. 230(120587) 2306.13661.pdf (707.95 KB)
.
2023. Dance hit song prediction. Journal of New music Research. 43:302. wp_hit.pdf (689.07 KB)
.
2014. Dance Hit Song Science. International Workshop on Music and Machine Learning. abstract_preprint_MML2013_DH.pdf (194.82 KB)
.
2013. DART: Disentanglement of Accent and Speaker Representation in Multispeaker Text-to-Speech. Audio Imagination: NeurIPS 2024 Workshop.
.
2024. .
2020. A dataset and classification model for Malay, Hindi, Tamil and Chinese music. 13th Workshop on music and machine learning (MML) as part of ECML/PKDD. 2009.04459.pdf (234.8 KB)
.
2020. Deep Neural Network Based Respiratory Pathology Classification Using Cough Sounds. Sensors. 21(16):5555. 2106.12174.pdf (6.52 MB)
.
2021. DeepUnifiedMom: Unified Time-series Momentum Portfolio Construction via Multi-Task Learning with Multi-Gate Mixture of Experts. arXiv:2406.08742. 2406.08742v1.pdf (1.06 MB)
.
2024. Development of Machine Learning for asthmatic and healthy voluntary cough - a proof of concept study. Applied Sciences. 9(14) applsci-09-02833.pdf (2.06 MB)
.
2019. DiffRoll: Diffusion-based Generative Music Transcription with Unsupervised Pretraining Capability. ICASSP. diffroll.pdf (2.2 MB)
.
2023. DisfluencySpeech -- Single-Speaker Conversational Speech Dataset with Paralanguage. Proc. of IEEE Tencon, Singapore.
.
2024. A Domain-Knowledge-Inspired Music Embedding Space and a Novel Attention Mechanism for Symbolic Music Modeling. Proceedings of the 37th AAAI Conference on Artificial Intelligence. 2212.00973.pdf (1.74 MB)
.
2023. Doppler Invariant Demodulation for Shallow Water Acoustic Communications Using Deep Belief Networks. 16th IEEE Asia Pacific Wireless Communications Symposium (APWCS). 1909.02850.pdf (790.54 KB)
.
2019. Downscaling using Deep Convolutional Autoencoders, a case study for South East Asia. Egusphere preprint. egusphere-2022-234.pdf (8.99 MB)
.
2022. .
2010. The effect of repetitive structure on enjoyment and altered states in uplifting trance music. 2nd International Conference on Music and Consciousness (MUSCON 2), Brighton. AgresEtAl_muscon.pdf (12.47 KB)
.
2015. The Effect of Repetitive Structure on Enjoyment in Uplifting Trance Music. 14th International Conference for Music Perception and Cognition (ICMPC). :280-282. preprint_trance.pdf (139.27 KB)
.
2016. The Effect of Spectrogram Reconstructions on Automatic Music Transcription:An Alternative Approach to Improve Transcription Accuracy. Proceedings of the International Conference on Pattern Recognition (ICPR2020). 2010.09969.pdf (3.46 MB)
.
2021. The emergence of deep learning: new opportunities for music and audio technologies. Neural Computing and Applications. main_preprint.pdf (102.16 KB)
.
2019. EmoMV: Affective Music-Video Correspondence Learning Datasets for Classification and Retrieval. Information Fusion. SSRN-id4189323.pdf (2.01 MB)
.
2022. Evaluating the Effectiveness of an Augmented Reality Game Promoting Environmental Action. Sustainability. 13(24):13912. sustainability-13-13912.pdf (16.23 MB)
.
2021. First species counterpoint generation with VNS and vertical viewpoints. Digital Music Research Network (DMNR+8). dnmr8_dh_dc.pdf (147.73 KB)
.
2013. First species counterpoint generation with VNS and vertical viewpoints. Annual Conference of the Belgian Operation Research Society (ORBEL28). orbel28_dh.pdf (216.63 KB)
.
2014. Forecasting Bitcoin Volatility Spikes from Whale Transactions and Cryptoquant Data Using Synthesizer Transformer Models. SSRN. SSRN-id4247684.pdf (5.05 MB)
.
2022. From Context to Concept: Exploring Semantic Relationships in Music with Word2Vec. Neural Computing and Applications. paper.pdf (1.64 MB)
.
2018. A Functional Taxonomy of Music Generation Systems. ACM Computing Surveys. 50(5):30. music_generation_survey_dh_preprint.pdf (349.15 KB)
.
2017. FuX, an Android app that generates counterpoint. IEEE Symposium on Computational Intelligence for Creativity and Affective Computing (CICAC). :48-55. wp_fux.pdf (486.27 KB)
.
2013. Gamification and skills tree. Trends and Foresight Report on Cyber-Physical Learning.
.
2024. A Gaussian mixture classifier model to differentiate respiratory symptoms using phonated /ɑː/ sounds. The 18th Australasian International Conference on Speech Science and Technology (SST). ahsounds.pdf (1018.01 KB)
.
2022. Generating Fingerings for Polyphonic Piano Music with a Tabu Search Algorithm. Mathematics and Computation in Music. 9110:149-160. paper_mcm_preprint.pdf (405.73 KB)
.
2015. Generating guitar solos by integer programming. Journal of the Operational Research Society. :971-985. preprint_guitar_solo_generation_dh.pdf (772.59 KB)
.
2017. Generating Lead Sheets with Affect: A Novel Conditional seq2seq Framework. Proceedings of the International Joint Conference on Neural Networks (IJCNN). 2104.13056.pdf (857.78 KB)
.
2021. Generating music with an optimization algorithm using a Markov based objective function. ORBEL29, Belgian Conference on Operations Research. orbel29abs.pdf (138.67 KB)
.
2015. Generating structured music for bagana using quality metrics based on Markov models. Expert Systems With Applications. 42 (21)(21):424–7435. paper-bagana.pdf (1.73 MB)
.
2015. .
2014. Generative Modelling for Controllable Audio Synthesis of Expressive Piano Performance. Workshop on Machine Learning for Music Discover (ML4MD) as part of ICML. 2006.09833.pdf (2.81 MB)
.
2020. Harmonic Structure Predicts the Enjoyment of Uplifting Trance Music. Frontiers in Psychology, Cognitive Science. 7(1999) agres16ut.pdf (1.15 MB)
.
2017. HEAR 2021: Holistic Evaluation of Audio Representations. Proceedings of Machine Learning Research (PMLR): NeurIPS 2021 Competition Track. 2203.03022.pdf (406.58 KB)
.
2022. Hierarchical Recurrent Neural Networks for Conditional Melody Generation with Long-term Structure. Proceedings of the International Joint Conference on Neural Networks (IJCNN). 2102.09794.pdf (1015.73 KB)
.
2021. Hit Song Prediction Based on Early Adopter Data and Audio Features. The 18th International Society for Music Information Retrieval Conference (ISMIR) - Late Breaking Demo. paper_preprint_hit.pdf (221.73 KB)
.
2017. A Hybrid Fuzzy Logic-Neural Network Approach For Multi-path Separation Of Underwater Acoustic Signals. 89th IEEE Vehicular Technology Conference. fuzzy logic.pdf (1.66 MB)
.
2019. IMMA-Emo: A Multimodal Interface for Visualising Score- and Audio-synchronised Emotion Annotations. Audio Mostly. IMMA-emo_preprint.pdf (1.4 MB)
.
2017. The impact of Audio input representations on neural network based music transcription. Proceedings of the International Joint Conference on Neural Networks (IJCNN). 2001.09989.pdf (1.87 MB)
.
2020. The impact of musical structure on enjoyment and absorptive listening states in trance music. Music and Consciousness 2 - Worlds, Practices, Modalities.
.
2019. .
2022.
Latent space representation for multi-target speaker detection and identification with a sparse dataset using Triplet neural networks. IEEE Automatic Speech Recognition and Understanding Workshop (ASRU 2019). 1910.01463.pdf (934.76 KB)
.
2019. Learning accent representation with multi-level VAE towards controllable speech synthesis. IEEE Spoken Language Technology (SLT) Workshop.
.
2023. .
2019. .
2014. A Machine Learning Approach for MIDI to Guitar Tablature Conversion. Sound and Music Computing Conference (SMC). 25.pdf (528.42 KB)
.
2022. Machine Learning Research that Matters for Music Creation: A Case Study. Journal of New Music Research. 48(1):36-55. concert_paper_preprint.pdf (1.6 MB)
.
2019. Markov Based Quality Metrics For Generating Structured Music With Optimization Techniques. Digital Music Research Network (DMNR+9). dmrn9_dh.pdf (133.29 KB)
.
2014. MERP: A Music Dataset with Emotion Ratings and Raters’ Profile Information. Sensors - Intelligent Sensors. 23(1) sensors-23-00382 (2).pdf (1.21 MB)
.
2023. Midi Miner – A Python library for tonal tension and track classification. ISMIR - Late Breaking Demo. midi_miner.pdf (83.7 KB)
.
2019. MidiCaps — A large-scale MIDI dataset with text captions. ISMIR. 2406.02255v1.pdf (699.83 KB)
.
2024. Minimally Simple Binaural Room Modelling Using a Single Feedback Delay Network. Journal of the Audio Engineering Society. 66(10):791-807. angus_jaes_preprint.pdf (6.39 MB)
.
2018. MIRFLEX: Music Information Retrieval Feature Library for Extraction. ISMIR, Late Breaking Demos. 2411.00469v1.pdf (89.86 KB)
.
2024. Modeling Musical Context with Word2vec. First International Workshop On Deep Learning and Music. 1:11-18. herremans2017work2vec.pdf (745.8 KB)
.
2017. Modeling temporal tonal relations in polyphonic music through deep networks with a novel image-based representation. The Thirty-Second AAAI Conference on Artificial Intelligence. preprint_lstm.pdf (741.28 KB)
.
2018. Modern Portfolio Construction with Advanced Deep Learning Models. SUTD. PhD Joel_Ong_Thesis.pdf (3.44 MB)
.
2024. MorpheuS: Automatic music generation with recurrent pattern constraints and tension profiles. IEEE TENCON. paper_morpheus_dh_ieee.pdf (550.61 KB)
.
2016. MorpheuS: constraining structure in automatic music generation. Dagstuhl seminar on Computational Music Structure Analysis. abstract_dagstuhl_dh.pdf (88.49 KB)
.
2016. MorpheuS: generating structured music with constrained patterns and tension. IEEE Transactions on Affective Computing. PP (In Press)(99) herremans2017morpheusFullIEEE.pdf (5.71 MB)
.
2017. Multimodal Deep Models for Predicting Affective Responses Evoked by Movies. The 2nd International Workshop on Computer Vision for Physiological Measurement as part of ICCV. Seoul, South Korea. 2019. 1909.06957.pdf (836.3 KB)
.
2019. A Multimodal Model with Twitter Finbert Embeddings for Extreme Price Movement Prediction of Bitcoin. Expert Systems with Applications. 2206.00648.pdf (3.26 MB)
.
2023. A multi-modal platform for semantic music analysis: visualizing audio- and score-based tension. 11th International Conference on Semantic Computing IEEE ICSC 2017. paper_preprint.pdf (1.63 MB)
.
2017. MusIAC: An extensible generative framework for Music Infilling Application with multi-level Control. EvoMUSART. 2202.05528.pdf (893.23 KB)
.
2022. Music and Motion-Detection: A Game Prototype for Rehabilitation and Strengthening in the Elderly. IEEE International Conference on Orange Technologies (ICOT) . agres_herr_music_rehab_preprint.pdf (1.77 MB)
.
2017. Music, Computing, and Health: A roadmap for the current and future roles of music technology for healthcare and well-being. Music & Science. Preprint for OSF_Agres, Schaefer, Volk, et al. (2021)_Music & Science_watermark.pdf (4.07 MB)
.
2021. Music FaderNets: Controllable Music Generation Based On High-Level Features via Low-Level Feature Modelling. ISMIR. 2007.15474.pdf (2.67 MB)
.
2020. Music generation with structural constraints: an operations research approach. 30th Annual Conference of the Belgian Operational Research (OR) Society (ORBEL30). :37-39. orbel30_dh.pdf (117.78 KB)
.
2016. Musical stylometry: Characterisation of music. Multivariate Humanities.
.
2021. Mustango: Toward Controllable Text-to-Music Generation. Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers). pages 8293–8316. 2311.08355 (1).pdf (11.38 MB)
.
2024. Natural Language Processing Methods for Symbolic Music Generation and Information Retrieval: a Survey. ACM Computing Surveys. 2402.17467.pdf (1.01 MB)
.
2025. nnAudio: A PyTorch Audio Processing Tool Using 1D Convolution neural networks. ISMIR - Late Breaking Demo. nnAudio.pdf (399.08 KB)
.
2019. nnAudio: An on-the-fly GPU Audio to Spectrogram Conversion Toolbox Using 1D Convolution Neural Networks. IEEE Access. nnAudio.pdf (10.2 MB)
.
2020. A Novel Interface for the Graphical Analysis of Music Practice Behaviours. Frontiers in Psychology - Human-Media Interaction. 9 practice_browser.pdf (4.9 MB)
.
2018. A novel music-based game with motion capture to support cognitive and motor function in the elderly. IEEE Conference on Games. preprint.pdf (2.6 MB)
.
2019. O.R. and music generation. OR/MS Today. 45(1) O.R. and music generation - INFORMS.pdf (825.66 KB)
.
2018.