Publications
A Multimodal Model with Twitter Finbert Embeddings for Extreme Price Movement Prediction of Bitcoin. Expert Systems with Applications.
2206.00648.pdf (3.26 MB)
.
2023. 
.
2025.
Evaluating the Effectiveness of an Augmented Reality Game Promoting Environmental Action. Sustainability. 13(24):13912.
sustainability-13-13912.pdf (16.23 MB)
.
2021. 
DisfluencySpeech -- Single-Speaker Conversational Speech Dataset with Paralanguage. Proc. of IEEE Tencon, Singapore.
.
2024. HEAR 2021: Holistic Evaluation of Audio Representations. Proceedings of Machine Learning Research (PMLR): NeurIPS 2021 Competition Track.
2203.03022.pdf (406.58 KB)
.
2022. 
Music FaderNets: Controllable Music Generation Based On High-Level Features via Low-Level Feature Modelling. ISMIR.
2007.15474.pdf (2.67 MB)
.
2020. 
Generative Modelling for Controllable Audio Synthesis of Expressive Piano Performance. Workshop on Machine Learning for Music Discover (ML4MD) as part of ICML.
2006.09833.pdf (2.81 MB)
.
2020. 
Deep Neural Network Based Respiratory Pathology Classification Using Cough Sounds. Sensors. 21(16):5555.
2106.12174.pdf (6.52 MB)
.
2021. 
Machine Learning Research that Matters for Music Creation: A Case Study. Journal of New Music Research. 48(1):36-55.
concert_paper_preprint.pdf (1.6 MB)
.
2019. 
A Novel Interface for the Graphical Analysis of Music Practice Behaviours. Frontiers in Psychology - Human-Media Interaction. 9
practice_browser.pdf (4.9 MB)
.
2018. 
Towards the future of education: cyber-physical learning. Discover Education. 4:1–16.
.
2025. A white paper on cyberphysical learning. White paper, Singapore University of Technology and Design.
LSL_WhitePaper_Cyber-physical-Campus-Higher-Education.pdf (6.98 MB)
.
2022. 
AttendAffectNet: Self-Attention based Networks for Predicting Affective Responses from Movies. Proceedings of the International Conference on Pattern Recognition (ICPR2020).
2010.11188.pdf (7.07 MB)
.
2021. 
Multimodal Deep Models for Predicting Affective Responses Evoked by Movies. The 2nd International Workshop on Computer Vision for Physiological Measurement as part of ICCV. Seoul, South Korea. 2019.
1909.06957.pdf (836.3 KB)
.
2019. 
AttendAffectNet – Emotion Prediction of Movie Viewers Using Multimodal Fusion with Self-attention. Sensors. Special issue on Intelligent Sensors: Sensor Based Multi-Modal Emotion Recognition.
sensors-21-08356.pdf (1.03 MB)
.
2021. 
EmoMV: Affective Music-Video Correspondence Learning Datasets for Classification and Retrieval. Information Fusion.
SSRN-id4189323.pdf (2.01 MB)
.
2022. .
2020. 
DeepUnifiedMom: Unified Time-series Momentum Portfolio Construction via Multi-Task Learning with Multi-Gate Mixture of Experts. arXiv:2406.08742.
2406.08742v1.pdf (1.06 MB)
.
2024. 
Modern Portfolio Construction with Advanced Deep Learning Models. SUTD. PhD
Joel_Ong_Thesis.pdf (3.44 MB)
.
2024. 
Constructing Time-Series Momentum Portfolios with Deep Multi-Task Learning. Expert Systems with Applications. 230(120587)
2306.13661.pdf (707.95 KB)
.
2023. 
A dataset and classification model for Malay, Hindi, Tamil and Chinese music. 13th Workshop on music and machine learning (MML) as part of ECML/PKDD.
2009.04459.pdf (234.8 KB)
.
2020. 
DART: Disentanglement of Accent and Speaker Representation in Multispeaker Text-to-Speech. Audio Imagination: NeurIPS 2024 Workshop.
.
2024. Accent Conversion in Text-To-Speech Using Multi-Level VAE and Adversarial Training. Proc. of IEEE Tencon, Singapore.
.
2024. Accented Text-to-Speech Synthesis with a Conditional Variational Autoencoder. Proc. of IEEE Tencon, Singapore.
.
2024. MidiCaps — A large-scale MIDI dataset with text captions. ISMIR.
2406.02255v1.pdf (699.83 KB)
.
2024. 
Learning accent representation with multi-level VAE towards controllable speech synthesis. IEEE Spoken Language Technology (SLT) Workshop.
.
2023. Mustango: Toward Controllable Text-to-Music Generation. Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers). pages 8293–8316.
2311.08355 (1).pdf (11.38 MB)
.
2024. 
Conditional Drums Generation using Compound Word Representations. EvoMUSART (EVO*) - Lecture Notes in Computer Science.
2202.04464.pdf (525.36 KB)
.
2022. 
Generating Lead Sheets with Affect: A Novel Conditional seq2seq Framework. Proceedings of the International Joint Conference on Neural Networks (IJCNN).
2104.13056.pdf (857.78 KB)
.
2021. 
Singing voice conversion with disentangled representations of singer and vocal technique using variational autoencoders. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP).
1912.02613.pdf (2.9 MB)
.
2020. 
BandControlNet: Parallel Transformers-based Steerable Popular Music Generation with Fine-Grained Spatiotemporal Features. arXiv:2407.10462.
2407.10462v1.pdf (2.3 MB)
.
2024. 
Learning Disentangled Representations of Timbre and Pitch for Musical Instrument Sounds Using Gaussian Mixture Variational Autoencoders. ISMIR.
jyun-ismir.pdf (5.62 MB)
.
2019. 
Unsupervised disentanglement of pitch and timbre for isolated musical instrument sounds. Proceedings of the International Society of Music Information Retrieval (ISMIR).
.
2020. .
2025. Singing Voice Separation Using a Deep Convolutional Neural Network Trained by Ideal Binary Mask and Cross Entropy. Neural Computing and Applications.
main.pdf (2.59 MB)
.
2018. 
Downscaling using Deep Convolutional Autoencoders, a case study for South East Asia. Egusphere preprint.
egusphere-2022-234.pdf (8.99 MB)
.
2022. 
Underwater Acoustic Communication Receiver Using Deep Belief Network. IEEE Transactions on Communications. :1-1.
2102.13397.pdf (12.87 MB)
.
2021. 
Doppler Invariant Demodulation for Shallow Water Acoustic Communications Using Deep Belief Networks. 16th IEEE Asia Pacific Wireless Communications Symposium (APWCS).
1909.02850.pdf (790.54 KB)
.
2019. 
A Hybrid Fuzzy Logic-Neural Network Approach For Multi-path Separation Of Underwater Acoustic Signals. 89th IEEE Vehicular Technology Conference.
fuzzy logic.pdf (1.66 MB)
.
2019. 
Natural Language Processing Methods for Symbolic Music Generation and Information Retrieval: a Survey. ACM Computing Surveys.
2402.17467.pdf (1.01 MB)
.
2025. 
Coarse-to-Fine Text-to-Music Latent Diffusion. Audio Imagination: NeurIPS 2024 Workshop.
.
2024. .
2025. SNIPER Training: Variable Sparsity Rate Training For Text-To-Speech. Proc. of IEEE Tencon, Singapore.
2211.07283.pdf (435.22 KB)
.
2024. 
PRESENT: Zero-Shot Text-to-Prosody Control. IEEE Signal Processing Letters.
2408.06827v1.pdf (367.55 KB)
.
2025. 
Understanding Audio Features via Trainable Basis Functions. Arxiv preprint.
2204.11437.pdf (7.36 MB)
.
2022. 
Musical stylometry: Characterisation of music. Multivariate Humanities.
.
2021. MERP: A Music Dataset with Emotion Ratings and Raters’ Profile Information. Sensors - Intelligent Sensors. 23(1)
sensors-23-00382 (2).pdf (1.21 MB)
.
2023. .
2025. 
Are we there yet? A brief survey of Music Emotion Prediction Datasets, Models and Outstanding Challenges arXiv:2406.08809.
2406.08809v1.pdf (156.19 KB)
.
2024. 
Video2Music: Suitable Music Generation from Videos using an Affective Multimodal Transformer model. Expert Systems with Applications.
2311.00968.pdf (5.51 MB)
.
2024. 
A Machine Learning Approach for MIDI to Guitar Tablature Conversion. Sound and Music Computing Conference (SMC).
25.pdf (528.42 KB)
.
2022. 
Single Image Video Prediction with Auto-Regressive GANs. Sensors. 22:3533.
.
2022. Markov Based Quality Metrics For Generating Structured Music With Optimization Techniques. Digital Music Research Network (DMNR+9).
dmrn9_dh.pdf (133.29 KB)
.
2014. 
A Variable Neighborhood Search Algorithm for Composing First Species Counterpoint Musical Fragments. 2011017
wp_cp1.pdf (775.91 KB)
.
2011. 
Tabu Search voor de optimalisatie van muzikale fragmenten. Faculty of Applied Economics. MSc Business Engineer Management Information Systems
Thesis.pdf (1.25 MB)
.
2005. 
Compose=Compute - Computer Generation And Classification Of Music Through Operations Research Methods. PhD Thesis, University of Antwerp. :250.
.
2014. First species counterpoint generation with VNS and vertical viewpoints. Digital Music Research Network (DMNR+8).
dnmr8_dh_dc.pdf (147.73 KB)
.
2013. 
Generating structured music for bagana using quality metrics based on Markov models. Expert Systems With Applications. 42 (21)(21):424–7435.
paper-bagana.pdf (1.73 MB)
.
2015. 
MorpheuS: generating structured music with constrained patterns and tension. IEEE Transactions on Affective Computing. PP (In Press)(99)
herremans2017morpheusFullIEEE.pdf (5.71 MB)
.
2017. 
Looking into the minds of Bach, Haydn and Beethoven: Classification and generation of composer-specific music.
RPS-2014-001.pdf (575.42 KB)
.
2014. 
Composing first species counterpoint musical scores with a variable neighbourhood search algorithm. Journal of Mathematics and the Arts. 6:169-189.
.
2012. Music generation with structural constraints: an operations research approach. 30th Annual Conference of the Belgian Operational Research (OR) Society (ORBEL30). :37-39.
orbel30_dh.pdf (117.78 KB)
.
2016. 
Compose ≡ compute. 4OR. 13:335–336.
.
2015. Dance Hit Song Science. International Workshop on Music and Machine Learning.
abstract_preprint_MML2013_DH.pdf (194.82 KB)
.
2013. 
Forecasting Bitcoin Volatility Spikes from Whale Transactions and Cryptoquant Data Using Synthesizer Transformer Models. SSRN.
SSRN-id4247684.pdf (5.05 MB)
.
2022. 
Visualizing the evolution of alternative hit charts. The 18th International Society for Music Information Retrieval Conference (ISMIR) - Late Breaking Demo.
dh_visualiation_preprint.pdf (5.34 MB)
.
2017. .
2025. 
Generating music with an optimization algorithm using a Markov based objective function. ORBEL29, Belgian Conference on Operations Research.
orbel29abs.pdf (138.67 KB)
.
2015. 
MorpheuS: Automatic music generation with recurrent pattern constraints and tension profiles. IEEE TENCON.
paper_morpheus_dh_ieee.pdf (550.61 KB)
.
2016. 
A Functional Taxonomy of Music Generation Systems. ACM Computing Surveys. 50(5):30.
music_generation_survey_dh_preprint.pdf (349.15 KB)
.
2017. .
2014. .
2012. 
A multi-modal platform for semantic music analysis: visualizing audio- and score-based tension. 11th International Conference on Semantic Computing IEEE ICSC 2017.
paper_preprint.pdf (1.63 MB)
.
2017. 
Composing Fifth Species Counterpoint Music With A Variable Neighborhood Search Algorithm. Expert Systems with Applications. 40
paper_preprint_cp5.pdf (405.75 KB)
.
2013. 
Classification and generation of composer-specific music using global feature models and variable neighborhood search. Computer Music Journal. 39(3):91.
papercmj-dh_preprint.pdf (637.63 KB)
.
2015. 
MorpheuS: constraining structure in automatic music generation. Dagstuhl seminar on Computational Music Structure Analysis.
abstract_dagstuhl_dh.pdf (88.49 KB)
.
2016. 
IMMA-Emo: A Multimodal Interface for Visualising Score- and Audio-synchronised Emotion Annotations. Audio Mostly.
IMMA-emo_preprint.pdf (1.4 MB)
.
2017. 
First species counterpoint generation with VNS and vertical viewpoints. Annual Conference of the Belgian Operation Research Society (ORBEL28).
orbel28_dh.pdf (216.63 KB)
.
2014. 
Composing counterpoint musical scores with variable neighborhood search. Annual Conference of the Belgian Operation Research Society (ORBEL26).
orbel26abs_vnsforcp.pdf (116.85 KB)
.
2012. 
Composer Classification Models for Music-Theory Building. Computational Music Analysis.
Chapter_HerremansEtAl_preprint.pdf (475.26 KB)
.
2015. 
Sampling the extrema from statistical models of music with variable neighbourhood search. ICMC/SMC.
icmc_dh.pdf (1.07 MB)
.
2014. 
Hit Song Prediction Based on Early Adopter Data and Audio Features. The 18th International Society for Music Information Retrieval Conference (ISMIR) - Late Breaking Demo.
paper_preprint_hit.pdf (221.73 KB)
.
2017. .
2010. .
2021. 
Tension ribbons: Quantifying and visualising tonal tension. Second International Conference on Technologies for Music Notation and Representation (TENOR). 2:8-18.
paper_tenor_dh_preprint_small.pdf (1.67 MB)
.
2016. 
Towards emotion based music generation: A tonal tension model based on the spiral array. Proceedings of Cognitive Science (CogSci).
CogSci_tension (1).pdf (610.91 KB)
.
2019. 
Modeling Musical Context with Word2vec. First International Workshop On Deep Learning and Music. 1:11-18.
herremans2017work2vec.pdf (745.8 KB)
.
2017. 
The emergence of deep learning: new opportunities for music and audio technologies. Neural Computing and Applications.
main_preprint.pdf (102.16 KB)
.
2019. 
Dance hit song prediction. Journal of New music Research. 43:302.
wp_hit.pdf (689.07 KB)
.
2014. 
FuX, an Android app that generates counterpoint. IEEE Symposium on Computational Intelligence for Creativity and Affective Computing (CICAC). :48-55.
wp_fux.pdf (486.27 KB)
.
2013. 
O.R. and music generation. OR/MS Today. 45(1)
O.R. and music generation - INFORMS.pdf (825.66 KB)
.
2018. 
Development of Machine Learning for asthmatic and healthy voluntary cough - a proof of concept study. Applied Sciences. 9(14)
applsci-09-02833.pdf (2.06 MB)
.
2019. 
MusIAC: An extensible generative framework for Music Infilling Application with multi-level Control. EvoMUSART.
2202.05528.pdf (893.23 KB)
.
2022. 
Hierarchical Recurrent Neural Networks for Conditional Melody Generation with Long-term Structure. Proceedings of the International Joint Conference on Neural Networks (IJCNN).
2102.09794.pdf (1015.73 KB)
.
2021. 
A variational autoencoder for music generation controlled by tonal tension. Joint Conference on AI Music Creativity (CSMC + MuMe).
2010.06230.pdf (622.82 KB)
.
2020. 
Midi Miner – A Python library for tonal tension and track classification. ISMIR - Late Breaking Demo.
midi_miner.pdf (83.7 KB)
.
2019. .
2025. 
A Domain-Knowledge-Inspired Music Embedding Space and a Novel Attention Mechanism for Symbolic Music Modeling. Proceedings of the 37th AAAI Conference on Artificial Intelligence.
2212.00973.pdf (1.74 MB)
.
2023. 
PerceptionGAN: Real-world image construction from provided text through perceptual understanding. 4th Int. Conf. on Imaging, Vision and Pattern Recognition (IVPR), and 9th Int. Conf. on Informatics, Electronics & Vision (ICIEV).
perceptionGAN-preprint.pdf (2.83 MB)
.
2020. 