Abstract: Speech emotion recognition (SER) is pivotal for achieving empathetic and adaptive human–robot interaction (HRI) within Internet of Things (IoT) ecosystems. However, conventional SER methods ...