Synthetic data generation has drawn growing attention due to the lack of training data in many application domains. It is useful for privacy-concerned applications, e.g. digital health applications based on electronic medical records. It is also attractive for novel applications, e.g. multimodal applications in meta-verse, which have little data for training and evaluation. This project focuses on synthetic data generation for audio and the corresponding multimodal applications, such as mental health chatbots and digital assistants for negotiations.
For most such applications, the key technical challenge is to create disentangled representations for paralinguistic information and the content of speech. Herein, a component of such a disentangled representation contains only necessary information for the relevant attributes or properties. General composition mechanisms will be learned such that applications can combine appropriate components to generate desired data. For example, the script of an emotion support conversation can be well combined with the desired emotion and voice patterns for generating natural and empathetic speech. However, current deep generative models perform poorly for compositional generalization [3]. The recent work shows that disentangled representations can be defined from a causal perspective [1]. This is also relevant to causal representation learning, which aims for robustness and strong out-of-distribution generalization capability [2]. If user-specific information is identified and removable from the input data, the devised techniques can also be applied for privacy-sensitive applications, such as privacy-preserving ASR.
[1] Wang, Yixin, and Michael I. Jordan. "Desiderata for representation learning: A causal perspective." arXiv preprint arXiv:2109.03795 (2021).
[2] Schölkopf, Bernhard, Francesco Locatello, Stefan Bauer, Nan Rosemary Ke, Nal Kalchbrenner, Anirudh Goyal, and Yoshua Bengio. "Toward causal representation learning." Proceedings of the IEEE 109, no. 5 (2021): 612-634.
[3] Hupkes, Dieuwke, Verna Dankers, Mathijs Mul, and Elia Bruni. "Compositionality decomposed: how do neural networks generalise?." Journal of Artificial Intelligence Research 67 (2020): 757-795.
Similar Positions
-
Associate Lecturer/Lecturer, Speech Pathology, UNIVERSITY OF MELBOURNE, Australia, 2 days ago
Position Number: 0065584 Location: Parkville Role type: Full-time; Fixed-term for 2 years Faculty: Faculty of Medicine, Dentistry and Health Sciences Department/School: Department of Audiology a...
-
Associate Lecturer/Lecturer, Speech Pathology, University of Melbourne, Australia, about 8 hours ago
Position Number: 0065584 Location: Parkville Role type: Full-time; Fixed-term for 2 years Faculty: Faculty of Medicine, Dentistry and Health Sciences Department/School: Department of Audiology a...
-
Associate Lecturer In Practice Education Speech Pathology, UNIVERSITY OF MELBOURNE, Australia, 17 days ago
Position Number: 0065588 Location: Parkville Role type: Full-time; Fixed term from 1 June 2025 to 31 May 2027 Faculty: Medicine, Dentistry & Health Sciences Department: Audiology and Speech Pathol...
-
Associate Lecturer In Practice Education – Speech Pathology, University of Melbourne, Australia, 3 days ago
Position Number: 0065588 Location: Parkville Role type: Full-time; Fixed term from 1 June 2025 to 31 May 2027 Faculty: Medicine, Dentistry & Health Sciences Department: Audiology and Speech Pathol...
-
<! Ko If: Is Job Title Visible > Senior Lecturer Speech Pathology <! /Ko > <! Ko If: Is Already Applied Visible ><! /Ko > , Victoria University, Australia, 1 day ago
Job Info Job Identification 15763 Job Category Academic (teaching and research) Posting Date 04/17/2025, 02:58 AM Locations Footscray Park Apply Before 04/30/2025, 09:59 AM Job Schedule Full time ...
-
Clinical Research Nurse, Swinburne University of Technology, Australia, 22 days ago
Research Nurse role within Swinburne University Fixed term, parental leave cover position (June 2026) at our Hawthorn campus. This is a full- time role but is open to part-time/job-share applicati...