• Skip navigation
  • Skip to navigation
  • Skip to the bottom
Simulate organization breadcrumb open Simulate organization breadcrumb close
Friedrich-Alexander-Universität Speech and Language Processing
  • FAUTo the central FAU website
  1. Friedrich-Alexander-Universität
  2. Technische Fakultät
Suche öffnen
  • Campo
  • StudOn
  • FAUdir
  • Jobs
  • Map
  • Help
  1. Friedrich-Alexander-Universität
  2. Technische Fakultät
Friedrich-Alexander-Universität Speech and Language Processing
Navigation Navigation close
  • People
  • Education
  • Research
    • Publications
    • Projects
    Portal Research
  • Outreach

Speech and Language Processing

In page navigation: Research
  • Projects
  • Publications

Publications

Publications

2024

  • Hernandez A., Perez Toro PA., Arias-Vergara T., Vasquez-Correa JC., Yang SH., Orozco-Arroyave JR., Maier A.:
    Anonymizing Dysarthric Speech: Investigating the Effects of Voice Conversion on Pathological Information Preservation
    27th International Conference on Text, Speech, and Dialogue, TSD 2024 (Brno, CZE, 9. September 2024 - 13. September 2024)
    In: Elmar Nöth, Aleš Horák, Petr Sojka (ed.): Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) 2024
    DOI: 10.1007/978-3-031-70566-3_14
  • Schieber H., Demir K., Kleinbeck C., Yang SH., Roth D.:
    Indoor Synthetic Data Generation: A Systematic Review
    In: Computer Vision and Image Understanding 240 (2024), Article No.: 103907
    ISSN: 1077-3142
    DOI: 10.1016/j.cviu.2023.103907
  • Tayebi Arasteh S., Arias-Vergara T., Perez Toro PA., Weise T., Packhäuser K., Schuster M., Nöth E., Maier A., Yang SH.:
    Addressing challenges in speaker anonymization to maintain utility while ensuring privacy of pathological speech
    In: Communications Medicine 4 (2024), Article No.: 182
    ISSN: 2730-664X
    DOI: 10.1038/s43856-024-00609-5
  • Weise T., Klumpp P., Demir K., Perez Toro PA., Schuster M., Nöth E., Heismann B., Maier A., Yang SH.:
    Speaker-and Text-Independent Estimation of Articulatory Movements and Phoneme Alignments from Speech
    25th Interspeech Conferece 2024 (Kos Island, 1. September 2024 - 5. September 2024)
    In: Interspeech 2024 2024
    DOI: 10.21437/Interspeech.2024-1208

2023

  • Demir K., Schieber H., Weise T., May M., Maier A., Yang SH.:
    Deep Learning in Surgical Workflow Analysis: A Review of Phase and Step Recognition
    In: IEEE Journal of Biomedical and Health Informatics (2023), p. 1-14
    ISSN: 2168-2194
    DOI: 10.1109/JBHI.2023.3311628
  • Oppelt MP., Foltyn A., Deuschel J., Lang NR., Holzer N., Eskofier B., Yang SH.:
    ADABase: A Multimodal Dataset for Cognitive Load Estimation
    In: Sensors 23 (2023)
    ISSN: 1424-8220
    DOI: 10.3390/s23010340
  • Tayebi Arasteh S., Rios-Urrego CD., Nöth E., Maier A., Yang SH., Rusz J., Rafael Orozco-Arroyave J.:
    Federated learning for secure development of AI models for Parkinson’s disease detection using speech from different languages
    Interspeech 2023 (Dublin, 21. August 2023 - 24. August 2023)
    In: Proceedings of INTERSPEECH 2023, Dublin, Ireland: 2023
    DOI: 10.21437/Interspeech.2023-2108
  • Tayebi Arasteh S., Weise T., Schuster M., Nöth E., Maier A., Yang SH.:
    The effect of speech pathology on automatic speaker verification: a large-scale study
    In: Scientific Reports 13 (2023), p. 20476
    ISSN: 2045-2322
    DOI: 10.1038/s41598-023-47711-7
    URL: https://www.nature.com/articles/s41598-023-47711-7
  • Utz J., Weise T., Schlereth M., Wagner F., Thies M., Gu M., Uderhardt S., Breininger K.:
    Focus on Content not Noise: Improving Image Generation for Nuclei Segmentation by Suppressing Steganography in CycleGAN
    2023 IEEE/CVF International Conference on Computer Vision Workshops, ICCVW 2023 (Paris, 2. October 2023 - 6. October 2023)
    In: Proceedings - 2023 IEEE/CVF International Conference on Computer Vision Workshops, ICCVW 2023 2023
    DOI: 10.1109/ICCVW60793.2023.00417
  • Weise T., Maier A., Demir K., Perez Toro PA., Arias Vergara T., Heismann B., Nöth E., Schuster M., Yang SH.:
    Impact of Including Pathological Speech in Pre-training on Pathology Detection
    TSD 2023: Text, Speech, and Dialogue (Pilsen, 4. September 2023 - 6. September 2023)
    In: Kamil Ekštein, František Pártl, Miloslav Konopík (ed.): Text, Speech, and Dialogue, Cham: 2023
    DOI: 10.1007/978-3-031-40498-6_13
  • Weise T., Maier A., Demir K., Perez Toro PA., Arias Vergara T., Heismann B., Nöth E., Schuster ME., Yang SH.:
    Impact of Including Pathological Speech in Pre-training on Pathology Detection
    Springer Science and Business Media Deutschland GmbH, 2023
    ISBN: 9783031404979
    DOI: 10.1007/978-3-031-40498-6_13
  • Weise T., Perez Toro PA., Deitermann A., Hoffmann B., Demir K., Straetz T., Nöth E., Maier A., Kallert T., Yang SH.:
    Multi-Modal Biomarker Extraction Framework for Therapy Monitoring of Social Anxiety and Depression Using Audio and Video
    International Conference on Machine Learning (Workshop on Machine Learning for Multimodal Healthcare) (Hawaii Convention Center, 1801 Kalākaua Ave, Honolulu, HI 96815, United States, 29. July 2023 - 29. July 2023)
    In: Andreas K. Maier, Julia A. Schnabel, Pallavi Tiwari, Oliver Stegle (ed.): International Conference on Machine Learning (Workshop on Machine Learning for Multimodal Healthcare), Cham: 2023
    DOI: 10.1007/978-3-031-47679-2_3
  • Yang SH., Demir K., Weise T., Schmid A., May M., Maier A.:
    PoCaPNet: A Novel Approach for Surgical Phase Recognition Using Speech and X-Ray Images
    International Conference Interspeech 2023 (Dublin, 21. August 2023 - 24. August 2023)
  • Yang SH., Weise T., Demir K.:
    Impact of Including Pathological Speech in Pre-Training on Pathology Detection
    Text, Speech, and Dialogue. Satellite event of Interspeech 22023 (Pilsen, 4. September 2023 - 6. July 2023)

2022

  • Demir K., May M., Schmid A., Uder M., Breininger K., Weise T., Maier A., Yang SH.:
    PoCaP Corpus: A Multimodal Dataset for Smart Operating Room Speech Assistant Using Interventional Radiology Workflow Analysis
    25th International Conference on Text, Speech, and Dialogue, TSD 2022 (Brno, 6. September 2022 - 9. September 2022)
    In: Petr Sojka, Aleš Horák, Ivan Kopeček, Karel Pala (ed.): Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) 2022
    DOI: 10.1007/978-3-031-16270-1_38
  • Hernandez A., Klumpp P., Das BK., Maier A., Yang SH.:
    Autoblog 2021: The Importance of Language Models for Spontaneous Lecture Speech
    25th International Conference on Text, Speech and Dialogue (Brno, Czech Republic, 6. September 2022 - 9. September 2022)
    In: Petr Sojka, Aleš Horák, Ivan Kopeček, Karel Pala (ed.): Text, Speech, and Dialogue 25th International Conference, TSD 2022, Brno, Czech Republic, September 6–9, 2022, Proceedings, Springer Nature Switzerland AG: 2022
    DOI: 10.1007/978-3-031-16270-1_24
    URL: https://link.springer.com/chapter/10.1007/978-3-031-16270-1_24
  • Hernandez A., Perez Toro PA., Nöth E., Orozco Arroyave JR., Maier A., Yang SH.:
    Cross-lingual Self-Supervised Speech Representations for Improved Dysarthric Speech Recognition
    Interspeech (Seoul, 18. September 2022 - 22. September 2022)
    In: Proceedings of Interspeech 2022 2022
    DOI: 10.21437/Interspeech.2022-10674
    URL: https://www.isca-speech.org/archive/interspeech_2022/hernandez22_interspeech.html
  • Maier A., Köstler H., Heisig M., Krauß P., Yang SH.:
    Known operator learning and hybrid machine learning in medical imaging - A review of the past, the present, and the future
    In: Progress in Biomedical Engineering 4 (2022), Article No.: 022002
    ISSN: 2516-1091
    DOI: 10.1088/2516-1091/ac5b13
  • Maier A., Yang SH., Maleki F., Muthukrishnan N., Forghani R.:
    Offer Proprietary Algorithms Still Protection of Intellectual Property in the Age of Machine Learning?: A Case Study Using Dual Energy CT Data
    German Workshop on Medical Image Computing, 2022 (Heidelberg, DEU, 26. June 2022 - 28. June 2022)
    In: Klaus Maier-Hein, Thomas M. Deserno, Heinz Handels, Andreas Maier, Christoph Palm, Thomas Tolxdorff (ed.): Informatik aktuell 2022
    DOI: 10.1007/978-3-658-36932-3_70
  • Sindel A., Hernandez A., Yang SH., Christlein V., Maier A.:
    SliTraNet: Automatic Detection of Slide Transitions in Lecture Videos using Convolutional Neural Networks
    OAGM Workshop 2021 (, 24. November 2021 - 25. November 2021)
    In: Proceedings of the OAGM Workshop 2021. Computer Vision and Pattern Analysis Across Domains 2022
    DOI: 10.3217/978-3-85125-869-1-10
    URL: https://openlib.tugraz.at/download.php?id=621f329186973&location=browse
  • Weise T., Maier A., Nöth E., Heismann B., Schuster M., Yang SH.:
    Disentangled Latent Speech Representation for Automatic Pathological Intelligibility Assessment
    Proceedings of INTERSPEECH 2022 (Songdo)

2021

  • Hernandez A., Yang SH.:
    Multimodal Corpus Analysis of Autoblog 2020: Lecture Videos in Machine Learning
    International Conference on Speech and Computer (Online, 27. September 2021 - 30. September 2021)
    In: Multimodal Corpus Analysis of Autoblog 2020: Lecture Videos in Machine Learning 2021
    DOI: 10.1007/978-3-030-87802-3_24
    URL: https://link.springer.com/chapter/10.1007/978-3-030-87802-3_24

 

Friedrich-Alexander-Universität Erlangen-Nürnberg (FAU)
Professur für Speech and Language Processing

Henkestraße 91
91052 Erlangen
  • Legal notice
  • Privacy
  • Accessibility
    Up