|
|
|
|
|
Research Interests
- Spoken Language Processing
- Multi-lingual speech recognition and synthesis
- Speech technology for less-resourced languages
- Zero-resourced speech technology
- Spoken Language Translation
- Direct speech-to-speech translation
- Paralinguistic translation
- Social-Affective Dialog System
- Emotion recognition and emotional trigger analysis
- Response selection and generation
- Cognitive Communication
- Brain signal analysis
|
|
|
|
|
|
|
|
|
|
|
|
Education
- 2005-2008
- Doctorate degree (Dr.-Ing) in Engineering Science, University of Ulm, GERMANY
- Thesis: "Incorporating Knowledge into Statistical Acoustic Models for Spoken Language Dialogue Systems"
- (in collaboration with Spoken Language Communication Research Labs, ATR, JAPAN)
- 2000-2002
- Master degree (MSc ) in Communication Technology, University of Ulm, GERMANY
- Thesis: "Speech Recognition using Multiple Acoustic Feature Streams"
- (in collaboration with Speech Understanding Dept, Daimler Chrysler Reserach Center, GERMANY)
- 1995-1999
- Bachelor degree (BSc) in Informatics, Bandung Institute of Technology, INDONESIA
- Thesis: "Work Bench Symbolic of Machine Learning"
|
|
|
|
|
|
|
|
|
|
|
|
Work Experience
- Apr 2024 - Now
- Full Professor, Augmented Human Communication Labs, NAIST, JAPAN
- Apr 2024 - Now
- Adjunct Professor, School of Information Science, JAIST, JAPAN
- Oct 2021 - Now
- Visiting Research Scientist, RIKEN Center for Advanced Intelligent Project (AIP), JAPAN
- Jul 2021 - Now
- Adjunct Professor, Faculty of Computer Science, University of Indonesia, INDONESIA
- Oct 2021 - Mar 2024
- Associate Professor, School of Information Science, JAIST, JAPAN
- Oct 2021 - Mar 2024
- Adjunct Associate Professor, Augmented Human Communication Labs, NAIST, JAPAN
- Jan 2018 - Sep 2021
- Research Associate Professor, Augmented Human Communication Labs, NAIST, JAPAN
- Jan 2018 - Sep 2021
- Research Scientist, RIKEN Center for Advanced Intelligent Project (AIP), JAPAN
- Jun 2011 - Dec 2017
- Assistant Professor, Augmented Human Communication Labs, NAIST, JAPAN
- Feb 2015 - Jan 2016
- Visiting Scientific Researcher, Robotics and Intelligent Transportation System (RITS) Team, INRIA Roqcuencourt, FRANCE
- Jun 2009 - Sep 2011
- Visiting Professor, Faculty of Computer Science, University of Indonesia, INDONESIA
- Apr 2006 - May 2011
- Expert Researcher, Spoken Language Communication Research Groups, NICT, JAPAN
- Apr 2003 - Mar 2009
- Researcher, Spoken Language Communication Research Labs, ATR, JAPAN
- Oct 2001 - May 2002
- Masterarbeit, Speech Understanding Dept, DaimlerChrysler Research Center, GERMANY
- Nov 1999 - Mar 2000
- Junior IT Consultant, Sumarno Pabotingi Associate, INDONESIA
|
|
|
|
|
|
|
|
|
|
|
|
Teaching Experience
- Apr 2013 - Now
- Lecturer on "Sequential Data Modeling"
- for Graduate Students, Nara Institute of Science and Technology, JAPAN
- Apr 2016 - Mar 2020
- Lecturer on "Speech Processing" [in Japanese]
- for Graduate Students, Nara Institute of Science and Technology, JAPAN
- Apr 2013 - Mar 2017
- Lecturer on "Advanced Cutting-edge Research"
- for Graduate Students, Nara Institute of Science and Technology, JAPAN
- Apr 2013 - Mar 2015
- Visiting Lecturer on "Spoken Language Processing" [in Japanese]
- for Undergraduate Students, Kansai University, JAPAN
- Jun 2011 - Mar 2013
- Lecturer on "Intelligent System Design"
- for Graduate Students, Nara Institute of Science and Technology, JAPAN
- Jan 2009 - Sep 2011
- Lecturer on "Indonesian Spoken Language Processing: ASR and TTS"
- for Undergraduate and Graduate Students of the Computer Science Dept., University of Indonesia (UI), INDONESIA
- Jul 2010 (1 week)
- Distinguished Lecturer on "Spoken Language Processing: ASR, MT, TTS" Summer Course
- for Reseachers of the Agency for the Assessment and Application of Technology (BPPT), INDONESIA
- Jul 2008 (1 week)
- Inviting Lecturer on "Indonesian Large Vocabulary Continuous Speech Recognition" Workshop
- for Reseachers of the Agency for the Assessment and Application of Technology (BPPT) and
- the R&D Center of PT. Telekominikasi Indonesia (TELKOMRisTI), in BPPT, INDONESIA
- under the Asian Pacific Telecomunity (APT) Project
- Sep 2005 (1 week)
- Lecturer on "Development of Indonesian Large Vocabulary Continuous Speech Recognition" Tutorial
- for Reseachers of the R&D Center of PT. Telekominikasi Indonesia (TELKOMRisTI)
- in ATR, Kyoto, JAPAN
- under the Asian Pacific Telecomunity (APT) Project
- Nov 2003 (1 week)
- Lecturer on "Speech Recognition" Tutorial
- for Reseachers of the R&D Center of PT. Telekominikasi Indonesia (TELKOMRisTI)
- in ATR, Kyoto, JAPAN
- under the Asian Pacific Telecomunity (APT) Project
- Aug 1999 - Oct 1999
- Assistant Lecturer on "Machine Learning" Course
- for Undergraduate Students of the Informatics Dept., Bandung Institute of Technology (ITB), INDONESIA
- Jan 1999 - May 1999
- Assistant Lecturer on "Programming Language" Course
- for Undergraduate Students of the Informatics Dept., Bandung Institute of Technology (ITB), INDONESIA
|
|
|
|
|
|
|
|
|
|
|
|
Contributions on Indonesian Language Processing
- 2020
- Developed the First Cross-Lingual Machine Speech Chain on Indonesian Language Enabling Semi-Supervised Learning
- S. Novitasari, A. Tjandra, S. Sakti, S. Nakamura, "Cross-Lingual Machine Speech Chain for Javanese, Sundanese, Balinese, and Bataks Speech Recognition and Synthesis", in Proc. Spoken Language Technologies for Under-resourced languages (SLTU) and Collaboration and Computing for Under-Resourced Languages (CCURL), pp 131-138, May 2020. [pdf]
- 2019
- Developed Zero-resoursed Speech Technology on Indonesian Language Enabling Unsupervised Learning
- E. Dunbar, J. Karadayi, M. Bernard, X.-N. Cao, R. Algayres, L. Ondel, L. Besacier, S. Sakti, E. Dupoux, "The Zero Resource Speech Challenge 2020: Discovering discrete subword and word units.", in Proc. INTERSPEECH, October 2020. [pdf]
- A. Tjandra, S. Sakti, S. Nakamura, "Transformer VQ-VAE for Unsupervised Unit Discovery and Speech Synthesis: ZeroSpeech 2020 Challenge", in Proc. INTERSPEECH, October 2020. [pdf]
- E. Dunbar, R. Algayres, J. Karadayi, M. Bernard, J. Benjumea, X.-N. Cao, L. Miskic, C. Dugrain, L. Ondel, A.W. Black, L. Besacier, S. Sakti, E. Dupoux, "The Zero Resource Speech Challenge 2019: TTS Without T", in Proc. INTERSPEECH, September 2019. [pdf]
- A. Tjandra, B. Sisman, M. Zhang, S. Sakti, H. Li, S. Nakamura, "VQVAE Unsupervised Unit Discovery and Multi-Scale Code2Spec Inverter for Zerospeech Challenge 2019", in Proc. INTERSPEECH, pp. 1118-1122, September 2019. [pdf]
- 2018
- Developed Image Description Corpus and Semantic Analysis on Indonesian Language
- K. Nur'aini, J. Effendi, S. Sakti, M. Adriani, S. Nakamura, "Corpus Construction and Semantic Analysis of Indonesian Image Description", in Proc. SLTU, August 2018. [pdf]
- 2014
- Developed the First Emotional Speech Corpus and Emotion Recognition on Indonesian Language
- N. Lubis, D. Lestari, A. Purwarianti, S. Sakti, S. Nakamura, "Construction of Spontaneous Emotion Corpus from Indonesian TV Talk Shows and Its Application on Multimodal Emotion Recognition", in IEICE Transactions on Information and Systems, Vol. E101-D, No.8, pp.2092-2100, August 2018. [pdf]
- N. Lubis, S. Sakti, G. Neubig, T. Toda, S. Nakamura, "Construction and Analysis of Social-Affective Interaction in English and Indonesian", in Proc. Oriental COCOSDA, pp. 202-206, October 2015. [pdf]
- N. Lubis, D. Lestari, A. Purwarianti, S. Sakti, S. Nakamura, "Emotion Recognition on Indonesian Television Talk Shows", in Proc. IEEE SLT, pp. to appear, Lake Tahoe, USA, December 2014. [pdf]
- N. Lubis, D. Lestari, A. Purwarianti, S. Sakti, S. Nakamura, "Construction and Analysis of Indonesian Emotional Speech Corpus", in Proc. Oriental COCOSDA, Phuket, Thailand, September
2014. [pdf]
- 2013
- Developed the ASR for Indonesian Ethnic Languages
- S. Sakti, S. Nakamura, "Recent Progress in Developing Grapheme-based Speech Recognition for Indonesian Ethnic Languages: Javanese, Sundanese, Balinese, and Bataks", in Proc. SLTU, St. Petersburg, Russia, May 2014. [pdf]
- S. Sakti, S. Nakamura, "Towards Language Preservation: Design and Collection of Graphemically Balanced and Parallel Speech Corpora of Indonesian Ethnic Languages", in Proc. Oriental COCOSDA, Gurgaon, India, November 2013. [pdf]
- A. Sani, S. Sakti, G. Neubig, T. Toda, A. Mulyanto, S. Nakamura, "Towards Language Preservation: Preliminary Collection and Vowel Analysis of Indonesian Ethnic Speech Data", in Proc. Oriental COCOSDA, pp. 128-122, Macau, China, December 2012. [pdf]
- 2009
- Establised the First Speech-to-Speech Translation System for Indonesian Language (A-STAR Project)
- S. Sakti, M. Paul, A. Finch, S. Sakai, T.-T. Vu, N. Kimura, C. Hori, E. Sumita, S. Nakamura, J. Park, C. Wutiwiwatchai, B. Xu, H. Riza, K. Arora, C.-M. Luong, H. Li, "A-STAR: Toward Tranlating Asian Spoken Languages", Special issue on Speech-to-Speech Translation, Computer Speech and Language Journal (Elsevier), vol. 27, Issue 2, pp. 509-527, February 2013 [pdf]
- S. Sakti, N. Kimura, M. Paul, C. Hori, E. Sumita, S. Nakamura, J. Park, C. Wutiwiwachai, B. Xu, H. Riza, K. Arora, C. Luong and H. Li, "The Asian Network-based Speech-to-Speech Translation System", in Proc. ASRU, pp. 507-512, Merano, Italy, December 2009. [pdf]
- S. Sakti, M. Paul, R. Maia, S.Sakai, N. Kimura, Y. Ashikari, E. Sumita, S. Nakamura, "Toward Translating Indonesian Spoken Utterances to/from Other Languages", in Proc. O-COCOSDA, pp. 137-142, Beijing, China, August 2009. [pdf]
- S. Sakti, M. Paul, R. Maia, S.Sakai, N. Kimura, Y.Ashikari, E. Sumita, S. Nakamura, "Development of Indonesian Spoken Language Technologies for Multilingual Speech-to-Speech Translation
System", in Proc. MALINDO, pp. 49-54, Singapore, August 2009. [pdf]
- 2008
- Developed the First HMM-Based Indonesian Speech Synthesis using Limited Resources
- S. Sakti, S. Sakai, R. Isotani, H. Kawai, S. Nakamura, "Quality and Intelligibility Assessment of Indonesian HMM-Basaed Speech Synthesis System", in Proc. MALINDO, pp. 51-57, Jakarta, Indonesia, August 2010. [pdf]
- S. Sakti, R. Maia, S. Sakai, T. Shimizu, S. Nakamura, "Development of HMM-based Indonesian Speech Synthesis", in Proc. O-COCOSDA, pp. 215-220, Kyoto, Japan, November, 2008. [pdf]
- 2007
- Developed the First Indonesian Large Vocabulary Continous Speech Recognition System (APT Project)
- S. Sakti, E. Kelana, H. Riza, S. Sakai, K. Markov, S. Nakamura, "Recent Progress in Developing Indonesian Large-Vocabulary Corpora and LVCSR System", in Proc. MALINDO, pp. 40-45, Cyberjaya-Selangor, Malaysia, June, 2008. [pdf]
- S. Sakti, E. Kelana, H. Riza, S. Sakai, K. Markov, S. Nakamura, "Development of Indonesian Large Vocabulary Continuous Speech Recognition System within A-STAR Project", in Proc. IJCNLP Workshop on TCAST, pp. 19-24, Hyderabad, India, January 2008. [pdf]
- S. Sakti, E. Kelana, H. Riza, S. Nakamura, "Large Vocabulary ASR for Indonesian Language in the A-STAR Project", in Proc. ASJ Autumn Meeting, pp. 47-48, Yamanashi, Japan, 2007. [pdf]
- 2005
- Establised Rapid Development for Indonesian ASR using Limited Resources based on Cross-Language Approach
- S. Sakti, K. Markov, S. Nakamura, "Rapid Development of Initial Indonesia Phoneme-Based Speech Recognition Using Cross-Language Approach", in Proc. O-COCOSDA, pp. 38-43, Jakarta, Indonesia, 2005. [pdf]
- 2004
- Developed the First Indonesian Small-Vocabulary ASR System (APT Project)
- S. Sakti, P. Hutagaol, A. Arman, S. Nakamura, "Development of Speech Corpus & Speech Recognition System for Indonesian Language", in Proc. IEICE, 2004. [pdf]
- S. Sakti, P. Hutagaol, A. Arman, S. Nakamura, "Indonesian Speech Recognition for Hearing and Speaking Impaired People", in Proc. ICSLP, Jeju, Korea, 2004. [pdf]
|
|
|
|
|
|
|
|
|
|
|
|
Memberships on Academic Societies
- Oct 2012 - Now
- Member of Society of Neuroscience (SFN)
- Sep 2012 - Now
- Member of Japanese Neuroscience Society (JNS)
- Dec 2009 - Now
- Affiliate Member of IEEE Computer Society
- Aug 2009 - Now
- Member of Association for Computational Linguistic (ACL)
- Sep 2005 - Now
- Member of Acoustical Society of Japan (ASJ)
- Oct 2004 - Now
- Member of International Speech Communication Association (ISCA)
- Oct 1998 - Sep 1999
- Chief of Communication Division in Informatics Student Federation (HMIF), ITB, INDONESIA
- Oct 1996 - Sep 1998
- Member of Consideration and Controlling Council (DPP) for Informatics Student Federation (HMIF), ITB, INDONESIA
|
|
|
|
|
|
|
|
|
|
|
|
Program Committee / Manuscript Reviewer
- Society
- Nov 2020 - Now: Speech and Language Technical Committee (SLTC) of IEEE Signal Processing Society [Nov 2020 - Now]
- Journal/Transactions
- Associate editor of IEEE IEEE/ACM Transactions on Speech, Audio, and Language Processing [Oct 2020 - Now]
- Manuscript reviewer of Speech Communication (Elsevier), IEEE Signal Processing, IEICE Transactions on Information and Systems, International Journal of Social Robotics, ACM Transaction on Low-Resource Language Information Processing (TALLIP)
- Workshops/Conference
- Manuscript reviewer of International Conference on Acoustics, Speech and Signal Processing (ICASSP), INTERSPEECH,
IEEE Automatic Speech Recognition and Understanding (ASRU), Annual Meeting of the Association for Computational Linguistics (ACL),
Conference on Empirical Methods in NLP (EMNLP), SIGDial Meeting on Discourse and Dialogue (SigDial),
International Workshop on Spoken Dialogue Systems Technology (IWSDS), Speech and Computer (SPECOM),
Intermational Conference on Intelligent Robots and System (IROS), Workshop on Multimodal Semantics for Robotic Systems (MUSRobS),
Workshops on Intelligent Environment (IE), European Signal Processing Conference (EUSIPCO),
International Workshop on Malaysian and Indonesian Language Processing (MALINDO), International Conference on Asian Language Processing (IALP),
Oriental Comittee for Coordination and Standardization of Speech Databses and Assessment Techniques (O-COCOSDA),
Spoken Language Technology for Under-resourced Languages (SLTU), Workshop on Collaboration and Computing for Under-resourced Languages (CCURL)
|
|
|
|
|
|
|
|
|
|
|
|
Committee
- 2021 - Now
- Chair of ELRA/ISCA Special Interest Group on Under-resourced Languages (SIG-UL)
- 2018 - 2020
- General Secretary of ELRA/ISCA Special Interest Group on Under-resourced Languages (SIG-UL)
- 2016 - Now
- Board Member of Spoken Language Technology for Under-resourced Language (SLTU)
- 2017
- Organization Chair of INTERSPEECH 2017 Special Session "Digital Revolution for Under-resourced Language" (DigRevURL 2017)
- 2016
- Organization Chair of Spoken Language Technology for Under-resourced Language (SLTU 2016)
- 2013
- Organization Co-Chair of APSIPA 2013 Special Section
- 2010
- Local comittee of International Workshop on Spoken Language Dialogue Systems (IWSDS 2010)
|
|
|
|
|
|
|
|
|
|
|
|
Awards
- JSPS Strategic Young Researcher Overseas Visits Program for Accelerating Brain Circulation, 2015-2016
- Siemens-DAAD Scholarship Program ASIA 21st Century, 2000-2002
|
|
|
|
|
|
|
|
|