Speech Signal Processing Lab. @ NTUT

 

Home
Prof. Yuan-Fu Liao
Publications
Projects
Laboratory
Members

Overview:

Speech is the most natural, powerful and universal human-machine/computer interface. Therefore our focus is to make speech recognition more robust for our daily life. To demonstrate the concept and our research results, a spoken dialog-based multimedia retrieval prototype will be built to integrate both voice and data services under internet environment.

Research Direction:

Annual Speech Processing Workshop:

Group Project Link:

Demonstration:New~~                            Hot ~~~

  Projects

  Undergraduate Projects

  • Speech recognition-enabled automatic telephone directory assistant system for National Taipei University of  Technology    (NTUT's 104 directory assistant service ) 

                          Demo      Call Statistics

    NTUT's extension number  5104 NTUT's 104 directory assistant service ( +886-2-27712171 ext. 5104)

    Example: You  want to find committee Cai-Jia Chun " 電子系主任賴柏洲" (dian  zi xi zhu ren lai bo zhou)you can use the following four methods 

    • Example 1: call extension 5104speak 賴柏洲 (lai bo zhou)
    • Example 2: call extension 5104speak ”電子系賴柏洲 (dian zi xi lai bo zhou)
    • Example 3: call extension 5104speak ”電子系主任 ” (dian zi xi xhu ren)
    • Example 4: call extension 5104speak ”電子系 ” (dian zi xi)

     

  • (Phone book.txt) and (Phone_book.xls)  (the first column:  names or department,  the second column: pronunciations,  the third column:  extensions of  telephone system). (Update: 2006/01/26)

    corresponding person :Zhi-Ren Zeng , email : s9360382@yahoo.com.tw  extension:2247, cell-phone:0919550114

  • Communication Platform for Distributed Speech Recognition(DSR) VoIP. Text Chat & Face Simulation   

                            Windows video (wmv, 20.1M, Mandarin)          

                            Prof. Sagayama, Tokyo Univ. visit our demo  

                            Take photos with Lab. members   

  • Speaker Verification Security System

                        Demo   

                             Prof. Fujisaki, Tokyo Univ. visit our demo  

  • Multi modal spoken dialog system-based multimedia retrieval system

                           Windows video (wmv, 15.8M, Mandarin)

  • Voice over IP (VOIP) system and real-time ETSI extended advanced DSR front-end codec (ETSI ES 208 212)

                          Windows video (wmv, 20.4M, Mandarin)

  Communication DSP Lab.

  • OFDM Transmitter and Receiver Implementation Using DSK6416

                         Demo

  • DSP-based OFDM 語音即時通訊系統

                        Demo

  • OFDM通訊系統封包同步處理與無線通道模擬DSP實作

                  Demo

  • 以分散式語音辨認為基礎之網路電話語音自動總機

                  Demo

Latest Poster&Presentation:

                           

                       A Reference Model Weighting-based Method for Robust Speech Recognition

                       Multimedia Network Communicate Platform

                       Latent Prosody Analysis for Robust Speaker Identification

                       Test Norm-Based Speaker Verification Security System


Lab. Overview:

  • Speech Processing (Speech)

  • Overview:
    • Cooperation with NCTU speech lab., professor Sin-Horng, Chen and professor Yi-Ru, Wang
    • National Science Council (NSC) 3 years project (2004.8~2007.7)"Robust speaker verification for spoken dialogue system "
    • NCTU's ITS project, spoken dialogue system using galaxy communicator
  • Graduate student: four
  • Undergraduate: student: seven
  • Workstation: seven
  • Dialog Card: two
  • Pictures: 

   

    Linux workstation                   Lab                       Meeting Room

  • Teaching:

    • Digital signal processing
      • Digital signal processing laboratory
        • TI DSK6711
    • Speech processing
    • Random processing
    • Communication signal processing
      • Communication signal processing lab.
        • TI DSK6416 + Signalware AED-101 80M Wideband AD/DA

         

  • Communication/Digital Signal Processing (CommDSP/DSP Lab.)

  • Overview:
    • Dsk6711:twenty
    • Dsk6416:thirty
    • Evm6701:two
    • Course
      • Communication  signal  processing laboratory (graduate)
      • Digital signal processing laboratory (undergraduate)
     
    1. Communication  Signal  Processing  laboratory

                     Demo

  • Overview of TI  DSK6416
    code composer studio, DSP/Bios
  • Real-time/Embedded DSP framework
  • Vocoder (GSM6.10)
  • Convolution code and Viterbi decoding
  • QAM baseband Transmitter/Receiver, Timing synchronization, vocoder
  • Channel Model
    AWGN, multipath fading channel
  • OFDM baseband Transmitter/Receiver, FFT, Frequency/Phase synchronization
    Channel Estimation, Scrambler/interleaver, Convolution code/Viterbi decoder
  • Digital signal processing laboratory
  • Overview of TI DSK6711
  • AM modulation and demodulation
  • EMIF and memory
  • Timer and interrupt service
  • DMA
  • Calling assembly language function from C language (optional)
  •  Sound recording and playing
  •  Data transfer between PC and DSK (optional)
  • FFT
  • Embedded system
  • Karaoke effect (changing  voice, echo)
  • Digital filter (equalizer)
  • Music synthesis

Server:

 


Contact Information

Tel.
+886-2-2771-2171 ext. 2247
Fax
+886-2-2731-7120
Address
1, Sec. 3, ChungHsiao E. Rd. Taipei, Taiwan (National Taipei University of  Technology  composite accommodation  406)
Email

Copyright(C) 2002 Speech Lab., Department of Electronic Engineering, National Taipei University of Technology
Latest update:2008/07/09