»yµ¦X¦¨ Speech synthesize note
¥»ºô¶¥H¥´³yµL»Ùê¾\Ū¬°¥Ø¼Ð¡A¥i¥H¥Î¥ô¦óÂsÄý¾¹¨ÓÆ[¬Ý¥»ºô¶
¥»¬ã¨s´Á±æ«Ø¥ß¤@®M¥H¦Û¥Ñ³nÅ鬰°ò¦ªº°ê»y»yµ¦X¦¨Àô¹Ò¡A¤£¦ý¦³¬ÛÃöªº»yµ¤u¨ã¡A¨ç¦¡®w¡A¥B¦³§K¶Oªºì©lµ{¦¡¥i¨Ñ¨Ï¥Î»P¾Ç²ß¡A±q²z½×¨ì¹ê°È¬Ò¯à¦³¨}¦nªº¥Ü½dÀô¹Ò¡AÄ~¦Ó³Ð³y¤@Ó¦¨¥\ªº°ê»y»yµ¦X¦¨pµeªºªÀ¸s¹B°Ê¡C
²¤¶
-
»yµ¦X¦¨¤S¦W¤å¥yÂà»yµ(Text-To-Speech,TTS)¡A¬O«ü±N¿é¤Jªº¤å¦r©ÎÀx¦s©ó¹q¸£¤¤ªº¤å¥ó¼ÒÀÀ¤HÁnµo¥X»yµªº§Þ³N¡C
- »yµ¦X¦¨¸û»yµ¿ëÃѪºµo®i¦¤F³\¦h¡A¦ýÀ³¥Î¼h±¤j¦h¤´¦b¾\Ū¹q¸£¿Ã¹õ¤Wªº¤å³¹¡A»yµ«ü¤Þ¡A¤¬°Ê¦^õX¡A©Î»²§U»¡©ú¡C
»yµ¦X¦¨ªº§@ªk
- ÀWÃаѼƦX¦¨¤èªk(Articulatory Synthesis)¡G
¦pHolmesªº¨ÃÁp¦@®¶®p¦X¦¨¾¹¡]1973¡^©MKlattªº¦ê/¨ÃÁp¦@®¶®p(Formant)¡]1980¡^¦X¦¨¡B°ò©óLPCµ¥Án¾Ç°Ñ¼Æªº¦X¦¨¨t²Î¡A¦ýn¦X¦¨¥X²M´·ªº»yµ»Ýn·Ç½Tªº³]©w°Ñ¼Æ¡A¨Ï¥Î§xÃø¡A¥B¦X¦¨¥Xªº»yµ¤´¤£°÷¦ÛµM¡C
- ªi§Î«÷±µªk(Formant Synthesis)¡G
¦p°òÀW¦P¨B²Ö¥[ªk(PSOLA)¡]1990¡^¦b»yµªi§Î¤W°µ®É°ì(time domain)ªºÃý«ß×¥¿¨Ó¦X¦¨»yµ¡A´N¥i¥H²£¥Í¥X¨ã¦³Ãý«ßªº¦X¦¨»yµ¡C PSOLAªº³]p«ÂI¡A¦b§ï¨}ÀW°ì(frequency domain)¯Ó®É¡A¥H¤Î¦b®É°ì(time domain)±µ¦X®ÄªG¤Ó®tªº±¡§Î¡A¨ä¦X¦¨ªº»yµ¦bµ¦â»P¦ÛµM«×³£¤j¤jªº´£¤É¡A¥B¬[ºc¸û²³æ¡A®e©ö¹ê§@¡C
¹ï©ó TTS ¨t²Î¦Ó¨¥¡AµL½×±µ¨üªº¬O¤@¬q¤å¦rªº¿é¤J©Î¬O¤@½g¤å³¹¡A³o¨Ç¤å¦r¥»¨¨Ã¨S¦³¥]§t¥ô¦óÁn¾Ç¯S©Ê ( »¡¸ÜªºÁn½Õ¡A°±¹y¤è¦¡¡Aµoµªøµuµ¥Ãý«ß ) ¡A¥u¦³»y¨¥¾Çªº¯S©Ê¡A©Ò¥H¥²¶·³z¹L¦Û°Ê¹w´úªº¾÷¨î¨Ó²£¥Í³o¨Ç¤å¦rªº¥i¯àªºÁn¾Ç¯S©Ê (acustic feature) ¦Ó©Ò¿×¦Û°Ê¹w´úªº¾÷¨î¡A¤@¯ë¦³ rule-based ¸ò knowledge-based ¨âºØ¤èªk¡A¦ý¬O³o¨âºØ¤èªk¤£¦ý¦X¦¨ªºÁnµ¥²H¤S¯Ê¥F§l¤Þ¤O¥B¹J¨ì³sÄòµoµ©În«O¯d»yªÌµ¦â®Éªí²{³£¤£¦n¡A ¦]¦¹ªñ¨Ó¦ê±µ¦X¦¨ªk¤j¦æ¨ä¹D¡C
- ¦ê±µ¦X¦¨ªk(Concatenated Synthesis)¡G
¥H¤@Ó¿ý¦nÁnµªº»y®Æ®w¨Ó·í§@¤ñ¹ïªº¼Ðªº¡A±q»y®Æ®w¤¤§ì¥X¬Û¹ïÀ³ªºÁnµ³æ¤¸¡A¤@¨Ç¦b rule-based »P knowledge-based ¤èªk¤U»Ýn°µ²Ó¸`ªºÁnÃý½Õ¾ã¤]¦]¦¹´î¤Ö¤F³\¦h¡A¦p¦¹Â²¤Æ¤Fpºâ«÷±µ»P¤fµµ¥½ÆÂøªºpºâ¡A¤]¯S§O¾A¦X¦b¤Ö¶q¦r·Jªº¿é¥X®É¨Ï¥Î¡C
»yµ¦X¦¨ªº§xÃøÂI
- µoµªº¦ÛµM«×(²M´·¡B¬yºZ)¡C
- ¯}µ¦rªº³B²z¡C
- §Y®É³B²zªº¯à¤O¡C
»yµ¦X¦¨ªº4¤j¼Ò²Õ
- ¤å¥y¤ÀªR
¤ÀªR¤å¥yªº»yªk»P»y·N«áÂন»y¨¥¯S¼x°Ñ¼Æ
Åý¹q¸£ª¾¹D¥»¤å¤¤þ¨Ç¬Oµü¡Aþ¨Ç¬O¥y¤l¡Aµo¤°»òµ¡A«ç»òµoµ¡Aµoµ®É¨ìþÀ³¸Ó°±¹y¡A°±¹y¦hªøµ¥µ¥¡C
- rule base¡G³Ì¤j¤Ç°tªk¡B¤Ï¦V³Ì¤j¤Ç°tªk¡B³vµü·j´Mªk¡B³Ì¨Î¤Ç°tªk¡B¤G¦¸±½´yªkµ¥µ¥¡C
- data driven¡G¤G¤¸¤åªkªk(Di-Grammar Method)¡B¤T¤¸¤åªkªk(Tri-Grammar Method)¡BÁôÂæ¡°¨¥i¤Ò¼Ò«¬ªk(HMM Method)©MÃþ¯«¸gºô¸ôªk(Neural Network Method)µ¥µ¥¡C
- Ãý«ß²£¥Í¾¹
±N»y¨¥¯S¼x°Ñ¼Æ°e¤JÃý«ß²£¥Í¾¹¨Ó²£¥Í¤å¥yªº¨CÓµ¸`ªº¹ïÀ³Ãý«ß°T®§¡A¥]§t°òÀWy¸ñ¡Aµ¶q¡Aµªøµ¥
±N»¡¸ÜªºÁn½Õ¡A»y®ð¡A°±¹y¤è¦¡¡AµoµªøµuÂà´«¦¨Ãý«ß°Ñ¼Æ¡C
- rule base¡G¡C
- data driven¡GÃþ¯«¸gºô¸ôªk(Neural Network Method)¡C
- ¦X¦¨³æ¤¸²£¥Í¾¹
®Ú¾Ú»yµ¸ê®Æ®w¤¤ªº³æµ¸`µ¯À»yµªi§Î¼Ë¥»¿é¥X¦X¦¨³æ¤¸.
- »yµ¦X¦¨¾¹
®Ú¾Ú»Ýnµoªºµ±qÁnµ¸ê®Æ®w¤¤¿ï¾Ü¥X¦X¾AªºÁn¾Ç°Ñ¼Æ¡AµM«á®Ú¾Ú¦bÃý«ß¼Ò«¬¤¤±o¨ìªºÃý«ß°Ñ¼Æ¡A³z¹L»yµ¦X¦¨ºtºâªk²£¥Í»yµ¡C
»yµ¬ÛÃöÀ³¥Î
- »yµ¦X¦¨(Speech Synthesize)¡G¹B¥Î¸ê°T¬ì§Þ¨Ï¹q¸£©Î¹q¤l³]³Æ¼ÒÀÀ¤HÁn¡C
- »yµ¿ëÃÑ(Speech Recognition)¡GÅý¹q¸£Å¥±oÀ´¤HÃþ»¡¸ÜªºÁnµ¡C
- »yªÌ¬ÛÃö(Speaker Dependent)¡G¤£n¨D»yªÌµoµ·Ç½T¡A»Ý¥ý¸g¹L°V½m¡C
- «D»yªÌ¬ÛÃö(Speaker Independent)¡G»yªÌµoµ»Ý¸û¥¿½T¡A¥BµL¶·°V½m¡C
- »yªÌÃѧO(Speaker Identification)¡G¿ëÃÑ»¡¸ÜªÌªº¨¥÷
»yµ¾Ç(phonetics)
- ¦b¬ã¨s»yµ¤§«e§ÚÌ¥ý¨Ó¬Ý¬Ý Ánµ¡þµ¦â(timbre)ªºn¯À¡G
- ÀW²v(frequency)
ÀW²v¬O¥H®É¶¡¬°°ò·Ç,®¶°Ê§Ö«hÀW²v°ª,µ½Õ¸û¬°°ª,¨ä¿Å¶q³æ¦ì¬°»®¯÷(Hertz,Hz),¤@¦¸®¶°Ê¬O«üªi§Î±q¤¤¶b©¹¤W¦ù©µ¦Üªi®p¡A©¹¤U¸ó¤¤¶b¦Üªi¨¦¦Aªð¦^¤¤¶b¡AºÙ¬°¤@Ó¶g´Á¡FÀW²v¶V°ª¡AµÀW(pitch)´N¶V°ª¡A¤HÃþ¦Õ¦·¥iÅ¥¨ìªºµÀW½d³ò¬O20Hz¦Ü20kHz¡CÀW²v¬OÁnªi¨C¬í®¶°Êªº¦¸¼Æ¡A¤@¤d»®¯÷(1000Hz,©ÎºÙKHz)´Nµ¥©ó¨C¬íÄÁ®¶°Ê¤@¤d¦¸,¤]´N¬O¨C¬íÄÁ²£¥Í¤@¤dÓµªi¡C
- ®¶´T(amplitude)¡þÅT«×(loudness)
®¶´T¬O«üµªiªº[®¶°Ê´T«×],¥ç¥iºÙ¬°[¤O«×],¼vÅT©Ò¤Î¬OÁnµªi§Îªº°ª§C,µªiªº®¶´T·U¤j,«hÅT«×·U¤j,¨ä¿Å¶q°ò·Ç¬O¥H®¶´Tªº¤j¤p¬°·Ç,¥Hvolt©ÎdB(¤À¨©decibe)¨Ó¿Å¶q.DBªº¤Ø«×¬O§e«ü¼Æ¼Wªøªº,¨C¹j20¤À¨©,¨äÅT«×©Î®¶´T«h¼W¥[10¿,¨Ò¦p:40¤À¨©¤ñ20¤À¨©ªºÅT«×,´£°ª¤F10¿,¦Ó60¤À¨©«h¤ñ20¤À¨©´£°ª¤F¤@¦Ê¿,80¤À¨©´£°ª¤@¤d¿¤§¦h¡C
- Digital Audio¼Æ¦ìµ®Ä
µªi¥ÑÃþ¤ñ«¬ºAÂର¼Æ¦ì«¬ºA¡AÀx¦s®æ¦¡¤è±¡APC¥¥x³Ì±`¥Îªº¬OWAV®æ¦¡¡AMac¥¥x³Ì±`¥Îªº¬OAIFF®æ¦¡¡C¡C
- ¨ú¼ËÀW²v(Sampling Rate)
«üµ®Ä¥d¦b¤@¬í¤§¤¤¹ïÁnµ(ªi§Î)°µ°O¿ýªº¦¸¼Æ¡C®Ú¾Ú¬ã¨s,Ánµ¼½¥X®Éªº«~½è±`±`¥u¯à¹F¨ì¨ú¼ËÀW²vªº¤@¥b,¦]¦¹¶·±Ä¨úÂù¿¼Ë²v¤~¯à±N쵷ǽT«²{.¤HªºÅ¥¤O·¥P¬ù¬°20KHz,©Ò¥H°ª«~½èªº¨ú¼ËÀ³¬°¨ä¨â¿¥H¤W,·íÁnµ¨Ó·½¬°µ¼Ö®É,¦]¦ì¥¦©Ò¾î¸óªºÀW²vÅܤƷ¥¬°¼e¼s,³q±`¥H±Ä44.1KHzªºÀW²v¬°CDµ¼Ö¨ú¼Ë²vªº¼Ð·Ç;¦ý¬OY¥H»yµ¬°¥D,¥Ñ©ó¤H»¡¸Üªº»yµ¤j¬ù¬°10KHz,¦]¦¹¥[¿±Ä¼Ë,¥u¨ú22KHz§Y¥i¡C¨ú¼Ë²v¶V°ª,
©Ò°O¿ý¤U¨Óªºµ½è´N¶V²M´·;·íµM,¶V°ªªº¨ú¼Ë©Ò°O¿ý¤U¨ÓªºÀÉ®×´N·|¶V¤j¡C
- ¨ú¼Ë¸ÑªR«×(sampling resolution)
¸ÑªR«×¨M©w¤F¨ú¼Ëªº¤@µªi¬O§_¯à«O«ùì¥ýªº§Îª¬,·U±µªñì§Î«h©Ò»Ý¸ÑªR«×·U°ª¡CY¥H8¦ì¤¸¨Ó°O¿ý¨ú¼Ë,«h¨ä©Ò¯àªí¹Fªº²Õ¦XºØÃþ¬O2ªº8¦¸¤è,§Y256,ªí¥Ü¥Î8¦ì¤¸ªº¨ú¼Ë¤j¤p¯à¤À¿ë¥X256Ó¼h¦¸ªºÁnµ;Y±Ä16¦ì¤¸¨Ó¨ú¼Ë,«h¯à¤À¿ëªº®t²§±N°ª¹F2ªº16¦¸¤è,¬°65536,¨äºë½T«×¦ÛµM¤j¬°´£°ª¡C
16bit,8bit¨ú¼Ëªº®t§O¦b©ó°ÊºA½d³òªº¼e¯¶;°ÊºA½d³ò¼e,µ¶q°_¥ñªº¤j¤pÅܤƴN¯à°÷§óºë²Ó¦a³Q°O¿ý¤U¨Ó¡C¦p¦¹¤@¨Ó¤£½×¬O²Ó·LªºÁnµ©Î¬O±j¯Pªº°Ê·P¾_¾Ù,³£¥i¥Hªí²{±o²OºvºÉP;¦ÓCDµ½èªº¨ú¼Ë³W®æ¥¿¬O16¦ì¤¸¨ú¼Ëªº³W®æ¡C
- ÁA¸Ñ¤FÁnµªºI´º¤§«á,Åý§Ų́ӸѪR»y¨¥ªºµ²ºc¡A»y¨¥¬[ºc¤jP¥i°Ï¤À¦p¤U¡G
- »y¨¥ªºµ²ºc[»yµ¾Ç(phonetics)]
- ¥y¤l(sentence)
¡u§Ú¬O¤@ÓÁ¿°ê»yªº¥xÆW¤H¡AÁöµM§Úªº¯ª¥ý¨Ó¦ÛºÖ«Ø¡A¦ý§Úªº¥xÆW¸Ü»¡±o¨Ã¤£¬yºZ¡C¡v
- ¤l¥y(clause)
¡u§Ú¬O¤@ÓÁ¿°ê»yªº¥xÆW¤H¡v
- µü²Õ(phrase)
- »yµü(word)
¡u¥xÆW¤H¡v
- µü¯À(morpheme)
¡u¥xÆW¤H¡v
- µ¸`(syllable)
¡u¥x¡v¡A¡uÆW¡v¡A¡u¤H¡v§Y¤@¯ë©Ò»¡ªº¦rµ¡A¬Oťı¤W³Ì®e©ö¤À¿ë¥X¨Óªº»yµ³æ¦ì¡C¤@¯ë¨Ó»¡¡A¤@Óº~¦r´N¬O¤@Óµ¸`¡Cº~»yªºµ¸`¤@¯ë¬O¥ÑÁn¥À¡BÃý¥À©MÁn½Õºc¦¨ªº¡F¤£¹L¡A¦³®É¤]¥i¥H¨S¦³Án¥À¡A¥u¥ÑÃý¥À©MÁn½Õ²Õ¦X¦Ó¦¨¡AºÙ¤§¬°¡u¹sÁn¥À¡v¡C
- µ¯À(phoneme)
¡u¤H¡v¬O¥Ñ¤TÓµ¯À/r/¡A/e/¡A/n/ ©Ò§Î¦¨¡A¬O³Ì¤pªº»yµ³æ¦ì¡C
- ÁA¸Ñ»y¨¥ªº¬[ºc¥i¥HÀ°§U§Ṳ́ÀªRn¦p¦óµoµ¡A±µ¤U¨ÓÅý§Ų́ÓÁA¸Ñµoµªº2¤jn¯À¡G
- ¤¸µ/¥Àµ(vowel)
µoµ®É¡A®ð¬y·|®¶°ÊÁn±a¡A¦b¸g¹L«|ÀY¡B¤fµÄ¡B»óµÄµ¥¦a¤è®É¡A®ð¬y´X¥GºZ³qµLªý¡C¥Ñ©óÁn±aŸ°Ê¡A©Ò¥HÁnµÅT«G¡C
- »²µ/¤lµ(consonant)
µoµ®É¡A®ð¬y¦b«|ÀY¡B¤fµÄ¡B»óµÄµ¥³¡¦ì·|¨ü¨ìªýê¡C¥Ñ©óÁn±a¤£¤@©wŸ°Ê¡A©Ò¥HÁnµ¤j¦h¤£ÅT«G¡C
- ÁA¸Ñ¤Fµoµn¯À«á¡A«nªº¬O¦p¦óŪ¥XÁnµ¨Ó¡A§Ú̺٤§¬°«÷µ¡A³o¬O·|ÀHµÛ°ê®a»P¦a°Ï©Ê¦Ó¦³©Ò¤£¦P¡A§Ú̱`¥Îªºª`µºÙ¤§¬°°êµ¤@¦¡¡A³q¥Î«÷µ»Pº~»y«÷µÁÙ¥¼½T©w½Ö¬O°êµ¤G¦¡¡A¦Ó¨Ï¥Î¦h¦~¤è«K¥~°ê¤HµoµªººÙ¤§¬°Ã¹°¨«÷µ(¤ñ¸ûªí)¡A¥H¤U¤¶²Ð°êµªº«÷µªº3¤jn¯À¡G
- Án¥À(initial)=«eµ=¤lµ=»²µ(consonant)
µo¥XªºÁnµ·|¾D¨üªýê,Án¥À¦³¿ë¸qªº§@¥Î¦p£|»P£}¡G·F¤°»ò»P¬Ý¤°»ò,£z»P£{¡G´o«ã»P¦Ñ¸ô
«ü©ñ¦bµ¸`¶}ÀYªº»²µ¡C¥Ñ©óÁn¥À¬O¥Ñ»²µ¥R·í¡A¦]¦¹µoµ¨Ã¤£ÅT«G¡A°ê»yªºÁn¥À¦³21Ó¡A«öµoµ³¡¦ì¤À¬°¤C²Õ¡C
| 1. |
Âù®BµBilabials |
b(£t) |
p(£u) |
m(£v) |
¡@ |
| 2. |
®B¾¦µLabiodental |
f(£w) |
¡@ |
¡@ |
¡@ |
| 3. |
¦Þ¦yµApicals |
d(£x) |
t(£y) |
n(£z) |
l(£{) |
| 4. |
¦Þ®ÚµVelars |
g(£|) |
k(£}) |
h(£~) |
¡@ |
| 5. |
¦Þ±µFront Palatals |
j(£¡) |
q(£¢) |
x(££) |
¡@ |
| 6. |
¦Þ¦y«áµ(¼¦Þµ)Retroflexes |
zh(£¤) |
ch(£¥) |
sh(£¦) |
r(£§) |
| 7. |
¦Þ¦y«eµ(¥¦Þµ)Blade-alveolars |
z(£¨) |
c(£©) |
s(£ª) |
¡@ |
- Ãý¥À(final)=«áµ=¥Àµ=¥Dµ=¤¸µ(vowel)
»P¤fµÄªº¶}¦X¦³Ãö¡A¦p£¸ => £® => £«
¬Oµ¸`¤¤Án¥À¥H«áªº³¡¤À¡A¥i¥H¥Ñ¤¸µ and¡þor »²µ¥R·í¡A¦Ó¥BµoµÅT«G¡C
| 1. |
¥b¤¸(¥À)µsemi-vowels |
(y)i(£¸) |
(w)u(£¹) |
(y)u(£º) |
¡@ |
| 2. |
³æÃý¥ÀSimple Vowels |
a(£«) |
o(£¬) |
e(£) |
ie(£®) |
| 3. |
½ÆÃý¥ÀDiphthongs |
ai(£¯) |
ei(£°)¡@ |
ao(£±) |
ou(£²) |
| 4. |
»óÃý¥ÀFinals with nasal endings |
an(£³) |
en(£´) |
ang(£µ) |
eng(£¶) |
| 5. |
±²¦ÞÃý¥ÀRetroflex |
er(£·) |
¡@ |
¡@ |
¡@ |
- Án½Õ(tone)
«ü¦rµªº°ª§C¤É°ÅܤơB°ê»y»yµ°ª§C¤É°¡B¨ã¦³°Ï§Oµü¸q§@¥Îªº¦³³W«hÅܤƴN¬OÁn½Õ¡CÁn¥¿½T»P§_¬O»yµ·Ç½TªºÃöÁä¡CÁ|¨Ò»¡¡G°ê»y¨Ì°òÀWy¸ñ(pitch
contour)¤À¤@Án(³±¥)¡B¤GÁn(¶§¥)¡B¤TÁn(¤WÁn)©M¥|Án(¥hÁn)¡C
- °ê»yªºµ¸`¼Æ
¨Ì·ÓÁn,Ãý,½Õªºµ²ºc,¥i¯àªºµ¸`²Õ¦X¦³22*39*5=4290ºØ¡A¦ý°ê»y¦³ÄY®æªºÁnÃý²Õ¦X³W«h,¨Ò¦pÁn¥Àªº£¡,£¢,££«á±¥u¯à¬O¥H£¸,£ºªºÃý¥À,·íÃý¥À¬O£º®É¥u¦³£¡,£¢,££,£z,£{ªºÁn¥À,¦]¦¹¹ê»Ú¥i¥Îªº°ê»yµ¸`¬ù1300¦hÓ¡AY¤£¦Ò¼{Án½Õªº¸Ü¥u¦³411Ó¡C
- «÷µªº²Õ¦¨
Án¥À¦b«e¡AÃý¥À¦b«á
Án¥À»´µu¡AÃý¥ÀÅT«G
- «÷ªk¤f³Z
«eµ»´µu«áµ«¡A¨âµ¬Û³s²r¤@¸I¡I
»yµ«~½èªºµû¶q
¹ï©ó»yµ«~½èªºµû¶q¡A¦h¦~¨Ó¤HÌ´£¥X¤F³\¦h¤èªk¡AÂk¯Ç°_¨Ó¤jP¥i¤À¬°¨âÃþ¡A§Y«ÈÆ[µû©w¤èªk©M¥DÆ[µû©w¤èªk¡C
¡@¡@ «ÈÆ[µû©w¤èªk¥Î«ÈÆ[´ú¶qªº¤â¬q¨Óµû»ù»yµ½s½Xªº½è¶q¡A±`¥Îªº¤èªk¦³«H¾¸¤ñ¡B¥[Åv«H¾¸¤ñ¡B¥§¡¤À¬q«H¾¸¤ñµ¥¡C¥¦Ì³£¬O«Ø¥ß¦b«×¶q§¡¤è»~®tªº°ò¦¤W¡A¨ä¯SÂI¬Opºâ²³æ¡A¦ý¤£¯à§¹¥þ¤Ï¬M¤H¹ï»yµ½è¶qªº·Pı¡C³oÓ°ÝÃD¹ï©ó³t²v¬°16Kbit/s¥H¤Uªº¤¤¡B§C³t²v»yµ½s½X¤×¬°¬ð¥X¡A¦]¦¹¥Dn¾A¥Î©ó³t²v¸û°ªªºªi§Î½s½XÃþ«¬¡C
¡@¡@ ¥DÆ[µû©w¤èªk²Å¦X¤HÃþÅ¥¸Ü®É¹ï»yµ½è¶qªº·Pı¡A¦]¦Ó¥Ø«e±o¨ì¼sªxÀ³¥Î¡C³Ì¥Dnªº¥DÆ[µû©w¤èªk¬O¥DÆ[µû©wµ¥¯Å¡]Subjective Opinion Scale¡^¡A©ÎºÙ¥§¡µû©w±o¤À¡]Mean Opinion Score¡AÁY¼gMOS¡^¡CMOS±o¤À±Ä¥Î¤¯Åµû¤À¼Ð·Ç¡A¨ä¤èªk¬O¡A¥Ñ¼Æ¤Q¦W¸ÕÅ¥ªÌ¦b¬Û¦P«H¹DÀô¹Ò¤¤¸ÕÅ¥¨Ãµ¹¤©µû¤À¡AµM«á¹ïµû¤À¶i¦æ²Îp³B²z¡A¨D¥X¥§¡±o¤À¡C¥Ñ©ó¥DÆ[©M«ÈÆ[¤WªººØºØì¦]¡A¨C¦¸¸ÕÅ¥©Ò±oªºµû¤À·|¦³ªi°Ê¡C¬°¤F´î¤pªi°Êªº»~®t¡A°£¤F¸ÕÅ¥ªÌ¤H¼Æn¨¬°÷¦h¤§¥~¡A©Ò´ú»yµ§÷®Æ¤]n¨¬°÷Â×´I¡A¸ÕÅ¥Àô¹Ò¤]À³¾¨¶q«O«ù¬Û¦P¡C
¦b³o¸Ìn¯S§O»Ýn»¡©úªº¬O¡A¸ÕÅ¥ªÌ¹ï»yµ½è¶qªº¥DÆ[·Pı©¹©¹¬O©M¨äª`·N¤O¶°¤¤µ{«×¬ÛÁpôªº¡A¦]¦Ó¡A¹ïÀ³©ó¥DÆ[µû©wµ¥¯Å¡AÁÙ¦³¤@Ó¦¬Å¥ª`·N¤Oµ¥¯Å ¡]Listening Effect Scale¡^¡C¤Uªíµ¹¥X¥DÆ[µû©wµ¥¯Åªº½è¶qµ¥¯Å¡B¤À¼Æ©M¬ÛÀ³ªº¦¬Å¥ª`·N¤Oµ¥¯Å¡C
¥DÆ[µû©wµ¥¯Åªí
| ½è¶qµ¥¯Å |
¤À¼Æ |
¦¬Å¥ª`·N¤Oµ¥¯Å |
| Àu |
5 |
¥i§¹¥þ©ñÃP¡A¤£»Ýnª`·N¤O |
| ¨} |
4 |
»Ýnª`·N¡A¦ý¤£»Ý©úÅã¶°¤¤ª`·N¤O |
| º¡·N¡]¥¿±`¡^ |
3 |
¤¤µ¥µ{«×ªºª`·N¤O |
| ®t |
2 |
»Ýn¶°¤¤ª`·N¤O |
| ¦H |
1 |
§Y¨Ï§V¤O¥hÅ¥¡A¤]«ÜÃøÅ¥À´ |
¡@ ¡@±q¥Î¤á¨¤«×¬Ý¡A³q±`»{¬°MOS¤À4.0~4.5¤À¬°°ª½è¶q»yµ½s½X¡A¹F¨ìªø³~¹q¸Üºôªº½è¶qn¨D¡CMOS¤À3.5¤À¥ª¥kºÙ§@³q«H½è¶q¡A³o®ÉÅ¥ªÌ¯à·Pı¨ì»y µ½è¶q¦³©Ò¤U°¡A¦ý¤£¼vÅT¥¿±`ªº³q¸Ü¡A¥i¥Hº¡¨¬¦h¼Æ³q«H¨t²Î¨Ï¥În¨D¡CMOS¤À3.0¤À¥H¤U±`ºÙ¬°¦X¦¨»yµ½è¶q¡A³oºØ»yµ¤@¯ë¥u¦³¨¬°÷°ªªº¥iÀ´«×¡A¦ý¬O¦Û µM«×¸û®t¡A¤£®e©öÃѧOÁ¿¸ÜªÌ¡C
¡@¡@ »yµ½s½X§Þ³N¼Ð·Çªº¨î©w¡A¹ï¼Æ¦ì»yµ§Þ³Nªº¹ê¥Î¤Æ©Mµo®i°_¨ì¤F±À°Ê§@¥Î¡C
°Ñ¦Ò¸ê®Æ¡Ghttp://159.226.2.5:89/gate/big5/www.kepu.net.cn/gb/technology/telecom/wireless/wrl216.html
»yµ¬ÛÃö±M¥Î³N»y
- articulatory phonetics:µoµ»yµ¾Ç(¤HÃþÁn¹D»P¼L«¬³¬¦Xµ¥©Òµo¥XÁnµ)
- acoustic phonetics:Án¾Ç»yµ¾Ç(Ánµªºª«²z©Ê½è)
- auditory phonetics:ťı»yµ¾Ç(Å¥¨ì¤£¦P»yµªº¤ÏÀ³)
- phonetics:»yµ¾Ç
- phonology:µÃý¾Ç
- phonemics:µ¦ì¾Ç
- phonetic transcription:¼Ðµ
- articulation:µoµ
- syllables:µ¸`
- suprasegmental features:ªþ¥[µ¯À
- allophones: Åܵ
- prosodic:ÁnÃý¾Çªº(Ãý«ß¾Çªº)
- phonetic:»yµ¾Çªº
- phonologic:»yµÅé¨tªº
- phonetically:»yµ¤è±¦a
- semantic:»y¸q¾Çªº
- phoneme:µ¯À
- vocal track:Án¹D
- speech organs:µoµ¾¹©x
- acoustic:ťıªº
- consonant:¤lµ
- co articulation:³sµ
- duration:µªø
- intonation:»y½Õ,Án½Õ
- juncture:³sµ
- verbal:¤fÀY¤Wªº
- vowel:¥Àµ
- nasal:»óµ
- pitch:µ°ª,«üªº¬OÁnµ¦b¤ß²z¦L¶H¤Wªº±j©Î®z¡Cµ°ª±`¥Î§@ÀW²vªº¦P¸q¦r
- utterance:¨¥Ãã
- lexicon:Ãã¨å
- LPC(Linear Predict Coding):½u©Ê¹w¦ô½s½X¡A³q±`³Q·í¦¨»yµ°T¸¹ªº¯S¼xÈ
- HMM(ÁôÂæ¡°¨¥i¤Ò¼Ò«¬ªk):¦b°ê»yªºTTS¨t²Î¤¤¡A¨ä¤å¥y¤ÀªR¥²¶·©â¨ú»y¨¥°Ñ¼Æ¨Ã§â¤å¦r¦êÂà´«¬°µ¸`¦ê¡A¦ý¤¤¤å¤å¥y¨Ã¨S¦³©úÅ㪺µüÃä¬É¡A·|¦]Â_µüÂI¤£¦P¦Ó¦³¤£¦Pªº»y·N¡A¬G¥Hn¶¥ªºHMM¨Ó°µ»y¨¥°Ñ¼Æªº©â¨ú¡C
- EGG(Electro-GlottoGraph)¡G
- IVR(Interactive Voice Response)¡G¤¬°Ê¦¡»yµ¦^ÂÐ
- wavelength:ªiªø,«üªº¬O¦bªi§Î¤¤¡A¨âÓ³sÄò°ª®p¶g´Á¶¡ªº¶ZÂ÷¤SºÙ¬°¡u¼Ö¬q¡v(period)
Ánµªº®æ¦¡Ãþ«¬
- .wav : (WAVE)Microsoft§@·~¨t²ÎªºÁnµÀɮ׮榡
- .aif : (Audio Interchange File Format,AIFF)Appleµo®iªº®æ¦¡¡A¾A¥Î©óMac»PSGI
- .au : (u-law)Unix§@·~¨t²ÎªºÁnµÀɮ׮榡
- AIFC : Unix§@·~¨t²ÎªºÁnµÀɮ׮榡 Audio Interchange Format Compressed
- .mp3 : MPEG Audio Layer-3 ªºÁnµÀ£ÁY®æ¦¡
§K¶Oªº»yµ¤ÀªR³nÅé
- WaveSurfer
- WaveSurfer is an Open Source tool for sound visualization and manipulation. It has been designed to suit both novice and advanced users. WaveSurfer has a simple and logical user interface that provides functionality in an intuitive way and which can be adapted to different tasks. It can be used as a stand-alone tool for a wide range of tasks in speech research and education. Typical applications are speech/sound analysis and sound annotation/transcription. WaveSurfer can also serve as a platform for more advanced/specialized applications. This is accomplished either through extending the WaveSurfer application with new custom plug-ins or by embedding WaveSurfer visualization components in other applications.
- Speech Filing System
- SFS 4/Windows is a free computing environment for PCs for conducting research into the nature of speech. It comprises software tools, file and data formats, subroutine libraries, graphics, special programming languages and tutorial documentation. It performs standard operations such as acquisition, replay, display and labelling, spectrographic and formant analysis and fundamental frequency estimation. It comes with a large body of ready made tools for signal processing, synthesis and recognition, as well as support for your own software development.
more....
¥¼¾ã²z¸ê®Æ
³sÄò vs. ¤£³sÄò»yµ¿é¤J
»yµ¿ëÃѧ޳N¦bÓ¤H¹q¸£¤W¥i¤À¦¨»yµ¾Þ±±¤Î»yµ¿é¤J¡C»yµ¾Þ±±¬O¥Î»yµ«ü¥O¨Ó¾Þ§@¹q¸£, ¦Ó»yµ¿é¤J«h¬O¥Î¨Ó¿é¤J¤å¦r¡C¦Ó¦´Áªº»yµ¿é¤J¬O©Ò¿×¡u¤£³sÄò¡v(discrete
©Î discontinuous) ªº, ¤]´N¬O»¡, ¦b¦r»P¦r¤§¶¡¬O»Ýn¦³µu¼È¼È°±ªº¡C¦ÓÀHµÛÓ¤H¹q¸£µwÅé©Ê¯àªº¤£Â_´£ª@¡B»ù®æªº¤£Â_¤U·Æ,
¥H¤Î»yµ¿ëÃѧ޳Nªº¤£Â_ºë¶i, ±q 1997 ¦~¤U¥b¦~°_, ¹q¸£»yµ¿é¤J¥¿¦¡¶i¤J¨ì¡u³sÄò¡v (continuous) ¿é¤J®É´Á¡C
¬Û¹ï©ó¤£³sÄò»yµ¿é¤J, ³sÄò»yµ¿é¤J¦b¦r»P¦r¤§¶¡¬O¤£»Ýn¼È°±ªº, ¨Ï¥ÎªÌ¥i¥H±N¾ãÓ¥y¤l¤@®ð¨þ¦¨¦a©À§¹¡C¥H^¤åªº»yµ¿ëÃѲ£«~¨Ó»¡, ³Ì¤jªº¨â®a¼t°Ó¬°
IBM ¤Î Dragon¡C¦Ó IBM ¤½¥q®µµÛ¨äÃe¤jªº¬ãµo¤Î¦æ¾P¸ê·½, ¤]¤£Â_¶}µo¨ä¥¦°ê®aªº»y¨¥ª©¥», ¤¤¤å´N¬O IBM ViaVoice
²£«~ªº²Ä¤KºØ¤ä´©»y¨¥¡C
°ê¤º§Þ³Nµo®i²{ªp
§Ú°êªº»yµ¿ëÃѲ£«~¶}µo¥H¥»°ê»y¨¥¡Ð¤¤¤å(°ê»y)¬°¥D¡C°ê¤º·~¬É¥H¥x±d¤½¥qº¥ý©ó1991¦~±À¥X»yµ¿ëÃѲ£«~¡u±¶³q¡v»yµ¿é¤J¨t²Î¡A¥]¬A¤¤¤å»yµÅ¥¼g¡B¤¤¤å»yµ«ü¥O¡B¤¤¤å»yµ¦X¦¨µ¥¥\¯à¡CÊ
¤Ñ¤½¥q¥ç©ó1994¦~µoªí¡u¸Ü§X¤l¡v»yµ¿ëÃѲ£«~¡C¨âªÌ§¡ÄݯS©w¤H¡B³æ¦rµ¿ëÃѪº²£«~¡A¿ëÃѲv¤£¦p²z·Q¡C¤£¦p²z·Q¡C¦¹¥~¡A°ê¬ì·| ªº²£¾Ç¦X§@pµe¥ç¦³¦h®a¼t°Ó°Ñ»P¡A¥x¤j¡þ¤¤¬ã°|ªº¡uª÷Án¡v¨t¦C
°ê»yÅ¥¼g¾÷°Ñ»P¼t°Ó¦³Ê¤Ñ¡B©úùÖ¡A¦¨¤jªº¡uµ¤¤¥P¡v¤¤¤åµü¿é¤J¨t²Î¦³¥x±d¡B§Þ¹q¡B©ô§»µ¥¡CµØ¶©·L¹q¤l¥ç´¿±À¥X¤pµü·J(20-40µü) »yµ¿ëÃÑ´¹¤ù(«¬¸¹:HM2007)¡C
1995¦~11¤ëÄ«ªG¹q¸£«Å¥¬¨ä¡u¤¤¤åÅ¥¼g¤u¨ã¡v¡AÄݯS©w¤H¡B³æµü¿ëÃѪº²£«~¡F§»ùÖ©ó1995¦~¤E¤ë±À¥X ªº¡u´÷±æ¡v¦h´CÅé®a¥Î¹q¸£¤]·f°t¤£¯S©w¤H¡B¤pµü·Jªº^¤å»yµ«ü¥O±±¨î¥\¯à¡C³Ìªñ³\¦h¼t°Ó¹ï¤¤¤å»yµ¹q¸£¤Î»yµ¿ëÃÑ´¹¤ùªº¶}µo §¡ªí¥Ü°ª«×¿³½ì¡C¥H¥Í©R´Á¦Ó¨¥¡A»yµ¿ëÃѲ£«~©|³B©óµÞªÞ°_¨B¶¥ ¬q¡A¥«³õ¦¨ªø²v°ª¡C
»yµ¿ëÃѧ޳Nªºµo®i¡A¦b¼Ú¬üµ¥¥ý¶i°ê®a¥Ñ¨Ó¤w¤[¡A§Ú°ê¦b³o¶µ §Þ³Nªºµo®i¡A¦´Á¥H¾Ç³N¬É¬°¥D¡A©l©ó¥x¤jªº°ê»yÅ¥¼g¾÷¡]1983¦~ ¡^¬ã¨spµe¡A²M¤j¡B¥æ¤j¡B¦¨¤jµ¥¥ç§¡§ë¤J¬ã¨s¦h¦~¡C¥æ³q³¡¹q«H
¬ã¨s©Ò¥ç¦³°¾«¹q«HÀ³¥Îªº»yµ¿ëÃѧ޳N¬ãµo¡C¸gÀÙ³¡¬ì§Þ±M®×¥ç ©ó1991¦~°_¤ä«ù¤u¬ã°|¹q³q©Ò§ë¤J¤¤¤å»yµ¿ëÃѧ޳Nªº¬ãµo¡A¦b°ò¦§Þ³N¤Î¹êÅçÀô¹Ò«Ø¥ß¤§«á¡A©ó1992¦~¤C¤ë°_©ó¡u«e¤©Ê¸ê°T§Þ³N
¬ã¨spµe¡v¦¨¥ß¤@¤lpµe¡A1993¦~§¹¦¨¤¤¤å»y¨¥¼Ò«¬³]©w¤ÎµwÅé¨t ²Î¥\¯à³]p¡A 1994¦~§¹¦¨¦b¤u§@¯¸¤§¤@¯S©w¤H¡B¤jµü·J¡B³æ¦rµ¤§°ê»yÅ¥¼g¾÷Âú«¬¨t²Î¾ã¦X¡A1995¦~§¹¦¨¥H¤À¬q¾÷²v¼Ò«¬¶}µo¤§¤£¯S
©w¤H¡B¤¤µü·J¡B³æ¦rµ²Õµü¿ëÃѧ޳N¡A1995¦~12¤ë¸ê°T®i®i¥X¡¨PC ª©«D¯S©w»yªÌ¤¤¤å»yµ¿ëÃѨt²Î¡¨¡A¬°¤£¯S©w¤H¡B¤¤µü·J¡B³sµµü ¿ëÃѧ޳N(¨t²Î¬yµ{¦p¹Ï¤@)¡A¥¿Ä~Äò¬ãµo»yªÌ½Õ¾A¡B¾¸Án¼Ò«¬¡B³Á
§J·½Õ¾A¡B¼Ð·ÇÀ³¥Îµ{¦¡¤¶±¡B¶i¤@¨B´£°ª¿ëÃѲvµ¥§Þ³N¡A¨Ï¸Ó§Þ ³N¥i¥H¹ê¥Î¤Æ¡B°Ó«~¤Æ¡A¥Ñ¡u¯à¥Î¡v³vº¥¨«¦V¡u¦n¥Î¡v¡B¡u¨ì³B¥i¥Î¡v¡B©M¡uÀH®É¥i¥Î¡vªº¹Ò¬É¡C¥t¤@¤è±¡A¤u¬ã°|¹q³q©Ò¤]¦P®É§ë
¤J¯S©w¤H¡B¤pµü·J»yµ¿ëÃÑ´¹¤ùªº¶}µo¡A¥Dn¬°¨ó§U¥b¾ÉÅé¼t°Ó¶i ¤J®ø¶O©Ê¹q¤l©Ò»Ýªº»yµ¿ëÃÑ´¹¤ù»â°ì¡C¦¹¥~¡A¤]±N¶}©l»yµ¦X¦¨¤ÎÀ£ÁY§Þ³Nªºµo®i
°ê¥~¬ÛÃö¬ã¨s
| ¬ã¨s³æ¦ì |
²£«~ |
| AT&T Bell Labs |
Bell Labs TTS |
| BT Labs |
Laureate |
| Entropic |
Truetalk |
| Microsoft Research |
Whistler |
| Lernout & Hauspie |
TTS3000/M |
| Lucent |
Next Generation Speech |
| CSTR University of Edinburgh |
Festival |
| ETI-Eloquence |
ETI-Eloquence |
| Lernout & Hauspie |
Realspeak |
| Elan informatique |
Elan Speech Engine |
| »´ä¤¤¤å¤j¾Ç¤H¾÷³q°T¹êÅç«Ç |
CU VOCAL ¡u±y´¡v»yµ¦X¦¨¨t²Î |
°ê¤º¬ÛÃö¬ã¨s
| ¬ã¨s³æ¦ì |
²¤¶ |
¥x¤j§õµY¤s±Ð±Â
¬ÛÃö¸ê®Æ
|
ª÷Án¤@¸¹¤G¸¹¤T¸¹µ¥Å¥¼g¨t²Î¡B¤å¦rÂà»yµ¨t²Î¡B¹ï¸Ü¨t²Î¡B»yµ¬°°ò¦¤§¸ê°TÀ˯Á¨t²Îµ¥¡C
¥D±q¦¡¬[ºc¡Bºô¸ô¤§»yµ¤¶±¡BÀHºô¸ô¸ê·½½Õ¾A¤§µü¨å©M»y¨¥¼Ò«¬¡Bºô¸ô»y®Æ³B²z¡BµL½uÀô¹Ò¤U¤§¤À´²¦¡»yµ³B²zµ¥¡C
|
| ²M¤j¤ý¤p¤t±Ð±Â |
³sÄò¤T¦~ªº°ê¬ì·|pµe¡u°ê»y»yµ¸ê®Æ®w¤§³]p»P«Ø¥ß(MATpµe)¡v(1995-1998)¡A§¹¦¨¬ù7000¤H¤§¹q¸Ü»yµ¸ê®Æ¦¬¶°¡A³o¬O°ê¤ºº¦¸¤j³W¼Òªº»yµ¦¬¶°¡A¥Øªº¦b«Ø¥ß¤@Ó¬ã¨sÀô¹Ò¡A´£¨Ñ°ê¤º»yµ³B²z§Þ³N¬ãµo¤u§@ªÌ¤@®M»yµ¸ê®Æ®w¡C¨ä¤¤¸ê®ÆÀɤ§½s¿èµ{¦¡(¨ú¦W¬°Veditor3.0)¤wµn°OµÛ§@Åv¡C³¡¤À»yµ¸ê®Æ³°µ³©e°U¤¤µØ¥Á°êpºâ»y¨¥¾Ç¾Ç·|µo¦æ¡AMAT-160¡BMAT-400¡BMAT-2400¤w´£¨Ñ¾Ç®Õ¤Î¬ã¨s³æ¦ì¨Ï¥Î¡A¨ä¤¤MAT-2400«h¥Ñ°ê¬ì·|¿ì²z§Þ³N²¾Âà¡C |
| ¦¨¤j¤ýÂ@µo±Ð±Â |
¥Î©ó®Ú¾Ú¤@ËÀWÃЫY¼Æ¹Bºâ¦¡¨Ó³B²z½u©Ê¹w´ú«Y¼Æ¤§¥H½u©Ê¹w´ú«Y¼Æ¬°°ò¦ªºËÀWÃЫY¼Æ²£¥Í¾¹
»yµ½s¸Ñ½X¤èªk¤Î»yµ½s¸Ñ½X¾¹ |
| ¥æ¤j³¯«H§»±Ð±Â |
¨Ï¥ÎÃý«ß°T®§¤§Ãþ¯«¸gºô¸ô°ê»y³sÄò»yµ¿ë»{
¤£¯S©w»yªÌ°ê»y³sÄòµ¸`¿ë»{§Þ³N¤§±´°Q
¾A¦Xµø»ÙªÌ¨Ï¥Î¤§¹q¸£¬É±§Þ³N»P¨t²Î³]p-¤lpµe¤G:ª¼¥Î¹q¸£¤§°ê»y³æµü¿é¤J¤Î»yµ¿é¥X¨t²Î¤§µo®i |
| ¤¤¬ã°|³\»D·G±Ð±Â |
¤¤¤å¦Pµ¦rªº¦Û°Ê¿ë»{¡F¤¤¤å¦rÂ൥H¤Î»yµ¦X¦¨¨t²Î¡F»yµ¿ë»{ªº«á³B²z¡]µÂà¦r¥H¤Î®e¿ù¨t²Î¡^¡FOCR¡BOLCRªº«á³B²z¨t²Î¡F¦UÃþ¦Û§Î¿é¤Jªk¦P½X¦rªº
¦Û°Ê¿ï¨ú¨t²Î¡F¤¤¤å¥y«¬åªR¡]PARSING¡^¥H¤ÎÂ_µü¨t²Îµ¥¡C |
| ¥x¤j³¯«H§Æ±Ð±Â |
1. åªR¨t²Î
2. ½u¤W§Y®É^¤å½¤¤¤åªA°È¨t²Î
3. »OÆW¥»¤g»y¨¥¤¬Ä¶¤Î»yµ¦X¦¨¨t²Î
4. ¤¤¤åÂ_µü¤Î¤H¦W¡B²Õ´¦W¿ëÃѨt²Î
5. ¦h¤å¥ó·s»D¦Û°ÊºKn¨t²Î |
| ¥x¬ì¤j¥jÂEª¢±Ð±Â |
¼W¶i°Ñ¼Æ¿W¥ß±±¨î¤§¼u©Ê¡B¨Ã¥i²£¥ÍÂ×´Iµ¦â¤§°ê»yµ¸`«H¸¹¦X¦¨¤èªk
¥i§@°ÊºAµ¦âÅÜ´«¤§ °ê»y »yµ¦X¦¨ ³nÅé"
«È®a»y(Hakka)»yµ«H¸¹¦X¦¨
¾ã¦X°ÊºAµü¨å»P°¨¥i¤Ò¤¤¤å»y¨¥¼Ò«¬¤§¤èªk |
| ªø©°§f¤¯¶é±Ð±Â |
¡u°ê¥xÂù»y»yµ¿ë»{¦Û°Ê±¾¸¹¨t²Î¡v¥H¤Î¡u¥x»y¤å¦rÂà»yµ¡]»yµ¦X¦¨¡^¨t²Î¡v
¥xÆW¦a°Ï¦h»y»yµ¸ê®Æ®w¤§«Ø¥ß
»yµ¹q¸Ü±¾¸¹Á`¾÷ |
»yµ¬ÛÃöªº¤¤¤å®ÑÄy
- ·¨Âí¥ú,"Visual Basic»P»yµ¿ëÃÑ-Åý¹q¸£Å¥¸Ü",ªQ±^,2002¡C
- ªL¾È¥Í,"¼Æ¦ì«H¸¹-¼v¹³»P»yµ³B²z",¥þµØ,1999¡C
- Á¨qµ^,"¼Æ¦ì»yµ°T¸¹°ò¥»ì²z",¥þµØ,1996¡C
- ¼B®¶·½,"Ãþ¯«¸gºô¸ô¼Ò«¬»P»yµÃѧO",¥þµØ,1995¡C
- ¤ý¤¯µØ,"¤H¾÷»yµ³q«H",Áp¸g,1995¡C
- ³\§Ó¿³,"ÁnÅQ¥d¤§À³¥Î»P»yµ¿ëÃÑ",ºX¼Ð,1994¡C(´Â¶§¹Ï®ÑÀ]¦³ÂîÑ)
- ³¯©ú¼ü,"PC¹q¸£»yµ¿ë»{¹ê°µ",ºX¼Ð,1994¡C(´Â¶§¹Ï®ÑÀ]¦³ÂîÑ)
- ¶À¹ÅµØ,"Ánµ»P¦h´CÅéPC",¥þµØ,1994¡C(´Â¶§¹Ï®ÑÀ]¦³ÂîÑ)
- ³\¹l,"·L¹q¸£À³¥Î-»yµ³B²z",¥þµØ,1993¡C(´Â¶§¹Ï®ÑÀ]¦³ÂîÑ,°¾IC³]p,±´°Qmidi¸û¦h,¸ê¤u¸ê¬ì¹q¤lI´º¸û¾A¦X)
- §d©úõ;¶À¥@¶§,"VB4.0°Êµe»P»yµ§Þ¥©¯µÓD-¨Ï¥Îª«¥ó¾É¦Vµ{¦¡³]p",ªQ±^,?¡C
ºô¸ô¸ê·½
- »yµ¬ã¨sÀ³¥Î³nÅé
- ª`µ²Å¸¹Â²¤¶
- Pin-yin
- ¤j³°ªº¤H¾÷»yµ¥æ¤¬¬ì¬ã²Õ(TTS)
- »yµ¦X¦¨§Þ³Nªºì²z
- ¤j³°ªº»yµ¦X¦¨¬ÛÃöºô¶
- Speech Synthesis & Analysis
Software
- ¤¤¥¡¬ã¨s°|»y¨¥¾Ç¬ã¨s©ÒÄw³Æ³B»yµ¹êÅç«Ç
- µÀWµø°T®æ¦¡¤¶²Ð
- §d§Ó«i³Õ¤h
- ³¯¥Ã©Ó Evan Chen
- Examples of Synthesized Speech
- Speech Analysis Tutorial
- Pitch Analysis
¥D ºô ¯¸¡Ghttp://peterju.notlong.com
(¥Ø«eÂà§}¦Ü http://irw.ncut.edu.tw/peterju/)