日本語 NINJAL
 

Repord: "Construction of the Corpus of Spontaneous Japanese" Published: 26/06/2006

 

This title was published as NINJAL report no. 124. Revised to include organized versions of the various manuals that had been created throughly the CSJ's construction, its breadth is unprecedented world-wide. It can also be treated as a manual of the CSJ.
The contents of the report are shown below. Please click on the chapter you wish to view.

"The construction of the Corpus of Spontaneous Japanese" (In Japanese)
Chapter Title Author Start - end page
Preface Seiju Sugito i
Table of Contents
iii-xvi
Chapter 1: Overview Kikuo Maekawa 1-22
Chapter 2: Transcriptions Hanae Koiso, Kenya Nishikawa, Yoko Mabuchi 23-132
Chapter 3: Morphological Information Hideki Ogura 133-186
Chapter 4: SUW and LUW databases Masaya Yamaguchi 187-254
Chapter 5: Clause unit information Takehito Maruyama, Katsuya Takanashi, Kiyotaka Uchimoto 255-322
Chapter 6: Phone information Masako Fujimoto, Hideaki Kikuchi, Kikuo Maekawa 323-346
Chapter 7: Prosodic information Yosuke Igarashi, Hideaki Kikuchi, Kikuo Maekawa 347-454
Chapter 8: XML documents Hideaki Kikuchi, Wataru Tsukahara 455-526
Chapter 9: Searching the CSJ Kikuo Maekawa 527-542
Bibliography
543-546
Index
547-552



Full ReportBatch DL(34.9MB)

"Corpus of Spontaneous Japanese DVD edition manual"

(Published: 26/06/2006, Revised: 28/11/2011)

 
TitleUpdate
Overview of the "Corpus of Spontaneous Japanese" (<- please read 1st. , Revised : 28/11/2011)
Overview of speech recording work (<- Fixed issues : 25/03/2008)
Metadata
Text transcription
Recognition of bunsetsu
Overview of the morphological information in the CSJ
Short unit word and long unit word data manual
Short unit word dictionary manual
Segmental labelling in the CSJ (<- Revised : 28/11/2011)
Intonation labelling in the CSJ (<- Revised : 28/11/2011)
Overview of impression rating data (<- Fixed issues : 25/03/2008)
Clause unit assignment in the CSJ (<- Fixed typos : 4/10/2007)
Dependency structure in the CSJ (<- Revised : 28/11/2011)
Summarization and important sentence extraction
Discourse boundary information in the CSJ
Specifications of acoustic and language models for speech recognition
CSJ XML documents (<- Revised : 28/11/2011)
CSJ clause unit XML documents (<- Newly added : 10/03/2006)
Use of the CSJ clause unit XML viewer (<- Newly added : 21/08/2006)
Text editing of important sentence data (<- Newly added : 25/03/2008)