国立国語研究所
 
 

Edo Period Series Overview INTRODUCTION

As a part of Edo period series, we have released a corpus of Share-bon with morphological information (for Short Unit Words). This corpus includes the 30 works below.
Also, you can download a trial version "Share-bon corpus Ver.0.5" (3 works, with Short Unit Word-annotated) and "Ninjo-bon Corpus Ver.0.2" (string-search only), from the following.

Share-bon, Edo Period Series I

Overview

Share-bon (novelettes set in the pleasure quarters) are literary works including dialogue in which the language of Edo and Kyoto in the Late Edo Period is depicted. They are considered to be crucial materials for understanding the spoken language of the period. The source text, Share-bon Taisei Henshū Iinkai (ed.) Share-bon Taisei (1978-1988 Chūōkōron-sha) is a large-scale compendium of texts in 30 volumes transliterated into type, and is frequently referred to in research on the history of the Japanese language.

This corpus has the following characteristics:

  • In all of the texts morphological information ("reading" (pronunciation), part-of-speech, inflection, etc.) is annotated for Short Unit Words;
  • A recension has been conducted for part of the main text, with annotations for such things as categorization of dialogue, and information on the identity of the speakers;
  • For the information on the identity of the speakers, terms of reference have been regularized, and information on gender, social status, and regional differences have been added;
  • From each item in a search result a link to the image of the corresponding page of the text has been provided. (However, it may not necessarily match the original text. For details, please see the following usage information.)

Source Text

  • Share-bon Taisei Henshū Iinkai (ed.). Share-bon Taisei. Tokyo: Chūōkōron-sha, 1978-1988.

Works Comprising the Corpus

Ōsaka 大坂

  • Hijiri Yūkaku 聖遊廓
  • Gekka Yojō 月花余情
  • Shin Gekka Yojō 新月花余情
  • Yōdai Ihen / Shōkaku Higen 陽台遺編 / 〓*閣秘言
  • Ihon Kakuchū Kitan 異本郭中奇譚
  • Tanka Zuiyō 短華蘂葉
  • Hokka Tsūjō 北華通情
  • Nan'yūki 南遊記
  • Tōsei Sui no Akebono 当世粋の曙
  • Iro Fukami Sorane no Yume 色深猍睡夢
* 女+壯 (U+21742)

Kyōto 京都

  • Shimabara Yasamonogatari 原柳巷花語
  • Ron no nai Sato Mondō 無論里問答
  • Fūryū Hadaka Ningyō 風流裸人形
  • Seirō Oranda Kagami 青楼阿蘭陀鏡
  • Shōheiraku 昇平楽
  • Tōsei Uso no Kawa 当世嘘之川
  • Teike no Hana 竊潜妻
  • Ta ga Omokage 誰が面影
  • Katō Hōgen Hako-makura 河東方言箱まくら
  • Kyoto Tsuki 興斗月

Edo 江戸

  • Kakuchū Kitan 郭中奇譚
  • Kyōja Hōgen 侠者方言
  • Nankei Zatsuwa 南閨雑話
  • Kōeki Shin'wa 甲駅新話
  • Tōsei Sayōsa 当世左様候
  • Fukagawa Shin'wa 深川新話
  • Sōmagaki 総籬
  • Shikake Bunko 仕懸文庫
  • Sato-kagami 花街鑑
  • Sato-suzume 花街寿々女

For details of the works, please refer to the following: List of the Works

Overview / Annotation Guidelines

Please read before using.

Note for Users

Presentations of research results using this corpus must include a citation taking the general form of the example below (with appropriate modifications depending on the version and the date of access):

  • National Institute for Japanese Language and Linguistics (Murayama,Miwako, et. al.) (eds.) (2018) "Corpus of Historical Japanese, Edo Period Series, Volume I: Share-bon." (Short Unit Word data 0.9, Chunagon version 2.4.2) http://pj.ninjal.ac.jp/corpus_center/chj/edo.html (accessed March 30, 2018)

* As long as one of either the version or the date of access is clearly cited, the other can be omitted, as below:

  • National Institute for Japanese Language and Linguistics (2018) "Corpus of Historical Japanese, Edo Period Series, Volume I: Share-bon." http://pj.ninjal.ac.jp/corpus_center/chj/edo.html (accessed March 30, 2018)
  • National Institute for Japanese Language and Linguistics (2018) "Corpus of Historical Japanese, Edo Period Series, Volume I: Share-bon." http://pj.ninjal.ac.jp/corpus_center/chj/edo.html (Short Unit Word Data 0.9)

Users will need to access the Corpus of Historical Japanese through the online search engine Chunagon. Completion of a User's Licensing Agreement is required.
Please refer to the following: The Corpus of Historical Japanese: How to apply

Development Staff (Edo Period Series I: Share-bon)

    Developers

  • Murayama, Miwako (Adjunct Researcher)
  • Ichimura,Tarō (Assistant Professor, Tokoha University; Former Adjunct Researcher, The National Institute for Japanese Language and Linguistics)

    Assisting in Development

  • Ogiso, Toshinobu (Professor); Takada, Tomokazu (Associate Professor); Tanaka, Makirō (Professor, Meiji University)
  • Kōno, Tomoaki (Postdoctoral Research Fellow)
  • Kawase, Akihiro; Watanabe, Yuki; Katayama, Kurumi; Zeniya Masato; Kondō, Asuko (Adjunct Researcher)
  • Takita, Yūko; Miura, Satsuki (Support Technicians)
  • Suzuki, Nao; Kitazaki, Yūho; Yamada, Rina; Ōkubo, Ayumi (Production assistants)

    Joint Researchers

  • Okabe, Yoshiyuki (Professor, Chiba University); Murakami, Ken (Professor, Kwansai Gakuin University)

* Titles are current with the time of development.

The corpus compilation was supported by the NINJAL collaborative project "Design for a Diachronic Corpus" (2009-2016), the "The Construction of Diachronic Corpora and New Developments in Research on the History of Japanese" (2016-) project, and JSPS KAKENHI Grant Number JP15K16765 (2015-2018).

 Himawari Version Share-bon Corpus Ver. 0.5

Overview

  • Hijiri Yūkaku 聖遊廓 (1757 [Hōreki 7] / Annonymous / 1 volume / Ōsaka)
    Source text: Share-bon Taisei Henshū Iinkai (ed.) Share-bon Taisei (1978-1988 Chūōkōron-sha)
    Image of corresponding text: NINJAL repository
  • Katō Hōgen Hako-makura 河東方言箱まくら (1822 [Bunsei 5] / Ōgokudō no Arittake (author) / 3 volumes / Kyōto)
    Source text: Share-bon Taisei Henshū Iinkai (ed.) Share-bon Taisei (1978-1988 Chūōkōron-sha)
    Image of Corresponding text: NINJAL repository
  • Tama-giku Zenden Sato-kagami 玉菊全伝花街鑑 (1822 [Bunsei 5] / Bisanjin (author) 3 volumes / Edo)
    Source text: Share-bon Taisei Henshū Iinkai (ed.) Share-bon Taisei (1978-1988 Chūōkōron-sha)
    Image of Corresponding text: NINJAL repository

Share-bon (novelettes set in the pleasure quarters) are literary works including dialogue in which the language of Edo and Kyoto in the Late Edo Period is depicted. They are considered to be crucial materials for understanding the spoken language of the period. The source text, Share-bon Taisei Henshū Iinkai (ed.)
Share-bon Taisei (1978-1988 Chūōkōron-sha) is a large-scale compendium of texts in 30 volumes transliterated into type, and is frequently referred to in research on the history of the Japanese language.

This corpus has the following characteristics:

  • In all of the texts morphological information ("reading" (pronunciation), part-of-speech, inflection, etc.) is annotated for Short Unit Words;
  • A recension has been conducted for part of the main text, with annotations for such things as categorization of dialogue, and information on the identity of the speakers;
  • For the information on the identity of the speakers, terms of reference have been regularized, and information on gender, social status, and regional differences have been added;
  • From each item in a search result a link to the image of the corresponding page of the text has been provided.

How to access the corpus

Please read the following instructions before use:

Himawari version Share-bon Corpus is offered under the Creative Commons Attribution-Non-Commericial 4.0 International License. Please follow the articles of agreement when using, sharing, or modifying the data.
クリエイティブ・コモンズ・ライセンス

Download the corpus data through the link below:

Download:

How to use the Himawari version Share-bon Corpus

  • 1. Following the instructions at the link below, first download Himawari.
      Refer to: Complete Test Search System Himawari
  • 2. Download the data for Himawari version Share-bon Corpus.
    • For Windows users, be sure to click the downloaded file Himawari version Share-bon Corpus, and confirm that the security block has been disabled at [Properties] > [General]
  • 3. Once the zip file for Himawari version Share-bon Corpus has been decompressed, the folder sharebon_himawari will appear. Confirm that the following files are included in the folder:
    • Corpora folder ... a folder in which data has been stored
    • config_sharebon.xml ... a configuration file
  • 4. Move the Corpora folder and the config_sharebon.xml file into the folder in which Himawari has been installed. There might be a request for overwrite confirmation, but continue with the move. This completes the installation process.
  • 5. Once Himawari has been started, go to File and execute "new", then open config_sharebon.xml. Now searches are enabled. For how to use Himawari, please refer to the users' manual file in the folder in which Himawari has been installed.

Note for Users

Presentations of research results using this corpus must include a citation taking the general form of the example below (with appropriate modifications depending on the version and the date of access):

  • National Institute for Japanese Language and Linguistics, Center for Corpus Development (Ichimura,Tarō, et. al.) (eds.) (2015) Himawari version Share-bon Corpus (Corpus of Historical Japanese, Edo Period Series) http://pj.ninjal.ac.jp/corpus_center/chj/edo.html#share (Ver. 0.5) (accessed October 26, 2016)

* As long as one of either the version or the date of access is clearly cited, the other can be omitted, as below:

  • National Institute for Japanese Language and Linguistics (2015) Himawari version Share-bon Corpus (Corpus of Historical Japanese, Edo Period Series) http://pj.ninjal.ac.jp/corpus_center/chj/edo.html#share (Ver. 0.5)
  • National Institute for Japanese Language and Linguistics (2015) Himawari version Share-bon Corpus (Corpus of Historical Japanese, Edo Period Series) http://pj.ninjal.ac.jp/corpus_center/chj/edo.html#share (accessed March 31, 2016)

Development Staff (Himawari version Share-bon Corpus)

    Developers

  • Ichimura,Tarō (Adjunct Researcher)

    Assisting in Development

  • Ogiso, Toshinobu (Associate Professor); Tanaka, Makirō (Invited Scholar, The National Institute for Japanese Language and Linguistics; Professor, Meiji University); Takada, Tomokazu (Associate Professor)
  • Kōno, Tomoaki; Kawase, Akihiro; Watanabe, Yuki (Associate Researchers)
  • Murayama, Miwako; Takita, Yūko (Support Technicians)
  • Suzuki, Nao; Kitazaki, Yūho; Yamada, Rina (Production assistants)

    Joint Researchers

  • Okabe, Yoshiyuki (Professor, Chiba University); Murakami, Ken (Professor, Kwansai Gakuin University)

* Titles are current with the time of development.

The corpus compilation was supported by the NINJAL collaborative project "Design for a Diachronic Corpus" (2009-2016) and JSPS KAKENHI Grant Number JP15K16765 (2015-2018).

Ninjō-bon, Edo Period Series II

 Himawari Version Ninjō-bon Corpus Ver. 0.2

Overview

Ninjō-bon (lit. "sentimental books") are literary works including dialogue in which the language at the end of the Edo Period is depicted. They are considered to be crucial spoken-language materials for understanding the transitional stages from Modern Edo Japanese to contemporary Tokyo Japanese. In this corpus, the source texts have been transliterated into type in the process of building the text data.

This corpus has the following characteristics:

  • Searching is possible for both the main text and the ruby.
  • A recension has been conducted for part of the main text, with annotations for such things as categorization of dialogue, and information on the identity of the speakers;
  • From each item in a search result a link to the image of the corresponding page has been provided.
  • You can easily check on what line the search term is on the page of the original image. (access to IIIF viewer Mirador)

How to access to corpus

Please read the following instructions before use:

Himawari version Ninjō-bon Corpus is offered under the Creative Commons Attribution-Non-Commericial 4.0 International License. Please follow the articles of agreement when using, sharing, or modifying the data.
creativecommons

Download the corpus data through the link below:

Download:

How to use the Himawari version Ninjō-bon Corpus

  • 1. Following the instructions at the link below, first download Himawari.   Refer to: Complete Test Search System Himawari
  • 2. Download the data for Himawari version Ninjō-bon Corpus.
    • For Windows users, be sure to click the downloaded file Himawari version Ninjō-bon Corpus, and confirm that the security block has been disabled at [Properties] > [General]
  • 3. Once Himawari has been started, go to File and execute "install", then open ninjobon_Himawari_0.2.zip. Now searches are enabled. For how to use Himawari, please refer to the users' manual file in the folder in which Himawari has been installed.

Note for Users

Presentations of research results using this corpus must include a citation taking the general form of the example below (with appropriate modifications depending on the version and the date of access):

  • National Institute for Japanese Language and Linguistics (Fujimoto, Akari et. al.) (eds.) (2017) Himawari version Ninjō-bon Corpus (Corpus of Historical Japanese, Edo Period Series) http://pj.ninjal.ac.jp/corpus_center/chj/edo.html#ninjou (Ver. 0.2) (accessed November 8, 2017)

* As long as one of either the version or the date of access is clearly cited, the other can be omitted, as below:

  • National Institute for Japanese Language and Linguistics (2017) Himawari version Ninjō-bon Corpus (Corpus of Historical Japanese, Edo Period Series) http://pj.ninjal.ac.jp/corpus_center/chj/edo.html#ninjou (Ver. 0.2)
  • National Institute for Japanese Language and Linguistics (2017) Himawari version Ninjō-bon Corpus (Corpus of Historical Japanese, Edo Period Series) http://pj.ninjal.ac.jp/corpus_center/chj/edo.html#ninjou (accessed November 8, 2017)

Development Staff (Himawari version Ninjō-bon Corpus)

    Developers

  • Fujimoto, Akari (Project Assistant Professor); Takada, Tomokazu (Associate Professor)

    Assisting in Development

  • Okabe, Yoshiyuki (Professor, Chiba University); Ogiso, Toshinobu (Associate Professor)
  • Ichimura,Tarō; Zeniya, Masato (Associate Researcher)
  • Kitazaki, Yūho; Fukuyama, Masami (Production assistant)

* Titles are current with the time of development.

The corpus compilation was supported by the "Refining the Corpus of Historical Japanese with Information on Notation and Bibliographical Form" (2009-2016) project and the "The Construction of Diachronic Corpora and New Developments in Research on the History of Japanese" (2016-) project.

 
 
event
unidic_bnr

日本語をはじめとする言語を分析するための基礎資料として、書き言葉や話し言葉の資料を体系的に収集し、研究用の情報を付与したものです。