国立国語研究所
 
 

Edo Period Series Overview INTRODUCTION

As a trial version of the Corpus of Historical Japanese, Edo Period Version, we have made publicly available the Share-bon Corpus (Himawari version) (Ver.0.5, Short Unit Word-annotated) and the Ninjō-bon Corpus (Himawari version) (Ver.0.1, string-search only).
The error whereby "matches partially" was displayed as "matches beginning" was corrected. <2015.11.27>
The problems whereby the image of the original text could not be opened and whereby the page numbers failed to match was corrected. Furthermore, to use the data in this corpus, please install Himawari Version 1.5.3 or later. Problems may exist for earlier versions. <2015.11.06>

Share-bon, Edo Period Series I

Himawari version Share-bon Corpus Ver. 0.5 Overview

  • Hijiri Yūkaku 聖遊廓 (1757 [Hōreki 7] / Annonymous / 1 volume / Ōsaka)
    Source text: Share-bon Taisei Henshū Iinkai (ed.) Share-bon Taisei (1978-1988 Chūōkōron-sha)
    Image of corresponding text: NINJAL repository
  • Katō Hōgen Hako-makura 河東方言箱まくら (1822 [Bunsei 5] / Ōgokudō no Arittake (author) / 3 volumes / Kyōto)
    Source text: Share-bon Taisei Henshū Iinkai (ed.) Share-bon Taisei (1978-1988 Chūōkōron-sha)
    Image of Corresponding text: NINJAL repository
  • Tama-giku Zenden Sato-kagami 玉菊全伝花街鑑 (1822 [Bunsei 5] / Bisanjin (author) 3 volumes / Edo)
    Source text: Share-bon Taisei Henshū Iinkai (ed.) Share-bon Taisei (1978-1988 Chūōkōron-sha)
    Image of Corresponding text: NINJAL repository

Share-bon (novelettes set in the pleasure quarters) are literary works including dialogue in which the language of Edo and Kyoto in the Late Edo Period is depicted. They are considered to be crucial materials for understanding the spoken language of the period. The source text, Share-bon Taisei Henshū Iinkai (ed.)
Share-bon Taisei (1978-1988 Chūōkōron-sha) is a large-scale compendium of texts in 30 volumes transliterated into type, and is frequently referred to in research on the history of the Japanese language.

This corpus has the following characteristics:  

  • In all of the texts morphological information ("reading" (pronunciation), part-of-speech, inflection, etc.) is annotated for Short Unit Words;
  • A recension has been conducted for part of the main text, with annotations for such things as categorization of dialogue, and information on the identity of the speakers;
  • For the information on the identity of the speakers, terms of reference have been regularized, and information on gender, social status, and regional differences have been added;
  • From each item in a search result a link to the image of the corresponding page of the text has been provided.

How to access the corpus

Please read the following instructions before use:

Himawari version Share-bon Corpus is offered under the Creative Commons Attribution-Non-Commericial 4.0 International License. Please follow the articles of agreement when using, sharing, or modifying the data.
クリエイティブ・コモンズ・ライセンス

Download the corpus data through the link below:

Download:

How to use the Himawari version Share-bon Corpus

1. Following the instructions at the link below, first download Himawari.
  Refer to: Complete Test Search System Himawari

2. Download the data for Himawari version Share-bon Corpus.
* For Windows users, be sure to click the downloaded file Himawari version Share-bon Corpus, and confirm that the security block has been disabled at [Properties] > [General]  

3. Once the zip file for Himawari version Share-bon Corpus has been decompressed, the folder sharebon_himawari will appear. Confirm that the following files are included in the folder:
 - Corpora folder ... a folder in which data has been stored
 - config_sharebon.xml ... a configuration file

4. Move the Corpora folder and the config_sharebon.xml file into the folder in which Himawari has been installed. There might be a request for overwrite confirmation, but continue with the move. This completes the installation process.

5. Once Himawari has been started, go to File and execute "new", then open config_sharebon.xml. Now searches are enabled. For how to use Himawari, please refer to the users' manual file in the folder in which Himawari has been installed.

Note for Users

Presentations of research results using this corpus must include a citation taking the general form of the example below (with appropriate modifications depending on the version and the date of access):

  • National Institute for Japanese Language and Linguistics, Center for Corpus Development (Ichimura,Tarō, et. al.) (eds.) (2015) Himawari version Share-bon Corpus (Corpus of Historical Japanese, Edo Period Series) http://pj.ninjal.ac.jp/corpus_center/chj/edo.html#share (Ver. 0.5) (accessed October 26, 2016)

* As long as one of either the version or the date of access is clearly cited, the other can be omitted, as below:

  • National Institute for Japanese Language and Linguistics (2015) Himawari version Share-bon Corpus (Corpus of Historical Japanese, Edo Period Series) http://pj.ninjal.ac.jp/corpus_center/chj/edo.html#share (Ver. 0.5)
  • National Institute for Japanese Language and Linguistics (2015) Himawari version Share-bon Corpus (Corpus of Historical Japanese, Edo Period Series) http://pj.ninjal.ac.jp/corpus_center/chj/edo.html#share (accessed March 31, 2016)

Development Staff (Edo Period Series I, Share-bon)

    Developers

  • Ichimura,Tarō (Project researcher, temporary)

    Assisting in Development

  • Ogiso, Toshinobu (Associate Professor); Tanaka, Makirō (Invited Scholar / Professor, Meiji University); Takada, Tomokazu (Associate Professor)
  • Kōno, Tomoaki; Kawase, Akihiro; Watanabe, Yuki (Associate Researchers)
  • Murayama, Miwako; Takita, Yūko (Support Technicians)
  • Suzuki, Nao; Kitazaki, Yūho; Yamada, Rina (Production assistants)

    Joint Researchers

  • Okabe, Yoshiyuki (Professor, Chiba University); Murakami, Ken (Professor, Kansai Gakuin University)

* Titles are current with the time of development.

The corpus compilation was supported by the NINJAL collaborative project "Design for a Diachronic Corpus" (2009-2016) and JSPS KAKENHI Grant Number JP15K16765 (2015-2018).


Ninjō-bon, Edo Period Series II

Himawari version Ninjō-bon Corpus Ver. 0.2 Overview

Ninjō-bon (lit. "sentimental books") are literary works including dialogue in which the language at the end of the Edo Period is depicted. They are considered to be crucial spoken-language materials for understanding the transitional stages from Modern Edo Japanese to contemporary Tokyo Japanese. In this corpus, the source texts have been transliterated into type in the process of building the text data.

This corpus has the following characteristics:

  • Searching is possible for both the main text and the ruby.
  • A recension has been conducted for part of the main text, with annotations for such things as categorization of dialogue, and information on the identity of the speakers;
  • From each item in a search result a link to the image of the corresponding page has been provided.
  • You can easily check on what line the search term is on the page of the original image. (access to IIIF viewer Mirador)

How to access to corpus

Please read the following instructions before use:

Himawari version Ninjō-bon Corpus is offered under the Creative Commons Attribution-Non-Commericial 4.0 International License. Please follow the articles of agreement when using, sharing, or modifying the data.
creativecommons

Download the corpus data through the link below:

Download:

How to use the Himawari version Ninjō-bon Corpus

1. Following the instructions at the link below, first download Himawari.
  Refer to: Complete Test Search System Himawari

2. Download the data for Himawari version Ninjō-bon Corpus.
* For Windows users, be sure to click the downloaded file Himawari version Ninjō-bon Corpus, and confirm that the security block has been disabled at [Properties] > [General]

3. Once Himawari has been started, go to File and execute "install", then open ninjobon_Himawari_0.2.zip. Now searches are enabled. For how to use Himawari, please refer to the users' manual file in the folder in which Himawari has been installed.

Note for Users

Presentations of research results using this corpus must include a citation taking the general form of the example below (with appropriate modifications depending on the version and the date of access):

  • National Institute for Japanese Language and Linguistics (Fujimoto, Akari et. al.) (eds.) (2017) Himawari version Ninjō-bon Corpus (Corpus of Historical Japanese, Edo Period Series) http://pj.ninjal.ac.jp/corpus_center/chj/edo.html#ninjou (Ver. 0.2) (accessed November 8, 2017)

* As long as one of either the version or the date of access is clearly cited, the other can be omitted, as below:

  • National Institute for Japanese Language and Linguistics (2017) Himawari version Ninjō-bon Corpus (Corpus of Historical Japanese, Edo Period Series) http://pj.ninjal.ac.jp/corpus_center/chj/edo.html#ninjou (Ver. 0.2)
  • National Institute for Japanese Language and Linguistics (2017) Himawari version Ninjō-bon Corpus (Corpus of Historical Japanese, Edo Period Series) http://pj.ninjal.ac.jp/corpus_center/chj/edo.html#ninjou (accessed November 8, 2017)

Development Staff (Edo Period Series II, Ninjō-bon)

    Developers

  • Fujimoto, Akari (Project Assistant Professor); Takada, Tomokazu (Associate Professor)

    Assisting in Development

  • Okabe, Yoshiyuki (Professor, Chiba University); Ogiso, Toshinobu (Associate Professor)
  • Ichimura,Tarō; Zeniya, Masato (Associate Researcher)
  • Kitazaki, Yūho; Fukuyama, Masami (Production assistant)

* Titles are current with the time of development.

The corpus compilation was supported by the "Refining the Corpus of Historical Japanese with Information on Notation and Bibliographical Form" (2009-2016) project and the "The Construction of Diachronic Corpora and New Developments in Research on the History of Japanese" (2016-) project.

 
 
event
unidic_bnr

日本語をはじめとする言語を分析するための基礎資料として、書き言葉や話し言葉の資料を体系的に収集し、研究用の情報を付与したものです。