We make publicly available the morphologically annotated data (Short Unit Word Data) from magazines of the Meiji and Taishō eras.
Please note that Long Unit Word information is not annotated for any of the magazines in this corpus.
The titles, publication dates, (and volume numbers) of the magazines included in the Meiji Era / Taishō Era Series I: Magazines corpus are as follows:
This corpus has been created by grooming, expanding, and unifying four corpora that had been previously published: Meiroku Zasshi Corpus, Kokumin no Tomo Corpus, Taiyō Corpus, the Corpus of Modern Women's Magazines. It collects a certain number of years' magazine publications representative of each era so that changes in chronological periods can be followed in a close survey of the written language of the Meiji and Taishō eras.
In addition to the Short Unit Word morphological information rendered searchable in the text of this corpus, information on the genre, title of the magazine, article,
and author has been annotated and can be examined in search conditions and examined in the search results through the corpus search application Chūnagon.
Please read the following abstract for this corpus before use:
Please see the following pages for details on the specifications of the four sub-corpora that comprise this corpus:
The Short Unit Word morphological information for this corpus has been annotated in consideratino of a difference between Literary and Colloquial Japanese, according to the annotation guidelines listed below:
Please consult the Annotation Guidelines before use.
Images of the original texts of the magazines can be accessed through the search results from Chūnagon, so that the text of the corpus can be compared to the original documents during use. Please note that images of the original texts for Jogaku Zasshi, Jogaku Sekai, and Fujin Kurabu aren't available.
Presentations of research results using this corpus must include a citation taking the general form of the example below (with appropriate modifications depending on the version and the date of access):
* As long as one of either the version or the date of access is clearly cited, the other can be omitted, as below:
Users will need to access the Corpus of Historical Japanese through the online search engine Chunagon. Completion of a Users Licensing Agreement is required.
Please refer to the following: The Corpus of Historical Japanese: How to apply
* Titles are current with the time of development.
The corpus compilation was supported by the "Design for a Diachronic Corpus" (2009--2016) project and the "The Construction of Diachronic Corpora and New Developments in Research on the History of Japanese" (2016--) project.