Encoding Disappearing Characters: The Case of Twentieth-Century Japanese-Canadian Names
The Landscapes of Injustice project seeks to encode mid-twentieth-century documents by and about the Japanese-Canadian community so they are accessible to modern audiences. The fundamental problem is that some of the kanji used at that time have been replaced since then by different kanji, and other...
Guardado en:
Autor principal: | |
---|---|
Formato: | article |
Lenguaje: | DE EN ES FR IT |
Publicado: |
OpenEdition
2019
|
Materias: | |
Acceso en línea: | https://doaj.org/article/6da9d84d938248c5836b434b77c517e8 |
Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
id |
oai:doaj.org-article:6da9d84d938248c5836b434b77c517e8 |
---|---|
record_format |
dspace |
spelling |
oai:doaj.org-article:6da9d84d938248c5836b434b77c517e82021-12-02T11:28:57ZEncoding Disappearing Characters: The Case of Twentieth-Century Japanese-Canadian Names2162-560310.4000/jtei.2301https://doaj.org/article/6da9d84d938248c5836b434b77c517e82019-08-01T00:00:00Zhttp://journals.openedition.org/jtei/2301https://doaj.org/toc/2162-5603The Landscapes of Injustice project seeks to encode mid-twentieth-century documents by and about the Japanese-Canadian community so they are accessible to modern audiences. The fundamental problem is that some of the kanji used at that time have been replaced since then by different kanji, and others have been removed from lists of formally acceptable characters. This report documents our efforts with two technologies designed to address this situation. The first is the Standardized Variation Sequence (SVS) feature of Unicode. Our work revealed that this set of variation sequences does not completely cover the old and new glyph pairs identified by the Japanese authorities, and that the pairs formally identified by the Japanese authorities do not completely cover all the new glyph forms in general use. We turned to TEI’s <charDecl>, <glyph>, and <mapping> elements as a second technology to augment the support provided by Unicode. Lastly, we dealt with the issue of finding suitably qualified people to do the markup. The result is markup which retains the original glyphs and relates them to the modern glyphs, so that in our output products we will be able to support search and display using either form of the glyph.Stewart ArneilOpenEditionarticlekanjiunicodeglyphvariantJapaneseComputer engineering. Computer hardwareTK7885-7895DEENESFRITJournal of the Text Encoding Initiative, Vol 12 (2019) |
institution |
DOAJ |
collection |
DOAJ |
language |
DE EN ES FR IT |
topic |
kanji unicode glyph variant Japanese Computer engineering. Computer hardware TK7885-7895 |
spellingShingle |
kanji unicode glyph variant Japanese Computer engineering. Computer hardware TK7885-7895 Stewart Arneil Encoding Disappearing Characters: The Case of Twentieth-Century Japanese-Canadian Names |
description |
The Landscapes of Injustice project seeks to encode mid-twentieth-century documents by and about the Japanese-Canadian community so they are accessible to modern audiences. The fundamental problem is that some of the kanji used at that time have been replaced since then by different kanji, and others have been removed from lists of formally acceptable characters. This report documents our efforts with two technologies designed to address this situation. The first is the Standardized Variation Sequence (SVS) feature of Unicode. Our work revealed that this set of variation sequences does not completely cover the old and new glyph pairs identified by the Japanese authorities, and that the pairs formally identified by the Japanese authorities do not completely cover all the new glyph forms in general use. We turned to TEI’s <charDecl>, <glyph>, and <mapping> elements as a second technology to augment the support provided by Unicode. Lastly, we dealt with the issue of finding suitably qualified people to do the markup. The result is markup which retains the original glyphs and relates them to the modern glyphs, so that in our output products we will be able to support search and display using either form of the glyph. |
format |
article |
author |
Stewart Arneil |
author_facet |
Stewart Arneil |
author_sort |
Stewart Arneil |
title |
Encoding Disappearing Characters: The Case of Twentieth-Century Japanese-Canadian Names |
title_short |
Encoding Disappearing Characters: The Case of Twentieth-Century Japanese-Canadian Names |
title_full |
Encoding Disappearing Characters: The Case of Twentieth-Century Japanese-Canadian Names |
title_fullStr |
Encoding Disappearing Characters: The Case of Twentieth-Century Japanese-Canadian Names |
title_full_unstemmed |
Encoding Disappearing Characters: The Case of Twentieth-Century Japanese-Canadian Names |
title_sort |
encoding disappearing characters: the case of twentieth-century japanese-canadian names |
publisher |
OpenEdition |
publishDate |
2019 |
url |
https://doaj.org/article/6da9d84d938248c5836b434b77c517e8 |
work_keys_str_mv |
AT stewartarneil encodingdisappearingcharactersthecaseoftwentiethcenturyjapanesecanadiannames |
_version_ |
1718395883716870144 |