Encoding Disappearing Characters: The Case of Twentieth-Century Japanese-Canadian Names

The Landscapes of Injustice project seeks to encode mid-twentieth-century documents by and about the Japanese-Canadian community so they are accessible to modern audiences. The fundamental problem is that some of the kanji used at that time have been replaced since then by different kanji, and other...

Descripción completa

Guardado en:
Detalles Bibliográficos
Autor principal: Stewart Arneil
Formato: article
Lenguaje:DE
EN
ES
FR
IT
Publicado: OpenEdition 2019
Materias:
Acceso en línea:https://doaj.org/article/6da9d84d938248c5836b434b77c517e8
Etiquetas: Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
id oai:doaj.org-article:6da9d84d938248c5836b434b77c517e8
record_format dspace
spelling oai:doaj.org-article:6da9d84d938248c5836b434b77c517e82021-12-02T11:28:57ZEncoding Disappearing Characters: The Case of Twentieth-Century Japanese-Canadian Names2162-560310.4000/jtei.2301https://doaj.org/article/6da9d84d938248c5836b434b77c517e82019-08-01T00:00:00Zhttp://journals.openedition.org/jtei/2301https://doaj.org/toc/2162-5603The Landscapes of Injustice project seeks to encode mid-twentieth-century documents by and about the Japanese-Canadian community so they are accessible to modern audiences. The fundamental problem is that some of the kanji used at that time have been replaced since then by different kanji, and others have been removed from lists of formally acceptable characters. This report documents our efforts with two technologies designed to address this situation. The first is the Standardized Variation Sequence (SVS) feature of Unicode. Our work revealed that this set of variation sequences does not completely cover the old and new glyph pairs identified by the Japanese authorities, and that the pairs formally identified by the Japanese authorities do not completely cover all the new glyph forms in general use. We turned to TEI’s <charDecl>, <glyph>, and <mapping> elements as a second technology to augment the support provided by Unicode. Lastly, we dealt with the issue of finding suitably qualified people to do the markup. The result is markup which retains the original glyphs and relates them to the modern glyphs, so that in our output products we will be able to support search and display using either form of the glyph.Stewart ArneilOpenEditionarticlekanjiunicodeglyphvariantJapaneseComputer engineering. Computer hardwareTK7885-7895DEENESFRITJournal of the Text Encoding Initiative, Vol 12 (2019)
institution DOAJ
collection DOAJ
language DE
EN
ES
FR
IT
topic kanji
unicode
glyph
variant
Japanese
Computer engineering. Computer hardware
TK7885-7895
spellingShingle kanji
unicode
glyph
variant
Japanese
Computer engineering. Computer hardware
TK7885-7895
Stewart Arneil
Encoding Disappearing Characters: The Case of Twentieth-Century Japanese-Canadian Names
description The Landscapes of Injustice project seeks to encode mid-twentieth-century documents by and about the Japanese-Canadian community so they are accessible to modern audiences. The fundamental problem is that some of the kanji used at that time have been replaced since then by different kanji, and others have been removed from lists of formally acceptable characters. This report documents our efforts with two technologies designed to address this situation. The first is the Standardized Variation Sequence (SVS) feature of Unicode. Our work revealed that this set of variation sequences does not completely cover the old and new glyph pairs identified by the Japanese authorities, and that the pairs formally identified by the Japanese authorities do not completely cover all the new glyph forms in general use. We turned to TEI’s <charDecl>, <glyph>, and <mapping> elements as a second technology to augment the support provided by Unicode. Lastly, we dealt with the issue of finding suitably qualified people to do the markup. The result is markup which retains the original glyphs and relates them to the modern glyphs, so that in our output products we will be able to support search and display using either form of the glyph.
format article
author Stewart Arneil
author_facet Stewart Arneil
author_sort Stewart Arneil
title Encoding Disappearing Characters: The Case of Twentieth-Century Japanese-Canadian Names
title_short Encoding Disappearing Characters: The Case of Twentieth-Century Japanese-Canadian Names
title_full Encoding Disappearing Characters: The Case of Twentieth-Century Japanese-Canadian Names
title_fullStr Encoding Disappearing Characters: The Case of Twentieth-Century Japanese-Canadian Names
title_full_unstemmed Encoding Disappearing Characters: The Case of Twentieth-Century Japanese-Canadian Names
title_sort encoding disappearing characters: the case of twentieth-century japanese-canadian names
publisher OpenEdition
publishDate 2019
url https://doaj.org/article/6da9d84d938248c5836b434b77c517e8
work_keys_str_mv AT stewartarneil encodingdisappearingcharactersthecaseoftwentiethcenturyjapanesecanadiannames
_version_ 1718395883716870144