
佐伯 功介*
  • 裁判所書記官研究所講師、(社)日本ローマ字会理事、ソクタイプ研究所長

Sokutaipu Kanji Input Method

by Saeki Kōsuke*
transcribed by Jennifer Chan and Sammi de Guzman
  • Lecturer, Court Clerk Research Institute
    Director, Japan Romaji Society
    Director, Sokutaipu Research Institute
  1. まえがき
  2. ソクタイプ速記の現状
  3. 機械の構造
  4. 速記記号のあらまし
    1. 基本
    2. 略語
    3. 略語の数と割合
  5. 漢字入力への応用
    1. 漢字指定
    2. 漢字コード
  6. 熟語本位への切り換え
    1. 文字よりも単語
    2. 同音語の処理
  7. むすび


  1. Foreword
  2. Current State of Sokutaipu
  3. Keyboard Layout
  4. Shorthand Theory
    1. Basics
    2. Briefs
    3. Proportion of Briefs
  5. Kanji Input Approaches
    1. Kanji Assignments
    2. Kanji Coding
  6. Handling Complex Expressions
    1. Words over Letters
    2. Homophones
  7. Conclusion

1 まえがき



1 Foreword

In recent years, high-speed methods have been developed to input kanji and kana sentences by machine, capable of processing thousands of characters a minute, but most of the input for these machines is done using Kanji teletype-style perforated tape. The current reading speed for Kanji teletype is up to 70 characters per minute, which is suboptimal.

Last year, I published a method for entering kanji using Sokutaipu, but since it was only a summary, I received many questions. Due to page limitations, I won't be able to give enough information this time either, but I will go into a bit more detail.

2 ソクタイプ速記の現状

ソクタイプというのは、現在全国の裁判所で使っている速記の機械である。このソクタイプは、もともと第1次大戦のベレサイユ会議などで使われていたフランスの発明品を、故田中舘愛橘博士が日本に紹介したものであった。日本語を打つためのキー構成や記号、略語の組織を作って、今日の機械を完成させたのは、ソクタイプ研究所の川上 晃氏である。裁判所では昭和25年から速記官の養成を始め、20年間で卒業生1000人を超えた。






2 Current State of Sokutaipu

Sokutaipu is a shorthand machine currently used in courts across the country. It was originally a French invention, used at the Versailles conference at the end of World War I, and was introduced to Japan by the late Dr. Tanakadate Aitachi. It was Mr. Kawakami Akira of the Sokutaipu Research Institute who created the system of keys, symbols, and abbreviations for writing Japanese and brought the machine to its current form. The court system began training stenographers in 1950, and over 1,000 people have graduated in the past 20 years.

Becoming a stenographer requires two years of training, but you will learn all of the ways to write in the first six months. However, writing speed is about 70 to 80 words per minute at this point, and it cannot be used as a shorthand system.

The "70 or 80 wpm" figure above is the number of words in a sentence written in Romaji. When written in kanji and kana, statistically, one word corresponds to roughly two characters, so 80 words is about 160 characters. The test requires students to read out loud for 5 minutes at each speed, with a passing score of 2% or fewer incorrect words.

Once you have learned how to write, all you have to do is practice repeatedly. If you pass the 80 wpm test, you can improve to 90 wpm, then 100 wpm, then reach 130 to 140 wpm by the end of the year. The graduation speed requirement is 170 wpm, but virtually all of them have reached 180 wpm or more, and about half of them are at 200 wpm or more.

The speed record is 235 words per minute (470 characters per minute). At this point, it's as if the examiner is being tested to see how fast a person can read.

For practical purposes, being able to reliably capture 150 to 160 words per minute, about as fast as a radio news commentary, is considered sufficient. You can speak at more than 200 wpm in an instant, but 200 wpm lasting more than 10 seconds doesn't actually happen.

Sokutaipu layout
図1 ソクタイプのキーボード
Figure 1: Sokutaipu keyboard.

3 機械の構造



3 Keyboard Layout

The machine has 21 type bars connected in a row, and when you press a key connected to each type bar, they stick out about 2 mm and print at that location. Unlike a typewriter, you can press any number of keys at the same time, and the corresponding letters will be printed in one line. The paper tape is 6 cm wide and 60 m long, folded in a zigzag pattern and placed inside the machine. There are 21 characters lined up within this 6 cm width; whether you write one key or 20, if you press once and release your hand, it will print out one line.

Each letter is printed on its own unique position and does not move laterally, so it is equivalent to a 21-hole punch tape. Letters are used to associate with words and be more readable by humans, but in machines it can be written as black circles or holes. For example,

H S I OIA (人間)
K I IA K Y(機械)


Finger layout (Japanese version)

are identical. The keys are assigned to the ten fingers and arranged in a special shape so that you can hit any combination. (Figure 1) The keys are assigned to fingers as follows:

Finger layout (English version)

4 速記記号のあらまし

4.1 基本

20の文字キーを3つの群に分け、それぞれ左、右、中の群という。左の群は左手の4本の指が受け持つ8つのキーで、その情報数は2⁸ =256だから、五十音はもちろん、拗音、それらの長音(「〜おう」と「〜うう」)および「〜あい」、「〜えい」韻までをふくむ音節が区別される***。左の群の記号はそのまま裏返しにすれば右の群の記号となる。これらの記号はローマ字のように、子音+母音の構造になっているが、速記者の意識にはもはや子音、母音はなく

4 Shorthand Theory

4.1 Basics

The 20 letter keys are divided into three banks: left, right, and center. The left bank has 8 keys controlled by the 4 fingers of the left hand, giving a total of 2⁸ = 256 possible chords. Not only the standard 50 syllables of Japanese, but also complex syllables including long sounds (〜おう and 〜うう) and even 〜あい and 〜えい can be distinguished. The keys on the left bank become the keys on the right bank by reversing the order. These keys have a consonant + vowel structure like the Roman alphabet, but the stenographer does not think of consonants or vowels,

Interpreting Sokutaipu outlines as shapes.




  1. 語の第1音節は左の群で打つ。
  2. つぎの音節が中の群で打てる音ならば必ず中で打つ。
  3. 1打にならない語は残りをまた規則(a)、(b)に準して左から打ちつぐ。


but rather the shape of the outline and the position of each finger.

The center bank has four keys, so you can hit a small number of sounds (つ, く, い, ん, ち, き, and geminated sounds).

These three banks comprise what are called the basic sound chords. This is a sort of phonetic alphabet, but when it comes to representing words, there are three important rules:

  1. The first syllable of a word is written with the left bank.
  2. If the next syllable can be written with the center bank, it must be written with the center bank.
  3. For words that do not fit in one stroke, repeat the rest from the left according to rules (a) and (b).

According to these basic rules, two or three syllables can be written in one stroke. The equivalent in kana would be up to 7 characters, or 4 to 5 on average. However, this alone still cannot keep up with the speed of human speech. Since it can be difficult to catch up, briefs have been assigned to the most frequently used words.

4.2 略語


4.2 Briefs

There are about six types of abbreviations, but most of them can be distinguished from basic phonetic outlines at a glance. For example,

K T (くつ)


represents くつ according to the phonetic rules, whereas

K T (くべつ)


violates basic rule (b), so it is not a basic phonetic outline but rather an abbreviation (in this case, for 区別). Similarly,

S (すべて)


represents す on the right bank, but according to basic rules (a) and (c) the left bank should not be left open. This is a brief for すべて. These center bank briefs for particles:

T (に)
K (が)
TK (で)
I (は)
N (の)
IN (も)


and their combinations:

T IN (にも)
TK I (では)



are also briefs because they are open on the left. These are good briefs in that while each one is one stroke, when you write them with your thumbs, which occurs about once every 10 strokes, your four fingers rest and prepare for the next word. Because of this, you can write smoothly and rhythmically.

Although there is not enough paper to systematically discuss each type of brief, they can roughly be divided into one-handed briefs and two-handed briefs. One-handed briefs can be written on either the left or right bank, and include words with a weak meaning, such as され, られ, たら, たり, かた, きり, しか, and はな. This also includes formal words and particles such as だけ, など, こと, から, ます, ある, あります, ました, and ません. On the other hand, two-handed briefs are words that use both banks to form a single brief, and most of them are nouns.

4.3 略語の数と割合


  基本で打った行    40%
  片手略語を使った行  40%
  中の群の助詞     10%
  両手略語の行     10%


4.3 Proportion of Briefs

There are currently a total of 475 briefs, 227 for one hand and 248 for both hands. Previously, the ratio was 231 for one hand and 609 for both hands, but when I examined the actual steno tape, the ratio was roughly as follows:

Phonetic outlines40%
One-handed briefs40%
Center-bank particles  10%
Two-handed briefs10%

Looking at these results, the frequency of two-handed briefs is extremely low compared to one-handed briefs. Merely memorizing briefs is not useful; you have to practice them over and over until you can write them almost unconsciously and reflexively. I decided that it would be more advantageous to allocate training time to the basics and one-handed briefs, so about ten years ago I drastically reduced the number of two-handed briefs. This resulted in just under 2% more strokes, but overall the performance of stenographers has massively improved.

5 漢字入力への応用

5.1 漢字指定




  1. 基本はすべてひらがなにする。
  2. 略語で同音異語がなく、普通に漢字で書く語にそのまま漢字にする。たとえば

5 Kanji Input Approaches

5.1 Kanji Assignments

What I have described in section 4 above is not new; that is just a summary of the current Sokutaipu theory. By applying this method, the speed at which ordinary kanji and kana characters can be entered into a machine can be increased 5 to 7 times compared to current Kanji teletype punching machines. The purpose of this paper is to introduce this method, and the outline is given below.

Shorthand outlines are a type of phonetic alphabet; homonyms have the same outline, regardless of meaning, and when translating, characters are selected based on context. However, it is difficult for machines to make this determination, so if you use an appropriate method to distinguish between kanji, you can use this for high-speed kanji input. Fortunately the key in the center bank is rarely used for shorthand, so it can be used for specifying kanji. The location is perfect, as if it was designed that way on purpose.

Therefore, I have decided as follows:

  1. All phonetic outlines should be in hiragana.
  2. Briefs without homophones are written in kanji for words that are normally written in kanji. For example:
K I S H 日本にほん
TK O SK 問題もんだい
HK I T 技術
  1. 略語で同音異語があればその1つに決める。
  1. If the brief has homophones, choose one.
TK O AS Y政策せいさく(製作)
K S Y証人しょうにん(商人、承認)
K A T 間接かんせつ(関節)



  1. *印を利用して漢字1字ずつのコードをつくる。このコードの作り方は、つぎの5.2で別に述べる。
  2. 頻度の高い熟語には、略語のない場合にも熟語としてのコードをつくり、一挙に2字以上をまとめて指定する。最初この種類245語を選んで。

      Parentheses indicate homonyms that are spelled differently.

Approximately 250 briefs can be written directly in kanji based on rules (b) and (c).

  1. Use the key in the center bank to create a code for each kanji. How to create this code will be described separately in section 5.2 below.
  2. For frequently occurring expressions, codes are created even if they do not have briefs, and two or more characters are written at once. This covers an additional 245 words.

5.2 漢字コード


  1. 訓の強い文字は、その訓をキーワードとして、それをソクタイプの普通の打ち方で打つて同時に*を打つ。

5.2 Kanji Coding

One stroke (21 bits) of Sokutaipu has more than 2 million possibele values, so while it is easy to specify kanji with one stroke, it is necessary for humans to memorize it so it can be written quickly. There is an method for creating associations between characters and codes. The following three methods are used to create codes that can be written by humans, as they would be difficult for humans if they were made strictly mechanically compatible.

  1. For characters with a strong kun reading, use the kun reading as a keyword, write it normally, then press the key at the same time.
K I * I S きし
T K*I つき
  1. 字音からコードをつくるものも250字ほどある。



  2. 熟語をキーワードとするもの。これが約半数である。キーワードはなるべく音訓のどちらか、1字の内部で処理するほうが、オペレータの心理的負担は軽いのであるが、多くの同音漢字を区別するために、その字を含む熟語をキーとすることはやむをえない。1つだけ実例をあげると、「ど」という音の字は「土度怒努」であるが、いちばん流動性の大きい「度」に音の「ど」をあて、「土」は訓の「つち」がよく固まっている。「怒」には「怒号」、「努」には「努力」をキーとする。
  1. Another roughly 250 characters can be assigned codes solely from on readings.

    Examples: 盆 晚 案 電 度 液 芸 菲 不 会

    As much as possible, choose characters that have no meaning on their own, ones that are often used with sounds, or ones that are often used in combination with other words.

  2. Use expressions as keywords. This accounts for about half of kanji. The cognitive burden on the operator is lighter if keywords can be input from a single character as much as possible, but in order to distinguish between many homophone kanji, it is unavoidable to use a phrase containing that character as a key. For example, 土 度 怒 努 are all pronounced ど, but the sound code is assigned to 度 which is the most frequent, then 土 gets assigned its kun reading, つち. The remaining two can then have keys based on words containing them: the key for 怒 would be 怒号, and 努 would be 努力.
T T *I 土(つち)
THKS * 度(ど)
THKS * IA KH 怒(怒号)
TK* SHKT 努(努力)

6 熟語本位への切り換え

6.1 文字よりも単語






6 Handling Complex Expressions

6.1 Words over Letters

I did a little experiment with the kanji assignment method as explained above, and found that I could easily write about 250 characters (120〜130 words) per minute.

However, there is a huge difference in efficiency between writing each kanji word one by one and treating it as a word. It is a heavy burden on the operator not only mechanically, with the number of strokes, but also mentally to recall each kanji.

From a linguistic point of view, when reading, writing, and typing (and not to mention speaking and listening), when using the Japanese language, it is ideal and natural that the letters do not even enter one's consciousness. Even with Chinese, while the awareness of letters is strong, they must be considered in combination rather than separately.

Therefore, starting last year (1969), we changed the designation of kanji from individual kanji to words and expressions. Before the switch, approximately 500 high-frequency words were input without bring broken down into letters; this time, I expanded it to about 10,000 words. Since 10,000 words can contain most of the standard kanji, I think this will greatly affect the operator's fatigue level at speed.

I simply selected these 10,000 words from a dictionary, so I hope that various aspects will be considered in the future. The number of words is also not limited; whether 20,000 or 3,000, the memory capacity is very high. Some people said it would be difficult to memorize that much, but if you already know all the proper words, you won't have to memorize them at all.

6.2 同音語の処理


6.2 Homophones

One problem that always arises when writing Chinese characters is the handling of homonyms. Kanji Sokutaipu provides five methods for instructing machines to write kanji using methods other than sounds. Although they are still related to sounds, they are written in different ways, but due to the lack of space, I will omit a detailed explanation and leave it for another day.

7 むすび





7 Conclusion

Once a 21-bit perforated tape containing kanji information is made, I think there will be a method in the future for a machine to receive it and create a regular document, but for now we can translate it into a 12-bit kanji teletape.

As such, it can be connected to existing computer terminals (for example, newspaper companies' Kan-tele and Monotype). Although the translation dictionary and the program are quite large, I think it is a rather light task given the capabilities of today's computers.

This method eliminates the problem of slow document creation speed in Japanese. It is expected that the input speed will be five to seven times faster than current Kan-tele input speeds, and could even reach the speed of shorthand.

Even in the future, when Japanese is written in Roman letters, I think that Sokutaipu, which would no longer require kanji, will be about twice as fast as writing directly in Roman letters.

T S IOTK INO KH つずけるのでございます
T KSAIOTK IN IAS H ことわらなければならない



This is because the speed increase comes from the kana writing.

(Received on January 17, 1970, then again on May 1 of the same year.)