Top
Best
New

Posted by nabla9 10/27/2024

Character amnesia in China(globalchinapulse.net)
483 points | 528 comments
drivenextfunc 10/27/2024|
As a relatively well-educated Japanese native speaker, I too experience this problem when writing Japanese on paper - being unable to write many kanji characters by hand. I am no exception among Japanese native speakers. While the author seems to interpret this problem as something crucial, I question whether it truly is.

The orthography of Mandarin and Japanese includes an alphabet consisting of thousands of characters, the majority of which comprise dozens of strokes. Although East Asian people have higher IQ scores on average, we are not superhuman - our memory capacity is bound by human limits, and the decreased frequency of actually writing kanji on paper has naturally resulted in our forgetting how to write many of them. Is this surprising?

Furthermore, orthography is not part of language in a fundamental sense - it's merely a useful tool that accompanies a language. Therefore, I do not see the writing system becoming less stable as a significant issue. Consider Korea as an example: they used to use kanji in their orthography but have almost completely eliminated it with virtually no adverse effects. While laypeople often assume orthography is an integral part of a language, this is just not the case from the linguistic perspective.

garou 10/28/2024||
If you consider that a lot of people using the Latin alphabet does use the cellphone autocomplete to check how to write a word used infrequently...

So I would say this text is biased by the "western" view of the writer, something that could be categorized as "Orientalism". A study about this phenomenon is valid, is important. But this post is not a good study.

nogridbag 10/28/2024|||
But autocomplete even for basic words? My wife is Chinese. I'll never forget when she was helping her family write some formal letter in Chinese in Microsoft Word and she simply could not input the numbers 1, 2, and 3 in Chinese because she forgot how. And I know this may be apples and oranges because this is keyboard input versus writing on paper but as a programmer who can type at a moderate pace since I was a kid (~120wpm) this was perplexing for me! And similar to the article, she's an Ivy league grad. Similarly, when she's communicating with her family via WeChat half the time she simply sends audio messages instead of text messages. I'm pretty surprised this method is so popular instead of some voice-to-text google assistant type system.
aragonite 10/28/2024|||
I think there may be some confusion. The standard Chinese characters for 1, 2, and 3 (一, 二, 三) are among the simplest characters in Chinese: literally just one, two and three horizontal strokes. These would be extremely difficult to forget! What your wife was likely trying to write were the special variants (壹, 贰, 叁) that are used on checks, official documents, etc. These were specifically designed to be hard to alter or forge (think the difference between writing "100" versus "ONE HUNDRED" on a check). Even highly educated Chinese people might need to look these up since they are specialized characters not used in everyday writing.
nogridbag 10/28/2024|||
That explains it. Yup these were some sort of official / govt documents. Thanks for the explanation!

Edit. I should have realized that. I just came back from China and my kids were watching a children's show with the following subtitles: "一二一二一二一二一二一二一二一二一二一二". Took me a while to realize the subtitles were not broken. The characters were marching chanting "one two one two..." :)

unscaled 10/28/2024||||
I think this is specifically more an IME (input method software) issue than a typing one. Japanese has similar "official" numbers (壱, 弐, 参, maybe some of the few cases where modern Japanese is more simplified than Simplified Chinese). These numbers couldn't be easier to type. I just type 1, 2, 3 (i.e. the digit keys on top of my keyboard), hit the convert key and select the right character (I also get offered 三, ③, 3⃣,³ and several other options to choose from). That's it.

I tried the same with Google's IME and I couldn't use digits as input, like the Japanese IMEs let you do. I could find the character for 叁 quickly enough, but 壹 was only on the second or third page. Still, I suck at Chinese and I found it.

Now, writing these characters is an entirely different story. I think any character that's rarely written and appears only in one common word runs the risky of being forgotten, even if that word is quite simple and used on a day-to-day basis. A word like 喷嚏 (sneeze) in Chinese or 薔薇 (rose) in Japanese fit the bill.

The Japanese fallback, in case you forgot the character is quite simple: you'd just use either Katakana or Hiragana with different connotations[1]. I'm not quite sure what the fallback would be in Chinese, but I guess that would often be picking another character with a close or same pronunciation, as Chinese speakers often do on purpose as a sort of pun.

I also expect there are still fewer cases of "character amnesia" in China than Japan, since the fallback mechanism is simpler and more standardized in Japan, and children are taught far less Kanji in school than their counterparts in Mainland China, Hong Kong or Taiwan.

[1] While Hiragana gives a familiar connotation, writing the word as バラ in Katakana is "more official", if anything, since names of flora and fauna are conventionally written using Katakana in official contexts, especially when you want to use the exact scientific name. This is the equivalent of using Latin names in Western countries, e.g. Rosa hirtula would be サンショウバラ.

Suppafly 10/29/2024|||
>The standard Chinese characters for 1, 2, and 3 (一, 二, 三) are among the simplest characters in Chinese: literally just one, two and three horizontal strokes.

Does that work for larger numbers, keep adding strokes?

GuB-42 10/30/2024||
No, 4 is 四. Numbers are simple characters, but only 1,2,3 are made by adding strokes.
garou 10/28/2024||||
I am not from Asia so I would trust more what our wife has to say than me. But I would argue that it is common for people living in a country with different language from they native language to forget how to write or even say some simple words. There's a good active effort to learn a new language.
nialv7 10/28/2024|||
It might be surprising but, in terms of written words, sneeze (喷嚏) is not "basic".
Chathamization 10/28/2024||||
That's very much the impression I get. I've never seen pinyin used in Chinese writing, and the Chinese friends I've met have said they've never seen it either (they said they'd probably just look up the character or write a homonym instead, but even then it's pretty rare that it comes to that).

That's not to say it's never done, but it feels like an outlier. As if a friend found a word too hard to understand and drew a picture instead, and then the author wrote an article about how spelling is so difficult that it leads English speakers to draw words instead of writing them.

But the thing that struck me the most was just how confused people were when I asked them about it. It just didn't seem to be anything that was an actual issue for them.

mapt 10/28/2024||||
> "This is such a gratifying experience, in fact, that I have actually kept a list of characters that I have observed Chinese people forget how to write. (A sick, obsessive activity, I know.) I have seen highly literate Chinese people forget how to write certain characters in common words like "tin can", "knee", "screwdriver", "snap" (as in "to snap one's fingers"), "elbow", "ginger", "cushion", "firecracker", and so on. And when I say "forget", I mean that they often cannot even put the first stroke down on the paper. Can you imagine a well-educated native English speaker totally forgetting how to write a word like "knee" or "tin can"? Or even a rarely-seen word like "scabbard" or "ragamuffin"? I was once at a luncheon with three Ph.D. students in the Chinese Department at Peking University, all native Chinese (one from Hong Kong). I happened to have a cold that day, and was trying to write a brief note to a friend canceling an appointment that day. I found that I couldn't remember how to write the character 嚔, as in da penti 打喷嚔 "to sneeze". I asked my three friends how to write the character, and to my surprise, all three of them simply shrugged in sheepish embarrassment. Not one of them could correctly produce the character. Now, Peking University is usually considered the "Harvard of China". Can you imagine three Ph.D. students in English at Harvard forgetting how to write the English word "sneeze"?? Yet this state of affairs is by no means uncommon in China. English is simply orders of magnitude easier to write and remember. No matter how low-frequency the word is, or how unorthodox the spelling, the English speaker can always come up with something, simply because there has to be some correspondence between sound and spelling. One might forget whether "abracadabra" is hyphenated or not, or get the last few letters wrong on "rhinoceros", but even the poorest of spellers can make a reasonable stab at almost anything. By contrast, often even the most well-educated Chinese have no recourse but to throw up their hands and ask someone else in the room how to write some particularly elusive character."

- https://pinyin.info/readings/texts/moser.html

fenomas 10/28/2024||||
Not at all - forgetting kanji just isn't similar to forgetting how to spell English words, as I think TFA made fairly clear. It's the simplest analogy to make, but it's not near enough to draw conclusions from.

The analogy I've used in the past is, you read kanji with your mind but you write them with your hand, so being unable to remember a kanji is more akin to forgetting a guitar chord or a keyboard shortcut - if your hands stop making the motions, you'll eventually forget them.

KittenInABox 10/28/2024||
Most people cannot accurately draw a bicycle.
fenomas 10/28/2024||
Yeah - the other analogy I've used is that everyone can recognize a Starbucks logo, but even if you went to the trouble of learning to accurately draw one, you'd forget if you didn't practice.
riffraff 10/28/2024|||
I am Italian and was taught cursive in elementary school and I can barely remember upper case cursive letters[0] thirty years later.

In my experience, most people of my generation have generally forgot and usually just write "lower case letters but big" or block letters.

So yeah, I don't think there's anything inherently chinese about forgetting writing things you don't use.

[0] https://www.genitorialmente.it/2016/10/alfabeto-corsivo-maiu...

cedws 10/27/2024|||
I'm studying Japanese at the moment and what struck me is how important context is, particularly in reading. You need to know where to read 1-3 letters ahead to read a word and interpret it. That's not really a thing in English - a word is a word, and the individual letters that it's composed of are almost always pronounced the same way.

I think digital is a big crutch for Japanese/Chinese because you have input methods that help you write what you want to say, so you don't actually need to remember how to write kanji as much in daily life.

Terr_ 10/27/2024|||
> You need to know where to read 1-3 letters ahead to read a word and interpret it. That's not really a thing in English

It happens in a English too, where you see a chunk of letters and mis-predict which word they represent in a way which affects its meaning [0], and sometimes that will also affect pronunciation. [1]

An example from the link:

> "The complex houses married and single soldiers and their families."

A reader linearly scanning along doesn't know whether "complex" is an adjective or a noun, and then whether "houses" is a noun or a verb. I'm pretty sure all human languages have similar problems where a certain amount of look-ahead or backtracking is necessary.

For another example to highlight pronunciation changes, consider the ambiguity of:

"I saw the rhino live in the zoo."

That could mean that the rhino was doing the verb of living, in which it rhymes with "give", or it could also mean that the speaker was seeing it in-person, in which case it rhymes with "drive".

[0] https://en.wikipedia.org/wiki/Garden-path_sentence

[1] https://en.wikipedia.org/wiki/Heteronym_(linguistics)

xarope 10/28/2024|||
seems like an opportune time to also talk about buffalo buffalo buffalo buffalo buffalo buffalo buffalo buffalo.

https://en.wikipedia.org/wiki/Buffalo_buffalo_Buffalo_buffal...

unscaled 10/28/2024|||
The Chinese equivalent would be the "The story of Mr. Shi Eating Lions":

https://www.yellowbridge.com/onlinelit/stonelion.php

Both rely on intonation (in addition to volume and pauses) for disambiguation, but the fun trick is that in the Chinese version the intonation is an integral part of the lexeme (i.e. it distinguishes between "words").

But I have to say, these kind of sentences (and full-fledged poems) are quite a different beast from simple cases of garden path sentences or syntactic ambiguity[1]. The poem lion-eating poet and the "buffalo buffalo buffalo..." sentence are both highly contrived and unlikely to be understood correctly on the first few goes even with the perfect prosody. They are cool "language hacks", but they do not occur in daily language and I personally believe (although I guess die-hard generative linguists would disagree) that they don't teach us very much about the language itself (except for what are the cool artistic possibilities it opens).

[1] https://en.wikipedia.org/wiki/Syntactic_ambiguity

jibal 10/28/2024|||
Incorrect capitalization.
coliveira 10/28/2024||||
When this happens in English, teachers will label this as "bad English" and ask you to rewrite. That's how the formal language deals with this problem.
Terr_ 10/29/2024|||
If anything, isn't that an informal solution? It relies on other people to complain that they dislike the sentence, without being able to point to any hard-and-fast rule.
everforward 10/30/2024||
The hard and fast rule is that repeating a word right next to itself is generally frowned up. It comes up with “that” a lot, like “he said that, that led to something else”. Sometimes people are doing something clever with the words, but it’s usually just poor English.
Suppafly 10/29/2024|||
Honestly, it rarely happens in English other than in contrived examples used to demonstrate the concept.
tomsmeding 10/28/2024||||
Yes, this happens in English too, but to find examples like this you have to go to Wikipedia, or wrack your brain and see if you remember one. In Japanese, almost every other word is like this.

I went to the first link in your comment ( https://en.wikipedia.org/wiki/Garden-path_sentence ), selected the Japanese version of the article, and took the first sentence:

> 袋小路文(ふくろこうじぶん)とは、文法的には正しいけれども、誤読が生じやすい書き出しで始まる文のことである。

As is usual for Japanese, this sentence contains a mix of Chinese(-origin) ("kanji", e.g. 袋 小 路 文 法 的) as well as Japanese phonetic ("kana", e.g. ふくろこうじぶん) characters. Usually, when in a multi-kanji word, kanji are pronounced with (a time-changed version of) Chinese pronunciation. For example, 文法 is "bun-pou", not "fumi-nori" or something else. However, the first character of the article title (fukurokoubunji), 袋, is "fukuro" here despite being in a four-kanji word. Further, 小 is "kou" here, which is nonstandard enough that its dictionary entry does not even list it as a possible pronunciation! [1] Then 路文 are both in Chinese pronunciation (ji-bun), but this does not necessarily make sense because the word is not split in two down the middle, but instead as 袋-小路-文 (bag-lane-sentence, where bag-lane is English cul-de-sac / blind alley). [2]

Now fukurokoubunji is a bit of a specialised word, so it might not be a great example. But in the rest of the sentence, we find 文, which is always pronounced "bun" (sentence) here, even when appearing separately, but could also (though more rarely) have been "fumi" (letter) — nothing but semantical context helps distinguish. Then we have 正しい "tada-shi-i", where 正 could have been "sei" as in 正確 "sei-kaku" (accurate) or "shou" as in 正直 "shou-jiki" (honest), but it isn't just because しい come after. Similarly, 生 in 生じやすい is "shou"(-ji-ya-su-i), which is conjugated from the base form 生じる "shou-ji-ru" and could have been "u" (生まれる "u-ma-re-ru") or "sei" (先生 "sen-sei") or "i" (生きる "i-ki-ru") or more (生 is somewhat infamous for having many readings). And I could go on: 書 could be "syo" (文書 "bun-syo") but is "ka" (書き出して "ka-ki-da-shi-te" conjugated from 書く "ka-ku").

This is a bit like the comments elsewhere here noting that the Chinese word for "sneeze" is a bad example because it happens to have so uncommon characters in it — and then people point to examples like "onomatopoeia" and "diarrhoea" as similar tricky examples in English. I can't comment on Chinese, but existence does not necessarily say much about frequency.

[1]: https://jisho.org/search/%E5%B0%8F%20%23kanji — Kun are the Japanese readings (chiisai, ko, o, sa), and On are the Chinese readings (only "shou" in this case)

[2]: This analysis of 袋小路文 is not completely etymologically honest. By the etymology ( https://en.wiktionary.org/wiki/%E5%B0%8F%E8%B7%AF#Etymology_... ), we see that the "kouji" pronunciation of 小路 is really a corruption of ancient "ko-michi", which is a consistent Japanese-Japanese reading of the two characters. However, because "ji" is also an (uncommon) Chinese reading of 路, if you don't know the etymology of the word, the re-analysis is appropriate in the context of how hard it is to read the written language.

naniwaduni 10/28/2024||
> However, because "ji" is also an (uncommon) Chinese reading of 路,

It's not a Chinese reading at all (as you can tell because it's ... wildly out of place with the the actual Chinese-derived readings ろ・る, onyomi are supposed to have semi-regular correspondences with each other and with Chinese Chinese readings). It's really just rendaku of ち, the basic root of fossilized compound みち (with still-salient prefix "honorific" み).

But most importantly, you never really see either 袋 or 小路 and expect them to have any other readings; maybe you'd expect しょうろ if you don't know the latter, but unless you're already literate in a Chinese or are blindly memorizing kanji tables, the other reading of 袋 (たい) probably isn't even salient, because it's one of those kanji that almost always takes its kunyomi even in compounds.

Side note, the line about u-onbin kind of buries the implication that this is a loanword from western Japanese, which is the culprit of several quasi-systematic but unevenly distributed divergences from regular sound changes.

tomsmeding 10/28/2024||
I stand corrected, you clearly know more about this than I do. :) (I'm only an intermediate learner.)

So perhaps my analysis of 袋小路文 wasn't very accurate at all. Yet I hope my point about 正, 生, 書, etc. stands.

naniwaduni 10/28/2024||
It's only, oh, just about the worst writing system since the Hittites or so, yeah.
Tor3 10/29/2024||||
> "The complex houses married and single soldiers and their families."

Wow.. I had to read that sentence three times before I got it right.

Suppafly 10/29/2024||
Maybe because I've seen a similar example used before, but I immediately read it correctly the first time. Honestly these sort of 'problems' only ever seem to occur when specifically created to demonstrate this problem and almost never happen in regular writing.
joe_the_user 10/28/2024|||
"I saw the rhino live in the zoo"

Might also mean; "Noted native-American zoologist 'I Saw The Rhino' lives at the zoo"

jibal 10/28/2024||
No it couldn't.
wkearney99 11/6/2024||
Given shenanigans like "thee stallion" as part of a name... sure it could.
jibal 11/8/2024||
Completely irrelevant. It couldn't because "live" and "lives" are different words.
yongjik 10/27/2024||||
I agree that Chinese/Japanese has it worse, but any language where "Spelling Bee" is a thing cannot be considered phonetic in a conventional sense.
zoky 10/27/2024||
And yet, given the definition and language of origin, most high-level spelling bee participants can make a pretty good guess at spelling a word they may have never seen before.

English is phonetic, it just borrows its pronunciation rules from many differing (and sometimes directly opposed) other languages.

stephen_g 10/28/2024|||
Very true - and every demonstration of “English is hard to spell/pronounce” focuses directly on the exceptions which exaggerates the problem. One analysis I’ve seen puts it that with a single set of rules, 59% of a sample corpus of 5000 English words can be pronounced perfectly from the spelling (of course, there will be regional accent and dialect differences so that percentage will be a bit different for each one) and up to 85% can be pretty close with only slight errors.

Then there’s a percentage where they’re just direct borrowings from other languages and you need to have an idea of how that language pronounces words (especially French), so really only 10-15% or so of English words end up being true exceptions.

1. https://www.zompist.com/spell.html

nl 10/28/2024|||
> a single set of rules, 59% of a sample corpus of 5000 English words can be pronounced perfectly from the spelling

To do this you need to know 56(!) rules.

I think this actually demonstrates how complex English pronunciation actually is.

poincaredisk 10/28/2024||
And you still only get 59% of the way to the correct pronunciation.

As a non native speaker of English, and a native speaker of a phonetic language, I strongly object to the notion that it's easy to guess English word pronunciation by just reading it.

coliveira 10/28/2024||
And that's another reason why there are so many English speakers who don't know how to read properly. It is so much harder to read compared to more sensible languages line German (and many others).
int_19h 10/28/2024||||
Those numbers are very bad, given that proper phonemic orthographies can give you a 90+% confidence with far fewer rules.

There's a simple and consistent way to compare languages in this way too, too: train a neural net to map spelling to pronunciation on one half of the dictionary, then test it on the other half. The more complicated and less consistent the orthography is, the more mistakes it'll make. People have in fact done this exact experiment, and English scores extremely poorly in it; for spelling, closer to Chinese, in fact, than many other European languages: https://aclanthology.org/2021.sigtyp-1.1/

folbec 10/28/2024||||
Maybe it's the right time to once again quote this poem :

https://jochenenglish.de/misc/dearest_creature.pdf

The joy of English pronunciation

George Nolst Trenit´e (1870–1946)

1 The text

Dearest creature in creation

Studying English pronunciation,

I will teach you in my verse

Sounds like corpse, corps, horse and worse.

I will keep you, Susy, busy,

Make your head with heat grow dizzy;

Tear in eye, your dress you’ll tear;

Queer, fair seer, hear my prayer.

Pray, console your loving poet,

Make my coat look new, dear, sew it!

Just compare heart, hear and heard,

Dies and diet, lord and word.

Sword and sward, retain and Britain

(Mind the latter how it’s written).

Made has not the sound of bade,

Say—said, pay—paid, laid but plaid.

Now I surely will not plague you

With such words as vague and ague,

But be careful how you speak,

Say: gush, bush, steak, streak, break, bleak,

Previous, precious, fuchsia, via,

Recipe, pipe, studding-sail, choir;

Woven, oven, how and low,

Script, receipt, shoe, poem, toe.

Say, expecting fraud and trickery:

1

Daughter, laughter and Terpsichore,

Branch, ranch, measles, topsails, aisles,

Missiles, similes, reviles.

... (7 pages of pain follow) ...

and the the Oxford and US pronunciation (at the time, it has changed since) in phonetic.

fennecbutt 10/29/2024||
Huge difference is: English is pretty much THE language that you can butcher and still have people perfectly understand (and hopefully politely correct) you. Even other European (stay mad) languages don't hold up to just how flexible English is in this regard.
Tor3 10/29/2024||
Well yes, that's (I believe) the reason English actually works as an international language, despite being horrible in so many respects (pronunciation, tons of exceptions, etc etc): It also has so much redundancy that even if you get all the grammar wrong the meaning is still there. "I is strongs". When someone knows a tiny bit of English it's often easier to communicate in English than in that person's language, even if you're studying said language. Unfortunately, kind of, but that's how it is.
fennecbutt 11/1/2024||
Yeah exactly. "Me arms big power" would make me go "Oh yeah you do have mighty biceps my dude".

And to the latter point I got that all the time in Japan, but I think main reasons are: they wanna practice, but even more they wanna practice with a native English speaker bc it's a novel experience for em!

Izkata 10/28/2024||||
Oh hurrah, I think that link is what I've been looking for for nearly a decade. I ran across it, or something like it, a long time ago and could never find it again. I don't remember all the special syntax, I think the one I found was written more in plain English with more examples (and I don't think the one I found back then mentioned ghoti either), but can't be sure it's been so long - maybe it was just that page and I don't remember it. It does have around the same number of rules I remember though.
tsimionescu 10/28/2024|||
This is satire, right? 56 rules to get 59% correct pronunciation on a corpus of 5000 words? And these rules don't even include the base sounds - it doesn't tell you how to actually pronounce "m", or "e". So in fact there are more than 70 rules required to get to a base pronunciation (you need to add at least one rule for each letter).
brigandish 10/28/2024|||
"ough" has at least 9 different possible pronunciations, how is that phonetic?
Suppafly 10/29/2024||
>"ough" has at least 9 different possible pronunciations, how is that phonetic?

Does a language stop being phonetic when you have to include other information provided by the rest of the word? I'm not a linguist by any means, but "ough" being pronounced a couple different ways depending how it's used doesn't seem like it'd preclude the language from being considered phonetic in general.

brigandish 11/5/2024||
9 is not a couple, unless you're in a very open relationship - which English words might be - but a language stops being phonetic at the point that the mappings between symbols and sounds are no longer clear and reliable. The most phonetic languages have one-to-one mappings with very few exceptions e.g. Japanese, Spanish, Italian, Finnish.

English, on the other hand, has silent letters, inconsistent mappings even within the same word, exceptions, irregularities, and sounds that are represented by multiple letters and spellings.

English is not a phonetic language except in the sense that it does have mappings between sounds and characters, which would make sense if one were to compare it to a wholly written language like Python, but not any human language.

seanmcdirmid 10/27/2024||||
Fruit flies like a banana. English has its own ambiguity, so it isn’t really that different.

I can only write Chinese via an IME these days. For one, I’m left handed so writing characters was always a struggle since stroke order worked against me, but it’s mostly how I only use Chinese anyways.

I told my wife our kid should learn to write via an IME as well and she was just horrified about that, though. None of the teaching material really supports it.

pfooti 10/28/2024||
Time flies. I can't they're too fast.
wyager 10/27/2024||||
I've been (very) casually learning Japanese for a couple years, and almost every time I think I find something "weird" that Japanese does, I almost immediately think of a very similar example in English.

The alphabet is a pretty awesome invention (alphabet > kana-style syllabary > kanji-style logography) but English writing is at least as complex as JP writing, just in different dimensions.

JP's phonetics, for example, are dead simple compared to English's, but they do a good job making up for it by having a few thousand Kanji.

BeretEnjoyer 10/28/2024||
> JP's phonetics, for example, are dead simple compared to English's

I'm not so sure about that. Do you know about pitch accent?

https://en.wikipedia.org/wiki/Japanese_pitch_accent

Tor3 10/29/2024|||
I'm not a native English speaker, so I don't really know why, or if, there's a problem for native English speakers to learn or "get" pitch accent. For speakers of many other European languages Japanese pitch accent is not tricky. You listen, and then you speak. Just as you would listen to English, and repeat it the same way.

Japanese, despite being extremely logical and so beautiful in so many ways, is still hard to learn for me, and of course learning the writing system is not done in the blink of an eye (unlike the Latin-based writing system we use), but pitch accent isn't really the problem here.

BalinKing 10/28/2024||||
Is that any more complicated than English stress, though? And regardless, Japanese has a very small number of phonemes (compared to English) and extremely restricted phonotactics.
wyager 10/28/2024|||
Yeah, but I don't expect this to be substantively harder than learning most regional accents (could be wrong), and afaik it's also not critical for legibility.
mmarq 10/28/2024||||
In English you have to know a word in order to pronounce it.

The “ou” diphthong in “hound” and “double” or “would” is pronounced differently. Or “ieu” in “lieutenant” vs “lieu”. Or “oo” in “poor” vs “root” Or “berry” in “berry” vs “strawberry”

I could go on forever. There’s no other western language I know of that behaves like that.

cameronh90 10/28/2024|||
English is a quasi-phonetic language in that most words can be mostly pronounced how they're written, but in some cases it inherits the pronunciation of the language the word came from. I'd imagine many English speakers would consider this an undesirable quirk, though.

Indeed, there has been a tendency over the centuries, particularly in the US, to move towards writing words how they sound or pronouncing words how they're written. Lieutenant is an interesting example, since in the UK we pronounce that "lef-tenant" traditionally, but the US moved to the (IMO superior) "lieu-tenant". Nowadays, most young people would probably use the US pronunciation.

I do take some slight umbrage with the implication that some people seem to be making in this thread that language features can't be criticised or that one language can't be better than another. I'm don't see why this would necessarily be true. Even with spoken languages. There are a ton of annoying aspects to English that simply aren't issues in other languages, and I think it's fair to criticise other languages for their failings too. This is especially true of writing systems, which are human inventions rather than something we learn intuitively.

Logographic/logo-syllabic orthographies are harder to learn and remain proficient at than alphabets/abjads, for native speakers and second language learners alike. Alphabets are an innovation that improved on ancient orthographies and enabled a wider range of people to be able to communicate as easily by writing as they do by speaking. Besides the issue mentioned in the article, the writing systems in China/Japan are associated with other issues we rarely see here. Even dictionaries are a non-obvious challenge with logographic languages, which has resulted in several competing ways to sort words.

int_19h 10/28/2024|||
I don't think one can reasonably claim that in English "words are mostly pronounced how they're written". I mean, "i" can stand for /i/, /ɪ/, or /aɪ/, for example (and also for /ə/ if you don't count "ir" as a distinct grapheme). Although vowels at least (mostly) follow some predictable patterns based on syllables - but e.g. it's impossible to say whether "ch" stands for /k/, /tʃ/, /ʃ/, or /x/ without knowing the etymology of the word.
_qbxi 10/28/2024|||
Americans pronounce "lieutenant" closer to the native French pronunciation.
lsaferite 10/28/2024||||
> “ieu” in “lieutenant” vs “lieu”

> “berry” in “berry” vs “strawberry”

Am I misunderstanding the point you are making or is my pronunciation just off? I would pronounce both parts of both examples the same.

mhandley 10/28/2024|||
That's mostly an British English vs American English distinction:

https://dictionary.cambridge.org/pronunciation/english/lieut...

https://dictionary.cambridge.org/pronunciation/english/straw...

wkat4242 10/28/2024|||
Strawberry is often pronounced as Strawbry. Sou the 'e' becomes silent. And Lieutenant as Lutenant (or leftenant in Britain)
Suppafly 10/29/2024||
>Strawberry is often pronounced as Strawbry.

Only in some dialects, not in the standard form.

int_19h 10/28/2024||||
French can be pretty bad. Not as bad as English for reading, but it's much worse for writing because there are so many spelling options for the same thing.
mmarq 10/28/2024||
You are right, but you can read French words without knowing the language, because a written word has a unique correct pronunciation.
Jolter 10/28/2024|||
You have the right idea on “ou” but your other examples don’t make sense.
thfuran 10/27/2024||||
>That's not really a thing in English - a word is a word, and the individual letters that it's composed of are almost always pronounced the same way.

https://en.m.wikipedia.org/wiki/Ough_(orthography)

ReverseCold 10/27/2024||||
> in English - a word is a word, and the individual letters that it's composed of are almost always pronounced the same way

Are you sure about that?

https://en.wikipedia.org/wiki/Ghoti

Izkata 10/28/2024|||
Posted up above, here's a collection of English pronunciation rules that English speakers have internalized so well they can't generally explain them: https://www.zompist.com/spell.html

"Ghoti" is mentioned a few times there, but basically "fish" is a nonsensical pronunciation that breaks several rules. There's a reason (well, a few reasons) why if you ask English speakers how to pronounce "ghoti" and they've never seen it before, they'll probably all guess some variation of "go-tee" or "go-tie".

Suppafly 10/29/2024||||
That's such a dumb example because it claims to follow english rules for those letters while ignoring the actual rules. It makes a somewhat humorous joke, but people pretending that it means anything linguistically are either ignorant or intentionally trying to confuse people.
sorokod 10/27/2024|||
shure!
DiscourseFan 10/28/2024||
reads like it would be pronounced with an aspirated -s- not sh.
maianhvu 10/28/2024||||
Not so much in terms of meaning but in terms of pronunciation, sometimes you also need to read ahead in English to know how a certain word is pronounced. For example: "I read a book yesterday." and "I read a book every night." Depending on the context that follows, "read" is pronounced differently. The same thing happens for "present" and "record". Admittedly, these are exceptions to the rule.
heavyset_go 10/28/2024||||
When teaching reading and English, learning about context clues is one of the ways students are taught to figure out the meaning of words.
grisBeik 10/27/2024||||
> in English - a word is a word, and the individual letters that it's composed of are almost always pronounced the same way

Some context-dependent examples: "read": /ɹid/ vs. /ɹɛd/; "lead": /lid/ vs. /lɛd/ (plumbum); "desert": /ˈdɛz.ɚt/ vs. /dɪˈzɝt/.

Terr_ 10/28/2024||
For more examples:

https://en.wikipedia.org/wiki/Heteronym_(linguistics)

xanderlewis 10/28/2024|||
I think you’ll find all of those things are true of English too.
James_K 10/28/2024|||
People find value in the tradition of writing. If Japanese were to ditch kanji as Korea did, I think there would be some complaints.
jerf 10/28/2024|||
People love to complain about how much work other people can do in order to slightly convenience themselves. And the media loves to run air their complaints, because they are snappy, and photogenic, and easy to pitch as "feel good stories about how much I care for the old ways unlike those lazy sloppy people over there"... even if "I" also find myself forgetting how to write kanji.

It doesn't matter. It won't be a top-down decision. It'll just be a long, slow progression of people slowly realizing that writing in kanji for this character is annoying, so maybe I'll just write it phonetically, and then that character, and then there will be a year or two where there's a phase change and suddenly it's everywhere, even though nobody decided.

And people will complain and whine and moan about the "beauty" of the kanji disappearing. And even though they have a point, it won't matter because the kanji will still be there as much as they ever were, and all one has to do is go study them... but they won't. Because complaining about how other people should keep doing something hard is easy, but actually doing the hard thing yourself is hard, and the vast, vast majority of the complainers won't actually do anything about it other than complain, but take the easier options themselves, just maybe a year or two later than others.

I have no beef with the people taking the easier option. Life is full of things to spend effort on and we can't give maximum effort to all of them. I am annoyed at people who complain about how other people can do vast, vast quantities of work so they can briefly feel slightly better about themselves in some way.

James_K 10/28/2024|||
You'd be surprised how much staying power these things have. The nation and its language are two concepts inherently intertwined. Take the case of Welsh in Wales. It was an almost dead language that no one spoke, but as soon as the Welsh got the ability to self-govern, they enacted laws to mandate all documents and road signs were available in Welsh, required it to be taught in schools, etc. It's very difficult to kill a language in a democratic state because it's a very bad look to oppose laws that "protect the nation's culture". The people who want these things are endlessly pandered to as a result.
jcranmer 10/28/2024||
Welsh is generally highlighted as the example of a successful language revitalism movement, but it's also one of the rare examples of such movements succeeding. By contrast, you can look at Irish--where the need for the language that wasn't English was seen as absolutely essential as part of the (successful) revolution and independence movement--and see that the language revitalism there is more or less a failure. A century after independence, the number of L1 speakers of Irish has gone down, and I believe the Irish government still conducts most of its business using English (despite English officially being the lesser of the official languages) since so few members of government are sufficiently proficient in Irish.
amrocha 10/28/2024|||
Kanji is never going away. I struggle to believe anyone who fully mastered kanji would say something like this.

Kanji is not just “harder”. It’s better.

jerf 10/28/2024|||
I've studied kanji to some degree. I'm not a "master", but I am aware of the way it resolves a lot of ambiguities in Japanese.

But that does not on its own mean that Japanese couldn't evolve out of Kanji. It is not the case that if Kanji goes away, the entire rest of the language MUST stay static. It in fact would not. It would begin a multi-decade process of adjustment to the new issues.

It has happened before in other contexts, and it will happen again. There's a lot of signs that Chinese is on the verge of such a change (on a decadal time scale), which carries somewhat different baggage, but roughly the same amount of it.

What really throws the wrench into the whole thing is computers, and I don't just mean that it will simply speed up or slow down such a change, but that it could send all of this flying out in an entirely new direction. If we're all wearing augmented reality goggles full time in 20 years, what will happen to ideograms if every ideogram you see comes with floating pronunciation guides, and your googles can also translate phonetic spellings transparently in real time back into kanji/ideograms? Could languages like English start growing something like ideograms, presumably descended from modern-day emoji, if computers erase the disadvantages of emoji that cause languages to largely go alphabetic thousands of years ago?

What I absolutely do know is this: In 50 years, no language will be the same as it is today. Guessing what the changes will be, especially in a rapidly evolving novel landscape, is really hard. I don't think kanji/ideograms being seriously diminished is off the table.

James_K 10/28/2024|||
Why is this if you don't mind me asking? I thought that hiragana could already write all the words. What makes kanji so much better than that?
Tor3 10/28/2024|||
In addition to the phoneme problem, it's about readability. Yes, really. The first time I saw わたし written as 私 I just about instantly remembered the latter (it is, after all, used constantly in writing). That kanji is much easier and faster to read than the corresponding hiragana, and it was like that from way back when I had just started learning Japanese. I still have a way to go.. learning a language at my age turns out to be quite slower than when I was younger.. but everything is just easier to read, as soon as one's able to read something in kanji instead of hiragana. The latter is hard and slow to read, even though it's such a simple character system to learn.
fennecbutt 10/29/2024||
Nah as someone that learnt it for 3 years, did a 6 month exchange and then stopped after that I totally disagree.

Not only are kanji needlessly complex because of history, there's also extra work like stroke order (another needlessly "important" thing).

Hira/kata is so much easier, but I ended up giving up the language after I both realised that I wouldn't live there and that they're just romanising so much anyways.

amrocha 10/31/2024||
This is equivalent of saying you studied engineering for 6 months and turns out arches are useless, you can just get rid of arches in all bridges and nothing bad will happen.
Analemma_ 10/28/2024||||
Japanese is very syllable-poor and so there are a colossal number of homonyms and homophones. In speech a lot of these are distinguished by tone and pronunciation, but in writing kanji is the only way to tell them apart. Reading kana-only Japanese is not impossible, but it's a fast path to a headache and leads to huge numbers of ambiguities even in the best case.
int_19h 10/28/2024|||
This just indicates that kana orthography is not phonemic enough; but there's no reason why it couldn't be improved to cover tones etc.
amrocha 10/28/2024|||
The issue is not the writing system. Japanese phonetics are extremely simple. There’s nothing you can do about that.
Tor3 10/28/2024|||
Japanese doesn't have tones, it has pitch accent, and pitch accent applies to words, not phonemes. You would have to invent a system where pitch accent could be indicated for each word. The difference between 橋 (bridge) and 箸 (chopsticks).. the pitch accent is slightly different. But written the same in Hiragana: はし So there would have to be something (wavy line above the text?) to indicate pitch accent. Not sure how that should be done. And then there are the words with little or no pitch accent difference, only context.. in kanji they're different, would be the same in hiragana, so how do you encode that.. compromises would have to be made. I'm sure people have tried to come up with something, somewhere. Maybe.

But then again.. it's that other problem: Reading when there's kanji is much faster. Even for beginners. If you don't understand a word in kanji then it doesn't work, but as soon as you understand it it's way easier and faster to read.

akavi 10/28/2024|||
> You would have to invent a system where pitch accent could be indicated for each word

Really not hard to do. A symbol on the syllable bearing the pitch accent would solve the issue

> And then there are the words with little or no pitch accent difference, only context

What's happened is that effectively a written "shorthand" has emerged that has evolved somewhat separately from how people speak. Losing kanji would mean losing this shorthand, in favor of writing more closely akin to the way people actually speak, but this is how the vast majority of written languages work. Preserving this shorthand seems like thin gruel to justify the complexity of kanji.

Tor3 10/28/2024||
Pitch accent is not accent as in English, it's not any "the" syllable. If you've ever seen any of those videos about it, you'll see these down-up-flat patterns over the whole multi-syllable word. From high to low, from low to high, or low to flat plus/and other variations.

I wouldn't compare kanji to shorthand. Shorthand is typically not easy to read, normal writing is easier. Reading written, fully-spelled English is fast. Reading hiragana is slow (and I've been reading hiragana for a long time)- it's slow, and mentally much harder than reading with kanji. The only issue (and that is of course an issue, but tiny compared to Chinese) is that there's a lot to learn before everything can be read fluently. But reading only hiragana is just.. too hard, for any serious amount of text. It's not hiragana per se, it's the language itself with its limited set of phonemes which contributes to the difficulty.

akavi 10/28/2024||
Pitch accent in Japanese is deterministic based on the mora that is "accented". While it's true the effect of this accent "spreads" across the entire word, you only need to mark a single mora to know the effects word-wide.

> Reading hiragana is slow (and I've been reading hiragana for a long time)- it's slow, and mentally much harder than reading with kanji.

What's the ratio of hiragana-only text that you read compared to Kanji? And does the hiragana text uses spaces between words? My strong suspicion is "low" and "no", respectively. Familiarity breeds comfort with any writing system, and word breaks are a fabulous ergonomic tool for easing reading.

Tor3 10/29/2024||
When I started Japanese a long time ago I would read (small) children's books because all I could read was hiragana. With spaces, for the smallest children. And that was all I read and could read. And yet.. as soon as I could read various words with kanji, the reading got easier and faster.
Suppafly 10/29/2024||
>And yet.. as soon as I could read various words with kanji, the reading got easier and faster.

Could part of that be due to the fact that your vocabulary was also increasing at that time?

Tor3 10/30/2024||
No, it wasn't because of vocabulary, which has only very slowly increased over time. The reading difference is instant and very noticeable. I can't read hiragana fast enough (matching speech) to follow subtitles which are all in hiragana, for example, while I can if there's kanji (though only if I can read it, there's still lots I can't read). This can be changed forth and back and tested with sites like Animelon, for example.
sparkie 10/29/2024|||
> I'm sure people have tried to come up with something, somewhere.

Perhaps related is the abjad used in Arabic and Farsi. Vowels are written with diacritics above or below the main part of the character, which represents a consonant. However, in modern Arabic, the vowels are rarely written and are inferred from context.

The bigger problem for Japanese is the absence of spacing between words. Even if you write everything in hiragana with spacing, it's significantly slower to read than when kanji is present without spacing. The mixing of kana and kanji usually provides a hint as to where word boundaries are, because there are few cases where kana is followed by kanji in the same word (eg お and ご), and kana which follows the kanji are most often a continuation of the word (okurigana) or a particle. Some words are usually written in kana despite having kanji available, and their presence can sometimes make it more difficult to read because they might look ambiguous with a particle or okurigana, and you have to figure out from context what was intended, which slows down reading slightly.

James_K 10/28/2024|||
I can't help but feel these languages are just silly, or at least very badly designed. Maybe in the future, when AI is good enough to translate everything in real time, we will just find a language that is best and teach children that instead. It would save a lot of headaches, and probably also cure dyslexia.
Tor3 10/29/2024||
To call a language silly is.. silly. I don't know Chinese. But for a person like me, Japanese is incredible. It's so extremely logical. Exceptions are almost non-existing. Sentences are modular. Etc. I love it, as a person with a programmer's mind. It has very few phonemes and that's one reason it's hard to "fix" the writing system, but that's also one of its good points, for someone learning the language.

As for "translate in real time", that won't happen because from Japanese to English it would mean to translate before the sentence is done, knowing the intention of the speaker before the speaker says anything. For the simple reason that in Japanese the verb comes at the end while in languages like English it's typically the second word. Using an AI wouldn't be any better than when I used to translate for my wife and the other way around. It works but is hardly satisfactory for anything more than occasionally (speak, wait to hear the translation, speak back, ditto).

A Star Trek universal transparent real-time translator will not happen.

As for dyslexia.. I don't see the connection. Dyslexia is a problem of reading and writing, and it exists independent of the language, and also the writing system (it has been sometimes claimed that Japanese children are less affected by dyslexia than people learning Latin-based languages, and I for some time kind of thought so too.. but I have since seen multiple cases of dyslexia related to Japanese as well, it's the exact same problem)

James_K 10/29/2024||
> it exists independent of the language

Rates of dyslexia are much higher in countries with less phonetic spelling systems. The general conclusion from this is that, while dyslexia may exist at equal rates in countries with phonetic spelling, its effects are diminished to the point where many individuals with it can read unimpaird.

> A Star Trek universal transparent real-time translator will not happen.

I never claimed it would. A delay of a few seconds between speech and translation is acceptable, much the same way actual translators do it.

Tor3 10/30/2024||
I would like to see actual research into dyslexia vs spelling systems, because I've tried to find it and I haven't been able to. Instead I see only claims as the above, which, so far, appear to be based on "common sense", which doesn't actually work here. Common sense says that languages with complicated spelling rules (English, French) should affect dyslectics more than straight-forward languages like Italian and Finnish, but it doesn't, to any noticeable effect.

As an individual I only have anecdotal "evidence", but for what it's worth - I already mentioned that I've seen dyslexia in Japanese children, but not only that - I've also seen that dyslectic bi-lingual children have dyslexia both in Japanese and in their European language.

Unless I see real evidence I'll continue to assume that dyslexia is simply under-reported in e.g. Japan. As has been the case for so many other things - nobody speaks of lactose intolerance in Japan, though it obviously exists.

James_K 10/30/2024||
Yes I was interested in this myself so, before posting what I just wrote, I looked into it and went through the sources on a few papers. I ended up at this fairly authoritative-sounding book which made the claim, though I don't remember the source they cited and I can't be bothered to find it again. The claim made was not that dyslexia wasn't present in other languages, but that its effects were reduced in phonetic ones. The same way that someone in a wheelchair still has broken legs, but can benefit greatly from the installation of ramps.
smallnamespace 10/28/2024||||
This is not a reason for Japanese people to keep Kanji, but Chinese tourists can read Japanese at about 50% comprehension level just due to Kanji without knowing at all how the words are pronounced in Japanese.
amrocha 10/28/2024|||
Another commenter pointed out the ambiguity in Japanese phonetics which is very true.

Imo, the biggest efficiency gain from kanji comes from reading. Meaning is grasped instantly because you don’t need to worry about phonetics. Pronunciation follows a general set of rules, such that even when encountering new words you can guess at how they’re pronounced, while grasping meaning at a glance.

To compare it to latin languages, the difference is like going from reading everything out loud to reading silently.

fennecbutt 10/29/2024||
How does pronunciation follow any rules? There are none that I know of where a given kanji can have several meanings completely independent of one another, there is no structure there.

I'd agree with you if you'd said Korean, where the makeup of the character has direct rules for pronouncing it, if you learn the simple rules then you can read any Korean character - this is the middle ground they should drop kanji for, imo

amrocha 10/31/2024||
The main radical in a character usually dictates how it’s read. General language familiarity tells you which of the readings to use. That’s accurate most of the time, and when it isn’t there’s furigana on the word.

For example, 青 is read as “sei”, and characters that use it as a radical are either read as “sei” or “jou”, such as in 情熱(jounetsu) or 清潔(seiketsu). So when you run into a rare character in a word that uses this same radical, you can assume that it uses a standard reading. For example, the word for fairy, 精霊, isn’t one you run into very often, but when you do you can assume that it’s read as “seirei” based on the radicals, and you’d be correct.

I’m explaining this in length here but with native level proficiency this process happens instantly, as you’re reading.

Japanese should not drop kanji. The only people that think that are foreigners that failed at learning the language. This is not a shared sentiment among japanese speakers.

int_19h 10/28/2024||||
Sure, and there were complaints in Korea, too. Lest we forget, Hangul was developed in 15th century, and was promptly condemned by the educated elites while being enthusiastically adopted by the underclasses. But the elite pushback, going as far as outright bans in some periods, meant that it wouldn't become the standard orthography until 1900s.

I don't think anyone today would seriously argue that Hanja is preferable, though. In retrospect, it's clear that the benefits of easily accessible universal literacy are too substantial to ignore for the sake of tradition.

smallnamespace 10/28/2024||
> I don't think anyone today would seriously argue that Hanja is preferable

It's necessary to use Hanja today in educated contexts because Hangul has too many homophones, and most educated (technical, literary, scientific) vocabulary has a Sinitic origin and therefore are more homophonic than typical Korean words.

int_19h 10/29/2024||
Sure, and lawyers in English-speaking countries similarly use Latin and Old French jargon to reduce ambiguity. But this is a fairly narrow use case that is really more of a specialized notation - it's not used day-to-day even by people who regularly use Hanja professionally.
WillAdams 10/28/2024||||
Hanja still get used in some contexts --- had to memorize ~500 of them when I was studying Korean.
iforgotpassword 10/28/2024||
AFAIK (maybe someone can correct or confirm) it is essential for studying law in Korea. To avoid ambiguity with identically sounding words, Chinese characters are used in law.
coliveira 10/28/2024|||
This is the reason Chinese characters are not going away. It is essential to comprehending written documents, because the Chinese language (and similars) have too many sounds that are the same or very similar for different words. So, if they abolish the characters and use something purely phonetic they'll have to reinvent the whole language to be understandable, especially for anything that is not colloquial.
James_K 10/28/2024|||
This is not a problem in other languages. The word "set" in English has 7 different meanings, yet you rarely struggle to tell which is intended. If the language can be understood when spoken, it can be understood when written phonetically.
coliveira 10/28/2024|||
Other languages are not Chinese. In Chinese a lot of the meaning in the spoken language is conveyed through tones and other conversational cues.
akavi 10/28/2024|||
Tones can be written, and all human spoken communication involves conversational cues, Chinese is not special in this respect.
coliveira 10/28/2024||
If this was so easy, pinyin (a standardized writing) would have replaced characters decades ago!
James_K 10/28/2024|||
But it is that easy. Pinyin has a standard notation for the tones of words. Your position on this matter cannot seriously be "if it were possible to write Chinese phonetically, the Chinese alphabet would no longer exist".
iforgotpassword 10/28/2024||
Their position is "since it is possible to write Chinese phonetically and yet characters didn't go away, there might be more to the story" (than self-proclaimed language experts on HN think)
James_K 10/29/2024||
This is incorrect and shows a basic lack of reading comprehension. Neither I nor anyone else claimed to be a language expert. The post in question said:

> If this [writing tones] was so easy, pinyin (a standardized writing) would have replaced characters decades ago!

Which is saying that the reason Pinyin has not replaced traditional characters because it cannot accurately transcribe Chinese speech.

coliveira 10/29/2024||
As the previous comment says, you're the one with reading comprehension problems. The topic of discussion is the replacement of Chinese characters with a phonetic writing. I said that pinyin already exists and it has not replaced characters, so this cannot be easy as you imagine (just writing down the phonetics of the language).
James_K 10/29/2024||
Which is your perspective, and distinct from the argument just made: that if it was possible to write Chinese tones, the traditional characters would have been replaced. It's obvious that the characters are not replaced due to cultural factors, rather than the inability to come up with a system that can transcribe Chinese speach.
fennecbutt 10/29/2024|||
I'd say nationalism is really the answer there.
James_K 10/28/2024|||
Can you quantify this? From what I understand, Chinese speakers can understand Pinyin text even without the tone marks.
sparkie 10/29/2024||
nd nglsh spkrs cn ndrstnd nglsh wtht wrttn vwls.

Easy to understand for a fluent speaker, but a learner might struggle.

We saw back when we had keypad phones, the youth would write "txt" speak because it was faster to type with 10 digits. I'm pretty sure there was a decline in literacy rate around this time, the youth struggled with spelling because they wrote rarely, but texted frequently. Smartphones fixed that problem, because they provide the full keyboard and auto-correct.

My guess is, if you took the tones out of pinyin, then a generation or two later there would be less literacy. Children would struggle to add the tones even though they know how to speak the word. Writing already contains far less information than speech. Over several more generations, the speech could even change because the written word has lost the tonal information. Compared to the past, we read far more, speak less, and write even less, and most writing has been replaced by typing.

iforgotpassword 10/29/2024|||
Most importantly, you can always pick a simple, predictable sentence, or one with enough redundancy to "prove" that point. Some everyday simple sentences might work in pinyin even without the accents for tones. Try an excerpt from a patent application and I'm sure even with tones you'll fail.

> mchncl lmt xsts t th wdth f sngl xhst prt, t bt 62% f th br dmtr fr rsnbl pstn rng lf.

> Th rd vlv s smpl bt ffctv frm f chck vlv

That's just from a Wikipedia page I have open from earlier. Already quite a bit harder to decipher.

Suppafly 10/29/2024|||
> the youth struggled with spelling because they wrote rarely, but texted frequently

I wonder if there is any evidence of that other than boomers complaining about it.

>Over several more generations, the speech could even change because the written word has lost the tonal information.

That happens automatically with every language already. It's not like a race to the bottom where suddenly no one knows how to communicate though.

>Compared to the past, we read far more, speak less, and write even less, and most writing has been replaced by typing.

That's not necessarily a bad thing.

Suppafly 10/29/2024|||
>This is not a problem in other languages.

I don't have a dog in this fight one way or the other, but it really is surprising that all these pro-kanji comments seem to ignore the concept of context altogether. It's very circular reasoning being used to try and explain why kanji are necessary.

jimmaswell 10/28/2024|||
> they'll have to reinvent the whole language to be understandable

Frankly, the whole language seems like such a mess that maybe they should?

coliveira 10/28/2024||
Good luck convincing 1.5 billion people that they need to reinvent a language they have used for thousands of years in order to satisfy somebody else...
James_K 10/28/2024||
In the Chinese style of government, people don't necessarily need to be convinced of something for it to be implemented.
pas 10/28/2024|||
It seems there's room for "legal innovation" there, by providing definitions early on in various texts to disambiguate, and then sticking to them throughout the text!?

I assume it's already done anyway for some terms. Why isn't this more widespread?

mannykannot 10/28/2024|||
Innovation is quite often resisted by those who have mastered the hard way of doing things, though I have no idea whether this is the case here.
fenomas 10/28/2024|||
I suppose for the same reason that law in English-speaking countries still uses so much Latin?
pas 11/2/2024|||
But that's the opposite of innovation? Basically, instead of describing things in detail drafters opt to use shortcuts, but that's how people end up getting fucked in court by some "technicality".

Innovation would be to just put in the verbiage, precisely define terms, fuck tradition.

hollerith 10/28/2024|||
The Latin (and Old French) words don't require a complicated arrangement to type them on a keyboard.
Tor3 10/29/2024||
For what it's worth, writing Japanese on a normal keyboard is easy, even for me. Fast too. And my wife is super-fast. I have no knowledge of how this is done in Chinese, or Korean for that matter (not to mention other non-Latin languages like Arabic, Thai or Hebrew), but for Japanese it's easy. There are two main ways of doing it, some prefer one, some the other.
nialv7 10/28/2024|||
Yeah, because Japanese without Kanji is at least 2x harder to read.
make3 10/29/2024|||
"Although East Asian people have higher IQ scores on average, we are not superhuman" weird explicit racism as the highest voted comment
bdjsiqoocwk 10/29/2024||
Before calling it racist we could ask him for a reference. After all, it might be a truthful fact.

But yes, it stood out to me too, and I'm confused how you're the only person commenting on it.

I wonder if these people justify having a shitty writing mechanism by being smart. "It's so needlessly complex, but we're smart so we can afford it" when in reality if you're smart you want it as simple as possible.

And then he comes on HN and rationalizes the fact that he can't spell. Ironic.

make3 10/30/2024|||
IQ has been disproved as not an accurate evaluation of intelligence even for problem solving, so asking for a scientific reference for this is like asking a scientific reference for how a certain locally dominant ethnicity have better chakras than another one. they don't, it's just racist.
devvvvvvv 10/30/2024||
The absolute state of orange reddit
Mali- 10/29/2024|||
I found the "we are not superhuman" bit annoying and condescending - replacing with different groups:

"While men are stronger than women on average, we are not superhuman" <- To me, this doesn't seem condescending. Not 100% sure why - perhaps because the difference in the trait (between men and women) is much larger?

"While left-leaning people are smarter than right-leaning on average, we're not superhuman" <- This definitely does seem condescending - perhaps because the skill in question is intelligence, and the statement reads as "You're stupider than me, but please strain your brain to understand".

"While rich people have higher IQ scores on average, we're not superhuman" <- I find this a bit less condescending than the previous one, not sure why. But still annoying for the same reason as previous.

"While whites are stronger than east asians on average, we're not superhuman" <- Again condescending and annoying, but I still think the intelligence statements are more grating.

My conclusion - the statement is irritating because it carries with it an implication people would be surprised Asian people can forget things too. Additionally, it gives the intention that because other groups are stupider, it needs spelling out in simple terms that they're not godly intellects.

jazzyjackson 10/28/2024|||
Maybe it's less of a factor since the standardization of mandarin, but the difference between kanji and an alphabet like Korean and Vietnamese has moved to is that writing with the alphabet leaves an artifact that is only understood by speakers of the same language, whereas kanji can have the same meaning but different spoken words entirely, such that cultures can communicate through written edicts without totally erasing linguistic differences through standardization. So you're right that the individual language/culture doesn't suffer from alphabetization or pinyinification, but I would submit there is change on the level of multicultural interactions, decreasing the mutual intelligibility between cultures for better or worse
bdjsiqoocwk 10/29/2024|||
> Although East Asian people have higher IQ scores on average

Citation needed.

alkyon 10/29/2024|||
Maybe parent was referring to the studies by Lynn, a self-declared "scientific racist".

I think that despite lower IQ scores on average South Korea has been consistently beating Japan in go in the recent years, and more importantly they get rid of hanja (Korean version of kanji) from their writing system.

cubefox 10/29/2024||
[flagged]
tptacek 10/30/2024|||
Ignore the SPLC's own editorializing --- I don't take SPLC's analyses all that seriously either --- and just read the quotes on this page.

https://www.splcenter.org/fighting-hate/extremist-files/indi...

Nobody here is defaming Lynn. You can disagree with the appellation (though I find a number of fact-checked publications claiming that he does describe himself that way), but I don't think it's reasonable to call the argument libelous.

cubefox 10/30/2024||
Calling someone a racist is one of the most defamatory statements one can make. People get cancelled for less. Instead of name calling, people should focus on the empirical question (is there a positive correlation between East Asian ancestry and IQ?) not on trying to undermine the reputation of someone.
tptacek 10/30/2024|||
While I don't agree with any of that, at all, we don't even reach the question, because the point of my comment is that he calls himself that. The SPLC has him in primary source quotes.

I don't think you're going to have an easy time of un-cancelling Richard Lynn. As it stands, your argument comes across more as trying to launder his most inflammatory claims back into the conversation. I'm not interested in debating phrenology, only the more specific question of what terms are and are not reasonably to apply to this person. That's a question we can actually answer empirically with sources available to us.

cubefox 10/31/2024||
> While I don't agree with any of that, at all, we don't even reach the question, because the point of my comment is that he calls himself that.

Apparently he does not. You haven't produced a quote to show that he does.

> I'm not interested in debating phrenology

Correlations between ancestry and IQ have nothing to do with phrenology. If they exist, they are not racist. Facts just are what they are.

tptacek 10/31/2024|||
For whatever it's worth, that was literally what David Duke said. I'm not saying you're David Duke, just that the rhetoric you're deploying isn't persuasive.
jibal 11/1/2024||||
And it's a fact that Lynn is a self-proclaimed "scientific racist", and that most "race scientists" are racist.
jibal 11/1/2024|||
> Apparently he does not. You haven't produced a quote to show that he does.

That's not how it works.

cubefox 11/1/2024||
You accuse someone of saying X, but can't provide evidence they ever said X, then there is no reason to believe you. The burden of proof is on your side.
jibal 11/2/2024||
"Apparently he does not" is a positive claim ... it does not follow even if you were right about not providing evidence--but you're not. And they can and did provide evidence, you simply ignored it. There are plenty reasons for honest people to believe it. Meanwhile, it's an empirical fact that Lynn is a racist, as well as most "race scientists" and their defenders. I won't comment on this further.
cubefox 11/2/2024||
> And they can and did provide evidence, you simply ignored it.

That's simply a lie. They didn't. Nowhere on that website was Lynn quoted as describing himself as a "scientific racist".

> Meanwhile, it's an empirical fact that Lynn is a racist

What would be the evidence for this claim? There could in any case be an association between IQ and East Asian ancestry. Whether there is, is an empirical question, and an empirical hypothesis can't be racist, it is just true or false, or supported/unsupported by the evidence. In the source I provided, Lynn references statistical evidence that supports it.

akerl_ 10/31/2024||||
Calling someone a racist is actually nearly never defamatory, since it’s almost always an opinion based either on disclosed facts or an opinion based on nothing.
cubefox 11/1/2024||
Of course it's defamatory, as many cases of cancellation show.
jibal 11/1/2024|||
He wasn't called a racist here ... the claim was that he labels himself as one.

OTOH there are those who protest too much that "it's just science".

alkyon 10/29/2024|||
Do you expect him to tatoo a word "racist" on his forehead (in kanji)?

That you even dare to quote this pseudo scientific crook is just mind boggling.

Relevant quote:

What is called for here is not genocide, the killing off of the populations of incompetent cultures. But we do need to think realistically in terms of "phasing out" of such peoples.

Calling his output research is abhorrent.

cubefox 10/30/2024||
What is the source of this quote?
jibal 11/1/2024||
The SPLC link above that you didn't read. None of the (sourced) quotes it gives from Lynn are scientific.
cubefox 10/29/2024|||
Here is one citation: https://www.sciencedirect.com/science/article/abs/pii/S10416...

But also in most other IQ tests, Ashkenazi Jewish and East Asian people tend to score the highest.

jibal 11/1/2024||
a) Bad science. b) Unsupported claim. c) Who cares, and why?

As an Ashkenazi Jew with an IQ 3 SD above the mean myself, I focus my attention on that question and have a good grasp of the answer. I also have particular insight into why some Jews have high scores, and how the people who care so much about the average IQs of various populations draw all the wrong conclusions from them because of their ideology. (I would also note that many of those who care so much have lower IQs than a very large fraction of the populations they disparage.)

jibal 10/28/2024|||
"While the author seems to interpret this problem as something crucial"

Does he? Read his last paragraph.

bane 10/28/2024||
My wife is college educated and native Korean, so these are just my observations of her and her friend group's engagement with Chinese characters (Hanja).

Hanja, in daily life, has largely disappeared from colloquial Korean for those under 40 or so. It's still preserved in some formal settings like medicine and law, and is used to appeal to older generations. I've been with my wife long enough to remember when Hanja was still very common to see on newspapers.

There are some small vestigial problem with eliminating from daily life, the large number of monosyllabic Chinese-origin loan words in modern Korean can sometime create ambiguity when written in Hangul. Native Korean speakers will sometimes disambiguate these words by referring to the Hanja, but that's largely disappearing as a habit as well.

Younger Korean generations still learn it in K-12, but it's mostly wasted class time in an already overly crammed education. The kids who focus on it are really geared towards becoming lawyers, and certain kinds of doctors (mostly traditional medicine). STEM focused kids will focus on English instead. As a result there's an active linguistic process occurring where English loan words are slowly replacing Chinese-origin words and concepts in active and modern Korean.

I don't too much about Japanese, but I do have a sense from native speakers that writing the same words in the four major writing systems offers some sense of nuance to how close a reader might be to a concept, or how they might consider it in various ways. From visits there, I did notice the expectation that native speakers could seamlessly read and jump between the systems, often within the same sentence. But I also understand that the pronunciation of Kanji is somewhat nonstandard, and it's not immediately clear how to say something written purely in Kanji (sometimes this is supported by providing explanatory superscripts in another system next to the Kanji). Why persist with this? I suppose it's the nuance that's being conveyed, and this nuance is still prized among native Japanese speakers.

I do get the sense that China has no particular plans on moving away from the system, as it's a unifying source of national identity (and has been for centuries). And they really have very few other options. The main problem is that China is a highly linguistically diverse country, and Chinese offers the ability to transmit ideas instead of sounds which allows speakers of non-mutually-intelligible "dialects" to communicate. Moving to a Latinate system or even to Zhuyin Fuhao (Bopomofo) encodes sounds, not ideas, and risks fracturing the state. It would only become possible if there was a concerted effort, maybe over a couple generations, to Mandarinize and discourage the use of local dialects, but that would also be highly disruptive. Koreans, Japanese (and other adjacent non-Sino languages like Vietnamese, etc.) escaped this either through a higher level of linguistic uniformity, or strong efforts to standardize or teach a national dialect that the writing system (Hangul, Chữ Quốc ngữ, Hiragana, etc.) could amplify.

suchire 10/27/2024||
At one point, apparently it was fashionable amongst teens to type characters by using pinyin and always selecting the first character in the list of options, regardless of the intended actual character. That was essentially phonetic writing, but as a result, texts were incomprehensible to parents (the desired outcome).
whoisburbansky 10/27/2024|
If the texts were incomprehensible to parents, how were they comprehensible to their intended recipients?
Prickle 10/28/2024|||
It's the same logic as writing a sentence like this:

Y U gna be late

It's grammatically completely incorrect. But you can still understand it.

When it comes to chinese/japanese characters, many have the same phonetic reading. So you can do something similar, while selecting the wrong characters.

olalonde 10/28/2024||||
I think it's just easier for beginners (or teenagers) to go from phonetic to meaning. I guess advanced Chinese readers don't even read the words out loud in their head and go directly to meaning. I'm beginner/intermediate at Chinese and surprisingly, I noticed that my pinyin often seems better than many Chinese natives.
neaden 10/28/2024||||
It sounds like the Chinese version of 1337speak.
raincole 10/28/2024||||
The real answer to your question: the most commonly used Chinese input method allows you to type the first pinyin letters only, and the algorithm will figure out the most likely Chinese characters you want.

It's not "the parents" can't read it. It's that people who don't use electronics have a harder time reading it.

raincole 10/28/2024||
(I misread the top comment)
oasisaimlessly 10/27/2024|||
Read the nonsense text aloud and then listen. Presumably with practice, you don't actually to actually speak aloud, and your 'inner monologue' voice is sufficient.
est 10/28/2024||
Chinese characters are not some kind of alphabet. It's like an intermediate language (IL) of mind. Many studentds in China can understand a subject but had the pronouciation completely wrong. In fact many would argue Chinese languages were never unified (mandarin/cantonese/etc) but the scripts were.

Chinese characters also had the benifits of photographic memory, presumably you are trained with the right method. The key is to detach the "listening/speaking" phonetics from the characters, wire your brain directly to visual ideograms along with reading/writing. Plus the grammar don't have conjugation nor declension, without the tense, case, voice, aspect, person, number, gender, mood, animacy, and definiteness and shit, which makes the scripts very fast to parse. I'd argue reading a paragraph of text is extremely fast in Chinese. You can grasp the general meaning from a large chunk of text without sequencially reading every word. It's like one of these novel apps that hightlight important vowel from English sentences for fast reading but still, you have to go to the translation layers of recall - sound - meaning process.

Sadly this art is lost because ideograms are fading in favor of PinYin in cyber world. The rise of shot-vids make literacy an expensive skill.

lolinder 10/28/2024||
> In fact many would argue Chinese languages were never unified (mandarin/cantonese/etc) but the scripts were.

This is, in fact, the default stance held by most non-CCP linguists. If you read what experts in the Chinese language family say, it's basically "Chinese languages are mutually unintelligible and more distinct than the Romance languages, but because the government of China says they're just dialects and we (as linguists) recognize that the line between dialect and language is basically arbitrary, we'll call them dialects so we can just study the languages and avoid getting sucked into nasty political discussions."

As the saying goes, a language is a dialect with an army and a navy—and this works both to define distinct languages that are otherwise mutually intelligible and to merge dialects that aren't.

niceguy1827 10/29/2024|||
> In fact many would argue Chinese languages were never unified (mandarin/cantonese/etc) but the scripts were.

This is the correct understanding, even within mainland China, and across all times. The practice of assigning a "mandarin" based on where the capital is/was dates back at least hundreds of years, if not thousands. You can easily Google "Mandarin in Republic of China" to see the Republic of China's attempt to standardize its mandarin. It's really not a CCP issue.

int_19h 10/28/2024||||
Linguists often use terms like "isolect" these days just to dodge this whole debate and the associated (often very toxic) politics. Not just with Chinese - it's also an issue around e.g. Serbo-Croatian.
GranularRecipe 10/28/2024||||
I think it's also an issue of translation. "Dialect" in English is variety of a language. 方言 (fangyang, translated as dialect) in Chinese is a regional language or speech. Linguistically, Cantonese and other varieties of Chinese are recognised to be part of the Chinese language group / family.

Cf https://baike.baidu.com/item/%E6%B1%89%E8%AF%AD/22488993?fro...

djtango 10/28/2024|||
I'm sure there are plenty of scholarly academics with a whole body of literature to argue otherwise, but I've never bought all the arguments of how different Cantonese and Mandarin are.

To my mind assimilating Cantonese from Mandarin or vice versa is way easier than French <> Italian or Italian <> Spanish. Spanish <> Portuguese is an interesting contender.

Good luck pretending Mandarin and Cantonese are distinct languages while comparing German and French lol.

I say this as someone with a whole lot of Cantonese dna in my heritage before people get all up in arms. I've personally always figured the barriers to learning both Mandarin and Cantonese was cultural and there are plenty of people in Guangzhou who are perfectly bilingual

broken_clock 10/28/2024||
How many of these languages do you speak?

I currently speak and understand English, Spanish, Cantonese and Mandarin to varying degrees.

I was forced to take French for 2 years in high school. Never even took it seriously. But because of that, after 6 weeks of private Spanish tutoring, I was able to hold hour+ long conversations with strangers while backpacking LatAm.

I've spoken Cantonese my entire life (but not truly native level). I took an entire year of college level accelerated "Mandarin for Other Chinese Language Speakers". I took it quite seriously. I'm backpacking China right now. I still can't even talk to anyone for more than a couple minutes without having to use a translator or look up words.

> there are plenty of people in Guangzhou who are perfectly bilingual

There are also plenty who move to Guangzhou and Shenzhen and can't pick up Cantonese at all. Turns out having an authoritarian government force Mandarin on you will make the Cantonese speakers bilingual rather quickly.

djtango 10/28/2024||
I have learned to varying degrees Mandarin Teochew Cantonese Japanese Spanish French German (and Latin lol) I was shocked at how intuitive Italian was when simply walking around Italy after having a grounding in so many adjacent languages (and learning classical music).

Admittedly I am atypical in my exposure to languages and I do enjoy linguistics but it seems to me there's a high initial barrier to the dialects but after the initial wall is overcome it just becomes a mapping exercise and a handful of idioms.

I'd be curious to know which bits of Mandarin you find difficult? Vocab? The grammar is close enough that you have a huge advantage over almost every other language in the world especially for the everyday stuff. Reading and writing, if you know traditional you'll pick up simplified in no time (speaking from experience backpacking through China armed with only a paper dictionary we didn't have smartphones back in my day) the Cantonese tones are quite wild but if you can do tones you have a huge advantage of languages which don't have tones.

If I'm allowed an uncharitable take, my experience has been that a lot of people from China don't feel a drive to learn more languages maybe with the sole exception of English. Maybe it's the result of being in a country of a billion+ that all ostensibly speak the same language. I've always found it so frustrating encountering people who move to the UK to study and they can barely hold a conversation in English despite doing A levels, Bachelor's and Masters in the UK sometimes. For all the complaints that dialects are hard a lot of south east Asian people back in the day would pick up a handful of them and often learn the basics of other languages like Bahasa. This kind of mindset and interaction reminds me more of Europe in the sense that people are more adaptable out of necessity

cyberax 10/28/2024|||
> Chinese characters are not some kind of alphabet.

Yes, they are. Modern Hanzi are a very bad phonetic alphabet.

While a minority of characters are indeed pure logograms (小,大,田,etc.), most modern Chinese words are two-syllabic. And syllables often don't have meaningful connection to the meaning of the word: 东西 ("east-west" literally, but means "a thing, object"), some characters have lost _any_ semantic meaning in most words (“子”), and many more characters can only be used as a part of another word ("bound forms", e.g. "据").

Classical Chinese was more logographic and less phonetic, but modern Chinese is not really close to it anymore.

est 10/28/2024|||
> Modern Hanzi are a very bad phonetic alphabet

alphabets, universally have one common property: they are sortable.

I challenge you to sort Chinese characters.

This is an idea from James Gleick's The Information. The Chinese may never be able to invent morse code alone, because encoding Chinese scripts is extremely hard, even today (think of all those massive code-points in CJK Unicode, with dups and errors)

Chinese text on the Internet may have some emulation of phonemes, but it's never systematically standarized. It just borrows some aspects here or there.

vntok 10/28/2024|||
> I challenge you to sort Chinese characters.

Chinese characters are in fact definitely sortable. There are multiple keys, the most popular ones being by stroke or by sound.

Example: https://en.m.wikipedia.org/wiki/Stroke-based_sorting

unscaled 10/28/2024|||
Chinese dictionaries have been sorted in various ways for at least two millennia, but there are some aspects which make alphabetic order sorting simpler:

1. Less ambiguous order: With classic Kangxi radicals for example, it's not always clear which radical to pick, and there is no clear order when there are multiple characters with the same radical and stroke count. There are other, more modern systems out there, but they all have some ambiguities.

2. Phonetic lookup: If you hear a word and don't know how to write it, you can just try to look it up phonetically. Unless the writing system is extremely perverse (I'm looking at you Ongloti, er, I mean English), you can kinda guess how it's written or at least how it starts. With Chinese characters that is not possible. Sure, Chinese dictionaries often have Pinyin or Zhuyin (Bopomofo) indexes, but Pinyin and Zhuyin are alphabets.

est 10/28/2024|||
good luck dealing with duplicates and hand-written variants.
tsimionescu 10/28/2024||
That's a problem in most alphabets as well. Several Latin letters (and the number symbols we use as well) have significant differences between printed and handwritten versions, and several handwritten versions around (g and z have some of the most variations).
freilanzer 10/28/2024||||
> alphabets, universally have one common property: they are sortable.

Isn't this just an arbitrary order? Why could I not assign numbers to chinese characters and sort them? I know next to nothing about Chinese.

seanc 10/28/2024|||
The sort order of the alphabet symbols is arbitrary, but since all of the words are composed of an ordered set of symbols then sorting the words relative to one another is trivial.
est 10/28/2024|||
> Isn't this just an arbitrary order

yeah but there are very limited number of alphabetical letters and commonly agreed order as a convention.

There's no such a thing in Chinese. For example, you can't easily sort names by A-Z in Chinese except PinYin (or Unicode codepoints for what matters)

fwip 10/28/2024|||
Dictionaries written in Chinese exist. They are in a sorted order, just like English dictionaries, so users can quickly look up the word they have in mind.

https://en.wikipedia.org/wiki/Chinese_character_orders

est 10/29/2024||
The thing is it's sorted only after PinYin is invented, sorta proves the point.

You can't easily compile an encoding out of it, but for alphabets it's intuitive to invent an index for each letter into dash-dots as morse code. It's extremely difficult to do so for Chinese.

Back to the topic, OP talks about "Character amnesia", if you think Chinese characters as emoji, yeah you talk about actions represented in emoji, but you forgot how it was drawn exactly. You can't sort emoji, and emojis don't generally have a sound.

unscaled 10/28/2024|||
Alphabet is a very specific thing: it's a small set of letters (usually less than 30) where each letter usually represents a single phoneme.

Sometimes a letter might represent a phoneme cluster (such as the letters "x" and "j" in English, that usually represent the consonant clusters /ks/ and /dʒ/ respectively). Sometimes there might be some ambiguity, like two letters being used for the same sound (both "c" and "k" can produce the sound /k/ in English) or one letter having two different pronunciations ("c" can be pronounced as either /k/ or /s/).

What distinguishes alphabets from all other similar written systems is that a single letter cannot represent a combination of a consonant and a vowel and that vowels can be independently represented by letters.

Other similar scripts are Abjad (like ancient Hebrew), where letters only represent consonants and vowels are implied from the context. The Ancient Hebrew script (which is different than the square Aramaic alphabet used to write Hebrew after circa 300 BC) is a later variant of the Proto-Canaanite script, an abjad which served as a basis to all later European alphabets (Etruscan, Greek, Latin, Runic and Cyrillic) and other Near Eastern alphabets (such as Aramaic, Arabic and Syriac). The only pre-modern alphabet (or abjad or abugida) I'm aware of that is not derived from Proto-Canaanite is Hangul (which is a true alphabet, unlike the two Japanese Kana).

Modern Hebrew and Arabic are mixed-alphabets, since some vowels can be represented by consonants, but not all of the vowels, and the letters that represent a vowel leave some ambiguity with regards to which vowels they represent (or whether they represent a vowel or a consonant).

The next type of similar system is abugida, which covers most of the Ethiopian, South Asian and South-East Asian scripts (Ge'ez, Devanagari, Tamil, Tibetan, Thai, Burmese, Khmer and many more). These are all probably derived from the Aramaic alphabet. In abugidas most letters represent a consonant that comes with default vowel (e.g. क in Devanagari used to write Hindi represents /ka/), but there are special diacritics that can modify a letter to have a different vowel (e.g. कॆ represents /ke/ in Devanagari) or even insert extra consonants or glides before the vowel. These combined forms together with the diacritics can get fairly irregular (especially in Ge'ez) and consonant clusters can become quite unwieldy and then about 80% of the consonants would just get dropped in Tibetan. But that's the general idea.

Then you've got syllabaries: these are pretty straightforward systems, where every letter represent a combination of a consonant (or a consonant cluster) and a vowel (sometimes a diphthong or a vowel with a glide). These scripts require you to remember more letters, but the combinations are simpler and more regular than most alphabets (let alone abjads and abugidas). This is the kind of writing system you see getting developed independently more often than others: Linear B, Japanese Kana, Cherokee, Vai, Yi.

Chinese characters are none of these. Characters never represent a single consonant or a stand-alone vowel that can combine with another consonant. In fact, bar few exceptions (such as 儿 in Mandarin) every character represents a full syllable and does not combine to form a syllable. But Chinese characters are not syllabaries either, since there are many characters that can be used to write each sound and they are not interchangeable with each other. A specific character has to be used based on the meaning of the word. This is how logographic writing systems works and modern Chinese is logographic language par excellence.

To appreciate that you have to compare Chinese characters with other logographic languages. Let's take Akkadian cuneiform (the writing system used for writing Babylonian and Assyrian) for example.

Cuneiform was first developed to write Sumerian, but this language was mostly dead by the times of Hammurabi (18th century BC), and it was a far-gone relic during the heyday of the Neo-Babylonian Empire of Nebuchadnezzar II. The Akkadians (i.e. the various Eastern Semitic language speakers of Mesopotamia) needed to write their own language with characters that represented Sumerian concepts, and they used the same methods modern Chinese (or Japanese) speakers use today: using a single Sumerian logogram in its own original meaning (but Akkadian pronounciation), transcribing a word using syllables that represent different words with same sounds and combining multiple logograms to form a new meaning. Like Japanese (but unlike Chinese), Akkadian cuneiform characters can represent a multi-syllable word and multiple logograms can combine to a new word with completely different (and unexpected) pronunciation. Akkadian is also commonly using logograms as word classifier (e.g to indicate geographical locations, gender, type of object and many other things[1]). These classifiers were written, but rarely (if ever?) pronounced.

Egyptian hieroglyphs, which I am even less familiar with than cuneiform, seem to have a far more developed system of classification (determinaties). They also seem to exhibit combinations of logograms to denote new meanigns and phonetic writing from a very early stage. In fact, Egyptian hieroglyphs, the quintessential "pictographic" in contemporary imagination, are mostly phonetic. Each hieroglyphs generally represents a cluster of 1-3 consonants, which probably came from the original pronunciation of the word it represented.

But this is like an abjad! And yes, the Proto-Canaanite abjad probably originated in a simplification of Egyptian hieroglyphs. And like abjads, which dveloped into mixed scripts (like Modern Hebrew and Arabic) and developed optional diacritics, Egyptian hieroglyphs also needed a method to disambiguate the multitude of similar-sounding words. And for that reason most phonetic Egyptian words (as far as I know) are accompanied by a logographic determinative [2] (classifer) that signifies whether it's the name of a God, a city, a house, a lotus flower, a lotus bud, another part of the lotus (stem, stalk or rhizome) or foxes skins. Yeah, these classifiers get rather specific. [3]

No system out there (including Japanese Kanji) is exactly like Chinese characters as used in Mandarin Chinese and other modern Chinese languages, but what I want to show here is that even though Modern Chinese is quite different from Classical Chinese, the writing system is still logographic. All logographic systems (including classical Chinese) have some phonetic features, at the very least in order to account for words that have no agreed-upon logogram. But what makes them logographic, is the pervasive use of logograms in a semantic role to disambiguate meanings.

[1] https://sumerianlanguage.tumblr.com/post/167245277900/hi-i-w...

[2] https://www.ucl.ac.uk/museums-static/digitalegypt/writing/sy...

[3] http://web.ff.cuni.cz/ustavy/egyptologie/pdf/Gardiner_signli...

dehrmann 10/28/2024|||
> Chinese characters are not some kind of alphabet. It's like an intermediate language (IL) of mind.

I realized this in Taiwan when I started being able to recognize characters, know what it means in English, and have absolutely no idea what the word is in Mandarin. The written language is almost orthogonal to the spoken one.

saithound 10/28/2024||
> The written language is almost orthogonal to the spoken one.

I'm almost certain that this is true of Chinese script (after all, it was and is used for writing many languages!), but it might not be deducible based on this sort of experience.

I say thus because I had a very similar experience after I had to spend a month in the UAE. Thanks to frequent bilingual signs, I started recognizing common Arabic words, but I had no idea what the words are in Arabic or how to say them. But as far as I know, written Arabic is not at all orthogonal to spoken Arabic, every word is written exactly as it sounds.

cjohnson318 10/28/2024||
Arabic is significantly more difficult for English speakers than a Romance language, but you're still able to draw a straight line from symbols to sounds. Once you learn the alphabet, sounding words out in your head isn't difficult. (Naturally, you will not sound like a native speaker for a long time.)
sandbach 10/29/2024|||
It's true that Chinese words don't inflect, but not all the grammatical categories you list are missing. There are aspect markers like 了 and 正在, and nouns are definite or indefinite even if they're not marked as such by articles: 有 can only have an indefinite object, for example.
djtango 10/28/2024||
> which makes the scripts very fast to parse

Yes my wife is bilingual and she thinks in English but prefers reading in Chinese because it's more terse

ggm 10/27/2024||
Mao and the party nearly adopted pinyin as the national alphabet but stepped back from the brink.

I remember the great Peking->Beijing uplift. Reading "China reconstructs" magazines there were suggestions it was coming, and then it just went away. BBC newsreaders explained it was the new official look. Like Turkey-> Türkiye.

I suspect all syllabery/ideogram scripts have this latent problem. At 2,500 ideograms for "literate" there's a lot of potential to lose non core elements. "Educated" means over 5,000 heading to 10,000 and the complete set is north of 40,000 from what I understand. I can't imagine the investment in time to get there.

jannesan 10/28/2024||
Official standard defines around 8k characters in modern usage. I’d guesstimate native highly educated speaker recognizes maybe 6-7k at most.

https://en.m.wikipedia.org/wiki/List_of_Commonly_Used_Standa...

Terr_ 10/28/2024|||
> nearly adopted pinyin as the national alphabet

Some years ago I saw street-signs in China that had both Mandarin characters and also alphabetical versions, and I couldn't understand why they would go so far and then omit the accent marks.

I wonder if it's been fixed since.

cyberax 10/28/2024|||
It's common. The Pinyin transcription is needed for foreigners to be able to recognize the directions, and so it doesn't need accent marks.
Terr_ 10/28/2024||
Yet then foreigners can't accurately repeat it, like "I'm at the corner of X and Y" or "The hotel is the one on Z road."
umanwizard 10/28/2024||
Even if they had the diacritics, it’s unlikely most foreigners could accurately reproduce the tones anyway.
Symmetry 10/28/2024||
And even ignoring the tones, how many foreigners can pronounce pinyin correctly? It takes a very different approach to representing non-Latin vowel sounds with Latin characters than English does, there's 'c' being pronounced 'ts', etc.
int_19h 10/28/2024||
"c" being pronounced as "ts" would be familiar to most people from Central or Eastern Europe, for example. Even coming from English, "c" can often mean "s" which is at least similar.

I'd say that the most unusual pinyin mappings are "q" and "x", although both have some analogues in European languages as well.

umanwizard 10/28/2024||
X is probably familiar to most people because of Xi Jinping's surname. I agree with the notion that Q is the Pinyin letter that would cause the most trouble for Westerners.
emmelaich 10/28/2024|||
Maybe the same reason English road signs omit punctuation.
Terr_ 10/28/2024||
No, those are not the same, because Chinese is a tonal language. [0] Taking Pinyin [1] and erasing the accent-marks creates ambiguity between several different words.

The English sign-equivalent would be... Well, something so dumb that nobody does it. Like perhaps deleting the ascenders and descenders of letters dbqp so that they look like oooo, which doesn't even help with horizontal space.

[0] https://en.wikipedia.org/wiki/Tone_(linguistics)

[1] https://en.wikipedia.org/wiki/Pinyin

poincaredisk 10/28/2024||
English equivalent would be writing something like "Aldwych", "Leicester Square" or "River Thames" on a sign, and expecting me, a foreigner, to pronounce it correctly.
Terr_ 10/28/2024|||
No, because without tones it can be ambiguous even for native speakers.

If you still don't believe me--or those Wikipedia links I already provided--you test it yourself by finding a native Mandarin speaker. Ask them to decisively determine the meaning and pronunciation of certain Chinese words only from their pinyin with the accents stripped out, such as ma or hua.

There's a store with snacks and produce. Do you want to eat lizi, or do you want to eat lizi? (Don't bother squinting, it's the same letters.)

fennecbutt 10/29/2024|||
Tbf the only reasons these spellings still exist is so that the British can sneer at anybody trying to pronounce them.

After moving here A: they weren't that tricky anyway (shire is a given, the only one I had to learn was *eicester) and B: I just get em to pronounce words in Maori.

mik09 10/31/2024|||
iirc, the official story from the mainland was that during around 1957 in order to boost literacy the govt simplified the characters, something like that...

today i learnt

lazide 10/27/2024|||
China is highly competitive, with a history of using ‘merit’ based admission tests.

The system/barriers were setup (as one’s always are) by incumbents, and the way they did it (while continuing to present it as ‘merit’ based) was to lean heavily on tests that require extensive memorization and tutoring, because only the wealthy can afford it.

This is one obvious sign of that. After all, who has time to memorize 40k different characters?

looping__lui 10/27/2024|||
That seems a bit made up tbh. I’ve worked with a fair number of Chinese overachievers (both in domestic China and abroad) and family background/affluence weren’t even remotely as much of a factor compared to the US or India (except for the IITs). Also, I noticed there were many many cases of brilliant young people rising through the ranks of the academic system in China compared to India for example where teachers often simply would not show up in public schools.

Personal background: I worked and studied in the US, I worked in China and I studied in India

alephnerd 10/27/2024|||
I think they mean historic (as in pre-1911 China).

And they aren't wrong, as even major China scholars like Yasheng Huang and Yuhua Wang point out that the Imperial Civil Service in China was stacked against merit due to structural issues that biased in favor of incumbents.

That said, similar issues continue to persist in China to this day due to the Zhongkao.

Specifically, if you didn't attend a academic high school (which only accepts around 45-55% of Zhongkao takers) you wouldn't be prepared for the Gaokao unless it was out of pocket at a cram school (which are now technically banned, but were anyhow exorbitantly expensive in a country where the median household income is $4,000). And if you couldn't pass the Gaokao, you couldn't attend university.

Furthermore, Academic High Schools tend to be few and far between, yet take the lion's share of resources unlike Vocational High Schools which most Chinese attend - but not the ones you meet in an air conditioned office unless you order from Meituan. In the vocational schools you see similar issues of teacher absence and lack of pedagogy.

The failure of the VET system in China has been a major sticking point in Chinese policymaking recently and a lot of domestic research is being done to understand why it failed [0][1][2]

[0] - https://www.emerald.com/insight/content/doi/10.1108/ET-09-20...

[1] - https://www.ewadirect.com/proceedings/lnep/article/view/9075

[2] - https://www.atlantis-press.com/article/125949173.pdf

looping__lui 10/27/2024||
Thanks for the additional details.

In Germany only about 30% of high-school kids qualify academic high-school (merit-based) and are eligible to apply for university after two-years of exams that determine your final grade (and chances). There are literally only public schools.

I think you describe that it is very similar today in China after they banned the private tutoring a couple of years back. So, we are somehow back to “work hard on not to unequal grounds and your personal merit/luck will determine your future”.

Thanks for sharing additional detail and it resembles what I remember - but it does not seem to be “instrumented in a way to keep certain groups out of academia based on income or social standing”.

Imho a 30% ratio for university graduates is a sweet spot for society. We have great (free) vocational training programs in Germany that empower all those who did not go to university to make money quicker and they more often that not end up in financially better situations. University often pays off only for top university graduates once they pass 35/40 years of age.

alephnerd 10/27/2024|||
The main difference is Germany actually puts money in vocational education and values it socially.

China does not.

China's education spend is only 4.1% of GDP, and much of that is diverted to academic high schools and universities. Furthermore, vocational students are viewed as "bad students" and good faculty prefer to join academic high schools or private high schools due to a better salary and social standing. Furthermore, even SOE factories (the traditional hirer of vocational students) in China now prefer hiring college graduates instead of vocational students because of a glut of college graduates due to cultural shifts in the 2000s-2010s, anti-vocational student bias, and college graduates trying to get a stable "iron rice bowl" (to use the Korean term) government job.

> I think you describe that it is very similar today in China after they banned the private tutoring a couple of years back. So, we are somehow back to “work hard on not to unequal grounds and your personal merit/luck will determine your future”

I think that was the aim of the legislation, but in action it didn't help social mobility much, because upper income households just resorted to private or online tutoring, and everyone ignores it.

It's not like legislators send their kids to vocational schools - they also prefer to send them to academic or top private schools so that they can then attend university (either domestically or abroad).

> but it does not seem to be “instrumented in a way to keep certain groups out of academia based on income or social standing”.

In face value it doesn't, but in action urban academic high schools and primary schools get the lions share of funding, and until recently those are gatekept to people holding a hukou for that city.

This meant that urban migrants who weren't able to convert their rural hukou to an urban hukuo (because that meant losing their rural landholdings which is the only asset of worth they have), and their kids were stuck either in underfunded rural schools or crappy private primary schools in an urban area.

This has a significant impact on social mobility to this day, and is a major reason why China's median household income remains very low. Urban China's median household income is $6,000, but rural China's is around $3,000. This means the bottom half of urban society in China and much of rural China's society does not have much of a chance of upward mobility because they can't pay for private tutoring nor can they afford a good private high school if their kid fails the Zhongkao.

looping__lui 10/27/2024||
Thanks! That’s a lot of additional information. I appreciate the effort!
josefx 10/27/2024|||
> In Germany only about 30% of high-school kids qualify academic high-school (merit-based) and are eligible to apply for university

Overall 50% of the population are eligible to apply for university since completing higher tier education at a vocational school or completing specialized courses at "lesser" schools can also fullfill the acceptance requirements for related fields of study.

looping__lui 10/28/2024||
There are additional paths people can beyond high schools and we have different tiers (allgemeine Hochschulreife; Fachhochschulreife) for different tiers of universities (academic further qualifying for a PhD career / practitioner); as the original post was referring to Chinese high school students specifically I tied it back to Gymnasium where give or take 30% of students go (probably a tad higher today because they lowered the bar).
alephnerd 10/28/2024||
There isn't a significant division between a a Gymnasium and Fachoberschule, because a Fachoberschule student can choose to get an Arbitur as well via an optional 13th year. And because German universities are de facto open entry, an Arbitur is all you need to be admitted.

China's vocational high schools on the other hand don't open the same doors to tertiary education that a Fachoberschule student has because they are structurally segregated away from academic students.

looping__lui 10/29/2024||
Abitur is 30% of students roughly which was the point of my discussion: https://de.m.wikipedia.org/wiki/Abiturientenquote

This should include the students who do Abitur via FOS.

lazide 10/27/2024|||
Made up how?

It’s still more merit based and class mobile than caste anything.

But the tests are the tests and require massive prep, eh?

And the tests require massive amounts of memorization and time to prep for, compared to western tests - even now.

Historically it was orders of magnitude worse though.

looping__lui 10/27/2024||
Kid’s ability to make it through the ranks of academia in my personal anecdotal evidence depends less on financial ability or social standing in China compared to the US or India.

I agree that it is merit driven above anything else - not sure I would agree that this is favoring wealthy kids/certain social class in a more disproportionate way than what we see in India or the US.

Now, I will say that neither university nor high school in China or India imho prepare kinds for success in life or the corporate world. Europe (with a hands-off “you get no help and will just struggle your way through life and learn quick”) and the US (“here are actual experts dedicated to help you succeed) seem imho to produce capable graduates for the corporate world.

lazide 10/27/2024||
Eh, see alephnerds comment above - that lines up with my experience as well.
looping__lui 10/27/2024||
Ok, the point being though (it seems): advantages are not universal geographically across China; I agree, that I also observed that.
lazide 10/27/2024||
In that geographical advantages are expressly tied to income and ‘official ness’ in an area (the Hukou system)?

If one is a poor rural immigrant in an otherwise high end area, you don’t get to send your kids to the nice schools.

alephnerd 10/27/2024|||
The issue with hukou is two-way.

Urban cities of course don't want to hand out urban hukou easily because they don't want an influx of migrants straining social services.

But conversely, a lot of rural hukou holders do not want to give up rural hukou because it gives retirement benefits at 55 and is a requirement in order to keep their rural landholding.

This is why you have migrants with a rural hukou working in urban China but not gaining an urban hukou.

The problem with hukou is fundamentally a social safety net problem - there is little to no social safety net in China, so the "migrant to urban area with rural hukou" is the least bad option out of multiple bad options (keep rural hukou and live in rural China barely eking a living or give up rural hukou and lose the only appreciating asset you had along with benefits at 55).

In essence, the lion's share of Chinese development is overly concentrated in a handful of urban agglomerations, and isn't spreading to rural China where 45-50% of Chinese still live to this day.

nradov 10/27/2024|||
Given the collapsing demographics in China, many of those rural hukou holders are obviously not going to receive the promised retirement benefits. The surplus resources to fund those payments don't exist so either the retirement age will be raised or benefits will be cut (either officially, or unofficially by just not sending payments and ignoring any protests).
alephnerd 10/27/2024||
Retirement benefits are the last thing the CCP touches because tens of millions of Chinese heavily rely on it already.

China is authoritarian, but the CCP absolutely does take public sentiment into account, and policies that have the chance of causing mass protests and discontent do get rolled back.

Zero Covid is a perfect example of this, as it was hastily rolled back after the wave of protests following the apartment fire in Urumqi due to Xinjiang CCP's hard Zero Covid enforcement.

And this is why China had not raised the retirement age until in the past few weeks despite trying for decades, and anyhow kicked that can down the line to 15 years.

nradov 10/27/2024||
I understand the concern over public sentiment but where will the revenue come from to pay those benefits? The ratio of workers to retirees is inevitably going to go way down and it seems unrealistic to expect that the government can borrow its way out of the problem. The retirees will have to take a hit somehow.
alephnerd 10/27/2024||
> where will the revenue come from to pay those benefits

A mixture of bonds/borrowing, federal bailouts, and (painful) corporate tax reform.

This is a major reason why provincial law enforcement has recently begun cracking down on unpaid corporate back taxes recently, because social spend is largely devolved to the provincial level.

The property crisis in China is itself a result of the retirement fund issue, as until recently provincial government's only financial lever was land sales, and retirement funds are largely the domain of provinces following Deng's reforms.

lazide 10/27/2024|||
The hokou/education thing is pretty clearly one way though, correct?
alephnerd 10/27/2024||
Just making urban hukou easier to adopt (which is something that multiple municipalities slowly started doing in the late 2010s) isn't enough to solve the social mobility issue.

Parents won't give up rural hukou if it also means losing your landholding and early retirement stipend benefits.

If you're a migrant worker from a rural household, you are most likely an unskilled laborer and are earning around $300/month, with dad working on a construction site or Meituan and mom working in a factory doing unskilled assembly or service job.

Around $150 is spent on incidentals because living in an urban area is expensive, an additional $100 is sent back to your family (grandma, grandpa, kids because the one child policy was largely ignored in rural China) back home in your rural town, and you might have $50 left over to save for retirement, healthcare, etc.

This is not enough to buy urban property, which is the asset class that appreciated the most in China, and this means the only large asset you have is your rural landholding. Furthermore, that early retirement benefit means you're earning an additional $15-20/month while continuing to work as a laborer or a Meituan delivery driver.

Fundamentally, salaries are too low in China and the social safety net is nonexistent, and this is what is causing the issues like overproduction, deflation, and sagging consumer demand which we are seeing nowadays.

The only way to solve this problem is to either expand the welfare system dramatically (thus incentivizing the bottom half to spend more by having to save less) or increase wages (thus incentivizing the bottom half to spend more by allowing them to save at the same rate while spending more). Working on increasing the quality of life in rural China would also help dramatically.

Sadly, Chinese leadership at the top level continues to ignore social welfare spending and rural China due to financial and moral concerns.

looping__lui 10/28/2024|||
I do not disagree with that stance; but now we are comparing a population of 1 B people and a country the size of US+Europe+more with the US or a European country. E.g., it would sound a bit like comparing all of the opportunities and prospects kids in the EU have to the US and calling the geographic discrepancies “social barriers”. So, the “poor european families don’t get to send their kids to the fancy US schools”.

I don’t disagree there is unfairness - but it exists everywhere and China just has more people than US, EU, Canada, UK combined.

nneonneo 10/27/2024|||
Once you get to ~5000 characters you can read most common texts. Vanishingly few people will know ~40,000 characters; a large fraction of those are obscure or ancient characters that only show up in historical texts.
lazide 10/27/2024||
That is rather the point I was making, wasn’t it?
otabdeveloper4 10/27/2024||
There are only about 200+ Chinese ideograms. Certainly not in the thousands.
nneonneo 10/27/2024|||
This is comically incorrect. Even the article plainly states that you need to know ~1500 characters to be considered literate in Chinese, with sixth graders required to learn ~2500 characters. A quick perusal of practically any Chinese-language website (e.g. https://zh.wikipedia.org/) will quickly disabuse you of the notion that there are "only 200+ Chinese ideograms".

You might be conflating "ideogram" with "radical", i.e. components of characters. There's probably a few hundred of those defined, but they're more like pieces of characters rather than whole ones. Combining radicals produces very different characters that have totally different meanings; learning the radicals alone buys you very little.

There are thousands of characters, and you have to know thousands. There are tens of thousands of characters in existence, although only highly educated folks will know anywhere near 10000 characters.

otabdeveloper4 10/27/2024||
The vast majority of Chinese characters aren't ideograms.
vitus 10/27/2024||||
Um, no. If you're talking about radicals (of which there are generally considered to be 214), yes, but you can't read / write in general if you only know those. Also, of those 214, a good chunk aren't standalone words. You're never going to see 疒 or 丶 by itself.

You won't have the words for "I" or "you". You might be able to read "melon" but not "fruit". You could read "papaya" and "corn" but not "vegetable". You could read "beef" or "lamb" but not "chicken". You could read "small" but not "few". You wouldn't be able to read "hello" or "goodbye", "happy" or "sad". But you'd be able to count 1, 2, 8, 10, 11, 12, 18, 20...

duskwuff 10/27/2024|||
The fact that characters are made up of radicals does make them easier to learn, though. There are ~2000 characters in the core vocabulary, sure - but it's not like you have to learn to write every one of them individually, just like you don't have to learn the spelling of every English word from scratch. There's common patterns.
DiogenesKynikos 10/27/2024||||
Learning the radicals is maybe the first 10% of the effort in learning how to read and write Chinese.

After that, you still have to learn how to combine the radicals in pretty much arbitrary ways to form several thousand characters. The way you combine them is sometimes related to the sound and/or meaning of the radicals, but it's not systematic at all.

The grandparent comment is massively downplaying the difficulty involved in learning to read and write Chinese.

otabdeveloper4 10/27/2024|||
The vast majority of Chinese characters are phonetic compounds, not ideograms. There are only a few hundred ideograms.
vitus 10/27/2024||
That's understating the actual number of distinct components you need to know, but even so, just memorizing the individual ideograms still isn't enough to allow you to read Chinese with meaningful fluency.

    Since the sound changes that had taken place over the two to three thousand
    years since the Old Chinese period have been extensive, in some instances,
    the phono-semantic natures of some compound characters have been
    obliterated, with the phonetic component providing no useful phonetic
    information at all in the modern language. For instance, 逾 (yú; /y³⁵/;
    'exceed'), 輸 (shū; /ʂu⁵⁵/; 'lose', 'donate'), 偷 (tōu; /tʰoʊ̯⁵⁵/; 'steal',
    'get by') share the phonetic 俞 (yú; /y³⁵/; 'agree') but their
    pronunciations bear no resemblance to each other in Standard Chinese or any
    other variety.
https://en.wikipedia.org/wiki/Chinese_character_classificati...

Or, consider 亭 (ting2), 叮 (ding1), 成 (cheng2), 打 (da3) which all supposedly derive their sound from 丁 (ding1) per the analysis at https://github.com/cjkvi/cjkvi-ids/. You can't just memorize the components and read all of these as "ding".

And beyond that, that doesn't help with being able to write. You can't just say "oh, I don't remember exactly how to write this word, but I'm just going to throw in some ideograph with the right phonetic component next to the radical and my reader will just know what I mean." You can't just take 瓦 (pottery) + 平 (ping2) to invent the word for vase. 瓶 (ping2) instead uses 并 (bing1), not to be confused with 井 (jing3).

orwin 10/27/2024|||
I'd say thousands of ideograms composed by hundreds of logograms, and you can be both right (this is actually a very reduced view of what logogram is, but English is quite imprecise on this).
joshdavham 10/27/2024||
> Chinese people are increasingly forgetting how to write characters by hand.

For those who can’t read Chinese here, I’ll just note that this is basically the equivalent of forgetting how to spell certain words in English. For example, I can read just fine, but there are still words I’m not good at spelling.

I’m just noting this so that this problem doesn’t seem totally exotic or specific to Chinese(/Japanese).

naming_the_user 10/27/2024||
Strongly disagree.

If you can remember the word "chocolate" but not the spelling then you can guess it. You might write choklit or choclate or something but you can at least get close.

If you forget what 警察 (police officer, jing3cha2) looks like then you're just completely screwed. Maybe you can remember a radical or two but it's still just going to be wrong and not meaningfully recognisable.

I guess you could write down 景茶 (jing3cha2) and rely on the phonetics, or use a different word if you know one, but it's still wrong on a level that "choclit" just isn't.

shrimptho 10/28/2024|||
The counter-argument would be that the person could just use the pinyin or use a digital device to get them the characters. But as the article pointed out, those are both modern conveniences. Less than 100 years old both. The script absolutely gives no clues otherwise.

Let's put it this way: We know what ancient might egyptian (most likely) sounds like because they gave their writting system the uniliterals, which are pronunciation guides for complex words. We know that they said waw, and we know what a waw sounds like. They did this probably so they would not to explode their character count from hundreds to thousands.

Chathamization 10/28/2024||||
> I guess you could write down 景茶 (jing3cha2) and rely on the phonetics, or use a different word if you know one, but it's still wrong on a level that "choclit" just isn't.

If a non-native did this in their own way would likely look wrong, but Chinese natives do occasionally use phonics to write or to substitute some characters with others.

naming_the_user 10/28/2024||
Yeah I agree with this. It's also super common in Pinyin to just typo and choose the wrong character through autocomplete.

I'm strictly referring to the handwritten language here, basically I don't think there is an analogue in alphabetic languages.

GranularRecipe 10/28/2024||||
I don't think these two forms of'amnesia-induced' typos are so different. "Typos" happen quite often, especially when typing on a smartphone and selecting the wrong character. And people learn to read it correctly. It's sometimes used intentionally, e.g. 歪果仁.
fennecbutt 10/29/2024||||
Exactly this, it was only recently my Italian friend was musing about how even other European languages don't really have this ability to just absoelootleigh fudge staff ind stihl b undurdudd.
nabla9 10/27/2024|||
The severity of the problem seems exotic.

> However, this new digitally induced amnesia is not merely a matter of forgetting a few strokes in a rare character. Highly literate people are forgetting how to write the characters in words like ‘kitchen’ (厨房), ‘lips’ (嘴唇), ‘cough’ (咳嗽), and ‘broom’ (扫帚). Victor Mair (2014) provides a striking example of the severity of the character amnesia problem. The following image is of a shopping list hastily written by a social science researcher from the PRC. The writer of the list struggled to remember the characters in ‘egg’ (鸡蛋), ‘shrimp’ (虾仁), and ‘chives’ (韭菜), and simply resorted to Pinyin.

mlinhares 10/27/2024|||
Completely common problem. Anyone that speaks multiple languages sees this everyday, there are many words in Portuguese/Spanish/English I need the spellchecker for (or even translation) to write because I don't use it as frequently in that specific language.

That is happening a lot with cooking, as I started to take it much more seriously when I moved to the US and now my cooking vocabulary in English is much better/wider than it is in my native Portuguese, so I'll frequently use words in english for stuff I should know in portuguese but don't remember.

marc_abonce 10/28/2024|||
> For those who can’t read Chinese here, I’ll just note that this is basically the equivalent of forgetting how to spell certain words in English.

I can't read Chinese, but I think the article has a better analogy: "most people can easily recognise the musical symbol for treble clef (𝄞), but very few could draw it by memory."

keybored 10/27/2024|||
> I’m just noting this so that this problem doesn’t seem totally exotic or specific to Chinese(/Japanese).

English orthography is exactly that. Exotic.

Imagine having such a strange spelling system that you have competitions where you try to recite spelling. Exotic.

pulsarmx 10/28/2024||
Many people would consider Spanish, my native language, to be much more straightforward when it comes to spelling.

We still have spelling contests at state and nation levels.

lomase 10/28/2024|||
I am from Spain and I have never heard of the national spelling contest.

I guess every word in that contest must have a v/b/y/ll or an h.

keybored 10/28/2024|||
> We still have spelling contests at state and nation levels.

That’s horrible. ;)

adastra22 10/27/2024|||
Except in German this is not a problem. The idea that you could say/hear a word and not know how to spell it doesn't even make sense in that context.
throw_pm23 10/27/2024|||
Nah, it's common in German too. For example, the first parts of "Widerspruch" and "wiedersehen" are said/heard the same, so you just have to learn the spelling. Many, many other examples... Although on the scale of languages German is indeed closer to phonetic spelling than some others.
jbeninger 10/27/2024|||
But if you were asked to spell the words, you'd produce something close to what was expected, rather than drawing a blank. The question "how do you spell wiedersehen" contains in itself a lot of clues.

This feels more like "what's the Unicode character for 'full moon'?" I'd be able to recognize the result as correct, but if I don't know the answer, I just don't know.

(Of course, that goes too far in the other direction. I assume you can draw a few strokes to "get someone started" on a character and they'll pick it up, whereas most people wouldn't recognize the first half of a Unicode code point. As the grandparent poster said, it's an exotic problem that's hard to empathize with in phonetic languages)

e63f67dd-065b 10/27/2024|||
> I assume you can draw a few strokes to "get someone started" on a character and they'll pick it up

In my experience this is not actually the case; I can usually remember a few parts of the character but draw a blank on the rest. You can see the picture of the grocery list that for some characters he got basically half the character right but gave up on the other half (shrimp is the combination of 虫 and 下, you can see he remembered the first half).

I guess there's several levels of character amnesia here, from "I remember half the character" to "I have no clue but I'll recognise it".

naniwaduni 10/28/2024||
> In my experience this is not actually the case; I can usually remember a few parts of the character but draw a blank on the rest. You can see the picture of the grocery list that for some characters he got basically half the character right but gave up on the other half (shrimp is the combination of 虫 and 下, you can see he remembered the first half).

That one's just bizarre, since 虾 is also just the most intuitively obvious choice to form a substitute character if you do forget the right component. If anything, I don't think pinyin substitution is something you do unless you're a highly-educated computer user who deals regularly in Latin script. It's a striking "man bites dog" moment, but the one has been passed around since 2006 (cf. https://pinyin.info/readings/defrancis/chinese_writing_refor...) and is not, as far as I can tell, indicative of any particular trend. Discreet literacy outliers in jobs where you'd expect it are ... a thing in English too: https://www.bbc.com/news/stories-43700153

(It's honestly weirder to see someone write jiu菜 than 9菜 too.)

Suppafly 10/29/2024|||
> it's an exotic problem that's hard to empathize with in phonetic languages

This, it's honestly not helpful to pretend it's the same as misspelling a word in a phonetic language because it's not and it's not even a good analogy to begin to understand the issue.

adrian_b 10/27/2024|||
I do not think that the example is good. "Wider" and "wieder" have different meanings, even if they are probably derived from the same word.

I do not know if this is true in all the German dialects, but at least some pronounce "wider" with short "i" and "wieder with long "i", so they are easy to distinguish when heard (like the difference in English between "fill" and "feel").

English and German appear to have had a similar semantic evolution for this pair of words, because "wider" means "against", while "wieder" means "again", so in both cases a single word has evolved to cover these two different meanings and the variants have become differentiated in pronunciation too.

piojojo 10/27/2024||
I'm a native German speaker and i don't know of a dialect in Germany that would pronounce "Widerstand" with a short "i". Would you mind sharing which dialect you think of?

"ie" is always long. For "i" it depends on the splitting of the word, i think. I don't know if this is a concept in other languages too. I think the rule is that if the "i" is at a split, then it is long, but i'm not sure and there are always exceptions to every rule in German. Consider "Schnit|zel", "Lis|te" (short) vs "Bi|bel", "Wi|der|stand" (long).

adrian_b 10/27/2024||
I have worked for some time in Germany and this is how some coworkers pronounced it, but I do not know where were they from.

From what you say I assume that the literary pronunciation is also with long "i", which is good to know.

Perhaps they were influenced by the different spelling, because I have seen this phenomenon in other countries, where despite a mostly phonetic writing some words were spelled differently than pronounced, for etymological or other reasons, and then the pronunciation of those words by many people has shifted, matching the spelling and not the traditional pronunciation.

joshdavham 10/27/2024||||
> Except in German this is not a problem.

I don't know any German, so I can't comment on this, but I'll add that the concept of a spelling contest (like we have in English) wouldn't make sense in a lot of languages because the spelling of words are so obvious/consistent.

cetu86 10/28/2024||||
If you need an example of a germanic language that has a very regular phonetic spelling I think Dutch is a better example than German. German has a lot of idiosyncrasies, because the spelling tries to preserve the etymology of the words. In Dutch they don't bother trying to preserve a word's history, everything is written as it sounds. (With a handful easy to memorize exceptions)
Suppafly 10/29/2024||
>In Dutch they don't bother trying to preserve a word's history, everything is written as it sounds.

That explains why it looks like a 'silly' version of english to english speakers.

Ekaros 10/27/2024||||
Finnish does this very well too. There is only a few tricky parts, but in general the spelling and pronunciation match. And if they don't, obvious solution is to write as word is pronounced. Which is a drift, but I think it is more desirable way.
Tor3 10/29/2024||
And Italian. Way back in time I used to get phone calls (at work) from Italian customers not speaking English, and I could always write down what they said on the phone, and get it right. Always. I knew very little Italian at the time.
zelphirkalt 10/27/2024|||
I think Spanish is a much better example for write how you speak and still being correct, than German.
Xenoamorphous 10/27/2024|||
Hmmm not so literate people (or just children) make a lot of mistakes writing Spanish because a bunch of letters are pronounced the same. And then the “h” when not preceded by a “c” is silent, which causes issues.

What is true however is that of you learn the pronunciation rules you should be able to read a text correctly even if you have no clue what you’re saying. That’s not true of English for example.

Maken 10/28/2024|||
There are a couple of rules that cause confusion:

- b, v and sound like /b/, because v lost its original pronunciation and w was lent from other languages.

- h lost its sound and became silent (used to be a soft /f/).

- g can sometimes sound like /j/ (there was some pressure to remove these uses).

- x can be an /s/ at the start of a word (due to Greek ancestry).

Those are considered mistakes, but they do not change the pronunciation of the words.

For the concrete rules: https://www.rae.es/ortograf%C3%ADa/valores-fonol%C3%B3gicos-.... You can see the exceptions to "one letter, one sound" are very few.

umanwizard 10/28/2024||
Also, in Latin America (but not most of Spain) z and s are pronounced the same.
lomase 10/28/2024|||
H, V/B, LL/Y are problematic in Spanish and you have to memorize how each word is wrote down.
adastra22 10/27/2024|||
Is it? There is a lot of historical spelling in Spanish, though not as much as English. German, on with the other hand, has its spelling routinely updated every few years.
int_19h 10/28/2024|||
Spanish is indeed somewhat more phonemic in spelling, even though it's not as good as Finnish, Turkish, or Serbo-Croatian.

That said, Spanish and German are both so much better than English that the difference between them can be disregarded in this context. The irony is that almost any major European language (with the notable exception of French) has better spelling than English; and even French, although it's horribly overcomplicated, is more consistent when it comes to reading.

Maken 10/28/2024|||
The historical spelling is quite limited, and mostly for retro-compatibility. Spanish tries to be understandable across a large number of countries and rules must accommodate old Spanish dating from the XV century.
cubefox 10/27/2024|||
The article points out that, because Chinese is not a phonetic language, if you don't know how to write something, you might not be able to write it at all, while in phonetic languages you can always spell something that sounds the same. E.g. "snees" for "sneeze".
almaight 10/28/2024||
This is true in mainland China, but not in Taiwan. There, katakana is used as pinyin, and words can be composed by katakana.
vunderba 10/28/2024||
I'm not sure where you've got this from, but I lived in Taiwan for years - you might be thinking of 注音 which does look similar to kana but is distinct and unrelated.

And the authors article is referring to the writing of the logograph - 注音 is strictly for pronunciation.

cubefox 10/28/2024||
So the question is then whether writing "snees" instead of "sneeze" is like writing sneeze in pinyin (or in "注音"), or not.
timlatim 10/27/2024|||
Even the Chinese Character Heroes competition mentioned in the article seems a lot like the spelling bee in the US, doesn't it? I wonder if the anecdote about the PhD students has a cultural dimension in addition to language proficiency — could the students have refused not because they don't know the characters, but because they aren't fully confident they wouldn't make a mistake?
thinkingemote 10/28/2024||
I think the example of "ampersand" "&" is good.

We have a word describing what it is, we have a symbol of how it looks, and we have a word of how it is pronounced "and". We also tend not to write ampersands down by hand. Its a more common symbol. However unlike the treble clef the meaning and the pronunciation is the same so perhaps the example isn't as good?

derekhsu 10/28/2024||
Yes, I endorse his research. As a Taiwanese, I use Traditional Chinese daily in Taiwan, primarily on computers and mobile phones, but I can’t write many of the Chinese characters he mentioned in the article.

I can share a more personal story: after spending a year studying abroad in Britain, I almost forgot how to write Chinese characters—even my own name—since I hadn't written any for over a year! However, when I returned to Taiwan, I was able to recall most of them within minutes. I consider this a temporary phenomenon that fades quickly with focus and a bit of practice.

jim-jim-jim 10/27/2024||
The Heisig method, which recursively breaks down Chinese characters into patterns with arbitrary meanings, can help you sidestep this problem. You're never dealing with shapes anymore, but rather reconstructing stories from these stroke/meaning pairs. Since patterns consist of subpatterns, you can tweak the level of granularity until a sensible narrative emerges. Just recite that story as you move your pen.

It's a lifesaver as an adult foreign learner, but I don't really see anything preventing native writers of Chinese and Japanese from benefitting from this general process as well. I've wondered if the guys who pass those truly insane 6,000+ character exams have to fall back on some sort of hack at that point.

eloisius 10/27/2024|
As another adult Chinese learner, does something like the Heisig method really help with language acquisition or just memorizing characters? I’m skeptical because of the immense amount of time it takes to learn even without elaborate story construction for each character. I’ve kind of resigned to being a word processor idiot, and only memorizing characters in handwriting as a bi-product of usage.
mbivert 10/27/2024|||
It may be anecdotal, but I once involuntarily trained my memory by trying to recall what I've been doing each day. After a few weeks, my memory noticeably improved.

That's to say, the task may feel insurmontable at first, but if you give time to your body to adjust, it should become easier.

graeme 10/27/2024||
That's very interesting. Did the memory boost persist? If so do you do any maintenance exercises like the original ones or have you noticed other effects?
mbivert 10/27/2024||
It's difficult to say: I think shortly after this "incident", I started doing more maths/physics on my free time, which must have help this boost to persist.

Thinking about it, this recalls me of a Leonardo quote (pertaining visual memory then):

> « I myself have proved it to be of no small use, when in bed in the dark, to recall in fancy the external details of forms previously studied, or other noteworthy things conceived by subtle speculation; and this is certainly an admirable exercise, and useful for impressing things on the memory. »

I remember reading about similar observations regarding visual memory, where students trained to memorize visual information would outperform their peers (observations perhaps in the 1800s/early 1900s, IIRC a woman was in charge of this).

kevin_p 10/28/2024||||
It's designed for Japanese where it probably works better - pretty much all Kanji have multiple pronunciations that can be completely different to each other so it makes some amount of sense to ignore the sound and focus on the shape and the meaning. Much less relevant for Chinese where you can usually tie the character to a single sound and learn all three halves at the same time.
latentsea 10/28/2024||||
I did Heisig's Remembering The Kanji as my first step on my Japanese learning journey. It helped make the written language feel more accessible, and I took it on as a fun challenge, but in hindsight only learning to read is important and learning to write is mostly a waste of time. Learning to read is vital, but being a word processor idiot is how everyone is living their lives for the most part.
Tor3 10/29/2024||
As someone who switched to using a keyboard for writing at the age of 14.. yes. But at least I can write my own language by hand, if I have to, even though it looks horrible (always did). I can draw it instead of writing it, and it looks better (filling in forms and the like).

For Japanese I use a keyboard.. should be fine but.. no. Whenever we're at a table and need to jot something down on paper, or near a whiteboard, I feel like an illiterate person, because I can't write by hand. I can read stuff I can't write. Even hiragana. I never did enough practice. My wife writes down everything so easily.. and I can't. Writing romaji.. argh. I hate that I can't write, by hand, what I can read.

latentsea 10/31/2024||
I think hating that is kind of a choice. Like sure it's bothersome, but if you didn't grow up there and spend a decade in school doing a lot of writing by hand then the only substitute is to do a crap tonne of writing by hand as an adult. The investment in time for such a small pay off as filing out forms being less of a hassle likely isn't worth the full investment of time required. So, if you just accept it for what it is, then don't mind the extra time required when filing out forms, I guess it's not so bad.
adastra22 10/27/2024||||
You've got it backwards. The Heisig method is faster, and less work overall. It takes about 10 seconds or so to set an image in your mind of the scene for a character, then a handful of reviews over the following weeks. Then you never forget it.

Classical methods would have you drilling characters for hours upon hours of wasted time.

throwaway2037 10/28/2024|||

    > It takes about 10 seconds or so to set an image in your mind of the scene for a character, then a handful of reviews over the following weeks. Then you never forget it.
As someone with experience using the Heisig method, I would strongly disagree. Yes, it is a helpful system to ease the burden of memorization, but it does not permanently embed this knowledge into your memory after "a handful of reviews over the following weeks". If this were true, there would be many, many more people who have memorized 4000+ Chinese characters, required for fluency.
Tor3 10/29/2024|||
For me the Heisig method was a great help for learning the first few hundred kanji. Knowing those made it way easier to learn to read words containing said kanji, because the meaning of the kanji more often than not helped remembering what the word meant, and, importantly, recognizing the word.

But after a few hundred kanji the system starts to get so complicated, with more and more elaborate and contrived mnemonics, that IMO it isn't worth continuing that approach. But that's fine, at that point you're already on board and can learn the rest from reading real texts.

(The negative of abandoning Heisig after the first few hundred kanji is that you also stop writing kanji at that point.. which adds to my difficulty of hand-writing Japanese)

adastra22 10/28/2024|||
Well, "it worked for me." Similar reports for many people who have done fast-track Heisig speed runs, such as Heisig himself. It takes about 10 seconds of initial study to fix a story in mind (which is actually quite long--seriously count out 10 seconds slowly and imagine that time spent fully focused on the character at hand), and a review sequence that gets it in your long-term memory.

There are in fact many people who have learned 4000+ Chinese characters, using this or other methods.

eloisius 10/27/2024|||
I mean to say that my current method mostly omits hand writing. I can use a keyboard or phone to write and I can recognize characters fine. But on top of just learning a language, is Heisig so effective that I will be able to also memorize handwriting each character? Or do people measure how useful it is by being able to memorize the strokes for many characters, yet fail to become fluent in the language otherwise?
adastra22 10/27/2024|||
Heisig will have you learn handwriting, yes. Because that is explicitly all you practice in that method.

Aside: Do you learn simplified or traditional? I learned traditional. I would have anyway because my wife is Taiwanese, but I advocate others to do the same because it is arguably the same difficulty if not easier. And going traditional -> simplified is tractable whereas the reverse is not.

Learning the Heisig method is similar: learning from a perspective of handwriting is easier, and you get the ability to read “for free.” It’s a better approach, even if you never need to write by hand.

eloisius 10/27/2024||
Also traditional, and I agree it’s easier to remember them than simplified characters even if they are more complicated to write.
adastra22 10/27/2024||
So you have a reference point for this. When you learn a character by the Heisig method, you go from meaning -> writing. You don’t bother practicing reading -> meaning. It turns out that it’s very easy to go from writing to reading, much like going from traditional to simplified, but the reverse not so much.
jryb 10/28/2024|||
I'm using something similar to Heisig, and I can already tell that while I can list all of the radicals and components in a character, I have no memory of their relative positions. I'm also not trying to learn them and I only using a pinyin input method, but I can't imagine really needing to be able to write by hand ever.
adastra22 10/28/2024||
If you're not actually writing out the characters, you're not using Heisig. You wouldn't have that issue if you were actually writing them out. And it's not the wrote practice, it's the fact that you intrinsically must write one primitive at a time, which solidifies the order in the story. Most Heisig students end up developing slightly different primitive meanings for different placements, or an aspect of the story which controls the layout, for that reason.
cyberax 10/28/2024||||
It surely helps. Mnemonics are a good tool for memorization. You don't have to remember elaborate stories, just something like: "歌" - "older brother lacks singing". It certainly doesn't work with all characters, but it helps.
jim-jim-jim 10/28/2024||||
fwiw I studied Japanese, but I believe most of this still applies.

It's divide and conquer. When you are reading the book, you are indeed just learning the characters. It's a significant ~2-3 month investment that maybe doesn't make sense unless you plan on living and working in the country. But once you've gotten through it, it absolutely feeds back into vocab acquisition, since the characters are now completely unambiguous to you. Much like how Latin/Greek helps with English, you can also work out what entirely new words might mean if you are familiar with their characters.

latentsea 10/28/2024||
I did RTK. I also learned to read around 3k kanji. Turns out it wasn't at all necessary to learn to write that additional 1k Kanji in order to become able to read / distinguish it.

The time is better invested in simply studying how to distinguish visually similar characters. That alone solves the problem directly.

jim-jim-jim 10/28/2024||
I also did all three volumes and found the extra 1,000 to be a waste. Really polluted my Anki.

> The time is better invested in simply studying how to distinguish visually similar characters.

But you still have to know what you're distinguishing between, which might only arise after repeated mistakes. Heading off this frustration directly by studying characters may not have been the best use of my time in absolute terms, but it did wonders for my overall motivation and made me feel like I was doing more than treading water. Pre-Heisig I was reading specific books intended for foreign learners, while afterwards I was just reading the newspaper.

latentsea 10/28/2024||
It's somewhat a shame there isn't heavily curated Anki decks for doing what I call "disambiguation study" where you focus on cards that help you distinguish similar things from one another. It'd really speed things up.

>But you still have to know what you're distinguishing between, which might only arise after repeated mistakes.

I'm learning Korean at the moment and it's particular brutal for this IMHO. Some words have taken a long time to properly understand due to repeatedly mistaking them for very similar words, and there are a lot of these in Korean.

throwaway2037 10/28/2024|||

    > language acquisition or just memorizing characters
What is the difference, in the case of Chinese?
eloisius 10/28/2024||
Characters are not words. Most words are 2- or 4-character combinations. Individually they have fragments of meaning, so you may get a hint, but not enough to understand for sure unless you’ve learned the word.
NoInkling 10/29/2024||
In addition, the Heisig method doesn't teach "meanings" at all, rather "keywords", which in many cases are only kinda sorta vaguely related to their respective characters' (fragment of) meaning. Either way, they're mainly useful as an "anchor" or name to refer to the character by, and in many cases might as well be arbitrary. The real purpose is just to turn an initially indecipherable blob that is a character into something recognizable (and writable, if you care about that). In general, you're not learning meaning to a useful degree until you start seeing or hearing (when you can associate pronunciation) those characters in context, as you imply. Yeah sure, with Heisig alone you'll be able to see 鸡 and know that it (most likely) refers to chicken, but characters like that are a relatively small proportion.
kragen 10/27/2024||
If this seems oddly familiar, you may actually have read it before, and no plagiarism is involved. Moser wrote the beginning of this article in the middle of his classic essay, ”Why Chinese is So Damn Hard” https://pinyin.info/readings/texts/moser.html

> Can you imagine a well-educated native English speaker totally forgetting how to write a word like "knee" or "tin can"? Or even a rarely-seen word like "scabbard" or "ragamuffin"? I was once at a luncheon with three Ph.D. students in the Chinese Department at Peking University, all native Chinese (one from Hong Kong). I happened to have a cold that day, and was trying to write a brief note to a friend canceling an appointment that day. I found that I couldn't remember how to write the character 嚔, as in da penti 打喷嚔 "to sneeze". I asked my three friends how to write the character, and to my surprise, all three of them simply shrugged in sheepish embarrassment. Not one of them could correctly produce the character. Now, Peking University is usually considered the "Harvard of China". Can you imagine three Ph.D. students in English at Harvard forgetting how to write the English word "sneeze"??

This page is from 02004: http://web.archive.org/web/20040811151534/http://pinyin.info.... Possibly the rest of the article is not simply an excerpt from it.

gonzobonzo 10/27/2024||
嚔 is an unusual character, though. Not just in that it's not often used, but also because it's construction is atypical. It would be like Ph.D. students forgetting how to spell "onomatopoeia," which, yes, wouldn't be surprising.
metacritic12 10/28/2024|||
The sneeze example is contrived because in English, sneeze is both phonetic and a word with common occurence.

A better example might involve a common English word with a wierd, non-phonetic spelling. A word that you might imagine it forgivable for even someone who recieved an English PhD to misspell. After all Chinese is a seperate language from English and it is neccessary for it to be evaluated in its own context.

If you think this definetly couldn't happen in English, take a look again at this post -- for it contains eight outright, unambiguous, misspellings of common English words that I would not be surprised if even an English PhD from Harvard made on occassion, especially if your choice of three students were unlucky and they were having embarassingly bad days. (After all, English PhDs isn't the study of spelling, it's the study of literature).

addy34 10/28/2024|||
It may be contrived, but it still highlights the key difference.

Even if sneeze was a word that you were taught once in school and hadn't used for 30 years, you would still likely get close to the correct spelling from the sound (sneaze, snease, sneeze), and seeing the misspelling also helps with recall and to self correct.

This is the "virtual circle" of speaking/listening -> reading -> writing -> referred to by the author, which is not possible with Chinese.

It's true that there are some weird non-phonetic English words that PhDs would likely misspell, but it's not 100% of the language and you still could at least make an attempt.

It's possible to just write Chinese in phonetic form (e.g. pinyin), which bypasses this issue, but you have a secondary problem, which is the extremely narrow range of syllables (~400 * 4/5 tones = 1600-2000), resulting in quite ambiguous text.

bobbylarrybobby 10/28/2024|||
Pinyin should be approximately as ambiguous as the spoken language, i.e. not very (especially if whitespace is used to denote word boundaries)
Chathamization 10/28/2024||
Removing the added information would make it much more difficult to parse, though. Paragraphs don't exist in oral English - or spaces between words, quotation marks, capitalization, etc. - but we still find it much more easy to read properly formatted text than improperly formatted text.

Just because people are able to understand strict phonetic transcriptions, doesn't mean it's a good way to convey information (which is why almost no language relies on just strict phonetic transcriptions).

djtango 10/28/2024||||
As mentioned in another comment, English has its share of words like that too. For example I'm sure diarrhoea can catch people out.

And how many people drink an espresso every day and think it has an x in it.

I knew plenty of elite students who would make classic English blunders like "expresso" or "pacifically"

ndsipa_pomu 10/28/2024|||
> how many people drink an espresso every day and think it has an x in it.

Arguably, "espresso" isn't an english word, but spelling it with an "x" as "expresso" isn't as incorrect as you may think. There's two main theories behind which word to use: "espresso" meaning to "press out" the coffee, or "expresso" meaning "expressly made for the customer" as it's quicker to make than a filter coffee. This is further confused by the Latin root being "exprimire" meaning "to press or squeeze out".

https://www.merriam-webster.com/grammar/espresso-vs-expresso...

labster 10/28/2024||||
Are those blunders or accents? From my point of view you spelled diarrhea incorrectly, unlike how our lord and savior Noah Webster taught us.

Maybe language is fine if it conveys the intended meaning.

djtango 10/28/2024||
Yes a rose by any other name would smell as sweet and well... diarrhoea is diarrhea

My point was addressing "tsinghua students..." and "Harvard students..." unless they were literary scholars or grammarians their wield of the language may be at the level of "educated" but still plenty fallible. I'm sure those of us who did any post grad would have met people who were smart in a given axis and otherwise very ordinary along the other axes.

katbyte 10/28/2024||||
Except most people will get close enough for most other people to understand. English is rather flexible

Not ti mention spelling differences and even all the unique words in different English countries. Or within the uk

djtango 10/28/2024||
Well in other comments, native Chinese speakers brought up that when you forget how to write a character you just write a homonym and readers can guess by the context - which is how Chinese speech works anyway.

So it turns out that humans are rather flexible

kseistrup 10/28/2024|||
Or even more classic English blunders, like not being able to choose correctly between e.g. “their”, “there”, or “they're”.
dayjaby 10/28/2024|||
I don't think that Chinese people have problems knowing how to spell a character as different characters share the same pronunciation (more or less) if they have the same phonetic component[1]. So pinyin helps literally zero.

What is harder is to distinguish the meaning of all these characters. Let's take this set as an example: 里理哩鲤鯉俚娌悝锂鋰

Ok, they are all pronounced the same, but guessing or knowing all their meanings is a different game. "鲤" has to be a fish that's pronounced li3. That might still be easy, but the more abstract the meaning-giving character radical is, the harder it becomes to distinguish all of them.

[1] https://hanzicraft.com/lists/phonetic-sets

kaiwen1 10/28/2024||||
This is not analagous. The sound of every English word give clues – and often precise guidance - on how to write it, but the sound of a Chinese word typically offers no hint of how to write it. If you give me some obscure English word, say "persiflage", I might have no idea what it means, but I can probably spell it. But if you give a Chinese speaker 馘 (góu) in context, almost no one will be able to write it, even if they know the precise meaning ("to cut off the left ear of the slain").
nkrisc 10/28/2024||||
I would guess that most Harvard students could spell those misspelled words in your comment correctly if you asked them to directly. When reading we know what the word is supposed to be and we correct it in our mind to the intended spelling. If anything it’s a testament to the resiliency of phonetic alphabets - but more so it’s illustrative of how English pronunciation has deviated from spellings that were mostly set with the advent of printing in England. Most of the misspellings in your comment involve misplaced letters that are not directly pronounced.

Strictly in terms phonetics, why couldn’t “weird” be spelled “wierd” when English also has “tier”? I’m guessing the Normans are to thank for turning “wyrd” into “weird”.

erremerre 10/28/2024||||
Something like Wednesday or was it Wensday, Wendsday, Wednsday, Wedensday, Wednseday, Wednesdy, Wednesay, Wedsday, Wednseday, or, Wendseday?
galangalalgol 10/28/2024||
I vote for wedensday as it is the closest to the originating Woden's day.
defen 10/28/2024||||
Even with the misspellings it's obvious what words you meant. If someone forgets how to write "嚔" are they just missing a few strokes but it's obvious what they actually meant? Or do they have zero clue what it's supposed to look like?
Chathamization 10/28/2024|||
From what I've heard people say (and what I've seen), the most common way to handle it is to simply write another character that sounds the same. In other words, the characters can be used as phonetic elements when it's needed. It looks weird (in the same way that spelling words phonetically in English can look word), but it's doable.

That's for situations where they had to write something by hand but didn't have their phone with them to check (otherwise they can just spend a second to look at the character), which isn't a common occurrence.

addy34 10/28/2024|||
It depends, but it's not uncommon to completely forget the entire character. If you sort of remember it, then the muscle memory in your hands often helps to finish the character correctly once you start, at least that's what I've found and heard from others.
nkrisc 10/28/2024||||
Therein lies the resiliency of phonetic orthography: despite the misspellings the sound represented by the words did not significantly change thus most readers would never even notice.

If anything it’s a statement on how the orthography of English in particular doesn’t well match the phonetic structure of the language - something due to a confluence of factors in English several hundred years ago, including the rise of printing.

nkrisc 10/28/2024||
Woops, I thought I’d post this comment and it never posted.
specialist 10/28/2024||||
Remember the english -> pirate translator? Simularily, per your graf, a "spell wrecker" tool, witch introduces mispellings and other errs, could be amusing.
ClassyJacket 10/28/2024||||
Well done. I spotted weird on its own but I had to go looking for the others.
carlmr 10/28/2024||||
>It would be like Ph.D. students forgetting how to spell "onomatopoeia,"

A (native) PhD student forgetting how to spell onomatopeia might be normal, but 3 I would say is statistically unlikely.

unscaled 10/28/2024||||
I admit I would have liked it if English language Ph.D candidates had some basic working understanding of Ancient Greek, Latin and Old English to serve as a cornerstone for their studies, but that ship has sailed more than a century ago. If you know Greek, onomatopoeia is not that confusing anymore, and you'd know that if you want to be super-pedantic, the plural of octopus would be octopodes, but never octopi.

Then again, if you knew some Old English, you would know correcting everyone and forcing them to use the "original form" is a silly goose chase. You'd know better than to remind everyone that children is a pesky double plural, and childer (from the old English cildru, and like the German Kinder) is more than enough? Or to remind everyone that's it's not "a newt tail" but rather "an EWT tail" and please mind your ewts and a good day to you to sir.

Or maybe there will be some people who would still keep doing that, who am I fooling...

raincole 10/28/2024||||
The fact there are two variants of 嚔 (嚏 and 嚔) implies that even ancient chinese people found it difficult and they misspelled it.
labster 10/28/2024||||
I would honestly be astonished if a 4 English department PhD students who were all native speakers of English could all not spell onomatopoeia. I’d expect 2 of the 4 could in the absolute worst case.

Spelling is simply not as hard as remembering hanzi, even in English.

wraptile 10/28/2024|||
You can spell as onomatopia and people will understand you just fine. You can't do that in Chinese.
generic92034 10/27/2024|||
Would it not be possible that the three students did not want to embarrass the author by showing their knowledge? I sometimes get that behavior from Chinese and Japanese colleagues when I am supposed to know something but temporarily forgot it (or just cannot access it at the time).
Barrin92 10/27/2024|||
No, they really just forgot the characters. I lived in Japan for the better part of a decade and it isn't at all uncommon. There were many times when I explicitly asked natives to recognize or help me with a Kanji and they struggled as well. I doubt everyone's been systematically lying when you genuinely ask for help.

The reason is just digital input really. Pinyin and Romaji typing have become so common that a lot of people write Hanzi/Kanji by hand less and less and it's so complicated of a skill there's really no other way to get it in your brain other than practice. I even notice it myself, I easily recognize 10x more characters than I can accurately write.

jasonhong 10/27/2024||||
I've seen my wife (Chinese) and her friends (also Chinese) have this same problem with the exact same word "sneeze", so I'm inclined to believe the author.
Chathamization 10/28/2024|||
The fact that 嚔/sneeze is usually the go to example means it ends up becoming the exception that proves the rule. Most other characters are much more easily remembered.
adrianN 10/28/2024||
If you learn languages you quickly notice that knowing the 99% most frequent words still means that you need to look up a word every paragraph or two and that you have trouble expressing yourself. To write Chinese you need to know several thousand characters, forgetting just a few dozen can be quite annoying when you try to write nontrivial texts.
Chathamization 10/28/2024||
Being able to write a character by hand, being able to type it up, and being able to read it are all different things. I doubt many Chinese would be thrown off from reading or typing 打喷嚏.

I actually did a deep dive into the issue of unfamiliar characters coming up when reading, and how people handle them. I won't go into all the details, but the general takeaway is:

1. Unfamiliar characters can actually be quite rare or quite common depending on the material you're reading.

2. It's not much of an issue for people either way.

adrianN 10/28/2024||
Of course, when I used the word „write“ I meant writing by hand.
nojs 10/27/2024|||
Yeah, this particular character seems to cause people problems because it’s not really used anywhere else.
shrimptho 10/28/2024||
I think the shrimp meat example from the researcher daily notes was a bigger tell of the issue.

Because shrimp meat is something I see written out EVERYWHERE.

Chathamization 10/28/2024||
The shrimp example is kind of strange. Like you said, it's an extremely common character, and not a difficult one either. But beyond that, if you look at it he got the radical, 虫, correct. The phonetic element, 下, is a fundamental character that I doubt anyone forgets to write.

It just seems like such an odd outlier example. Like talking about a friend that spells "been" as "bin." I'm sure it could happen, but it's not indicative of a broader trend.

The story was reported by Victor Mair, though, who is extremely opposed to using characters and often exaggerates the issues with them.

Personally, I've seen a lot of Chinese people's written notes, and I don't think I've ever seen them resort to pinyin, even among people that didn't go to college. I just asked a few Chinese friends about this, and they told me they never resort to pinyin either.

int_19h 10/28/2024||
A native English speaker wouldn't have trouble with "been" vs "bin" since these are two different vowel phonemes.
allen_fisher 10/28/2024||||
As a Chinese native speaker, I should admit that I forgot how to write them ("sneeze" characters 喷嚏) before I saw them just now. One of the reason, I think, is they are used quite often in oral Chinese, but rarely in written Chinese. And those characters are not easy to write, as you see.
lobochrome 10/28/2024|||
Happens all the time to Japanese people, too. There's a core set of characters that are well remembered, then there is a set of characters that you can remember most strokes of (so that someone else can read it) and then there's the stuff where you just have no idea.

Of course, you _can_ escape to Hiragana if you're so inclined, but then you would show that you don't know the character - so it's just avoided.

Ambiguity is king in Japan.

MichaelZuo 10/27/2024|||
That’s a pretty good point… if I saw ‘three Ph.D. students in English at Harvard’ forget how to write ‘sneeze’, I would assume they were lunatics pretending, not that they genuinely forgot how to write the word.

Does that imply learning the advanced literary culture, that is usually associated with prestige academia, has a vastly higher threshold in Chinese than in English?

It’s pretty disturbing for language itself to be a potentially retarding force on learning.

djtango 10/28/2024|||
A better example than sneeze might be diarrhoea. In Chinese this is very easy - 拉肚子 - but I'm sure there are English speakers who might forget the spelling for diarrhoea if they're having a bad day.
ralfd 10/28/2024||
Does English not have a native word? It would be „Durchfall“ in German, and the Greek word only be used if one wanted to be fancy.
umanwizard 10/28/2024|||
English speakers perceive it as a native word, not a Greek one.

There are lots of examples of this, where English has a foreign-sounding word for something whereas German has a Germanic one. For example oxygen vs. Sauerstoff.

jan_Inkepa 10/28/2024|||
It just has, uh, diarroeha...yeah I can't spell it either
shrimptho 10/28/2024|||
All three PhD's were perfectly capable of communicating the word for sneeze and also recognize it in writting. They just couldn't write it exactly.

I don't think it has a slowing effect. Except maybe by adding annoying/useless classes for mid/primary students - which is just par for course everywhere else. I can name 3 objectively completely useless classes from my european youth (plus one in college) that were only there because 'culture'.

MichaelZuo 10/28/2024||
How does your assessment of the relative usefulness of ‘classes’ from youth relate to the possible existence of a retarding effect arising from language differences?
capitainenemo 10/28/2024|||
Having read it, the rest of the article was new.
kragen 10/28/2024||
Thanks!
jkbach 10/28/2024|||
Thank you for this. I checked the date on the article twice and still couldn’t shake the strange feeling of deja vu.
kens 10/27/2024||
The article lumps together writing characters slightly incorrectly and failing to come up with the character at all. For instance, the game show contestant wrote the word "烹" with one extra stroke, while the writer of the shopping list writer gave up entirely on the characters for "egg" and "chives". (This is analogous to the difference between misspelling an English word and not being able to think of the word at all.) In the story of three PhD students who couldn't write the characters for "sneeze" (打喷嚏), it's entirely unclear if they were completely stuck, or if they just made small mistakes.

My question is if "character amnesia" describes trivial errors or if people are forgetting characters to a significant extent. In other words, is this article genuine or is it the equivalent of claiming English writers suffer from "word amnesia" because they sometimes need to look up a spelling?

mitthrowaway2 10/27/2024||
> In the story of three PhD students who couldn't write the characters for "sneeze" it's entirely unclear if they were completely stuck, or if they just made small mistakes.

The article says "all three simply shrugged in sheepish embarrassment", which in context, reads clearly to me that they were embarrassed for not knowing where to start, not that they wrote them down with minor errors that they were embarrassed about after checking a dictionary. The rest of the article strongly reinforces this interpretation. For example: "‘lift the pen, forget the character’... this new digitally induced amnesia is not merely a matter of forgetting a few strokes in a rare character..."

animal_spirits 10/27/2024|||
Further in the article he discusses the 'virtuous cycle'; connections between writing, speaking, and reading. With phonetic alphabets, the way something is spoken reinforces the way that it is read, the way it is read reinforces the way that it is written, the way that it is written reinforces the way that it is spoken (for the most part, even English fails this cycle sometimes). However, with character sets the cycle is broken, and the speaker has to learn 3 different memorization techniques.

Regarding your question, there is a difference between not knowing how to write a word and not knowing how to spell a word. If someone in English doesn't know how to spell 'sneeze' or any other word, they can at least come close enough and convey information 'fuzzily' via text using an incorrect spelling. Now that I'm writing this, though, I suppose with character sets like Chinese if you know characters that are close enough you potentially could use other characters to convey the information, like mouth-fart for sneeze or something along these lines. But I don't speak the language, so that is just a theory.

Do Chinese speakers use the language this way if there is a character they don't know how to write?

tdeck 10/27/2024||
> Do Chinese speakers use the language this way if there is a character they don't know how to write?

I'm by no means an expert on the topic but one thing I have noticed in learning Chinese languages is that there are a huge number of homophones. That means there are probably 20 other characters with the same pronunciation for any given syllable that are considered different words (not to get into it here but the conception of a word in Chinese languages can be a bit odd too). It seems to be very common for people to use the character for a similar sounding word or syllable to write slang words or local dialect words that don't have an official character.

adastra22 10/27/2024|||
In the example at the start of the article, 烹, the contestant wrote "child" when the proper grapheme was "complete." That's a non-trivial error, even if it is only one stroke difference.
ben_w 10/27/2024||
> That's a non-trivial error, even if it is only one stroke difference.

As is the difference between "fitter" and "filter" in English.

Chathamization 10/28/2024|||
Even less of a difference. There's no equivalent character, so it's clearly 烹 written with an extra stroke. It's more like the difference between writing "deceive" and decieve."
adastra22 10/27/2024||||
I don’t think so, no. English is phonetic and so some of those spelling errors can come down to your dialect pronouncing the word differently and forgetting the official spelling. That’s now how Chinese characters work though. It tells a story, and a story with “child” instead of “complete” is a vastly different story.
yongjik 10/27/2024||
Well, but as far as I can tell there is no legitimate character created by replacing complete(了) by child(子) in 烹. It does not matter what "story" it tries to tell, if someone writes "boild eggs" I'm not going to conjure up a hypothetical German etymology that might have led to "boild" that later got borrowed into English; I will understand it as a typo of "boiled".
riskable 10/27/2024|||
The equivalent in English would be writing "ghyche" when trying to write the word, "fish". Yeah, that combination of characters/marks can make sounds equivalent to "f" "ih" and "sh" but it's so far off it's laughable.
bloppe 10/27/2024||
> This is analogous to the difference between misspelling an English word and not being able to think of the word at all.

Part of the distinction is that you can always at least misspell the word when using an alphabet. That's why the shopping list used an alphabetic script.

ilaksh 10/27/2024|
If people normally enter characters in their phone or computer rather than actually handwriting, then there is no reason to keep remembering all of the character details.

Just like it doesn't matter that I frequently am uncertain about spelling for some words because we have spell check built into everything.

Do Chinese usually use pinyin to enter characters or what is the normal method? Whatever it is, they don't need to remember the character strokes apparently.

nneonneo 10/27/2024||
Some older folks, particularly those with regional accents or less Pinyin education, stick with handwriting input (which was surprisingly good even ~20 years ago) - drawing the characters with a stylus or a finger.

Most folks these days use Pinyin. The T9 input method (Pinyin, but using a nine-key telephone pad) is popular with folks who grew up using dumbphones.

Finally, voice input is really popular in China. Lots of folks send texts (on WeChat) as short voice messages. WeChat even has a feature to auto-transcribe these voice messages.

euroderf 10/27/2024||
But Pinyin is based on pronunciation, which varies across China. So Pinyin is based on Putonghua ? So Pinyin reinforces the role of putonghua as a project of national unification ? And the pre-eminent role of Beijing ?
throw-the-towel 10/27/2024||
Exactly.
joshdavham 10/27/2024|||
> Do Chinese usually use pinyin to enter characters or what is the normal method?

In mainland China, it’s pinyin, but in Taiwan, they often use Bopomofo: https://en.m.wikipedia.org/wiki/Bopomofo

bonzini 10/27/2024||
Either way it's phonetic; it's not based on either the radicals or the strokes. Even though there are input methods that use those, they are not commonly used.
robjan 10/27/2024||
There are other input methods such as Cangjie or Sucheng which are also popular in Taiwan and Hong Kong. They have much faster WPM since they are based on the structure of the characters. Pre-smartphone there was a Q9 input method with one stroke mapped to each of 8 numbers and a wildcard for if you forgot.
xyzsparetimexyz 10/28/2024||
I doubt that they're faster than pinyin. With pinyin you can type e.g. 'wmdl' and instantly enter '我们到了'.
bigstrat2003 10/27/2024||
> Just like it doesn't matter that I frequently am uncertain about spelling for some words because we have spell check built into everything.

But it does matter that you don't know the spelling for some words. For one, you don't always have a computer to fix it for you. For two, the computer's spelling is often wrong and you need the human knowledge to fill that gap.

e63f67dd-065b 10/27/2024|||
The interesting thing here is that nobody writes chinese one character at a time with pinyin; you almost always type out an entire phrase in pinyin, and usually there's only one meaningfully correct combination of those sounds in terms of characters and meaning (that's how the listener can tell what you're saying, after all, when listening) which will be the first choice in your input software (input software traditionally gives you a first choice with what it thinks the entire phrase is, and gets shorter with the 2nd choice onwards to partially match a phrase.)

The problem is not that people can't recognise the words; it's that we can't write them if given pen a paper. If the phone gets it wrong you just choose the nth choice instead of the first.

anal_reactor 10/27/2024|||
> For one, you don't always have a computer to fix it for you.

You can't be serious. I thought that the "you need to be good at algebra because you won't have a calculator in your pocket" argument died out naturally. I can't recall the last time I actually made hand-written notes.

> For two, the computer's spelling is often wrong and you need the human knowledge to fill that gap.

It happens to me once a month that the computer cannot recognize a correctly spelled word, and virtually never in English.

bigstrat2003 10/27/2024||
> You can't be serious. I thought that the "you need to be good at algebra because you won't have a calculator in your pocket" argument died out naturally. I can't recall the last time I actually made hand-written notes.

Then you are an extreme outlier. Most people write plenty of things down by hand in their day to day life. And there's no reason for the argument to die out, because it's completely correct.

> It happens to me once a month that the computer cannot recognize a correctly spelled word, and virtually never in English.

Again, this seems like an outlier. False positives and false negatives are both quite common in spell checkers.

anal_reactor 10/28/2024||
> Then you are an extreme outlier.

If that were the case then the problem described in the article would be limited to just a few outliers like me. But it's not.

> Again, this seems like an outlier.

No u. The fact that you are on this website means that you most likely are educated, therefore you are likely to use uncommon words unknown to the spellchecker. Most people focus on just a handful of basic words needed for everyday life, and besides that, very few people actually care about correct spelling. All of my friends have at least college degree and 50% of them pay zero attention to correct spelling, I can only assume that average Joe cares even less.

More comments...