Top
Best
New

Posted by abnry 6 days ago

How many of the 170k English words do you know?(vocabowl-870366514258.us-west1.run.app)
501 points | 554 commentspage 5
55555 5 days ago|
I am building in the language learning sector, and this test is almost certainly not accurate (depending on what you want to measure). It's fun and cool though. But basically this is all based on a frequency list, which itself depends on the corpus. I have not been able to find a good corpus of English which is representative of modern spoken English. Spoken english depends on your age range and subculture and and changes every few years. Example: https://observablehq.com/@yurivish/words

Most of the corpuses I've found heavily over-represent newspaper articles and books, obviously. So the frequency ranking is biased towards academic/crime/geopolitics, not spoken english. But even then, it depends what you most commonly speak about!

There's no better way to do it, though. I'm just providing context.

jrrv 6 days ago||
Presumably it's a random batch of words since you can run the test again. I wonder how much the word selection affects the outcome. I got 66,750 with 20/20/15/17/14.

I'm curious how the difficult is chosen because "obfuscate" was included in the hardest difficulty but I would not consider that to me a difficult word.

Also I found that some of the definitions were not completely correct.

rhdunn 6 days ago|
It could be based on things like word frequency. I'd expect obfuscate/obfuscation to be less common outside of programming and RPGs (Vampire the Masquerade).
over190bpm 5 days ago||
I could actually get almost all of the last third correctly by choosing the option that's the longest, has a semicolon, or a coma.

Aside from that, I didn't like that most of the words only had one or at most two definitions that sounded viable.

A lot of these words have either Latin or Greek origins, for most questions you can deduce the correct answer by asking the question: "Which of these would make sense to develop into a separate word through the mostly non-modern history of the language?".

I would enjoy it way more if all four options sounded equally viable, and I couldn't deduce the correct answer without actually being sure about the meaning of the word. I understand that coming up with choices like that for each question is way harder if you actually validate all of them manually.

I got a score of 76000 best estimate with 85 being correct, even though English is not my native language and I'm not that good at it.

throwaway27448 5 days ago||
Would other people define "complacent" as "Smug satisfaction with oneself"? I'm not so sure.

Regardless, this was fun.

ryanar 5 days ago|
yeah I was pretty confused to the answer of that one, I picked it because it was the closest thing that made sense.
teleforce 4 days ago||
Fun facts one third of English words are French words [1].

I was once reading a posted notice in French, and can mostly understand it although I don't speak French.

I suspect most of the modern English words are not even Old English. This makes learning English very difficult [2].

In this regard English is very much like C++. Once you know how to use it can becomes a very useful tools and utilities.

[1] Is English just badly pronounced French?

https://youtu.be/TUL29y0vJ8Q

[2] 10 Reasons English is Ridiculously Hard:

https://youtu.be/DrlX-L4o2KM

thimabi 5 days ago||
I got 68,900 words, with the vast majority of the errors being on the grandmaster level.

As a non-native English speaker, I found that result pretty good! Though being a native Portuguese speaker certainly helped me as many difficult words in English borrow from Latin, and in Portuguese the Latin influence is more pronounced.

waterpowder 6 days ago||
69,250 (91/100) - I think being French helped a lot for the most complex words, as they're basically the same!
RugnirViking 5 days ago||
The harder words are trivia questions an educated English native could get. What I mean by this is they're all words that you'd have a chance of knowing for a reason. Things like defenestrate, antidisestablishmentarianism, hippopotomonstrosesquipedaliophobia. I know these words, but these are not words I know because I've ever had cause to use them. Words can get way harder than this and still be actually used, and not strictly only in a scientific sense. I'm thinking things like "Ginnel" (narrow passage between houses) or "Vamp" (a part of a shoe) or "Moraines" (hilly landscape formed by glaciers) or "Lea" (land used to pasture animals)
sceptic123 6 days ago||
Yarborough is _also_ an English town so I should have got one more
extra88 6 days ago|
Same. Also, proper names should be excluded entirely; the only "Advanced" one I got wrong was a place name.
miki123211 5 days ago|
Non-native speaker of English here, got 81k. Mostly with intelligence, not language skills.

Once you figure out the pattern of "one answer sounds like the requested word, two are opposites, one is unrelated", the test suddenly becomes easy. Not all questions follow that pattern, but many of them do.

Sometimes there are two or three answers that sound like the question, sometimes a word that is clearly an adjective relating to a person (ending in -us) has non-adjective definitions. I don't think there's even a single question where more than two of the answers make sense, even if you've never heard the word before. That leaves very little room for mistakes.

olliepro 5 days ago|
This is the classic pattern of LLM generated MCQs.
More comments...