http://qs1969.pair.com?node_id=11133786


in reply to Re: How to count the vocabulary of an author?
in thread How to count the vocabulary of an author?

"Edit: I'm pretty sure if you include the huge vocabulary of curse words he must have known (due to him being a german and loosing two world wars), i'm pretty sure there are a lot more than 500 words he knew."

That's a very stupid comment.

  • Comment on Re^2: How to count the vocabulary of an author?

Replies are listed 'Best First'.
Re^3: How to count the vocabulary of an author?
by cavac (Parson) on Jun 11, 2021 at 14:16 UTC

    Why? Those "official" counts only count the written word. We often use a different vocabulary when communicating verbally. Some authors intentionally limit their written vocabulary to make their works accessible to a broader public.

    Cursing and other emotional expressions can often emphasize specific meanings of things said. The written equivalent in modern internet terms would be emoticons. Insofar as the Unicode consortium is concerned, these count as written expressions that are meaningful to the context of the discussion.

    It also depends on the cultural context and the circumstances as to when and what forms of cursing are acceptable or sometimes even required in a conversation. English speakers, and especially people in the United States, are much more prudent, compared to some other cultures. There are many groups out there that look to as outsider if you don't use a very frank and curseword-ridden way to talk to them. So, if you want to be accepted as equal (for example, because you need them on your project), you better have to learn their way of communicating - there shorthands, their curses, whathaveyou.

    It's a bit like driving a vehicle. You have areas in the world where everything is very regulated and everyone keeps to the rules. Than you have the seemingly chaotic i-honked-first-so-i-go-first way it works in other areas of the world. If you go there and rent a car, you better learn their ways and honk that horn.

    It's the same way with politicians. They may have voters from different cultural regions and context. So politicians better all those different ways their voters communicate and adapt when visiting the region. "I am one of you" is a big vote seller. But this might not reflect in a politicians writing and official speeches.

    As for Adenauer, a lot of his potential voters were soldiers and people from many different cultural groups. I'm pretty sure he took the time to adapt his vocabulary when meeting with local groups. But as i said, this might not reflect in his publications, as they were for a much broader public.

    Edit: Also take time to watch the Tom Scott video "What counts as a word?"

    perl -e 'use Crypt::Digest::SHA256 qw[sha256_hex]; print substr(sha256_hex("the Answer To Life, The Universe And Everything"), 6, 2), "\n";'
      Would be interesting to know how many new cursewords were introduced in areas like the Philippines and Cuba after being "liberated" by the US ...

        Don't forget the Cuban Missile Crisis. Not only had the cuban people a hard time, there were also a lot of russian soldiers there to teach them new things, especially while they had to take the missiles back down.

        And the day "Bay of Pigs" essentialy became a curse in the U.S.

        perl -e 'use Crypt::Digest::SHA256 qw[sha256_hex]; print substr(sha256_hex("the Answer To Life, The Universe And Everything"), 6, 2), "\n";'