Discipulus has asked for the wisdom of the Perl Monks concerning the following question:

Hello there!

A special friend of mine is tasked to spot 700 neologisms intruduced in German during last ten years. Being Perl so good at data and language processing, being this community populated also by some linguist and a bounch of german ones I hope I can get some useful tip from your part.

In detail, to be helpful in their research using Perl I'd like to be aware of:

I supposed that ideally if I can access a decent corpus of last ten years and decent vocabulary issued 10 years ago I can be able to spit out some difference. Ok no so trivial but feasible.

As last option if 10 german monks want to put here 70 different neologism each one.. I'm done :)

L*

There are no rules, there are no thumbs..
Reinvent the wheel, then learn The Wheel; may be one day you reinvent one of THE WHEELS.

Replies are listed 'Best First'.
Re: [OT] German language resources for neologisms
by haj (Vicar) on Jan 10, 2025 at 10:19 UTC

    A free online resource is the Neologismenwörterbuch, authored by Leibniz-Institut für Deutsche Sprache.

    They have a List of words which they are monitoring currently. I guess some of those might be forgotten soon, but some will stick.

      Thanks haj,

      seems a good resource!

      ..I forgot to ask for some resource with a public API, this could involve me as Perl hacker :)

      L*

      There are no rules, there are no thumbs..
      Reinvent the wheel, then learn The Wheel; may be one day you reinvent one of THE WHEELS.
Re: [OT] German language resources for neologisms
by karlgoethebier (Abbot) on Jan 10, 2025 at 12:02 UTC
Re: [OT] German language resources for neologisms
by LanX (Saint) on Jan 10, 2025 at 12:03 UTC
    3 fairly up to date authoritative sources come to mind
    • duden.de
    • Wiktionary ¹
    • Owid.de

    For instance the entry for Flugscham ° is referencing the other two.

    Regarding downloading corpora or API I dunno. You have to check there.

    At least the OWID is offering a corpora search.

    Cheers Rolf
    (addicted to the Perl Programming Language :)
    see Wikisyntax for the Monastery

    °) shame of flying (due to the environmental impact of aviation)

    ¹) OK probably rather open than authoritative