Beefy Boxes and Bandwidth Generously Provided by pair Networks
P is for Practical

Splitting language text into syllables

by programmingzeal (Sexton)
on Jan 29, 2022 at 11:22 UTC ( #11140953=perlquestion: print w/replies, xml ) Need Help??

programmingzeal has asked for the wisdom of the Perl Monks concerning the following question:

My goal is similar to this as in above post. This post is pretty old so there must be new developments since then in syllabification techniques. I want to parse any text in Ukrainian, Russian and English languages into syllables via Perl. So what approach should I use to achieve this? Are there any libraries available or do I have to do a dictionary lookup? Also, whether only hyphenation is needed for syllabification?</>

Replies are listed 'Best First'.
Re: Splitting language text into syllables
by eyepopslikeamosquito (Bishop) on Jan 29, 2022 at 13:59 UTC

    Just thought I should clarify that this new root level post seems to be a continuation of two recent necropost replies made on Jan 25 2022 to an old perl question Splitting text into syllables asked by crenz in 2003. Note that the inimitable Bod made one of these necropost replies at Re: Splitting text into syllables.

    Hope this clarifies more than it confuses. :)

Re: Splitting language text into syllables
by talexb (Chancellor) on Jan 29, 2022 at 17:13 UTC

    A quick Google search turned up this Hyphenation Dictonary Download; I would probably take that information and build it into a data structure that could be frozen and thawed as necessary. Or you could put it into an SQLite database.

    Alex / talexb / Toronto

    Thanks PJ. We owe you so much. Groklaw -- RIP -- 2003 to 2013.

Re: Splitting language text into syllables
by perlfan (Vicar) on Feb 04, 2022 at 05:49 UTC
    Search CPAN for "syllable" - you get stuff, like Lingua::Phonology and Lingua::EN::Syllable. Generally speaking, the "Lingua" module space is for text and natural language processing things.

Log In?

What's my password?
Create A New User
Domain Nodelet?
Node Status?
node history
Node Type: perlquestion [id://11140953]
Front-paged by Corion
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others musing on the Monastery: (1)
As of 2022-07-01 05:25 GMT
Find Nodes?
    Voting Booth?
    My most frequent journeys are powered by:

    Results (98 votes). Check out past polls.