href\s*=\s*[\"\'] [^\"\']+ [\"\']\s*> #first href (not captured) \s*([^<>]+?)\s* #text inside first (captured) #### \s*(.+?)\s* # non-greedy capturing of everything till the next
## \s*(.+?)\s* # non-greedy capturing of everything till the next