The reason why I don't want to use regexes here is that things like '23' can occur within a date or as 'bare' number. If I insert markup tags as you propose here, I will end up with <date>Oct. <number>23</number></date> rather than <date>Oct. 23</date>.