Is it possible to pad integers with zeros using regular expressions?

Shriram V picture Shriram V · Jun 25, 2010 · Viewed 19.5k times · Source

I have a series of numbers of different lengths (varying from 1 to 6 digits) within some text. I want to equalize the lenghts of all these numbers by padding shorter numbers by zeros.

E.g. The following 4 lines -

A1:11
A2:112
A3:223333
A4:1333
A5:19333
A6:4

Should become padded integers

A1:000011
A2:000112
A3:223333
A4:001333
A5:019333
A6:000004

I am using "sed" and the following combersome expression:

sed -e 's/:\([0-9]\{1\}\)\>/:00000\1/' \
    -e 's/:\([0-9]\{2\}\)\>/:0000\1/' \
    -e 's/:\([0-9]\{3\}\)\>/:000\1/' \
    -e 's/:\([0-9]\{4\}\)\>/:00\1/' \
    -e 's/:\([0-9]\{5\}\)\>/:0\1/'

Is it possible to do this in a better expression than this?

Answer

Mark Byers picture Mark Byers · Jun 25, 2010

You can pad it with too many zeros and then keep only the last six digits:

sed -e 's/:/:00000/;s/:0*\([0-9]\{6,\}\)$/:\1/'

Result:

A1:000011
A2:000112
A3:223333
A4:001333
A5:019333
A6:000004

It might be better to use awk though:

awk -F: '{ printf("%s:%06d\n", $1, $2) }'