Regular expression - any text to URL friendly one

Paul picture Paul · Oct 29, 2010 · Viewed 14.5k times · Source

PHP regular expression script to remove anything that is not a alphabetical letter or number 0 to 9 and replace space to a hyphen - change to lowercase make sure there is only one hyphen - between words no -- or --- etc.

For example:

Example: The quick brown fox jumped Result: the-quick-brown-fox-jumped

Example: The quick brown fox jumped! Result: the-quick-brown-fox-jumped

Example: The quick brown fox - jumped! Result: the-quick-brown-fox-jumped

Example: The quick ~`!@#$%^ &*()_+= ------- brown {}|][ :"'; <>?.,/ fox - jumped! Result: the-quick-brown-fox-jumped

Example: The quick 1234567890 ~`!@#$%^ &*()_+= ------- brown {}|][ :"'; <>?.,/ fox - jumped! Result: the-quick-1234567890-brown-fox-jumped


Anybody have idea for the regular expression?

Thanks!

Answer

Gumbo picture Gumbo · Oct 29, 2010

Since you seem to want all sequences of non-alphanumeric characters being replaced by a single hyphen, you can use this:

$str = preg_replace('/[^a-zA-Z0-9]+/', '-', $str);

But this can result in leading or trailing hyphens that can be removed with trim:

$str = trim($str, '-');

And to convert the result into lowercase, use strtolower:

$str = strtolower($str);

So all together:

$str = strtolower($str);
$str = trim($str, '-');
$str = preg_replace('/[^a-z0-9]+/', '-', $str);

Or in a compact one-liner:

$str = strtolower(trim(preg_replace('/[^a-zA-Z0-9]+/', '-', $str), '-'));