Remove special characters using Pentaho - Replace in String

M. Loyyy picture M. Loyyy · Feb 8, 2017 · Viewed 10.3k times · Source

I wanted to remove the special characters like ! @ # $ % ^ * _ = + | \ } { [ ] : ; < > ? / in a string field.

I used the "Replace in String" step and enabled the use RegEx. However, I do not know the right syntax that I will put in "Search" to remove all these characters from the string. If I only put one character in the "Search" it was removed from the string. How can I remove all of these??

This is the picture of how I did it: This is the picture of how I did it

Answer

Wiktor Stribiżew picture Wiktor Stribiżew · Feb 8, 2017

As per documentation, the regex flavor is Java. You may use

\p{Punct}

See the Java regex syntax reference:

\p{Punct} Punctuation: One of !"#$%&'()*+,-./:;<=>?@[]^_`{|}~