One Star

non-alphanumeric characters that aren't special RegExp characters

Can someone please verify for me that I have these characters grouped appropriately, and let me know about the "Unknown" characters (if they are ordinary/allowable or metacharacters/not ordinary/special), please?
I'm trying to figure out what characters we should NOT use (in strings that Talend will be using/calling/whatever) - just to avoid headaches.
Thanks!

Ordinary/Allowable Characters
*Characters that "match themselves" in Regular Expressions
**"Characters other than . $ ^ { } * + ? \ match themselves." (http://regexlib.com/CheatSheet.aspx)
Character (Name)
@ (Ampersand)
# (Number sign)
% (Percent)
- (Hyphen, Minus sign)
_ (Underscore)
= (Equals)
~ (Tilde)
` (Grave accent)
: (Colon)
; (Semicolon)
< (Less than, Left Angle Bracket)
> (Greater than, Right Angle Bracket)
, (Comma)

Metacharacters/Not Ordinary/Special Characters
*These will NOT match themselves
**Unless prefaced by a \
**Except for the backslash (\) character, which must be prefaced by \\\ to come out as a single backslash. (http://www.amk.ca/python/howto/regex/regex.html#SECTION000420000000000000000)
Character (Name)
$ (Dollar Sign)
( (Left Parentheses, Left Round Bracket)
) (Right Parentheses, Right Round Bracket)
* (Asterisk)
? (Question Mark)
\ (Backslash)
^ (Caret)
(Right Square Bracket/Brace)
{ (Left Curly Bracket/Brace)
} (Right Curly Bracket/Brace)
| (Pipe)
+ (Plus)

Unknown
*I don't know which of the two categories, above, the following characters fall into
Character (Name)
. (Period, Full stop)
? (Single quote, Apostrophe)
" (Double quote)

Sources of the above information:
http://www.regular-expressions.info/reference.html
http://regexlib.com/CheatSheet.aspx
http://www.amk.ca/python/howto/regex/regex.html
http://www.fileformat.info/tool/regex.htm
3 REPLIES
One Star

Re: non-alphanumeric characters that aren't special RegExp characters

Can someone please verify for me that I have these characters grouped appropriately, and let me know about the "Unknown" characters (if they are ordinary/allowable or metacharacters/not ordinary/special), please?
I'm trying to figure out what characters we should NOT use (in strings that Talend will be using/calling/whatever) - just to avoid headaches.
Thanks!

pleaseeeee?
One Star

Re: non-alphanumeric characters that aren't special RegExp characters

Hi,
your groups are correct.
For the last three signs: "." (Period, Full stop) => special (matches any character but also will match itself)
The other two do not have any special function in regex.
Bye
Volker
One Star

Re: non-alphanumeric characters that aren't special RegExp characters

ok, cool - just wanted to make sure cause I saw conflicting information about those three characters (between "regular" regular expressions and then java regular expressions)