Quick answer: Match a string consisting of a single character, where that character is alphanumeric (letters, numbers) an underscore (
_) or an asterisk (
\w" means "any word character" which usually means alphanumeric (letters, numbers, regardless of case) plus underscore (_)
^" "anchors" to the beginning of a string, and the "
$" "anchors" To the end of a string, which means that, in this case, the match must start at the beginning of a string and end at the end of the string.
means a character class, which means "match any character contained in the character class".
It is also worth mentioning that normal quoting and escaping rules for strings make it very difficult to enter regular expressions (all the backslashes would need to be escaped with additional backslashes), so in Python there is a special notation which has its own special quoting rules that allow for all of the backslashes to be interpreted properly, and that is what the "
r" at the beginning is for.
Note: Normally an asterisk (
*) means "0 or more of the previous thing" but in the example above, it does not have that meaning, since the asterisk is inside of the character class, so it loses its "special-ness".
As exhuma said, \w is any word-class character (alphanumeric as Jonathan clarifies).
However because it is in square brackets it will match:
So the whole regular expression matches:
so the following would match:
blah z <- matches this line blah
blah * <- matches this line blah