Python Regex Object & Methods

, , …,

Compile Regex Object

re.compile(pattern) → Compiles regex pattern into a object. A regex object can be used by regex object methods. ⁖…).

re.compile(pattern,flags) → use flags. 〔➤ Python Regex Flags

The use of regex object is similar to re module functions. For example, the following 2 have the same result.

Creating a regex object is useful when you need to use the same pattern on many strings. This is more efficient because python doesn't have to compile the regex pattern internally each time.

The following detail regex object methods. Most of them have the same syntax as regex functions, but some are more flexible with optional parameters. 〔➤ Python Regex Functions

Regex Object Methods

search is equivalent to, string). 〔➤ Python Regex Functions, startpos) → match on substring starting at startpos., startpos, endpos) → end at endpos.

The following have the same result:


RegexObj.match(string) → same result as re.match(pattern, string). 〔➤ Python Regex Functions

RegexObj.match(string, startpos) → match on substring starting at startpos.

RegexObj.match(string, startpos, endpos) → end at endpos


RegexObj.split(string, …) → same result as re.split(pattern, string, …). 〔➤ Python Regex Functions


RegexObj.findall(string) → same result as re.findall(pattern, string)

RegexObj.findall(string, startpos) → match on substring starting at startpos.

RegexObj.findall(string, startpos, endpos) → end at endpos.


RegexObj.finditer(string) → same result as re.finditer(pattern, string)

RegexObj.finditer(string, startpos) → match on substring starting at startpos.

RegexObj.finditer(string, startpos, endpos) → end at endpos.


RegexObj.sub( repl, string, …) → same result as re.sub(pattern, string, …).


RegexObj.subn( repl, string, …) → same result as re.subn(pattern, string, …)

Regex Attributes

The following are constants assigned by the module when a pattern object is compiled.


RegexObj.flags → The flags argument used when the regex object was compiled, or 0 if no flags were provided.

import re
patObj = re.compile(ur'\w', re.M|re.U)
print patObj.flags  # prints 40


RegexObj.groupindex → returns a dictionary. The keys are the named capture groups, the values are their order in the pattern. For example, here's a pattern of named group (?P<id>). The dictionary is empty if no symbolic groups were used in the pattern.

#-*- coding: utf-8 -*-
# python

# example of regex .groupindex

import re

# a typical image tag
# <img src="cat.jpg" alt="cat" width="123" height="456">

# a pattern to capture image tag, with named group
xx = re.compile(r'src="(?P<filename>[^"]+)" alt="(?P<alttext>[^"]+)" width="(?P<width>\d+)" height="(?P<height>\d+)"')

print xx.groupindex #  {'height': 4, 'width': 3, 'alttext': 2, 'filename': 1}


RegexObj.pattern → The pattern string from which the regex object was compiled.

blog comments powered by Disqus