0% found this document useful (0 votes)

74 views

Metacharacters in Python

Metacharacters are special characters in regular expressions that define patterns to match in strings. This document discusses the main metacharacters in Python like [], which represents a set of characters; *, which represents zero or more occurrences of a pattern; and |, which represents alternative matching characters. Examples are provided to demonstrate how each metacharacter can be used to extract specific patterns from strings using the re module in Python.

Uploaded by

Anonymous DFpzhrR

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

74 views

Metacharacters in Python

Uploaded by

Anonymous DFpzhrR

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Metacharacters in Python

Metacharacters are a very important concept of the regular expression that helps us to
solve programming tasks using the Python regex module. In this tutorial, we will learn
about the metacharacters in Python or how we can use them. We will explain each
metacharacter along with their short and simple example. The prerequisite of learning
metacharacters is that you should be familiar with the Python regular expression. If not,
visit our Python regular expression tutorial.

What are Metacharacters in Python?

Metacharacters are part of regular expression and are the special characters that
symbolize regex patterns or formats. Every character is either a metacharacter or a regular
character in a regular expression. However, metacharacters have a special meaning. They
are not used to match any patterns but to define some rules to find the specific pattern
in the statement. Metacharacters are also known as operators, signs, or symbols.

Metacharacter Description Example

[] It represents the set of characters. "[a-z]"

\ It represents the special sequence. "\r"

. It signals that any character is present at some specific place. "Ja.v."

^ It represents the pattern present at the beginning of the string. "^Java"

$ It represents the pattern present at the end of the string. "point"

* It represents zero or more occurrences of a pattern in the string. "hello*"

+ It represents one or more occurrences of a pattern in the string. "hello+"

{} The specified number of occurrences of a pattern the string. "java{2}"

| It represents either this or that character is present. "java|point"

() Capture and group (javatpoint)

[] Square Brackets Metacharacters
The [] square brackets represent the set of the characters. For example - Suppose we want
to get any occurrence of abc letters inside the target string. Or, we want to match the
words that are inside the square bracket to the target string. We can use the [abc] to
match such pattern. The [abc] will match contains any of a, b or c.

We can also specify the range of the characters using the - dash.

o [0-5] - It is same as the [012345].

o [A-E] - It is same as the [ABCDE].
o [a-d] - It is same as the [abcd].

Example -

1. import re
2. str1 = "Python is a most popular programming language. Javatpoint is best resou
rce to learn it."
3. res = re.findall(r"[jtp]", str1)
4. print(res)

Output:

['t', 't', 'p', 'p', 'p', 'J', 't', 'p', 't', 't', 't', 't']

The \ Backslash Metacharacter

The backslash is used to escape the various characters including the metacharacters. It
can also use to represent the special sequence. For example - \d is used to find the any
digit from 0-9.

Let's see another example - Suppose we want to search the match #a, where a is a
characters followed by the # special character.

Below is the table of some special characters which is used with the \.
Characters Description

\s It is used to match a one white space character.

\S It is used to match one non-white space character.

\0 It is used to match a NULL character.

\a It is used to match a bell or alarm.

\d It is used to match one decimal digit, which means from 0 to 9.

\D It is used to match any non-decimal digit.

\n It helps a user to match a new line.

\w It is used to match the alphanumeric [0-9a-zA-Z] characters.

\W It is used to match one non-word character

\b It is used to match a word boundary.

1. import re
2. str1 = "Python is a most popular programming language. Javatpoint is best resou
rce to learn it."
3. res = re.findall(r"\.", str1)
4. print(res)

Output:

['.', '.']

As we can that, it returned the list containing the two (.) dot.

The . Dot Metacharacter

The . dot metacharacter represents any string character apart from the newline character
(\n). It can consist of any letters of uppercase or lowercase, symbols such as dollar ($),
pound (#), mark (!), question mark (?) or colons (:), digits 0 to 9, including whitespace.
1. import re
2.
3. given_string = "Peter likes to \n roam on the road at night"
4. # dot(.) metacharacter to match any character
5. result_match = re.search(r'.', given_string)
6. print(result_match.group())
7.
8. # .+ to match any string except newline
9. result_match = re.search(r'.*', given_string )
10. print(result_match.group())
11.
12. given_string1 = "Peter's mobile number is - 4564\n67"
13. result_match1 = re.search(r'.+', given_string1 )
14. print(result_match1.group())

Output:

P
Peter likes to
Peter's mobile number is - 4564

The ^ Carrot Character

The carrot character returns the matching characters from beginning. For example - We
want first five words from the string, we would use the caret (^) metacharacter. Let's
understand the following example.

Example -

1. import re
2.
3. given_string = "Peter likes to \n roam on the road at night"
4. # dot(.) metacharacter to match any character
5. result_match = re.search(r'^\w{5}', given_string)
6. print(result_match.group())
Output:

Peter

In the above code, we used the \w special sequence which matches any lowercase or
uppercase, numbers, and underscore character. The five inside curly braces specify the
alphanumeric character should be occurring precisely five times.

The caret ( ^ ) to match a pattern at the beginning of each new

line
We can use the caret metacharacter only at the beginning of a string in a single line as it
is not used in multiline matching.

Example -
1. import re
2.
3. given_string = "Peter likes to \nroam on the road at night \nalso likes to eat ice-
creame"
4. # dot(.) metacharacter to match any character
5. result_match = re.search(r"^\w{5}", given_string, re.M)
6. print(result_match.group())

The $ Dollor Metacharacter

This metacharacter is just opposite to the dollor ($) metacharacter. It matches at the end
of the string. In the following example, we will match the ice-cream, which is a present at
the last of the string.

Example -
1. import re
2.
3. given_string = "Peter likes to \nroam on the road at night \nalso likes to eat ice-
cream"
4. # dot(.) metacharacter to match any character
5. result_match = re.search(r"\w{6}$", given_string, re.M)
6. print(result_match.group())
Output:

cream

The * asterisk/star Metacharacter

It is one of the most popular and widely used metacharacters in regular expression
patterns. The * asterisk represents the repetition 0 or more times as possible, meaning it
is a greedy repetition. The following example demonstrates the match of all the numbers
using the asterisk (*) metacharacter.

1. given_string = "Numbers are 1234, 8061,14567, 70453"

Observe that we need to match the two consecutive \d (represent any digit). The thing
should be remembered that at the end of the pattern means zero or more repetitions of
the preceding expression. In this case, we are preceding expression with last \d, not all
two of them. We can set the upper limit as we want. However, the lower limit is zero.

Let's understand the following example.

Example -

1. import re
2.
3. given_string = "Numbers are 1234, 8061,14567, 70453"
4. # dot(.) metacharacter to match any character
5. result_match = re.findall(r"\d\d*", given_string)
6. print(result_match)

Output:

['1234', '8061', '14567', '70453']

The + Plus Metacharacter
It is another popular and widely used metacharacter in regular expression patterns. It
represents the repetition of one or more times with as many repetitions. It means it is a
greedy repetition. In other words, there are 1 or more repetitions of the preceding
expression.

Here the pattern to be matched is \d\d+.

1. import re
2.
3. given_string = "Numbers are 5, 34, 1234, 8061,14567, 70453"
4. # dot(.) metacharacter to match any character
5. result_match = re.findall(r"\d\d+", given_string)
6. print(result_match)

Output:

['34', '1234', '8061', '14567', '70453']

The Pipe (|) Metacharacter

The pipe (|) metacharacter represents the alternative option of matching characters. Let's
understand the following example.

Example -

1. given_string = "This is my number."

2. # dot(.) metacharacter to match any character
3. result_match = re.search(r"i|n", given_string)
4. print(result_match)

Metacharacters plays a significant role in solving Python regex real-world problem, and they
come in a wide range. In this tutorial, we have included almost every metacharacters with the
proper explanation and the coding example.

Grade 7 English Language Examination For First Term
100% (1)
Grade 7 English Language Examination For First Term
7 pages
Lisp Interpreter in Rust
From Everand
Lisp Interpreter in Rust
Vishal Patil
1/5 (1)
Book 3 Language Laboratory Activities
100% (2)
Book 3 Language Laboratory Activities
86 pages
Engage 1 2nd Edition
67% (3)
Engage 1 2nd Edition
115 pages
Lecture 6 Re Basics
No ratings yet
Lecture 6 Re Basics
12 pages
45 The Matching Characters
No ratings yet
45 The Matching Characters
3 pages
Regular Expressions
No ratings yet
Regular Expressions
5 pages
Regular Expression 01
No ratings yet
Regular Expression 01
48 pages
Python Regular Expressions
No ratings yet
Python Regular Expressions
6 pages
Python How To Regex
No ratings yet
Python How To Regex
19 pages
Regular Expression Python
No ratings yet
Regular Expression Python
23 pages
Regular Expression HOWTO: Guido Van Rossum and The Python Development Team
No ratings yet
Regular Expression HOWTO: Guido Van Rossum and The Python Development Team
18 pages
Structuring with regix
No ratings yet
Structuring with regix
49 pages
Regular Expression HOWTO: Guido Van Rossum Fred L. Drake, JR., Editor
No ratings yet
Regular Expression HOWTO: Guido Van Rossum Fred L. Drake, JR., Editor
18 pages
Regular Expression HOWTO: Guido Van Rossum Fred L. Drake, JR., Editor
No ratings yet
Regular Expression HOWTO: Guido Van Rossum Fred L. Drake, JR., Editor
18 pages
Regular Expression HOWTO: Guido Van Rossum and The Python Development Team
No ratings yet
Regular Expression HOWTO: Guido Van Rossum and The Python Development Team
20 pages
Regular Expression
No ratings yet
Regular Expression
18 pages
Lesson 1: An Introduction, and The Abcs
No ratings yet
Lesson 1: An Introduction, and The Abcs
2 pages
Regular Expression HOWTO: Guido Van Rossum Fred L. Drake, JR., Editor
100% (1)
Regular Expression HOWTO: Guido Van Rossum Fred L. Drake, JR., Editor
18 pages
PP_Module-3 Notes
No ratings yet
PP_Module-3 Notes
56 pages
Howto Regex
No ratings yet
Howto Regex
19 pages
Chapter 5 Regular Expressions, Rollover and Frames Regular Expression
No ratings yet
Chapter 5 Regular Expressions, Rollover and Frames Regular Expression
16 pages
Python RegEx
No ratings yet
Python RegEx
8 pages
Regular Exp
No ratings yet
Regular Exp
6 pages
PP - Chapter - 4
No ratings yet
PP - Chapter - 4
15 pages
Howto Regex
No ratings yet
Howto Regex
17 pages
Regular Expressions
No ratings yet
Regular Expressions
9 pages
howto-regex
No ratings yet
howto-regex
20 pages
9.RegEx (1)
No ratings yet
9.RegEx (1)
57 pages
AfroAI Reasearch - 2024
No ratings yet
AfroAI Reasearch - 2024
10 pages
Regx
No ratings yet
Regx
3 pages
Regular Expression QuickRef
No ratings yet
Regular Expression QuickRef
1 page
Howto Regex
No ratings yet
Howto Regex
20 pages
Module 4 RegEX
No ratings yet
Module 4 RegEX
22 pages
Howto Regex PDF
No ratings yet
Howto Regex PDF
20 pages
Regular Expressions
No ratings yet
Regular Expressions
9 pages
Regular Expresions
No ratings yet
Regular Expresions
27 pages
RE Expression Sheet
No ratings yet
RE Expression Sheet
2 pages
Python RegEx
No ratings yet
Python RegEx
1 page
L4 (2)
No ratings yet
L4 (2)
73 pages
Regex Cheat Sheet
No ratings yet
Regex Cheat Sheet
12 pages
Chapter 5 Regular Expression, Rollover and Frames
No ratings yet
Chapter 5 Regular Expression, Rollover and Frames
56 pages
Python Regular Expression
100% (1)
Python Regular Expression
31 pages
Validations php with regex
No ratings yet
Validations php with regex
13 pages
Regular Expressions To Identify A String
No ratings yet
Regular Expressions To Identify A String
1 page
Regex
No ratings yet
Regex
24 pages
Lec 06 - Regular Expression
No ratings yet
Lec 06 - Regular Expression
19 pages
Regular Expressions (Slides)
No ratings yet
Regular Expressions (Slides)
20 pages
CHAPTER 10
No ratings yet
CHAPTER 10
28 pages
Data Analysis Using Python Lab Ex3
No ratings yet
Data Analysis Using Python Lab Ex3
27 pages
Module3 RegularExpressions
No ratings yet
Module3 RegularExpressions
8 pages
2 - Python Strings
No ratings yet
2 - Python Strings
23 pages
PHP - Regular Expressions1
No ratings yet
PHP - Regular Expressions1
6 pages
Natural Language Processing 5
No ratings yet
Natural Language Processing 5
13 pages
UNIT V
No ratings yet
UNIT V
11 pages
Supplement Python Regular Expression
No ratings yet
Supplement Python Regular Expression
6 pages
Module 4 - Regular Expressions1
No ratings yet
Module 4 - Regular Expressions1
37 pages
Regular Expression
No ratings yet
Regular Expression
21 pages
REGEX in Data Analytics
No ratings yet
REGEX in Data Analytics
5 pages
Introduction to PHP, Part 2, Second Edition
From Everand
Introduction to PHP, Part 2, Second Edition
Adam Majczak
No ratings yet
Ian Talks Regex A-Z
From Everand
Ian Talks Regex A-Z
Ian Eress
No ratings yet
The Essential R Reference
From Everand
The Essential R Reference
Mark Gardener
No ratings yet
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
Unit1 DBMS
No ratings yet
Unit1 DBMS
112 pages
Unitiii DBMS
No ratings yet
Unitiii DBMS
62 pages
Unitii DBMS
No ratings yet
Unitii DBMS
89 pages
Operator Precedence Grammar
100% (2)
Operator Precedence Grammar
5 pages
Unitv DBMS
No ratings yet
Unitv DBMS
81 pages
Unit I Introduction To Compiler: Question Bank
No ratings yet
Unit I Introduction To Compiler: Question Bank
7 pages
Normalization in DBMS
No ratings yet
Normalization in DBMS
9 pages
04 - Cache Memory PDF
No ratings yet
04 - Cache Memory PDF
71 pages
Zainhaider COAL
No ratings yet
Zainhaider COAL
43 pages
IT Discretemaths PDF
No ratings yet
IT Discretemaths PDF
195 pages
Dsa PDF
No ratings yet
Dsa PDF
293 pages
Calendar Problem
No ratings yet
Calendar Problem
3 pages
Paper 1 Study Material
50% (2)
Paper 1 Study Material
353 pages
Concept: 7 Day Cycle
No ratings yet
Concept: 7 Day Cycle
20 pages
Entity Relationship (E-R) Modeling
No ratings yet
Entity Relationship (E-R) Modeling
96 pages
Pdfcreator Opensource: The Quick Brown Fox Jumps Over The Lazy Dog. 0123456789
No ratings yet
Pdfcreator Opensource: The Quick Brown Fox Jumps Over The Lazy Dog. 0123456789
1 page
OODA Notes
No ratings yet
OODA Notes
193 pages
Entity Relationship (E-R) Modeling
No ratings yet
Entity Relationship (E-R) Modeling
96 pages
Clipping Algm
No ratings yet
Clipping Algm
14 pages
CG Question Bank
No ratings yet
CG Question Bank
17 pages
University of Palestine: Computer Graphics
No ratings yet
University of Palestine: Computer Graphics
21 pages
Uas Kelas X Semester 1
No ratings yet
Uas Kelas X Semester 1
3 pages
AI - Module IV - Propositionallogic
No ratings yet
AI - Module IV - Propositionallogic
49 pages
Makalah Present and Past Perfect Tense Muhammad Syahrus Tsani
No ratings yet
Makalah Present and Past Perfect Tense Muhammad Syahrus Tsani
9 pages
4 Basic Spatials - The Suffix of Place Position
No ratings yet
4 Basic Spatials - The Suffix of Place Position
7 pages
British vs. American English
No ratings yet
British vs. American English
6 pages
The Banyan Tree
No ratings yet
The Banyan Tree
9 pages
Josel Caraballe & Marielle Ann Oruga Midterm Exam Essay History of The English Language (TTH 5:30-7:00PM)
No ratings yet
Josel Caraballe & Marielle Ann Oruga Midterm Exam Essay History of The English Language (TTH 5:30-7:00PM)
4 pages
16 Tenses
No ratings yet
16 Tenses
6 pages
Unit 1 Getting To Know You: 1. Match The Questions and Answers
No ratings yet
Unit 1 Getting To Know You: 1. Match The Questions and Answers
6 pages
Further Reading Practice: Read The Words/sentences First To Yourself, Then Aloud
No ratings yet
Further Reading Practice: Read The Words/sentences First To Yourself, Then Aloud
16 pages
Present Continuous Vs Past Continuous
No ratings yet
Present Continuous Vs Past Continuous
9 pages
Additional 17 Grammar Patterns For Reading and Writing
No ratings yet
Additional 17 Grammar Patterns For Reading and Writing
7 pages
Phraseology: Phraseology Studies Such Collocations of Words (Phraseologisms
No ratings yet
Phraseology: Phraseology Studies Such Collocations of Words (Phraseologisms
10 pages
Tugas Unit 4
No ratings yet
Tugas Unit 4
12 pages
Back To Grammar Worksheets
No ratings yet
Back To Grammar Worksheets
8 pages
0112-French Online Certificate Course
No ratings yet
0112-French Online Certificate Course
12 pages
Compound Adjectives 4427
0% (2)
Compound Adjectives 4427
3 pages
Vocabulary
No ratings yet
Vocabulary
137 pages
Grade 10 English Worksheet-1
No ratings yet
Grade 10 English Worksheet-1
7 pages
Aptis Speaking About The Weather PDF
No ratings yet
Aptis Speaking About The Weather PDF
1 page
Verb Tenses
No ratings yet
Verb Tenses
8 pages
Building Vocabulary
No ratings yet
Building Vocabulary
7 pages
Stages of Language Acquisition-ArtfredFort
No ratings yet
Stages of Language Acquisition-ArtfredFort
9 pages
G8 Jitesh - English Language Test 1
No ratings yet
G8 Jitesh - English Language Test 1
5 pages
Chapter 1: Teaching Your Tongue To Speak English
No ratings yet
Chapter 1: Teaching Your Tongue To Speak English
36 pages
Taller Del Presente Progresivo
No ratings yet
Taller Del Presente Progresivo
8 pages
Verb Tense and Method
No ratings yet
Verb Tense and Method
5 pages