0% found this document useful (0 votes)

73 views9 pages

University of Mauritius

This document summarizes an assignment for the course AGRI 2081Y - Computational Biology offered at the University of Mauritius, Faculty of Agriculture. The assignment was completed by Marie Natacha Meunier with student ID 1712892 and submitted to the lecturer Dr Shakuntala Baichoo on 25th May 2020. The assignment contains code snippets and answers to computational biology questions involving string manipulation of DNA, RNA and protein sequences.

Uploaded by

grace meunier

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

73 views9 pages

University of Mauritius

Uploaded by

grace meunier

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 9

UNIVERSITY OF MAURITIUS

FACULTY OF AGRICULTURE
BSc (Hons) Biotechnology

AGRI 2081Y (3) - COMPUTATIONAL BIOLOGY

Name of Student: Marie Natacha Meunier

Student I.D: 1712892

Date: 25th May 2020

Lecturer Name: Dr Shakuntala Baichoo

chain_a = """SSSVPSQKTYQGSYGFRLGFLHSGTAKSVTCTYSPALNKM
FCQLAKTCPVQLWVDSTPPPGTRVRAMAIYKQSQHMTEVV
RRCPHHERCSDSDGLAPPQHLIRVEGNLRVEYLDDRNTFR
HSVVVPYEPPEVGSDCTTIHYNYMCNSSCMGGMNRRPILT
IITLEDSSGNLLGRNSFEVRVCACPGRDRRTEEENLRKKG
EPHHELPPGSTKRALPNNT"""

#Question 1 a

num_lines = chain_a.count ("\n")

print (num_lines)

#Question 1 b
length sequence = len (chain_a) - chain_a.count ("\n")
print (length sequence: ", length)

#Question 1 c
new_chain = chain_a.replace("\n", "")
print("New Chain:",new_chain)

#Question 1 d

count = 0
result=0
for i in chain_a:
if i == 'C':
count = count + 1
print ("Number of C:",count)

#Question 1 e
if "NLRVEYLDDRN" in chain_a:
print("yes found");

pos= chain_a.find("NLRVEYLDDRN")
print("Starting position :",pos);
Question 2

dna_seq = """GGGCTTGTGGCGCGAGCTTCTGAAACTAGGCGGCAGAGGCGGAGCCGCT
GTGGCACTGCTGCGCCTCTGCTGCGCCTCGGGTGTCTTTT
GCGGCGGTGGGTCGCCGCCGGGAGAAGCGTGAGGGGACAG
ATTTGTGACCGGCGCGGTTTTTGTCAGCTTACTCCGGCCA AAAAAGAACTGCACCTCTGGAGCGG""

#Question 2 a

# Count the number of C’s in DNA sequence

no_c = dna_seq.count ("C")

# Count the number of G’s in DNA sequence

no_g = dna_seq.count ("G")

#determine the length of the DNA sequence

dna_length = len(dna_seq)

#compute the GC content

gc_cont = (no_g + no_c)

#Question 2 b

rna_seq = dna_seq.replace("T","U")
#Question 2 c

intron = dna_seq[50:156]
exon1 = dna_seq[0:50]
exon2 = dna_seq[156:]
spliced = exon1+exon2

Question 3
#Question 3 a

clusters = """\
>Cluster 0
0 >YLR106C at 100.00%
>Cluster 50
0 >YPL082C at 100.00%
>Cluster 54
0 >YHL009W-A at 90.80%
1 >YHL009W-B at 100.00%
2 >YJL113W at 98.77%
3 >YJL114W at 97.35%
>Cluster 52
0 >YBR208C at 100.00%
"""

#Question a
result = re.findall(r">Cluster?([ \d.]+)", clusters, re.IGNORECASE |
re.MULTILINE)
#print("ID :",str(result))

#Question b
r = clusters.replace('>Cluster', 'Test')
#print("New :",r)
result = re.findall(r"> ?([A-Za-z0-9-]+)", r, re.IGNORECASE |
re.MULTILINE)
#print("sd :",str(result))

per=re.findall(r"> ?([A-Za-z0-9-]+)", r, re.IGNORECASE | re.MULTILINE)

+ re.findall(r"at ?([\d.]+)", clusters, re.IGNORECASE | re.MULTILINE)
#print("sd :",str(per))

lines = r.split('\n')
#print(lines)
for line in lines:
print(re.findall(r"> ?([A-Za-z0-9-]+)", line, re.IGNORECASE |
re.MULTILINE) + re.findall(r"at ?([\d.]+)", line, re.IGNORECASE |
re.MULTILINE))
#Question 4

("A", "T"): 10.0 / 5.0,

("A", "C"): 10.0 / 7.0,
("A", "G"): 10.0 / 6.0,
("T", "C"): 5.0 / 7.0,
("T", "G"): 5.0 / 6.0,
("C", "G"): 7.0 / 6.0 .
#Question 4 a

#There is no difference between the len(ratios), len(ratios.keys()),

len(ratios.values()) and len(ratios.items()) since all the commands
measure the key values
print len(ratios.keys())
print len(ratios.values())
print len(ratios.items())

#Question 4 b

ratio= ("A", "T"): 10.0 / 5.0, ("C", "G"): 7.0 / 6.0 .

If ("A", "T") in ratios:

print ("yes 'A, T' is found in ratios")
or:
print ("No 'T, A' is not found in ratios")

If ("C", "G") in ratios:

print ("yes 'C, G' is found in ratios")
or:
print ("No 'C, G' is not found in ratios")
#Question 4 c

contains_2 = 2 in ratios.values()
print contains_2

contains_3 = 3 in ratios.values()
print contains_3

#Question 4 d

2 in ("A", "T"):
print (("A", "T"), 2) in ratios.items()

1000 in ("C", "G"):

print (("C", "G"), 1000) in ratios.items()

#Question 4 e

keys = [key_value[0]
for key_value in ratios.items()]
values = [key_value[-1]
for key_value in ratios.items()]
#Question 5

#translate the list:

list = ["A", "T", "T", "A", "G", "T", "C"]

translation=

String="ade tym tym ade gua tym cyt"

str = " ade tym tym ade gua tym cyt "

s = ['A, T, T, A, G, T, C ', 'for', ' ade, tym, tym, ade, gua, tym, cyt ']

print(listToString(s))
#Question 6

A python program to read the file data.fasta

text=""">2HMI:A|PDBID|CHAIN|SEQUENCE

PISPIETVPVKLKPGMDGPKVKQWPLTEEKIKALVEICTEMEKEGKISKI

>2HMI:B|PDBID|CHAIN|SEQUENCE

PISPIETVPVKLKPGMDGPKVKQWPLTEEKIKALVEICTEMEKEGKISKI

>2HMI:C|PDBID|CHAIN|SEQUENCE

DIQMTQTTSSLSASLGDRVTISCSASQDISSYLNWYQQKPEGTVKLLIYY

>2HMI:D|PDBID|CHAIN|SEQUENCE

QITLKESGPGIVQPSQPFRLTCTFSGFSLSTSGIGVTWIRQPSGKGLEWL

>2HMI:E|PDBID|CHAIN|SEQUENCE

ATGGCGCCCGAACAGGGAC

>2HMI:F|PDBID|CHAIN|SEQUENCE

GTCCCTGTTCGGGCGCCA"""

fastaFile = open('fasta_file.txt')

Biostatistics and Research Methodology
From Everand
Biostatistics and Research Methodology
Dr. G. Nageswara Rao
5/5 (5)
Shogun Method Derek Rake
13% (8)
Shogun Method Derek Rake
33 pages
Fundamentals of Artificial Intelligence - Lab 1: Expert Fundamentals of Artificial Intelligence - Lab 1: Expert Systems Systems
No ratings yet
Fundamentals of Artificial Intelligence - Lab 1: Expert Fundamentals of Artificial Intelligence - Lab 1: Expert Systems Systems
8 pages
Genetic Engineering Essay
No ratings yet
Genetic Engineering Essay
4 pages
Function Solutions
No ratings yet
Function Solutions
10 pages
p3 Python Project
No ratings yet
p3 Python Project
4 pages
BINP16 Programming Exam 2016-10-25 Solutions
No ratings yet
BINP16 Programming Exam 2016-10-25 Solutions
5 pages
IDC306 Assignment 5 MS21009
No ratings yet
IDC306 Assignment 5 MS21009
4 pages
solutionsExerciseMaster11 23
No ratings yet
solutionsExerciseMaster11 23
13 pages
BT3040 - BIOINFORMATICS - Assignment 4: Question 1
No ratings yet
BT3040 - BIOINFORMATICS - Assignment 4: Question 1
9 pages
Lösungen Zu Den Exercises AI Python
No ratings yet
Lösungen Zu Den Exercises AI Python
26 pages
2nd Year
No ratings yet
2nd Year
83 pages
Untitled Document
No ratings yet
Untitled Document
15 pages
p2 Python Project
No ratings yet
p2 Python Project
3 pages
Py 1679789071
No ratings yet
Py 1679789071
2 pages
AI - Programs KP Print
No ratings yet
AI - Programs KP Print
14 pages
ENEL2CM Assignment 2 (2025)
No ratings yet
ENEL2CM Assignment 2 (2025)
15 pages
15CSL76 Students
No ratings yet
15CSL76 Students
18 pages
Aiml Sample Programs
No ratings yet
Aiml Sample Programs
20 pages
J.K. Institute of Applied Physics and Technology: Natural Language Processing Assignment
No ratings yet
J.K. Institute of Applied Physics and Technology: Natural Language Processing Assignment
22 pages
Group17 2
No ratings yet
Group17 2
9 pages
AIML Manual V1-6-83
No ratings yet
AIML Manual V1-6-83
78 pages
PY Exam
No ratings yet
PY Exam
11 pages
AI-tutor Src Quiz System.py at Main · AdityaK-101 AI-tutor
No ratings yet
AI-tutor Src Quiz System.py at Main · AdityaK-101 AI-tutor
13 pages
Ai SRK
No ratings yet
Ai SRK
19 pages
Python Solutions
No ratings yet
Python Solutions
4 pages
Dy Ai Rec
No ratings yet
Dy Ai Rec
24 pages
Artificial Intelligence Lab File
No ratings yet
Artificial Intelligence Lab File
10 pages
Artificial Intelligence
No ratings yet
Artificial Intelligence
36 pages
DWM Final Exps
No ratings yet
DWM Final Exps
14 pages
Python
No ratings yet
Python
9 pages
Ex 06 LL
No ratings yet
Ex 06 LL
9 pages
CS3491-AIML Lab Manual
No ratings yet
CS3491-AIML Lab Manual
20 pages
Exam Sample Questions
No ratings yet
Exam Sample Questions
6 pages
31-40 Prolog Program
No ratings yet
31-40 Prolog Program
23 pages
PRGM Aiml
No ratings yet
PRGM Aiml
27 pages
Homework 4
No ratings yet
Homework 4
7 pages
Answers Etc Sip Class 12
No ratings yet
Answers Etc Sip Class 12
9 pages
AIML Lab Manual
No ratings yet
AIML Lab Manual
39 pages
Bioinf575 hw07 Dmeghana
No ratings yet
Bioinf575 hw07 Dmeghana
34 pages
2024 Final Exams Rev Worksheet 1
No ratings yet
2024 Final Exams Rev Worksheet 1
9 pages
Aiml Lab
No ratings yet
Aiml Lab
10 pages
Comp Sci Prac
No ratings yet
Comp Sci Prac
8 pages
0 Aimlfinal
No ratings yet
0 Aimlfinal
24 pages
CSE160 Final 23wi Key
No ratings yet
CSE160 Final 23wi Key
10 pages
Aiml Programs
No ratings yet
Aiml Programs
24 pages
Algo
No ratings yet
Algo
10 pages
Aiml Lab Manual
No ratings yet
Aiml Lab Manual
44 pages
Arduino DLL
No ratings yet
Arduino DLL
13 pages
Shashidhar-18csl76 Final
No ratings yet
Shashidhar-18csl76 Final
19 pages
AI & ML Lab Manual
No ratings yet
AI & ML Lab Manual
25 pages
Aiml Lab Manual New Ucev
No ratings yet
Aiml Lab Manual New Ucev
37 pages
Lab Manual: Spring 2021
No ratings yet
Lab Manual: Spring 2021
33 pages
AI&ML Lab Manual
No ratings yet
AI&ML Lab Manual
50 pages
Aiml Lab Programs PDF
No ratings yet
Aiml Lab Programs PDF
25 pages
Machine Learning Through Python Lab Mannual
No ratings yet
Machine Learning Through Python Lab Mannual
33 pages
Prac1 23bme053
No ratings yet
Prac1 23bme053
32 pages
Aiml Lab Manual 2023
No ratings yet
Aiml Lab Manual 2023
17 pages
Ai Myh
No ratings yet
Ai Myh
8 pages
Quizlet Py
No ratings yet
Quizlet Py
13 pages
Calculus and Statistics
From Everand
Calculus and Statistics
Michael C. Gemignani
4/5 (1)
University of Mauritius
No ratings yet
University of Mauritius
4 pages
Radial Immuno
No ratings yet
Radial Immuno
8 pages
SBSC Agribiotechnology Year Iii: Gmo Test
No ratings yet
SBSC Agribiotechnology Year Iii: Gmo Test
1 page
Intracellular Signal Transduction
No ratings yet
Intracellular Signal Transduction
11 pages
Revision questions-AGRI 2042
No ratings yet
Revision questions-AGRI 2042
1 page
Agarose Gel Electrophoresis
No ratings yet
Agarose Gel Electrophoresis
1 page
KD N Covid To Upload
No ratings yet
KD N Covid To Upload
12 pages
BT Cotton
No ratings yet
BT Cotton
3 pages
Saliva Report
No ratings yet
Saliva Report
11 pages
Genetics Dictionary
No ratings yet
Genetics Dictionary
2 pages
CRANIOFACIAL Anomalies in Children
No ratings yet
CRANIOFACIAL Anomalies in Children
55 pages
Kami Export - BioProtein WK
No ratings yet
Kami Export - BioProtein WK
1 page
PCR Primer Design Workshop V 1
No ratings yet
PCR Primer Design Workshop V 1
63 pages
Pathologic Basis of Veterinary Disease, 4th Edition: Chapter 5 Diseases of Immunity
No ratings yet
Pathologic Basis of Veterinary Disease, 4th Edition: Chapter 5 Diseases of Immunity
88 pages
Human Values in Relation To Evolution
No ratings yet
Human Values in Relation To Evolution
6 pages
Activity Sheet in Earth and Life Science
0% (1)
Activity Sheet in Earth and Life Science
23 pages
Horticulturae 09 01066 v2
No ratings yet
Horticulturae 09 01066 v2
12 pages
Biochemical Pharmacology: Naina Monga, Gurupreet S. Sethi, Kanthi Kiran Kondepudi, Amarjit S. Naura
No ratings yet
Biochemical Pharmacology: Naina Monga, Gurupreet S. Sethi, Kanthi Kiran Kondepudi, Amarjit S. Naura
12 pages
Listado de Especies Maldi Biotyper - Enero 2014
No ratings yet
Listado de Especies Maldi Biotyper - Enero 2014
52 pages
Pmw150a Assignment
No ratings yet
Pmw150a Assignment
9 pages
Ring Species
No ratings yet
Ring Species
10 pages
Learning Activity 4.2 - Investigating DNA Replication
No ratings yet
Learning Activity 4.2 - Investigating DNA Replication
31 pages
PDF Ebook Online Access For Anatomy & Physiology 2nd Edition, (Ebook PDF) Download
100% (7)
PDF Ebook Online Access For Anatomy & Physiology 2nd Edition, (Ebook PDF) Download
65 pages
BCH 801 Anaplerosis & Cataplerosis
No ratings yet
BCH 801 Anaplerosis & Cataplerosis
4 pages
ANTHR 101 - 5 Evolution 1
No ratings yet
ANTHR 101 - 5 Evolution 1
17 pages
BCH 414-1
No ratings yet
BCH 414-1
7 pages
NEET White Paper Classification of Plant Kingdom
No ratings yet
NEET White Paper Classification of Plant Kingdom
6 pages
Polycythemia Vera Report
No ratings yet
Polycythemia Vera Report
31 pages
Introduction To Medical Parasitology
No ratings yet
Introduction To Medical Parasitology
50 pages
Studies On Production of Griseofulvin: Bioprocess Engineering 21 (1999) 489 495 Ó Springer-Verlag 1999
No ratings yet
Studies On Production of Griseofulvin: Bioprocess Engineering 21 (1999) 489 495 Ó Springer-Verlag 1999
7 pages
Gut Tube and Body Cavity..
No ratings yet
Gut Tube and Body Cavity..
2 pages
Lab 9, RFLP
No ratings yet
Lab 9, RFLP
5 pages
Claudio Daniel Stern
No ratings yet
Claudio Daniel Stern
3 pages
Psychsm TB Ch02
100% (1)
Psychsm TB Ch02
35 pages
Fungi
100% (17)
Fungi
68 pages
HSC FRB Bio. Exam-18 - B - Practice With Solve
No ratings yet
HSC FRB Bio. Exam-18 - B - Practice With Solve
8 pages
GMO, Shikimate Pathway Gut Flora and Health
No ratings yet
GMO, Shikimate Pathway Gut Flora and Health
77 pages
JCRImpact Factor 2021
No ratings yet
JCRImpact Factor 2021
526 pages

University of Mauritius

Uploaded by

University of Mauritius

Uploaded by

UNIVERSITY OF MAURITIUS

AGRI 2081Y (3) - COMPUTATIONAL BIOLOGY

Name of Student: Marie Natacha Meunier

Student I.D: 1712892

Date: 25th May 2020

Lecturer Name: Dr Shakuntala Baichoo

num_lines = chain_a.count ("\n")

# Count the number of C’s in DNA sequence

# Count the number of G’s in DNA sequence

#determine the length of the DNA sequence

#compute the GC content

gc_cont = (no_g + no_c)

per=re.findall(r"> ?([A-Za-z0-9-]+)", r, re.IGNORECASE | re.MULTILINE)

("A", "T"): 10.0 / 5.0,

#There is no difference between the len(ratios), len(ratios.keys()),

ratio= ("A", "T"): 10.0 / 5.0, ("C", "G"): 7.0 / 6.0 .

If ("A", "T") in ratios:

If ("C", "G") in ratios:

1000 in ("C", "G"):

#translate the list:

list = ["A", "T", "T", "A", "G", "T", "C"]

String="ade tym tym ade gua tym cyt"

A python program to read the file data.fasta

You might also like