p3 Python Project

This program allows a user to input DNA sequences from a file and either find the consensus sequence or transcribe the sequences to RNA. It contains functions to load the sequences, count nucleotide frequencies, find the consensus, convert DNA to RNA, and output the results to a new file. The main function handles user input to select the option and calls the appropriate functions to analyze the sequences and write the output.

Uploaded by

Daniella Vargas

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as TXT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views

p3 Python Project

Uploaded by

Daniella Vargas

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as TXT, PDF, TXT or read online on Scribd

You are on page 1/ 4

"""This program ask the user to enter a number of DNA sequences and finds the

consensus sequence. The ouput is the consensus.

Add the corresponding code to accomplish the requested tasks
"""

##### ADD YOUR NAME, Student ID, and Section number #######
# NAME: DANIELLA VARGAS FIGUEROA
# STUDENT ID:802228453
# SECTION:096
###########################################################

# The function load_data, it take as an argument, it input the DNA sequences, save
in the list and return the list
# a: is a number of sequences to be input

#Auxiliar functions

def valid_seq(seq):
isvalid = False
for s in list(seq):
if (s == 'A') or (s == 'C') or (s == 'T') or (s == 'G'):
isvalid = True
else:
isvalid = False
break
return isvalid

#the max_nuc() takes four inputs: the nucleotide frequencey in a colum, and returns
a list of two elements containing the nucleotide
#and its frequency in a column
def max_nuc(freq_a, freq_g, freq_c, freq_t):
if freq_a > freq_g and freq_a > freq_c and freq_a > freq_t:
return ["A", freq_a]
elif freq_g > freq_a and freq_g > freq_c and freq_g > freq_t:
return ["G", freq_g]
elif freq_c > freq_a and freq_c > freq_g and freq_c > freq_t:
return ["C", freq_c]
elif freq_t > freq_a and freq_t > freq_c and freq_t > freq_g:
return ["T", freq_t]

#########################
#the load_data() takes two inputs: the file name and returns one tuple (firts one
list of elements, and option (consesus or transcription)
def load_data(filename, option):
#assign variable and open file
lst = []
infile = open(filename, "r")
#read file
valid_length = None
for line in infile:
seq = line.rstrip("\n")
#Check if the sequence is valid and is the same length as the first one to
continue with program.
if valid_seq(seq) == True and (valid_length == len(seq)
or valid_length == None):
lst.append(seq)
if len(lst) == 1:
valid_length = len(lst[0])
result = (lst, option)
#Return result.
return result

# The function count_nucl_freq, it take arguments the load_data, contains the

frecuencies of the nucleotides for each column
# a: is a list of DNA sequences
def count_nucl_freq(a):
#create an empty list to store each letter's frequency
frequencies = []
#Use for loops to look for the frequency of each letter in each column.
for i in range(0, len(a[0])):
columnfrec = [0, 0, 0, 0]
for j in range(0, len(a)):
let = a[j][i]
if let == "A":
columnfrec[0] = columnfrec[0] + 1
elif let == "G":
columnfrec[1] = columnfrec[1] + 1
elif let == "C":
columnfrec[2] = columnfrec[2] + 1
else:
columnfrec[3] = columnfrec[3] + 1
#Append each Maximum frequency by column to the list frequencies.
frequencies.append(
max_nuc(columnfrec[0], columnfrec[1], columnfrec[2], columnfrec[3]))
#return list
return frequencies
# analyze the list by columns
# find nucleotide frecuencies
# you will decide what data type, from the ones already explained, works best for
your implementation
# return frecuencies

# The function find_consensus, it take arguments the count_nucl_freq and return a

consensus sequence
# a: is a you return in count_nucl_freq
def find_consensus(a):
#Open a new file to store the consesus string.
f = open("answer.txt", "w")
# Create an empty string to store the consensus.
consensusString = ""
#For loop to access each element in index 0 in the frequency list done before and
add it to the consensous string.
for element in a:
#print(element)
x = element[0]

consensusString = consensusString + x
#Write the Consensus inside the file.
f.write(consensusString)

# function convert_seqn it take one argument the dna sequences

def convert_seq(a):
#Create empty string to store converted DNA to RNA results
result = ""
#Iterate throught each DNA sequences and convert each letter.
for let in a:
if let == "A":
result += "U"
elif let == "T":
result += "A"
elif let == "C":
result += "G"
elif let == "G":
result += "C"
#Return string with converted RNA sequences.
return result

# convert dna to rna sequences

# return rna sequences

#function transcript_seq, it take one argument the list of sequences

def transcript_seq(a):
#Create an empty list to store converted RNA sequences.
rnaseq = []
file = open("answer.txt", "w")
#Iterate through DNA sequences and convert each sequence to RNA.
for seq in a:
rna = convert_seq(seq)
file.write(rna + "\n")
#Append converted RNA sequences to empty list.
rnaseq.append(rna)
#Return RNA sequences list.
return rnaseq

# Read list DNA sequences

# return list RNA Sequences

# The function main, your program to start and function calls and write new file
with consensus or transcription
def main():
filename = input("Write the name of the file: ")
print('Select option:')
print('1. Consensus Sequences')
print('2. Transcriptions Sequences')
option = int(input(""))
#Create while loop to only accept option one or two.
while option != 1 and option != 2:
print("Incorrect input. Only enter 1 or 2.")
option = int(input(""))
data = load_data(filename, option)
#Create the function calls according to the option the user inputs.
if data[1] == 1:
freq = count_nucl_freq(data[0])
cons = find_consensus(freq)
elif data[1] == 2:
# conv=convert_seq(data[0])
transcript = transcript_seq(data[0])

#ask the number DNA sequence

# contains the functions call
# function doesn't return anyting

if __name__ == "__main__":
main()

Max For Live Ultimate Zen Guide
100% (4)
Max For Live Ultimate Zen Guide
101 pages
Shogun Method Derek Rake
13% (8)
Shogun Method Derek Rake
33 pages
Lisp Interpreter in Rust
From Everand
Lisp Interpreter in Rust
Vishal Patil
1/5 (1)
p2 Python Project
No ratings yet
p2 Python Project
3 pages
University of Mauritius
No ratings yet
University of Mauritius
9 pages
Manual de Ejercicios de Python
No ratings yet
Manual de Ejercicios de Python
1 page
IDC306_Assignment_5_MS21009
No ratings yet
IDC306_Assignment_5_MS21009
4 pages
solutionsExerciseMaster11 23
No ratings yet
solutionsExerciseMaster11 23
13 pages
Function Solutions
No ratings yet
Function Solutions
10 pages
Python
No ratings yet
Python
9 pages
Ass 2 Bioinformatics
No ratings yet
Ass 2 Bioinformatics
8 pages
Introduction to PHP, Part 2, Second Edition
From Everand
Introduction to PHP, Part 2, Second Edition
Adam Majczak
No ratings yet
MOOC Project Work - Sequence Analysis - Data Analysis With Python 2021
No ratings yet
MOOC Project Work - Sequence Analysis - Data Analysis With Python 2021
29 pages
02-11-22-Lab-5-MS21212.ipynb - Colaboratory
No ratings yet
02-11-22-Lab-5-MS21212.ipynb - Colaboratory
8 pages
INFO390C DNDS Pset05
No ratings yet
INFO390C DNDS Pset05
9 pages
Faculty of Engineering Ain Shams University Name: Ahmed Nashaat Hassanen Department: CESS Bioinformatics ID: 14P6016 Ass1
No ratings yet
Faculty of Engineering Ain Shams University Name: Ahmed Nashaat Hassanen Department: CESS Bioinformatics ID: 14P6016 Ass1
3 pages
Computer Engineering Laboratory Solution Primer
From Everand
Computer Engineering Laboratory Solution Primer
Karan Bhandari
No ratings yet
python assignment
No ratings yet
python assignment
8 pages
BINP16 Programming Exam 2016-10-25 Solutions
No ratings yet
BINP16 Programming Exam 2016-10-25 Solutions
5 pages
vertopal.com_bioinf575_hw07_dmeghana (1)
No ratings yet
vertopal.com_bioinf575_hw07_dmeghana (1)
34 pages
The Essential R Reference
From Everand
The Essential R Reference
Mark Gardener
No ratings yet
Group17 2
No ratings yet
Group17 2
9 pages
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
Bio Python 202111
No ratings yet
Bio Python 202111
63 pages
UNIX Shell Programming Interview Questions You'll Most Likely Be Asked
From Everand
UNIX Shell Programming Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
EX-9 EXCEPTION
No ratings yet
EX-9 EXCEPTION
3 pages
Assignment 1
No ratings yet
Assignment 1
5 pages
Python_Basics_Exercises
No ratings yet
Python_Basics_Exercises
4 pages
Oracle Certified Professional Java Programmer OCPJP 1Z0 809
From Everand
Oracle Certified Professional Java Programmer OCPJP 1Z0 809
Manish Soni
No ratings yet
01 07 FrequentWordsWithMismatchesSolution
No ratings yet
01 07 FrequentWordsWithMismatchesSolution
2 pages
In-Linear-Time: Check This Web Site
No ratings yet
In-Linear-Time: Check This Web Site
4 pages
exam_programming_exercises
No ratings yet
exam_programming_exercises
7 pages
Computational and Systems Biology Assignment Help
100% (1)
Computational and Systems Biology Assignment Help
15 pages
Python: Advanced Guide to Programming Code with Python: Python Computer Programming, #4
From Everand
Python: Advanced Guide to Programming Code with Python: Python Computer Programming, #4
Charlie Masterson
No ratings yet
CSE160-Final-23wi-key
No ratings yet
CSE160-Final-23wi-key
10 pages
Simplifying Data Science With Python
From Everand
Simplifying Data Science With Python
Billy David millican
No ratings yet
BIO Code Report
No ratings yet
BIO Code Report
6 pages
C Programming
From Everand
C Programming
Netra
No ratings yet
Python: Advanced Guide to Programming Code with Python
From Everand
Python: Advanced Guide to Programming Code with Python
Charlie Masterson
No ratings yet
with open
No ratings yet
with open
6 pages
PHP programming
From Everand
PHP programming
Nino Paiotta
No ratings yet
BECOB236 Code
No ratings yet
BECOB236 Code
10 pages
Lösungen Zu Den Exercises AI Python
No ratings yet
Lösungen Zu Den Exercises AI Python
26 pages
HW 13
No ratings yet
HW 13
6 pages
BioInfo2 Assignment - Python
No ratings yet
BioInfo2 Assignment - Python
11 pages
PS1
No ratings yet
PS1
2 pages
BT3040 - BIOINFORMATICS - Assignment 4: Question 1
No ratings yet
BT3040 - BIOINFORMATICS - Assignment 4: Question 1
9 pages
Tut3 2022
No ratings yet
Tut3 2022
4 pages
solutionsExerciseMaster1 10
No ratings yet
solutionsExerciseMaster1 10
9 pages
programs (1)
No ratings yet
programs (1)
8 pages
CSE 5370: Bioinformatics Homework 2: Due Thursday, February 24th, 2022 at 4:59PM CST
No ratings yet
CSE 5370: Bioinformatics Homework 2: Due Thursday, February 24th, 2022 at 4:59PM CST
3 pages
Code2pdf 6564f797c624e
No ratings yet
Code2pdf 6564f797c624e
2 pages
CS Practical File
No ratings yet
CS Practical File
28 pages
AI and ML Lab Program
No ratings yet
AI and ML Lab Program
24 pages
RIP-Tutorials-bioinformatics
No ratings yet
RIP-Tutorials-bioinformatics
19 pages
C++ Functions and tutorial
From Everand
C++ Functions and tutorial
Nino Paiotta
No ratings yet
Artificial Intelligence Lab File
No ratings yet
Artificial Intelligence Lab File
10 pages
15CSL76 Students
No ratings yet
15CSL76 Students
18 pages
Fds SLOT 2
No ratings yet
Fds SLOT 2
12 pages
50 Python Concepts Every Developer Should Know
From Everand
50 Python Concepts Every Developer Should Know
Hernando Abella
No ratings yet
AIML_Manual_V1-6-83
No ratings yet
AIML_Manual_V1-6-83
78 pages
Tting Started With Python
No ratings yet
Tting Started With Python
12 pages
C For C++ Programmers
No ratings yet
C For C++ Programmers
6 pages
Introduction To Python Libraries
No ratings yet
Introduction To Python Libraries
11 pages
Mr-Je - C Serco Instruction Manual (Modbus-Tcp)
No ratings yet
Mr-Je - C Serco Instruction Manual (Modbus-Tcp)
86 pages
05) Assignment Digital Literacy
No ratings yet
05) Assignment Digital Literacy
3 pages
Ossher - Subject-Oriented Programming Supporting Decentralized Development of Objects
No ratings yet
Ossher - Subject-Oriented Programming Supporting Decentralized Development of Objects
13 pages
Callrevu Freephoneevaluation
No ratings yet
Callrevu Freephoneevaluation
1 page
AI ZTE Roadmap
No ratings yet
AI ZTE Roadmap
26 pages
Restricting Users From Assigning Items To Organization
No ratings yet
Restricting Users From Assigning Items To Organization
4 pages
Ricky Sabhikhi Resume
No ratings yet
Ricky Sabhikhi Resume
2 pages
VIP-2022-Industry Problem Statement-F
No ratings yet
VIP-2022-Industry Problem Statement-F
5 pages
Optical Burst Switching
No ratings yet
Optical Burst Switching
14 pages
Finger Print Recognition
No ratings yet
Finger Print Recognition
56 pages
IRC5-FlexPendant Operators Manual 3HAC16590-1 - Revb - en
No ratings yet
IRC5-FlexPendant Operators Manual 3HAC16590-1 - Revb - en
319 pages
UML Class Diagram
No ratings yet
UML Class Diagram
1 page
Autosar RTE Layer
No ratings yet
Autosar RTE Layer
1,116 pages
ETL - 4.4 - Years - Experience - Padma Sri - Ananthu
100% (1)
ETL - 4.4 - Years - Experience - Padma Sri - Ananthu
3 pages
2022 - Review - 5G - Ecosystem - Adoption - Industrial - Use - Cases - in - Asia
No ratings yet
2022 - Review - 5G - Ecosystem - Adoption - Industrial - Use - Cases - in - Asia
135 pages
Pexip Infinity Server Design Guide V35.a
No ratings yet
Pexip Infinity Server Design Guide V35.a
34 pages
Switch Basics: Learning, Forwarding/Filtering, and Interface Settings
No ratings yet
Switch Basics: Learning, Forwarding/Filtering, and Interface Settings
5 pages
[Ebooks PDF] download Health and Wellness Measurement Approaches for Mobile Healthcare Gita Khalili Moghaddam full chapters
100% (6)
[Ebooks PDF] download Health and Wellness Measurement Approaches for Mobile Healthcare Gita Khalili Moghaddam full chapters
65 pages
Recoverability: Recoverable Schedules: Schedules in Which Transactions Commit
No ratings yet
Recoverability: Recoverable Schedules: Schedules in Which Transactions Commit
4 pages
SCADALink RIO900 Datasheet - v2 PDF
No ratings yet
SCADALink RIO900 Datasheet - v2 PDF
2 pages
Dbms MCQ 100
67% (3)
Dbms MCQ 100
14 pages
Meter To Cash
No ratings yet
Meter To Cash
3 pages
Technical clean up for BI – Saptechnicalguru.com
No ratings yet
Technical clean up for BI – Saptechnicalguru.com
4 pages
Manual Simulador CSM NOJA
No ratings yet
Manual Simulador CSM NOJA
6 pages
Future Trends in Ict Technology
No ratings yet
Future Trends in Ict Technology
2 pages
Object oriented software development using Java principles patterns and frameworks 2nd Edition Xiaoping Jia. 2025 scribd download
100% (1)
Object oriented software development using Java principles patterns and frameworks 2nd Edition Xiaoping Jia. 2025 scribd download
82 pages