Osish Bantha Internship
Osish Bantha Internship
Osish Bantha Internship
BACHELOR OF ENGINEERING
IN
INFORMATION SCIENCE AND ENGINEERING
Submitted By
CERTIFICATE
This is to certify that the Internship report entitled “INNOVATION PROJECT ON PYTHON
AND MACHINE LEARNING” carried out by OSISH BANTHA bearing USN: 1RI21IS041
bonafide student of RR Institute of Technology, submitted this report in partial fulfilment for the
award of the degree in Bachelor of Engineering in Information Science and Engineering from
Visvesvaraya Technological University, Belagavi during the academic year 2023-24. It is hereby
certified that all the corrections/suggestions indicated for internal assessment have been incorporated
into the report, which has been submitted to the Department Library. This Internship (21INT68) report
has been approved, as it satisfies the academic requirements in respect of internship (21INT68)
prescribed for the said degree.
I, OSISH BANTHA third year students from Department of Information Science and
Engineering, RR Institute of Technology, Bengaluru, hereby declare that the Internship report
entitled “INNOVATION PROJECT ON PYTHON AND MACHINE LEARNING” has been carried
out by me under the super vision of my Prof. Sowmya J, Assistant Professor Department of
Information Science and Engineering, RR Institute of Technology , Bengaluru and submitted in
partial fulfilment for the award of degree in Bachelor of Engineering in Information Science and
Engineering from Visvesvaraya Technological University, Belagavi during the academic year 2023
- 2024. I further declare that the report has not been submitted to any other University for the award
of any other degree.
PLACE: BENGALURU
DATE:
OSISH BANTHA
(1RI21IS041)
ACKNOWLEDGEMENT
The satisfaction that accompanies the success in completion of this project would be
incomplete without the mention of the people who made it possible, without whose constant
guidance and encouragement would have made my efforts go in vain. I consider myself
privileged to express gratitude and respect towards all those who guided me through the
completion of this project.
I would like to express my sincere gratitude to Dr. Erappa G, Professor and Head,
Department of Information Science and Engineering, RRIT, Bengaluru for giving me the
support and encouragement that was necessary for the completion of this Internship.
I would also like to convey my regards to all faculty members and non-teaching staff
of R R Institute of Technology, Bengaluru for constantly motivating and guiding me in my
journey at RRIT.
Finally, I thank my Parents, Friends and Family members for their co-operation and
their guidance in completing this Internship.
OSISH BANTHA
(1RI21IS041)
ABSTRACT
5 CONCLUSION 17
BIBLIOGRAPHY 18
APPENDIX 19
Appendix A: Abbreviations 19
LIST OF FIGURES
Screenshots
COMPANY PROFILE
Objectives
Their goal is to consistently deliver success to students by going the extra mile.
To help their students meet their technological skills and career opportunities,
they offer the right people, solutions, and services.
By leveraging leading technologies and industry best practices, they provide their
students with the most efficient and effective training.
Operation of the Organization
The race for digital transformation is on. In this globally connected on-demand
world with rapid advancements in internet technologies, businesses worldwide
are under constant pressure to add innovative real-time capabilities to their
applications to respond to market opportunities.
Every business worldwide is building event-driven, real-time applications - from
financial services, transportation, and energy, to retail, healthcare, and Gaming
companies.
Our endeavor is to make it easy to develop innovative real-time applications and
efficient to operate them in production.
We have a proven record of building highly scalable, world-class consulting
processes that offer tremendous business advantages to our clients in the form of
huge cost- benefits, definitive results and consistent project deliveries across the
globe.
We prominently strive to improve your business by delivering the full range of
competencies including operational performance, developing and applying
business strategies to improve financial reports, defining strategic goals and
measure andmanage those goals along with measuring and managing them.
Page | 1
Aqmenz automation private limited Dept. of ISE, RRIT 2023-24
Services Offered
Training / Internships form a very important part of students over all development
that's why AICTE and Universities have made it mandatory for every engineer
and MCA to undergo the same, we help students in achieving this goal by helping
them acquire latest skills and provide them with hands on projects.
Page | 2
Aqmenz automation private limited Dept. of ISE, RRIT 2023-24
Page | 3
Aqmenz automation private limited Dept. of ISE, RRIT 2023-24
CHAPTER – 2
ABOUT THE DEPARTMENT
1. Specific Functionalities of the Department
Our department of tech support majorly focused on manage, maintain and repair
IT systems. The Special functionalities include
Understanding the work to be completed.
Planning the assigned activities in more detail if needed
Completing assigned work withinthe budget,
timeline and quality expectations
Informing the project manager of issues, scope changes, risk
and quality concerns
Proactively communicating status and managing expectation
2.2 Process Adopted
The department aims to first understand the user requirements. Further on,
a basic structure of the product that needs to be built is drawn and
understood. Eventually, the technologies that would best help in
developing the product are understood. If the product has database
requirements, the schema and the database design are worked upon. The
department believes in “Think before you code”- the requirements and
logics are first understood over a paper and then are moved to a code form.
Agile processes generally promote a disciplined project management
process that encourages frequent inspection and adaptation, a leadership
philosophy that encourages teamwork, self-organization and
accountability, a set of engineering best practices intended to allow for
rapid delivery of high-quality software, and a business approach that aligns
development with customer needs and company goals. Agile development
refers to any development process that is aligned with the concepts of the
Agile Manifesto. The Manifesto was developed by a group fourteen
leading figures in the software industry, and reflects their experience of
what approaches do and do not work for software development.
Page | 4
Aqmenz automation private limited Dept. of ISE, RRIT 2023-24
3 Testing
Testing was done according to the Corporate Standards. As each component was
being built, Unit testing was performed in order to check if the desired
functionality is obtained. Each component in turn is tested with multiple test
cases to verify if it is properly working. These unit tested components are
integrated with the existing built components and then integration testing is
performed. Here again, multiple test cases are run to ensure the newly built
component runs in co-ordination with the existing components. Unit and
Integration testing are iteratively performed until the complete product is built.
Once the complete product is built, it is again tested against multiple test cases
and all the functionalities.
The product could be working fine in the developer’s environment but might not
necessarily work well in all other environments that the users could be using.
Hence, the product is also tested under multiple environments (Various operating
systems and devices). At every step, if a flaw is observed, the component is
rebuilt to fix the bugs. This way, testing is done hierarchically and iteratively.
Page | 5
Aqmenz automation private limited Dept. of ISE, RRIT 2023-24
Page | 6
Aqmenz automation private limited Dept. of ISE, RRIT 2023-24
CHAPTER – 3
TASK PERFORMED
In this Internship Machine Learning with Python using AI it was divided into two
parts one is front enddevelopment and one more is backend course.
Training Program
The internship is a platform where the trainees are assigned with the specific task.
In the initial days of the internship, I was trained on the following:
Python Programming
Artificial Intelligence
Machine Learning Algorithms
DATA SET
This section describes, in brief, the data that has been used for the research. Data
from multiple sources was used in this project, the major amount of data was
extracted from public website Yocket (Yocket.com), data regarding the rankings,
fees and enrolment in colleges was obtained from a leading educational
consultancy firm The Mentors Circle in India. Data from both the sources was
integrated together to form a staging data-set. For predicting the chance of a
student getting shortlisted in universities the final data-set was divided into
multiple data- sets each representing a particular university. For predicting the
list of universities suitable for students based on their profile data of all the
students the staging data-set was updated only to have records of students who
had successfully secured admission in the universities. Below table shows the
different features of the data-sets.
GRE Marks scored by the student in GRE
TOEFL Score Marks scored by the student in English Proficiency Test
Ranking The University Ranking
SOP Quality of Statement of Purpose or Statement of Intent
LOR Quality of Letter of Recommendations documents
CGPA Result of the student in their Undergraduate Course
Research Relevant experience in Research field.
Page | 7
Aqmenz automation private limited Dept. of ISE, RRIT 2023-24
Page | 8
Aqmenz automation private limited Dept. of ISE, RRIT 2023-24
Page | 9
Aqmenz automation private limited Dept. of ISE, RRIT 2023-24
CHAPTER – 4
REFLECTION NOTES
1. Experience
According to our internship experience, Knowledge Solutions India offers a
positive work culture and courteous personnel at all levels, from staff to
management. The instructors are knowledgeable in their subjects and treat
everyone fairly. There are no distinctions made between new graduates and
corporate executives, and everyone is treated equally. Every activity, no matter
how difficult or simple, requires a lot of teamwork, and the mood is always
peaceful and welcoming. Because of the excellent communication and support
available, there is a lot of room for self-improvement. Interns were well treated
and educated, and all of our questions and concerns about the training or the
firms were addressed. All in all, Knowledge Solutions India was a great place for
a fresher to start career and also for a corporate to boost his/her career. It has
been a great experience to be an intern in such a reputed organization.
1. Technical Outcomes
4.2.1System Requirements and Specification HARDWARE
REQUIREMENTS:
Processor : x86 or x64
Hard Disk : 216 GB or more.
Ram : 512 MB (minimum), 1 GB(recommended)
SOFTWARE REQUIREMENTS:
Operating System : Windows or Linux
Development Environment : Anaconda Navigator
(Jupiter Notebook or Spyder)
Page | 10
Aqmenz automation private limited Dept. of ISE, RRIT 2023-24
1 Existing System
(Bibodi et al. (n.d.)) used multiple machine learning models to create a system
that would help the students to shortlist the universities suitable for them also a
second model was created to help the colleges to decide on enrolment of the
student. Nave Bayes algorithm was used to predict the likelihood of success of
an application, and multiple classification algorithms like Decision Tree,
Random Forest, Nave Bayes and SVM were compared and evaluated based on
their accuracy to select the best candidates for the college.
GRADE system was developed by (Waters and Miikkulainen (2013)) to support
the admission process for the graduate students in the University of Texas Austin
Department of Computer Science. The main objective of the project was to
develop a system that can help the admission committee of the university to take
better and faster decisions. Logistic regression and SVM were used to create the
model, both models performed equally well and the final system was developed
using Logistic regression due to its simplicity. The time required by the admission
committee to review the applications was reduced by 74% but human intervention
was required to make the final decision on status if the application. (Nandeshwar
et al. (2014)) created a similar model to predict the enrolment of the student in
the university based on the factors like SAT score, GPA score, residency race etc.
The Model was created using the Multiple Logistic regression algorithm, it was
able to achieve accuracy rate of 67% only.
2 Disadvantages of the Existing System
Limitation of this system only relied on the GRE,
TOEFL and Undergraduate Score of the student and
missed on taking into consideration other important
factors like SOP and LOR.
The existing system lagged the factor of the research
work in the related field.
This model achieved only 67% accuracy.
Page | 11
Aqmenz automation private limited Dept. of ISE, RRIT 2023-24
The principal objective of the research is to help the students who are aspiring to
pursue their education in the USA. The Graduate Admissions Prediction system
will help them to evaluate the chances of success in any university without being
dependent on any education consultancy firm. It will help them in saving a huge
amount of time and money spent in the application process. Also, it will help
them to limit the number of applications made by the students by suggesting
them the best universities where they have high chances of securing admission
thereby by saving the amount of money spent by the students by applying in
universities where they have less chance to secure admit based on their profile.
1. Advantages of the Proposed System
Information about the prediction analysis is clear to
enter all the required information to predict the
admission.
The user interface code will interact with the Linear
Regression, ANN, random forest regressor, decision
tree regressor to provide the users with the required
result.
The ANN algorithm and Linear Regression Algorithm
will be used to determine the chance of the student of
securing admission in a particular university based on
his/her profile.
Once the models have been executed the result will be
provided to the student as the output on the user
interface.
4.1 System Architecture
4.3.1 Data Flow Diagram
The machine learning models are trained with the given dataset. The machine
learning models used in this project are linear regression, artificial neural
network (ANN), random forest regressor, decision tree regressor. Once the
models are trained, the student’s profile details are entered to predict the chances
of getting the admit to the university.
Page | 12
Aqmenz automation private limited Dept. of ISE, RRIT 2023-24
Page | 13
Aqmenz automation private limited Dept. of ISE, RRIT 2023-24
3 Implementation
1. Modules
1. Exploratory Data Analysis in Machine Learning
2. Data Visualization
3. Training and Testing
4. Train and Evaluate Linear Regression
5. Train and Evaluate Artificial Neural Network
MODULES DESCRIPTION
Exploratory Data Analysis: Performed initial investigations on data so as to
discover patterns, to spot anomalies, to test hypothesis and to check assumptions
with the help of summary statistics and graphical representations.
Data Visualization: Using data visualization, I summarized the data with graphs,
pictures and maps, so that the human mind has an easier time processing and
understanding the given data. Data visualization plays a significant role in the
representation of both small and large data sets, but it is especially useful when
we have large data sets, in which it is impossible to see all of our data, let alone
process and understand it manually.
Training and Testing: In this project, datasets are split into two subsets. The first
subset is known as the training data - it's a portion of our actual dataset that is
fed into the machine learning model to discover and learn patterns. In this way,
it trains our model. The other subset is known as the testing data.
Train and Evaluate Linear Regression: Simple linear regression is an approach
for predicting a quantitative response using a single feature (or "predictor" or
"input variable"). It takes the following form: y=β0+β1x
Page | 14
Aqmenz automation private limited Dept. of ISE, RRIT 2023-24
Screenshots
Page | 15
Aqmenz automation private limited Dept. of ISE, RRIT 2023-24
Page | 16
Aqmenz automation private limited Dept. of ISE, RRIT 2023-24
CHAPTER – 4
CONCLUSION
The major goal of this study was to create a prototype of a system that students
interested in studying in the United States might use. For this study, several
machine learning algorithms were created and used. When compared to the
Logistic regression model, Linear Regression demonstrated to be the greatest fit
for system development. The programme was designed with a basic user interface
to make it interactive and simple to use for non-technical people.
The ultimate goal of the study was met since the approach allows students to save
time and money that they would otherwise spend on education advisors and
application fees for colleges where they have a lower chance of being accepted.
It will also assist students in making better and faster decisions on university
applications.
Page | 17
Aqmenz automation private limited Dept. of ISE, RRIT 2023-24
BIBLIOGRAPHY
Bibodi, J., Vadodaria, A., Rawat, A. and Patel, J. (n.d.). Admission
Prediction System Using Machine Learning.
Abdul Fatah S; M, A. H. (2012). Hybrid Recommender System for
Predicting College Admission, pp. 107–113.
College Admission Predictor Journal of Network Communications and
Emerging Technologies (JNCET), Volume 8, Issue 4, April (2018).
Prediction of Admission Process for Gradational Studies using Al
Algorithm by Saurabh Singhal, Ashish Sharma. European Journal of
Molecular &Clinical Medicine Vol 7, Issue 4.
Graduate Admission Prediction Using Machine Learning December 2020
DOI:10.46300/91013.2020.14.13
Page | 18
Aqmenz automation private limited Dept. of ISE, RRIT 2023-24
APPENDIX
Appendix A: Abbreviation
IDE: Jupyter notebook is an open-source IDE that is used to create Jupyter
documents that can be created and shared with live codes. Also, it is a web-based
interactive computational environment. The Jupyter notebook can support
various languages that are popular in data science such as Python, Julia, Scala,
R, etc.
ANN: Artificial Neural networks (ANN) or neural networks are computational
algorithms. It intended to simulate the behavior of biological systems composed
of “neurons”. ANNs are computational models inspired by an animal's central
nervous systems. It is capable of machine learning as well as pattern recognition.
AI: Artificial intelligence (AI) refers to the simulation of human intelligence in
machines that are programmed to think like humans and mimic their actions. The
term may also be applied to any machine that exhibits traits associated with a
human mind such as learning and problem- solving.
ML: Machine learning (ML) is a type of artificial intelligence (AI) that allows
software applications to become more accurate at predicting outcomes without
being explicitly programmed to do so. Machine learning algorithms use historical
data as input to predict new output values.
Page | 19