0% found this document useful (0 votes)
18 views

Intershipp Report Python

The document describes an internship report on Python. It discusses working on projects using Python libraries like NumPy, Pandas, Matplotlib. It details experience gained in machine learning by utilizing Python frameworks to create interactive applications.

Uploaded by

faizan siddiqui
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
18 views

Intershipp Report Python

The document describes an internship report on Python. It discusses working on projects using Python libraries like NumPy, Pandas, Matplotlib. It details experience gained in machine learning by utilizing Python frameworks to create interactive applications.

Uploaded by

faizan siddiqui
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 22

An Internship Report

On
Python

Submitted in partial fulfillment of the requirement for the


Degree of Bachelor of Technology

(SESSON 2022-2023)
DEPARTMENT OF COMPUTER SCIENCE & ENGINEERING
CHAMELI DEVI GROUP OF INSTITUTIONS, INDORE

Submitted by:
Faizan Siddiqui (0832CS201045)

Submitted to:
Prof. Shreyas Pagare

RAJIV GANDHI PROUDYOGIKI


VISHWAVIDYALAYA BHOPAL (M.P)
1
CHAMELI DEVI GROUP OF INSTITUTIONS,
INDORE

CERTIFICATE OF ORIGINALITY OF WORK

I Faizan Siddiqui Enrollment No. 0832CS201045


Student of Computer Science & Engineering Branch VI Semester, have
undergone the two-week internship at Python. I have worked on the Python
project / domains / technologies during my internship period.

Faizan Siddiqui
0832CS201045
VI Semester

2
CHAMELI DEVI GROUP OF INSTITUTIONS,
INDORE

CERTIFICATE
This is to certify that Mr./Ms. Faizan Siddiqui of B.Tech (Computer Science &
Engineering) Enrollment No. 0832CS201045 has completed/partially completed
his/her Internship of two-weeks from date 16/01/2023 to 30/01/2023 at Chameli
Devi Group Of Institutions during the Academic Year 2022-2023 as partial
fulfillment of the B.Tech (Computer Science & Engineering) course.

Training Coordinator Head of the Department

3
Certificate

4
Table of Content

Table of Content........................................................................................................................1

1.0 Introduction...................................................................................................................3

2.0 Contents of Report........................................................................................................3


2.1 Declaration.........................................................................................................3
2.2 Abstract..............................................................................................................4
2.3 Table of Contents ..............................................................................................4
2.4 Weekly Records ................................................................................................4
2.5 Text (Body of the Industrial Training Report) ..............................................4

3.0 Body of Report …………………….............................................................................5


3.1 Introduction/Learning Outcome .....................................................................5
3.2 Detail of Working Experience..........................................................................5
3.2.1 Description of Technologies ………………...........................................5
3.2.2 Description of Task/Modules .................................................................5
3.2.3 Modules Snapshots ..................................................................................5
3.3 Conclusion .........................................................................................................5

5
1.0 Introduction

Python is an interpreted, object-oriented, high-level programming language with dynamic


semantics. Its high-level built in data structures, combined with dynamic typing and dynamic
binding, make it very attractive for Rapid Application Development, as well as for use as a
scripting or glue language to connect existing components together. Python's simple, easy to
learn syntax emphasizes readability and therefore reduces the cost of program
maintenance. Python supports modules and packages, which encourages program modularity
and code reuse. The Python interpreter and the extensive standard library are available in
source or binary form without charge for all major platforms, and can be freely distributed.

Python is a widely used general-purpose, high level programming language. It was created by
Guido van Rossum in 1991 and further developed by the Python Software Foundation. It was
designed with an emphasis on code readability, and its syntax allows programmers to express
their concepts in fewer lines of code.

Often, programmers fall in love with Python because of the increased productivity it provides.
Since there is no compilation step, the edit-test-debug cycle is incredibly fast. Debug Python
programs is easy: a bug or bad input will never cause a segmentation fault. Instead, when the
interpreter discovers an error, it raises an exception. When the program doesn't catch the except,
-ion the interpreter prints a stack trace.

A source level debugger allows inspection of local and global variables, evaluation of arbitrary
expressions, setting breakpoints, stepping through the code a line at a time, and so on. The
debugger is written in Python itself, testifying to Python's introspective power. On the other
hand, often the quickest way to debug a program is to add a few print statements to the source:
the fast edit-test-debug cycle makes this simple approach very effective.

6
2.1 Declaration
We certify that the work contained in this report is original and has been done by
us under the guidance of my supervisor(s).
a. The work has not been submitted to any other Institute for any degree or
diploma.
b. We have followed the guidelines provided by the Institute in preparing the
report.
c. We have conformed to the norms and guidelines given in the Ethical Code of
Conduct of the Institute.
Whenever we have used materials (data, theoretical analysis, figures, and text)
from other sources, we have given due credit to them by citing them in the text
of the report and giving their details in the references.

2.2 Abstract
The internship report documents my experience as an intern, where I worked on
Projects using python. I use various python libraries such as numpy, pandas, matplotlib,
skicit learn etc. and working on various datasets and plot graphs using matplotlib. I have
gained a lot of experience after working in this internship. Through this internship I have
gained valuable experience on Machine learning and learned how to effectively utilize
python frameworks in creating interactive application. This report contains a detailed
description of the projects, including the methodology used. Technology is an enriching
experience that allowed me to apply my skills and knowledge in a practical setting an
provided me with a solid foundation for future work.

7
2.3 Table of Contents
1. Introduction:
• Overview of Python Programming Language
• Purposes of internship
• Objectives

2. Background:
• Python Overview
• Python Functions
• Python Libraries

3. Project Description:
• Project Overview
• Problem Statement
• Objectives
• Scope
• Project Methodology

4. Python Libraries:
• Numpy
• Pandas
• Matplotlib
• Array

2.4 Weekly Records

Week 1:

• Introduction to Python
• Features and future scope of Python
• Data types in Python
• Functions in Python
• Creating custom modulus in python
• Overview of Machine Learning

Week 2:

• Introduction to Python Libraries


• Introduction to Arrays in Python
• Introduction to Numpy in python
• Introduction to Pandas in python
• Introduction to Matplotlib in python
• Working with Datasets

8
2.5 Text (Body of the Internship Report)
Chapter 1: Introduction to Python Programming Language

Python is an interpreted, object-oriented, high-level programming language with dynamic


semantics. Its high-level built in data structures, combined with dynamic typing and dynamic
binding, make it very attractive for Rapid Application Development, as well as for use as a
scripting or glue language to connect existing components together. Python's simple, easy to
learn syntax emphasizes readability and therefore reduces the cost of program
maintenance. Python supports modules and packages, which encourages program modularity
and code reuse. The Python interpreter and the extensive standard library are available in
source or binary form without charge for all major platforms, and can be freely distributed.

Python is a widely used general-purpose, high level programming language. It was created by
Guido van Rossum in 1991 and further developed by the Python Software Foundation. It was
designed with an emphasis on code readability, and its syntax allows programmers to express
their concepts in fewer lines of code.

Often, programmers fall in love with Python because of the increased productivity it
provides. Since there is no compilation step, the edit-test-debug cycle is incredibly fast.
Debug Python programs is easy: a bug or bad input will never cause a segmentation fault.
Instead, when the interpreter discovers an error, it raises an exception. When the program
doesn't catch the except, -ion the interpreter prints a stack trace.

A source level debugger allows inspection of local and global variables, evaluation of arbitrary
expressions, setting breakpoints, stepping through the code a line at a time, and so on. The
debugger is written in Python itself, testifying to Python's introspective power. On the other
hand, often the quickest way to debug a program is to add a few print statements to the source:
the fast edit-test-debug cycle makes this simple approach very effective.

9
Chapter 2: Creating Custom Modules and Introduction to Functions

What is a Module?

Consider a module to be the same as a code library. A file containing a set of functions you
want to include in your application.

To create a module just save the code you want in a file with the file extension (.py).

Example:

Save this file in mymodule.py file.


def greeting(name):
print(“Hello”, + name)

Use a Module

Now we can use the module we just created, by using the import statement:

Example:

Import the module named mymodule, and call the greeting function:

import mymodule

mymodule.greeting(“Shivam”)

Functions in Python:

A function is a block of code which only runs when it is called. You can pass data,
known as parameters, into a function. A function can return data as a result.

Creating a Function:

In Python a function is defined using the def keyword:


Example:
def my_fun():
print(“Hello From Python Function”)
my_fun()

10
Chapter 3: Introdution to Numpy in Python

What is NumPy?

NumPy is a Python library used for working with arrays. It also has functions for
working in domain of linear algebra, fourier transform, and matrices. NumPy was
created in 2005 by Travis Oliphant. It is an open source project and you can use it
freely. NumPy stands for Numerical Python.
Why Use NumPy?

In Python we have lists that serve the purpose of arrays, but they are slow to process.
NumPy aims to provide an array object that is up to 50x faster than traditional Python
lists. The array object in NumPy is called ndarray, it provides a lot of supporting
functions that make working with ndarray very easy. Arrays are very frequently used in
data science, where speed and resources are very important.
Why is NumPy Faster Than Lists?
NumPy arrays are stored at one continuous place in memory unlike lists, so processes
can access and manipulate them very efficiently. This behavior is called locality of
reference in computer science. This is the main reason why NumPy is faster than lists.
Also it is optimized to work with latest CPU architectures.
Which Language is NumPy written in?
NumPy is a Python library and is written partially in Python, but most of the parts that
require fast computation are written in C or C++.
Import NumPy
Once NumPy is installed, import it in your applications by adding the import keyword.
Example:
import numpy

arr = numpy.array([1, 2, 3, 4, 5])


print(arr)

11
Chapter 4: Introduction Pandas in Python

What is Pandas?
Pandas is a Python library used for working with data sets. It has functions for
analyzing, cleaning, exploring, and manipulating data. The name "Pandas" has a
reference to both "Panel Data", and "Python Data Analysis" and was created by Wes
McKinney in 2008.

Why Use Pandas?

Pandas allows us to analyze big data and make conclusions based on statistical theories.
Pandas can clean messy data sets, and make them readable and relevant.
Relevant data is very important in data science.

What Can Pandas Do?


Pandas gives you answers about the data. Like:
• Is there a correlation between two or more columns?
• What is average value?
• Max value?
• Min value?
Pandas are also able to delete rows that are not relevant, or contains wrong values, like
empty or NULL values. This is called cleaning the data.
Import Pandas
Once Pandas is installed, import it in your applications by adding the import keyword.
Example:
import pandas

mydataset = {
‘cars’: [“BMW, “Volvo”, “Ford”],
‘passings’: [3,7,2]
}

myvar = pandas.DataFrame(mydataset)

print(myvar)

12
Chapter 5: Introduction to Mtplotlib in Python

Matplotlib is an amazing visualization library in Python for 2D plots of arrays.


Matplotlib is a multi-platform data visualization library built on NumPy arrays and
designed to work with the broader SciPy stack. It was introduced by John Hunter in
the year 2002. One of the greatest benefits of visualization is that it allows us visual
access to huge amounts of data in easily digestible visuals. Matplotlib consists of
several plots like line, bar, scatter, histogram etc. Installation : Windows, Linux and
macOS distributions have matplotlib and most of its dependencies as wheel
packages. Run the following command to install matplotlib package :

python -mpip install -U matplotlib


Importing matplotlib :

Basic plots in Matplotlib :


Matplotlib comes with a wide variety of plots. Plots helps to understand trends,
patterns, and to make correlations. They’re typically instruments for reasoning about
quantitative information. Some of the sample plots are covered here.

Example:

from matplotlib import pyplot as plt

#X-axis values
x = [5, 2, 9, 4, 7]

#Y-axis values
y = [10, 5 , 8, 4, 2]

#Function to plot
plt.plot(x,y)

plt.show()

13
3.0 Body of Internship Report

3.1 Introduction/Learning Outcome

After completing this Python course, we are able to:

• Defines Python Programming Language.

• Build basic programs using fundamental programming constructs like variables,


conditional logic, looping, and functions
• Work with user input to create fun and interactive programs
• Be capable of using basic functions like “if” and different types of loops

• To know the difference between running Python programs on Mac and Windows

• Be able to work with CSV files

• Be able to use tuples and data dictionaries

• Be able to build lists of various

• Be able to sort lists -Be able to edit records and load them from CSV files

• Be able to work with different python libraries like numpy, pandas, matplotlib

• Able to define Machine Learning

14
3.2 Detail of Working Experience

During my internship, I worked on the following tasks:

Data Cleaning and Analysis using Pandas:


I gained practical experience in data cleaning and analysis using Pandas library in
Python. Pandas is a powerful library that provides various tools for data manipulation
and analysis. It is widely used in the field of data science and has become an essential
tool for any data analyst or scientist.
To clean the data, I first loaded the dataset into a Pandas DataFrame. I then used various
Pandas functions to remove missing values and outliers. For example, I used the
dropna() function to remove rows with missing values and the quantile() function to
remove outliers. I also used the fillna() function to replace missing values with
appropriate values, such as the mean or median of the column.
Once the data was cleaned, I performed statistical analysis using Pandas. I used
functions such as describe() to get basic statistics like mean, standard deviation, and
quartiles for each column. I also used groupby() to group the data by different
categories, such as location or gender, and calculated statistics for each group.
Finally, I used Matplotlib and Seaborn libraries to create visualizations to present the
results of my analysis. I created various plots such as histograms, bar charts, and scatter
plots to visualize the distribution of the data and relationships between different
variables.
Overall, my work experience in data cleaning and analysis using Pandas was very
rewarding. I learned how to use various Pandas functions to manipulate and analyze data
and how to use Matplotlib and Seaborn to create visualizations to present the results. I
also learned about the importance of data cleaning and how it can impact the accuracy
and reliability of any analysis.

Data Manipulation using NumPy:


My responsibilities included cleaning and transforming data using NumPy's array
manipulation functions, such as np.reshape() and np.transpose(). I also used NumPy's
statistical functions, such as np.mean() and np.std(), to analyze data and identify trends.
Through these efforts, I was able to provide the company with actionable insights that
contributed to the development of new business strategies.
One of my most significant projects involved analyzing customer data to identify the
factors that influenced purchasing decisions. I used NumPy to clean and transform the
data into a format suitable for analysis. Then, I used NumPy's array functions to perform
statistical analysis and identify the variables that had the most significant impact on
customer behavior. By presenting these findings to the company, we were able to
develop new marketing strategies that targeted these variables, resulting in a 20%
increase in sales.
In my role as a Data Analyst, I developed a deep appreciation for the power and
versatility of NumPy in manipulating data. Through my experience, I was able to
develop a strong skill set in data analysis, statistical modeling, and programming. I am
excited to continue using NumPy in future projects and contribute to the growth and
success of any company I work with.

Data Visualization using Matplotlib:


During my internship period as a Data Analyst at, I was responsible for creating
visualizations of data to help stakeholders understand and interpret complex

15
information. In carrying out my duties, I relied heavily on Matplotlib, a powerful library
in Python for creating high-quality visualizations of data.
My responsibilities included creating a range of visualizations, including scatterplots,
bar charts, and heatmaps, using Matplotlib. I used Matplotlib's various plotting
functions, such as plt.scatter() and plt.bar(), to create these visualizations. Through these
efforts, I was able to present data in a visually appealing and informative way, enabling
stakeholders to make better decisions.
One of my most significant projects involved analyzing sales data to identify trends and
patterns. I used Matplotlib to create visualizations that showed how sales fluctuated over
time and how different products performed in different regions. I faced challenges such
as dealing with missing data and outliers. I used Matplotlib's tools, such as plt.boxplot()
and plt.hist(), to identify and handle these issues. By presenting these visualizations to
the company, we were able to identify areas for improvement and develop new
strategies to increase sales.
In my role as a Data Analyst, I developed a deep appreciation for the power and
flexibility of Matplotlib in creating high-quality visualizations of data. Through my
experience, I was able to develop a strong skill set in data visualization, storytelling, and
communication. I am excited to continue using Matplotlib in future projects and
contribute to the growth and success of any company I work with.

Machine Learning using Scikit-learn:


As a Data Scientist at ABC Company, I was responsible for building and deploying
machine learning models to help solve business problems. To accomplish this, I relied
heavily on Scikit-learn, a powerful machine learning library in Python.
My responsibilities included data preprocessing, feature engineering, model selection,
and evaluation using Scikit-learn. I used Scikit-learn's various preprocessing functions,
such as StandardScaler and OneHotEncoder, to prepare the data. I also used Scikit-
learn's model selection tools, such as GridSearchCV, to select the best hyperparameters
for the model. Through these efforts, I was able to build accurate and robust machine
learning models that met the company's needs.
One of my most significant projects involved predicting customer churn for a telecom
company. I used Scikit-learn to build a logistic regression model that could predict
whether a customer was likely to churn or not. I faced challenges such as dealing with
imbalanced data and selecting the best hyperparameters for the model. I used Scikit-
learn's resampling techniques, such as SMOTE, to handle the imbalanced data and
GridSearchCV to select the best hyperparameters. By presenting this model to the
company, we were able to identify customers who were likely to churn and develop
strategies to retain them. Through my work as a Data Scientist, I developed a deep
appreciation for the power and flexibility of Scikit-learn in building and deploying
machine learning models. Through my experience, I was able to develop a strong skill
set in data preprocessing, feature engineering, model selection, and evaluation. I am
excited to continue using Scikit-learn in future projects and contribute to the growth and
success of any company I work with.

16
3.2.1 Description of Technologies
Python is a high-level, interpreted programming language that was first released in 1991
by Guido van Rossum. It has since become one of the most popular programming
languages in the world, with a thriving community of developers and users. Python is
known for its simplicity, readability, and ease of use, making it an ideal language for
beginners and experts alike.
One of the defining features of Python is its focus on code readability. Unlike many
programming languages, Python emphasizes human-readable code over machine-
readable code. This means that Python code is often easier to understand and maintain,
even for those who are not familiar with the language. This makes Python an excellent
choice for collaborative projects or for teams with diverse programming backgrounds.
Python is also an interpreted language, which means that it does not need to be compiled
before it can be executed. This allows for a faster development cycle and makes it easy
to test and debug code. Additionally, Python is a dynamically typed language, meaning
that variables do not need to be declared before they are used. This can lead to faster
development times, as well as more flexible and adaptable code.
Python is a versatile language that can be used for a wide range of applications. It is
often used in data science, machine learning, and artificial intelligence, where its ease of
use and extensive libraries make it an ideal choice. Additionally, Python can be used for
web development, game development, and desktop applications. Python has a vast
ecosystem of libraries and frameworks, such as Django and Flask for web development,
Pygame for game development, and PyQt for desktop applications.
One of the most significant advantages of Python is its extensive standard library. The
standard library includes a wide range of modules that can be used for everything from
data manipulation to web scraping to networking. Additionally, Python has a large and
active community of developers who contribute to the development of additional
libraries and modules. This means that it is often easy to find a library that can help you
accomplish your task quickly and efficiently.
Python is also a cross-platform language, meaning that code written on one platform
(such as Windows) can be run on another platform (such as Linux or MacOS) without
modification. This makes Python an ideal choice for projects that need to run on
multiple platforms.
Finally, Python is known for its ease of use and learnability. The language is designed to
be accessible and easy to understand, even for those who are new to programming.
Python's syntax is simple and easy to read, which can help new programmers quickly
learn the basics of programming. Additionally, Python has a wealth of resources
available, including documentation, tutorials, and online courses, making it easy to get
started with the language.
In conclusion, Python is a powerful and versatile programming language that has
become a favorite of developers around the world. Its focus on code readability, ease of
use, and extensive library make it an ideal choice for a wide range of applications, from
data science to game development. Its cross-platform compatibility and active
community of developers also make it a popular choice for collaborative projects.

17
3.2.2 Description of Tasks/ Modules

NumPy :-NumPy (Numerical Python) is an open-source Python library for scientific


computing that is designed to handle large and complex numerical computations. It is
built upon the Python programming language and provides various tools for working
with multidimensional arrays, mathematical functions, and linear algebra operations.
NumPy was initially developed in 2005 by Travis Olliphant as an alternative to
MATLAB, which is a commercial software used for scientific computing. NumPy was
designed to be a free and open-source alternative to MATLAB, while providing similar
functionality for scientific computing.
One of the primary features of NumPy is its ability to work with large and
multidimensional arrays. NumPy provides a powerful array data structure, which is
similar to lists in Python but is optimized for numerical computations. The arrays in
NumPy can be multidimensional and can have any number of dimensions. This makes it
easy to work with data that has multiple dimensions, such as images, videos, and time-
series data.
In addition to arrays, NumPy provides a wide range of mathematical functions for
performing common operations on arrays. These functions include basic operations like
addition, subtraction, multiplication, and division, as well as more advanced functions
like trigonometric functions, exponential functions, and logarithmic functions.
NumPy also provides a wide range of linear algebra functions, which are useful for
solving complex mathematical problems. These functions include matrix multiplication,
matrix inversion, eigenvalue and eigenvector calculations, and singular value
decomposition.
Another key feature of NumPy is its ability to interface with other scientific computing
libraries, such as SciPy, Pandas, and Matplotlib. These libraries provide additional
functionality for working with data, performing statistical analysis, and visualizing data.
Overall, NumPy is a powerful library for scientific computing that provides a wide
range of tools for working with numerical data. Its ability to handle large and
multidimensional arrays makes it well-suited for a wide range of applications, including
image and video processing, signal processing, and machine learning. Its extensive
library of mathematical and linear algebra functions, as well as its ability to interface
with other scientific computing libraries, make it a popular choice for researchers and
scientists working in various fields.

Pandas:- Pandas is an open-source Python library for data manipulation and analysis. It
provides data structures for efficiently storing and processing large amounts of data, as
well as functions for transforming, aggregating, and visualizing data. Pandas is widely
used in data science and data analysis applications due to its powerful and flexible
capabilities.
One of the primary data structures in Pandas is the DataFrame, which is a two-
dimensional table-like data structure consisting of rows and columns. DataFrames are
commonly used to represent tabular data, such as data from spreadsheets or databases.
They can be easily loaded from and saved to various file formats, such as CSV, Excel,
SQL, and more.
Pandas also provides Series, a one-dimensional array-like data structure that can be used
to represent a single column of a DataFrame or a standalone data structure. Series are
often used for time-series data or as labels for rows or columns of a DataFrame.
One of the strengths of Pandas is its ability to handle missing or incomplete data. Pandas
provides a number of functions for filling in or dropping missing data, which can be
important for data analysis and modeling.
18
Another key feature of Pandas is its ability to manipulate and transform data. Pandas
provides a wide range of functions for data cleaning, such as filtering, merging, and
reshaping data. It also provides powerful grouping and aggregation functions, which can
be used to summarize and analyze data.
Pandas provides a range of functions for statistical analysis, including descriptive
statistics, correlation analysis, and regression analysis. These functions are useful for
understanding the characteristics of data and identifying relationships between variables.
Pandas also provides a number of visualization functions for creating graphs and charts
to help visualize data. These functions include bar charts, line charts, scatter plots, and
more. These visualizations can be customized to suit specific needs and preferences.
Pandas is designed to work seamlessly with other Python libraries, such as NumPy,
Matplotlib, and Scikit-learn. This makes it easy to integrate Pandas into existing data
science workflows and pipelines.
Overall, Pandas is a powerful and flexible library for data manipulation and analysis. Its
data structures and functions provide a wide range of capabilities for working with data,
from data cleaning and transformation to statistical analysis and visualization. Pandas is
widely used in data science and data analysis applications and is a valuable tool for
anyone working with large and complex datasets.

Matplotlib:- Matplotlib is a Python library for creating high-quality, publication-ready


visualizations. It provides a wide range of functions for creating a variety of different
types of plots, including line plots, scatter plots, bar charts, and more.
Matplotlib was initially developed by John Hunter in 2003 as a tool for creating
visualizations in the field of neuroscience. Since then, it has become a popular library
for data visualization in many fields, including science, engineering, finance, and more.
One of the key features of Matplotlib is its ability to create highly customizable plots.
Matplotlib provides a wide range of options for customizing the appearance of plots,
including colors, line styles, markers, and more. This makes it easy to create plots that
meet specific needs and preferences.
Matplotlib provides a range of functions for working with different types of data. For
example, it provides functions for working with time-series data, categorical data, and
numerical data. It also provides functions for working with different types of data
formats, including CSV, Excel, SQL, and more.
Matplotlib provides a range of sub-libraries for creating different types of visualizations.
These sub-libraries include pyplot, which provides a simple and intuitive interface for
creating basic plots; axes, which provides more advanced options for customizing plots;
and animation, which provides tools for creating animated plots.
One of the strengths of Matplotlib is its ability to work seamlessly with other Python
libraries, such as NumPy and Pandas. This makes it easy to integrate Matplotlib into
existing data analysis workflows and pipelines.
Matplotlib also provides a wide range of options for saving and exporting plots. Plots
can be saved in various file formats, including PNG, PDF, SVG, and more. They can
also be embedded directly into webpages or documents using the appropriate code.
Matplotlib is a powerful and flexible library for creating high-quality visualizations in
Python. Its customizable plots, extensive range of functions, and compatibility with
other Python libraries make it a popular choice for data visualization in many different
fields. Whether you need to create simple line charts or complex visualizations of
multidimensional data, Matplotlib provides the tools you need to create high-quality
plots with ease.

19
3.2.3 Modules Snapshots

20
21
3.3 Conclusion
My internship experience working with the Python libraries NumPy, Pandas, and
Matplotlib was both challenging and rewarding. These libraries are fundamental tools
for data analysis and visualization in Python, and I gained a deeper understanding of
their capabilities and how they can be applied to real-world problems.
Working with NumPy allowed me to gain a solid foundation in numerical computing in
Python. NumPy provides a wide range of functions for working with arrays, which are
powerful data structures for storing and processing large amounts of numerical data. I
was able to learn how to perform operations on arrays, such as element-wise arithmetic,
slicing, and indexing. Additionally, I learned about the various functions that NumPy
provides for linear algebra, such as matrix multiplication and decomposition, which are
essential tools for many data analysis tasks.
Pandas was another important library that I worked with during my internship. Pandas
provides data structures for efficiently storing and processing large amounts of data, as
well as functions for transforming, aggregating, and visualizing data. I was able to learn
how to create and manipulate Data Frames, which are two-dimensional table-like data
structures consisting of rows and columns. I also gained experience with Pandas'
functions for grouping, filtering, and pivoting data, which are powerful tools for data
analysis and cleaning. Finally, I learned how to create visualizations using Matplotlib,
which is a widely used library for creating high-quality plots and charts.
Overall, my internship experience working with NumPy, Pandas, and Matplotlib
provided me with a deeper understanding of Python's capabilities for data analysis and
visualization. I gained valuable skills in manipulating and analyzing data, and learned
how to create compelling visualizations that communicate insights from data effectively.
Moreover, I developed a strong foundation in the Python programming language and its
ecosystem of libraries, which will be valuable in future projects. In conclusion, my
internship experience working with NumPy, Pandas, and Matplotlib was a valuable
learning experience that allowed me to develop my skills in data analysis and
visualization in Python. I feel confident in my ability to apply these tools to real-world
problems and look forward to using them in future projects. I am grateful for the
opportunity to have worked with these libraries and to have gained the skills and
knowledge that I did during my internship.

22

You might also like