Projet Python

Uploaded by

islemfatmagamha1

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views13 pages

Projet Python

Uploaded by

islemfatmagamha1

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 13

Startup Investment Predictor :

Leveraging Machine Learning to Identify the Best Startups

to Invest In

Presented by
Ramy Lazghab & Islam Fatma Gamha
Introduction

Why This
Project?
Investors often face challenges in identifying promising
startups due to a lack of structured data and insights.

To develop a machine learning model that predicts the

market worth of startups based on critical factors like the
company age, region, deals flow, markets and products, and
the investments per stage.
Objectives 1. Automate the evaluation of startups using data-

Project
driven techniques.
2. Provide actionable insights on the top startups to

Objectives
invest in.
3. Compare multiple machine learning models for
performance.
4. Offer transparency and reproducibility through clear
data handling and predictions.
Methodology

Process :
We followed a methodology inspired by the CRISP-DM
framework, beginning with an in-depth analysis of the
sector’s specificities to guide our data scraping strategy.

We organized our work into phases by adhering to the

iterative CRISP-DM process. Our initial project assessment
highlighted areas that needed more attention and revealed
misconceptions, enabling us to adjust our approach and
refine our focus for better alignment with the project goals.
Scrapping - Dataset

Dataset
Dataset includes startup attributes like
company name, stage, dealflow, region,
creation date, market value, etc.

Overview
We Scraped the data from AngelList (name,
stage and deals flow).
We augmented our data through the
integration of Gemini.
We got over than 160 Startups data.
Data Preprocessing

Cleaning and
Preparing the
Data

Explain key preprocessing steps:

1. Converting creation date to startup age.

2. Normalization of the numeric data.
3. Handling missing data and one-hot encoding
categorical variables plus TF-IDF for the
textual data.
Data Visualization

Data Analysis
and visualization
Analyzing the key relationships between different
features and gaining an overview of the dataset's
characteristics.
List the models used and why:

1. Linear Regression: For its simplicity and

interpretability.
2. Lasso Regression: To handle feature selection and
regularization.
3. Support Vector Regression (SVR): This captures
complex patterns.
4. Random Forest Regression: For its ability to
handle non-linearity and feature importance
estimation

Machine
Learning Models
Model
Performance
Metrics used: Mean Squared Error (MSE) and R² Score.
Present key results:
Linear Regression achieved the highest R² score of 0.69 and
the lowest MSE.
Compare its performance against other models:

Linear Regression: R² = 0.69

Lasso Regression: R² = 0.10
Support Vector Regression: R² = 0.67
Random Forest Regression: R² = 0.60
What We
Learned
The most influential factors are "Stage, region, and startup
age significantly impact market worth."
Highlight unexpected findings: "Dealflow showed less
influence than initially anticipated."
Benefits of combining models.
Demo
Conclusion
This project provides a reliable, Machine learning can enhance
data-driven solution for startup investor decision-making and
investment decisions. reduce risks.
Thank You

1975 Patchogue-Medford High Yearbook - Part 2 - Activities and Sports
No ratings yet
1975 Patchogue-Medford High Yearbook - Part 2 - Activities and Sports
86 pages
Nervous System Lesson Plan
100% (2)
Nervous System Lesson Plan
9 pages
Am Jetstream Pre-Int Unit 8 Lesson 3
100% (5)
Am Jetstream Pre-Int Unit 8 Lesson 3
3 pages
DATA ANALYSIS AND DATA SCIENCE: Unlock Insights and Drive Innovation with Advanced Analytical Techniques (2024 Guide)
From Everand
DATA ANALYSIS AND DATA SCIENCE: Unlock Insights and Drive Innovation with Advanced Analytical Techniques (2024 Guide)
WINTON CLEM
No ratings yet
Setting Crunchbase For Data Science Prep
No ratings yet
Setting Crunchbase For Data Science Prep
9 pages
Practical Data Strategies and Recipes
From Everand
Practical Data Strategies and Recipes
Tom Henricksen
No ratings yet
Predictive Analysis of Fuel Prices Using Machine Learning
No ratings yet
Predictive Analysis of Fuel Prices Using Machine Learning
6 pages
Spreadsheets To Cubes (Advanced Data Analytics for Small Medium Business): Data Science
From Everand
Spreadsheets To Cubes (Advanced Data Analytics for Small Medium Business): Data Science
alasdair gilchrist
No ratings yet
IJERT Machine Learning Based Outcome Pre
No ratings yet
IJERT Machine Learning Based Outcome Pre
4 pages
Microsoft Dynamics NAV Administration
From Everand
Microsoft Dynamics NAV Administration
Amit Sachdev
No ratings yet
C++ for Finance: Writing Fast and Reliable Trading Algorithms
From Everand
C++ for Finance: Writing Fast and Reliable Trading Algorithms
Robert Johnson
No ratings yet
DeepSeek for Startups: Turning Insights into Action and Profit
From Everand
DeepSeek for Startups: Turning Insights into Action and Profit
Bill Riley
No ratings yet
Sales Prediction For Big Mart 3.0.pptx MM
No ratings yet
Sales Prediction For Big Mart 3.0.pptx MM
25 pages
Data Quality: Empowering Businesses with Analytics and AI
From Everand
Data Quality: Empowering Businesses with Analytics and AI
Prashanth Southekal
No ratings yet
Smarter Data Science: Succeeding with Enterprise-Grade Data and AI Projects
From Everand
Smarter Data Science: Succeeding with Enterprise-Grade Data and AI Projects
Neal Fishman
No ratings yet
Searching For A Unicorn A Machine Learning Approach
No ratings yet
Searching For A Unicorn A Machine Learning Approach
57 pages
Business Intelligence and Data Mining Techniques
From Everand
Business Intelligence and Data Mining Techniques
Dwaipayan Sethi
No ratings yet
AI ML K6rn1i 54 Merged
No ratings yet
AI ML K6rn1i 54 Merged
6 pages
The MSP’s Guide to the Ultimate Client Experience: Optimizing service efficiency, account management productivity, and client engagement with a modern digital-first approach.
From Everand
The MSP’s Guide to the Ultimate Client Experience: Optimizing service efficiency, account management productivity, and client engagement with a modern digital-first approach.
Jeff Farris
No ratings yet
DeepSeek for Data Analysis: The Future of Data Analysis for Business Professionals
From Everand
DeepSeek for Data Analysis: The Future of Data Analysis for Business Professionals
Mohammod Shaharuzzaman
No ratings yet
Mini Project 2nd
No ratings yet
Mini Project 2nd
32 pages
Making Big Data Work for Your Business: A guide to effective Big Data analytics
From Everand
Making Big Data Work for Your Business: A guide to effective Big Data analytics
Sudhi Sinha
No ratings yet
Select Business Using Machine Learning
No ratings yet
Select Business Using Machine Learning
24 pages
DsNaIT v2.0
No ratings yet
DsNaIT v2.0
43 pages
RP Final
No ratings yet
RP Final
13 pages
Mastering Lead Generation with DeepSeek AI: Unlocking the Future of Customer Acquisition
From Everand
Mastering Lead Generation with DeepSeek AI: Unlocking the Future of Customer Acquisition
Robert Cullen
No ratings yet
FinalPaper SalesPredictionModelforBigMart
No ratings yet
FinalPaper SalesPredictionModelforBigMart
14 pages
Oe Cae 3
No ratings yet
Oe Cae 3
7 pages
Data Analytics and Data Processing Essentials
From Everand
Data Analytics and Data Processing Essentials
gareth thomas
No ratings yet
Laptop Price Pred
No ratings yet
Laptop Price Pred
11 pages
Big Data Visualization
From Everand
Big Data Visualization
James D. Miller
No ratings yet
Business Analytics: Leveraging Data for Insights and Competitive Advantage
From Everand
Business Analytics: Leveraging Data for Insights and Competitive Advantage
Ronald BLaha
No ratings yet
Big Data: Understanding How Data Powers Big Business
From Everand
Big Data: Understanding How Data Powers Big Business
Bill Schmarzo
2/5 (1)
Oracle CRM On Demand Administration Essentials
From Everand
Oracle CRM On Demand Administration Essentials
Padmanabha Rao
No ratings yet
Data Entry Operator: Skills, Software, Career Tips, and Interview Q&A
From Everand
Data Entry Operator: Skills, Software, Career Tips, and Interview Q&A
Sumitra Kumari
No ratings yet
Mastering Lead Generation with DeepSeek AI/ A Comprehensive Guide to Transforming Your Sales Strategy
From Everand
Mastering Lead Generation with DeepSeek AI/ A Comprehensive Guide to Transforming Your Sales Strategy
Robert Cullen
No ratings yet
Data Mining: Fundamentals and Applications
From Everand
Data Mining: Fundamentals and Applications
Fouad Sabry
No ratings yet
Knight's Microsoft Business Intelligence 24-Hour Trainer: Leveraging Microsoft SQL Server Integration, Analysis, and Reporting Services with Excel and SharePoint
From Everand
Knight's Microsoft Business Intelligence 24-Hour Trainer: Leveraging Microsoft SQL Server Integration, Analysis, and Reporting Services with Excel and SharePoint
Brian Knight
3/5 (1)
IBM Cognos Business Intelligence
From Everand
IBM Cognos Business Intelligence
Dustin Adkison
No ratings yet
Business Analysis : Learn in 24 Hours
From Everand
Business Analysis : Learn in 24 Hours
Alex Nordeen
No ratings yet
Decision Making with Data
From Everand
Decision Making with Data
Ravi Deshpande
No ratings yet
Supervised Learning Research Paper With Images (1)
No ratings yet
Supervised Learning Research Paper With Images (1)
10 pages
Analysis Phase: The Business Leader's Playbook of Software Development, #2
From Everand
Analysis Phase: The Business Leader's Playbook of Software Development, #2
Michael Afar
No ratings yet
CDP Systems and Implementation: Definitive Reference for Developers and Engineers
From Everand
CDP Systems and Implementation: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Data Engineering with Scala and Spark: Build streaming and batch pipelines that process massive amounts of data using Scala
From Everand
Data Engineering with Scala and Spark: Build streaming and batch pipelines that process massive amounts of data using Scala
Eric Tome
No ratings yet
Predicting Profit of A Startup Companies Using Machine Learning Algorithms
No ratings yet
Predicting Profit of A Startup Companies Using Machine Learning Algorithms
5 pages
PPIR!1
No ratings yet
PPIR!1
9 pages
Introduction to Business Analytics
From Everand
Introduction to Business Analytics
Dwaipayan Sethi
No ratings yet
The Secret Of Machine Learning
From Everand
The Secret Of Machine Learning
Mhd Arjunanta
No ratings yet
Stock Prediction Project
No ratings yet
Stock Prediction Project
3 pages
IT Interview Guide for Freshers: Crack your IT interview with confidence
From Everand
IT Interview Guide for Freshers: Crack your IT interview with confidence
Sameer S Paradkar
No ratings yet
Crack the Data Analyst Interview: Real-Time Questions & Expert Answers
From Everand
Crack the Data Analyst Interview: Real-Time Questions & Expert Answers
Yash d.
No ratings yet
DataRobot: Practical Automation for Enterprise AI
From Everand
DataRobot: Practical Automation for Enterprise AI
Richard Johnson
No ratings yet
Data Analytics for Marketing: A practical guide to analyzing marketing data using Python
From Everand
Data Analytics for Marketing: A practical guide to analyzing marketing data using Python
Guilherme Diaz-Bérrio
No ratings yet
ML Project
100% (1)
ML Project
10 pages
Chapter AI PDF
No ratings yet
Chapter AI PDF
20 pages
House Price Prediction: Numpy Pandas Matplotlib Seaborn
No ratings yet
House Price Prediction: Numpy Pandas Matplotlib Seaborn
8 pages
Data Cleaning with Power BI: The definitive guide to transforming dirty data into actionable insights
From Everand
Data Cleaning with Power BI: The definitive guide to transforming dirty data into actionable insights
Gus Frazer
No ratings yet
AI-Driven DAOs: Architecting the Future of Startup Ecosystems
From Everand
AI-Driven DAOs: Architecting the Future of Startup Ecosystems
Engr. Rajib Mazumder
No ratings yet
Application and Technology Rationalization: A Strategic Guide for Midsize to Large Companies: IT and Digital Transformation
From Everand
Application and Technology Rationalization: A Strategic Guide for Midsize to Large Companies: IT and Digital Transformation
Pavi Agrawal
No ratings yet
Ids Case Study
No ratings yet
Ids Case Study
15 pages
The Comprehensive Guide to RPA, IDP, and Workflow Automation: For Business Efficiency and Revenue Growth
From Everand
The Comprehensive Guide to RPA, IDP, and Workflow Automation: For Business Efficiency and Revenue Growth
Rick Spair
No ratings yet
E Commerce Project
No ratings yet
E Commerce Project
12 pages
MKT 837
No ratings yet
MKT 837
197 pages
Pharmacognosy
No ratings yet
Pharmacognosy
10 pages
Labrador, Maria Romina.-G3-Pinagbuhatanes-Nlc-Accomplishment-Report-Wk2
No ratings yet
Labrador, Maria Romina.-G3-Pinagbuhatanes-Nlc-Accomplishment-Report-Wk2
5 pages
CRS SS 1 Week 5
No ratings yet
CRS SS 1 Week 5
12 pages
Undergrad Thesis
100% (1)
Undergrad Thesis
85 pages
Resume Philip Reichle For Teaching - Act
No ratings yet
Resume Philip Reichle For Teaching - Act
2 pages
Wiley India Textbooks Price List - June 2015 PDF
No ratings yet
Wiley India Textbooks Price List - June 2015 PDF
152 pages
Science 4 Parts of Animals For Getting Food
No ratings yet
Science 4 Parts of Animals For Getting Food
6 pages
Class 9 (Physics)
No ratings yet
Class 9 (Physics)
16 pages
Campbell Ashley Resume2
No ratings yet
Campbell Ashley Resume2
2 pages
Dorothy Johnson
No ratings yet
Dorothy Johnson
10 pages
Academic Vacancies
No ratings yet
Academic Vacancies
5 pages
Phyllis Creme, Mary Lea-Writing at University-Open University Press (2008) PDF
100% (1)
Phyllis Creme, Mary Lea-Writing at University-Open University Press (2008) PDF
234 pages
g6 Chapter 2
100% (1)
g6 Chapter 2
13 pages
IBM-MBCET Note College-1
No ratings yet
IBM-MBCET Note College-1
1 page
G3 Chapter 1 Revised
No ratings yet
G3 Chapter 1 Revised
15 pages
Annual Report 2013
No ratings yet
Annual Report 2013
65 pages
Occupational Therapy in Short Term Psychiatry Edited by Moya Willson
No ratings yet
Occupational Therapy in Short Term Psychiatry Edited by Moya Willson
340 pages
Classroom Management 1
No ratings yet
Classroom Management 1
4 pages
Error Analysis - TEFL
No ratings yet
Error Analysis - TEFL
8 pages
Final Poster Presentation
No ratings yet
Final Poster Presentation
1 page
Unit 4
No ratings yet
Unit 4
62 pages
Cultural Safety in Nursing Newzeland
No ratings yet
Cultural Safety in Nursing Newzeland
9 pages
Practical Research
No ratings yet
Practical Research
55 pages
Classroom Management Plan
No ratings yet
Classroom Management Plan
24 pages
1.1 Introduction To DTB
No ratings yet
1.1 Introduction To DTB
20 pages
New Maths GCSE - G19 - Length, Area and Volume Scale Factors PDF
No ratings yet
New Maths GCSE - G19 - Length, Area and Volume Scale Factors PDF
4 pages

Projet Python

Uploaded by

Projet Python

Uploaded by

Startup Investment Predictor :

Leveraging Machine Learning to Identify the Best Startups

To develop a machine learning model that predicts the

We organized our work into phases by adhering to the

Explain key preprocessing steps:

1. Converting creation date to startup age.

1. Linear Regression: For its simplicity and

Linear Regression: R² = 0.69

You might also like