0% found this document useful (0 votes)
88 views35 pages

BigData-V I

The document compares the total cost of ownership (TCO) over 5 years for hosting a data and analytics platform on AWS, IBM, Google Cloud, and Microsoft Azure. It provides cost estimates for infrastructure/hosting, data extraction from source systems, application development, setup costs, and yearly operating and capital expenditures. The AWS option has the lowest estimated 5-year TCO of $1.36 billion, while the on-premise IBM solution has the highest at $1.34 billion.

Uploaded by

1977am
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as XLSX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
88 views35 pages

BigData-V I

The document compares the total cost of ownership (TCO) over 5 years for hosting a data and analytics platform on AWS, IBM, Google Cloud, and Microsoft Azure. It provides cost estimates for infrastructure/hosting, data extraction from source systems, application development, setup costs, and yearly operating and capital expenditures. The AWS option has the lowest estimated 5-year TCO of $1.36 billion, while the on-premise IBM solution has the highest at $1.34 billion.

Uploaded by

1977am
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as XLSX, PDF, TXT or read online on Scribd
You are on page 1/ 35

Y0

5 Y TCO CAPEX
Infra/Cloud Hosting 0.00
Data Extraction from Source Systems 25,000,000.00
Application Development Services (23 = PM(1)+SA(3)+Dev(15) +
QA(4)) -
On Premise Set Up (Security + Network + LZ) 30,000,000.00
INR 55,000,000.00
Total INR Mn 55.00

NOTES
Internal Head Count Cost not factored in above cost estiamtes
First 2 years 25 data sources, remaining 3 yrs 10 data sources extracted
BI Reporting is not included in above estimates.
Security Software like Protegrity costs around 1 Cr (Capex) annually & one time.
Network connectivity to Cloud Provider is 1.5 Cr annually for 1 Gbps (Opex)
LZ Nodes will cost around 2 Cr in 1st year and taper down gradually(Capex).
Cost estimates are ballpark and can vary significantly depending upon choice of platform.
TCO may vary with INR -USD ratio. Current Rate 1 USD = 73 INR
AWS
Y0 Y1 Y2
OPEX CAPEX OPEX CAPEX
97,933,704.22 - 119,036,367.26 -
- 25,000,000.00 2,500,000.00 10,000,000.00

88,200,000.00 - 88,200,000.00 -
15,000,000.00 20,000,000.00 18,000,000.00 10,000,000.00
201,133,704.22 45,000,000.00 227,736,367.26 20,000,000.00
201.13 45.00 227.74 20.00
Y2 Y3 Y4
OPEX CAPEX OPEX CAPEX OPEX
133,852,798.25 - 153,226,896.05 - 174,922,043.45
3,500,000.00 10,000,000.00 4,500,000.00 10,000,000.00 5,500,000.00

88,200,000.00 - 88,200,000.00 - 88,200,000.00


18,000,000.00 - 18,000,000.00 - 18,000,000.00
243,552,798.25 10,000,000.00 263,926,896.05 10,000,000.00 286,622,043.45
243.55 10.00 263.93 10.00 286.62
Total
678,971,809.22
96,000,000.00

441,000,000.00
147,000,000.00
1,362,971,809.22
1,362.97
Y0
5Y TCO CAPEX OPEX
IBM Infra 248,397,463.76 -
Data Extraction from Source Systems 25,000,000.00 -
Application Development Services (23 =
PM(1)+SA(3)+Dev(15) + QA(4)) - 88,200,000.00
On Premise Set Up (Security + Network + LZ) 30,000,000.00 15,000,000.00
Data Center Hosting Cost 300,000.00 8,179,500.00
Total INR 303,697,463.76 111,379,500.00
Total INR Mn 303.70 111.38

NOTES
Internal Head Count Cost not factored in above cost estiamtes
First 2 years 25 data sources, remaining 3 yrs 10 data sources extracted
BI Reporting is not included in above estimates.
Security Software like Protegrity costs around 1 Cr (Capex) annually & one time.
Network connectivity to Cloud Provider is 1.5 Cr annually for 1 Gbps (Opex)
LZ Nodes will cost around 2 Cr in 1st year and taper down gradually(Capex).
Cost estimates are ballpark and can vary significantly depending upon choice of platform.
IBM
Y1 Y2 Y3
CAPEX OPEX CAPEX OPEX CAPEX
65,637,073.23 - 62,826,346.55 - 79,985,077.40
25,000,000.00 2,500,000.00 10,000,000.00 3,500,000.00 10,000,000.00

- 88,200,000.00 - 88,200,000.00 -
20,000,000.00 18,000,000.00 10,000,000.00 18,000,000.00 -
180,000.00 13,084,788.00 120,000.00 16,356,588.00 120,000.00
110,817,073.23 121,784,788.00 82,946,346.55 126,056,588.00 90,105,077.40
110.82 121.78 82.95 126.06 90.11
Y3 Y4
OPEX CAPEX OPEX Total
- 114,708,104.14 - 571,554,065.08
4,500,000.00 10,000,000.00 5,500,000.00 96,000,000.00

88,200,000.00 - 88,200,000.00 441,000,000.00


18,000,000.00 - 18,000,000.00 147,000,000.00
19,628,388.00 120,000.00 22,900,188.00 80,989,452.00
130,328,388.00 124,828,104.14 134,600,188.00 1,336,543,517.08
130.33 124.83 134.60 1,336.54
Y0
5 Y TCO CAPEX OPEX
Infra/Cloud Hosting 0.00 34,586,260.97
Data Extraction from Source Systems 25,000,000.00 -
Application Development Services (23 =
PM(1)+SA(3)+Dev(15) + QA(4)) - 88,200,000.00
On Premise Set Up (Security + Network + LZ) 30,000,000.00 15,000,000.00
INR 55,000,000.00 137,786,260.97
Total INR Mn 55.00 137.79

NOTES
Internal Head Count Cost not factored in above cost estiamtes
First 2 years 25 data sources, remaining 3 yrs 10 data sources extracted
BI Reporting is not included in above estimates.
Security Software like Protegrity costs around 1 Cr (Capex) annually & one time.
Network connectivity to Cloud Provider is 1.5 Cr annually for 1 Gbps (Opex)
LZ Nodes will cost around 2 Cr in 1st year and taper down gradually(Capex).
Cost estimates are ballpark and can vary significantly depending upon choice of platform.
GOOGLE
Y1 Y2 Y3
CAPEX OPEX CAPEX OPEX CAPEX OPEX
- 65,564,770.57 - 89,644,324.59 - 121,104,524.38
25,000,000.00 2,500,000.00 10,000,000.00 3,500,000.00 10,000,000.00 4,500,000.00

- 88,200,000.00 - 88,200,000.00 - 88,200,000.00


20,000,000.00 18,000,000.00 10,000,000.00 18,000,000.00 - 18,000,000.00
45,000,000.00 174,264,770.57 20,000,000.00 199,344,324.59 10,000,000.00 231,804,524.38
45.00 174.26 20.00 199.34 10.00 231.80
Y4
CAPEX OPEX Total
- 158,807,757.31 469,707,637.81
10,000,000.00 5,500,000.00 96,000,000.00

- 88,200,000.00 441,000,000.00
- 18,000,000.00 147,000,000.00
10,000,000.00 270,507,757.31 1,153,707,637.81
10.00 270.51 1,153.71
Y0
5 Y TCO CAPEX OPEX
Infra/Cloud Hosting 0.00 46,978,135.01
Data Extraction from Source Systems 25,000,000.00 -
Application Development Services (23 =
PM(1)+SA(3)+Dev(15) + QA(4)) - 88,200,000.00
On Premise Set Up (Security + Network + LZ) 30,000,000.00 15,000,000.00
INR 55,000,000.00 150,178,135.01
Total INR Mn 55.00 150.18
Microsoft
Y1 Y2 Y3
CAPEX OPEX CAPEX OPEX CAPEX
- 86,036,686.72 - 117,159,101.45 -
25,000,000.00 2,500,000.00 10,000,000.00 3,500,000.00 10,000,000.00

- 88,200,000.00 - 88,200,000.00 -
20,000,000.00 18,000,000.00 10,000,000.00 18,000,000.00 -
45,000,000.00 194,736,686.72 20,000,000.00 226,859,101.45 10,000,000.00
45.00 194.74 20.00 226.86 10.00
Y3 Y4
OPEX CAPEX OPEX Total
163,597,613.10 - 223,500,747.75 637,272,284.04
4,500,000.00 10,000,000.00 5,500,000.00 96,000,000.00

88,200,000.00 - 88,200,000.00 441,000,000.00


18,000,000.00 - 18,000,000.00 147,000,000.00
274,297,613.10 10,000,000.00 335,200,747.75 1,321,272,284.04
274.30 10.00 335.20 1,321.27
Y0 Y1
AWS 5 Y TCO INR Mln CAPEX OPEX CAPEX
55.00 201.13 45.00
TOTAL 256.13 272.74

Y0 Y1
IBM 5 Y TCO INR Mln CAPEX OPEX CAPEX
303.70 111.38 110.82
TOTAL 415.08 232.60

Y0 Y1
Google 5 Y TCO INR Mln CAPEX OPEX CAPEX
55.00 137.79 45.00
TOTAL 192.79 219.26

Y0 Y1
Microsoft 5 Y TCO INR Mln CAPEX OPEX CAPEX
55.00 150.18 45.00
TOTAL 205.18 239.74

450.00

400.00

350.00

300.00
COST (INR Mln)

250.00

200.00

150.00

100.00

50.00

0.00
Y0 Y1
100.00

50.00

0.00
Y0 Y1
Y1 Y2 Y3 Y4 Total CAPEX Total OPEX
OPEX CAPEX OPEX CAPEX OPEX CAPEX OPEX
227.74 20.00 243.55 10.00 263.93 10.00 286.62 140.00 1,222.97
272.74 263.55 273.93 296.62

Y1 Y2 Y3 Y4 Total CAPEX Total OPEX


OPEX CAPEX OPEX CAPEX OPEX CAPEX OPEX
121.78 82.95 126.06 90.11 130.33 124.83 134.60 712.39 624.15
232.60 209.00 220.43 259.43

Y1 Y2 Y3 Y4 Total CAPEX Total OPEX


OPEX CAPEX OPEX CAPEX OPEX CAPEX OPEX
174.26 20.00 199.34 10.00 231.80 10.00 270.51 140.00 1,013.71
219.26 219.34 241.80 280.51

Y1 Y2 Y3 Y4 Total CAPEX Total OPEX


OPEX CAPEX OPEX CAPEX OPEX CAPEX OPEX
194.74 20.00 226.86 10.00 274.30 10.00 335.20 140.00 1,181.27
239.74 246.86 284.30 345.20

Microsoft

AWS
Google

IBM

Y1 Y2 Y3 Y4
Y1 Y2 Y3 Y4
TOTAL

1,362.97

TOTAL

1,336.54

TOTAL

1,153.71

TOTAL

1,321.27

Google

M
IBM PRIVATE CLOUD
ON PREMISE IBM CLOUD (Cost in INR Mn)

Cost INR Mln(Vodafone)


Cost INR Mln (Vodafone + Idea)

AWS
AWS Public CLOUD (Cost in INR Mn)
Cost USD
Cost INR
Cost INR Mn

GOOGLE
GCP Public CLOUD
GCP Cost USD
GCP Cost INR
GCP Cost INR Mn

Microsoft
Microsoft Public CLOUD
Cost USD
Cost INR
Cost INR Mn
YEAR 1 YEAR 2 YEAR 3 YEAR 4 YEAR 5

146.12 38.61 36.96 47.05 67.48


248,397,463.76 65,637,073.23 62,826,346.55 79,985,077.40 114,708,104.14

Year 0 Year 1 Year 2 Year 3 Year 4


$ 1,341,557.59 $ 1,630,635.17 $ 1,833,599.98 $ 2,098,998.58 $ 2,396,192.38
97,933,704.22 119,036,367.26 133,852,798.25 153,226,896.05 174,922,043.45
97.93 119.04 133.85 153.23 174.92

Year 0 Year 1 Year 2 Year 3 Year 4


$ 473,784.40 $ 898,147.54 $ 1,228,004.45 $ 1,658,966.09 $ 2,175,448.73
34,586,260.97 65,564,770.57 89,644,324.59 121,104,524.38 158,807,757.31
34.59 65.56 89.64 121.10 158.81

Year 0 Year 1 Year 2 Year 3 Year 4


$ 643,536.10 $ 1,178,584.75 $ 1,604,919.20 $ 2,241,063.19 $ 3,061,654.08
46,978,135.01 86,036,686.72 117,159,101.45 163,597,613.10 223,500,747.75
46.98 86.04 117.16 163.60 223.50
TOTAL

336.21
-
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
Vodafone Data Sources
Netezza
MICS (CRM & CS data Mart)
CDR mart (Call data records)
Amdocs (billing data & Credit & Collections data)
Calidus (Distributor Commissions)
SEMS ( Network Experience)
Hyperion (Financial Planning & Accounts reporting )
Nucleus (Call center planning data)
Adobe (Apps and website data)
Eshop (Eshop data)
Flytext (Campaign data)
Twitter or Social data
Facebook data
MyVodafone App data
Network data (Call drop etc)
Location data
Payment Patterns
Call center interaction data
CPOS data
Olympus master data
Instore Interaction & purchase data
Chota Credit or any similar credit products data
Device changes data including master
CDR and RCDR data
Product Consumption data
Usage data (Voice,Data,SMS, Roaming)
Demographic data
Ekyc data
Product master data
Netperform SDK data
Bill dump data
Segmented products promoid data
UPC & MNP
Balance data
deduction-MTR(Mediated Transaction Record)
Recharge
Email
USSD - Activation
Eshop - Activation
SSK - Activation
Network Data - CS Raw (Mobile Circuit Switch network probes)
Network Data - PS Raw (Mobile Packet Switch network probes)
Network Data - CS + PS KPIs
Cell Inventory (Mobile network cell inventory and topology)
Household data (Composition and Profile)
Campaigns communications and responses
Device catalogue (local)
Fixed network probes
Fixed coverage maps
Mobile network coverage maps
Call center calls
MCI (Missed Call Integration)
Product identification for postpaid
Web chat
Youbroad band data
Vodafone Social Media customer interactions
External Devices (attached to routers)
Subscriber Information
Net Perform Personal data
Net Perform Data
Medallia (Customer feedback through own channels)
App Catalog
Device Catalog (group)
NPS data
CDR for SNA
IDEA Data Sources
Mediation
ER_Non_usage
ER_SCP
ER_MSC
ER_GGSN
ZTE_Non_usage
UCS
Medation VLR
VAS (MG+CG+SE)
Others
Vtopup
FTA
Pre-pol
Prepaid CRM
Payout - Scheme Studio
Web-portal
Mobile App
MNP
IVR
MYSMS
ICB
ILD
Imagine
Dedup
DND
Prepaid NSMS
CSM
SPS
SDP
SIMEX
IRDCS
SCPDB
FST
USSD
CRM
BSCS
CMS
NSMS
DCOIN
IPOPs
OFS
EBPP
UDR
CMDM
Adobe
Team

S No. Resource
1 Big Data Architect
2 Cloud Architect
3 Tech Lead
4 PM
5 Developers
6 ETL/BI engineer
7 Cloud Admin
8 Devops Engineers
9 QA

NOTES

Architects will collaborate with entire team.


Above team structure may change as when we document the scope.
Internal head count is not considered in above team structure
Proposed skills set may undergo some change depending upon our choice of big Data platform.
TEAM
Expereince Count Skills
15 to 16 years 1 Big Data/Hadoop stack, Java, Cloud, Devops, Search, ETL, BI
15 to 16 years 1 Cloud(AWS/Azure/GCP),Big Data/Hadoop stack, Java, Cloud, Devops
8 to 10 years 2 Cloud, Big Data, Java
15 yrs plus 1 Delivery management and Reporting.
4 to 8 yrs 8 Big Data/Hadoop stack, Java, Cloud, Python, NOSQL
5 to 10 years 2 Any ETL Tool such as Talend, Ab Initio. SQL
4 to 6 yrs 3 Cloud Infra Management, Provisioning.
4 to 6 years 2 Docker, Ansible, Kubernetes etc.
3 to 6 years 4 Hadoop & Cloud Testing. Exp in Testing frameworks.

Team Size 24

hoice of big Data platform.


Big Data Analytics

S No. Module
1 ETL
2 Data Security
3 Data Processing
4 Data Governance
5 Data Worflows
6 Data Catalogue
7 Data Science Workbench
8 Machine Learning Modeling
9 Data Warehouse & Data Marts
10 ML Models Integration with Big Data
11 Application/Web Services
12 Platform Monitoring
13 ML Models Monitoring
14 Search
15 Data Visulaization
16
Big Data Analytics Platform Modules

Description
Data Ingestion
Masking, Anonymization, Encryption etc.
Cleaning, Transformation, formatting, storage, compression, harmonization, salting, processing etc
Audit, Tracing, Logging, Archiving, BackUp.
Spark & MR workflows for data processing.
Meta Data Management and Data Exploration enablement.
Workbench integration with DW, Data Marts & Data Catalogue (Zeppelin, Jupyter, AWS Athena, Google Big Query, Impala)
ML Models Development
Design & Development (Hive, HBase, MySQL, Cassandra, Google Spanner, AWS Aurora etc.)
Scoring, Integration and deployment on Big Data Platform
DaaS/AAAS for consumption by internal applications such as Marketing, FlyTxt, CRM, Netezza, DB2, Customer 360.
Cluster/Cloud/Applications monitoring & maintenance
Monitoring for Models evaluation.
Solr or Elastic or GSA search. Data should be searchable for fast querying, for ex in Customer 360 portal.
Tableau, Power BI, QlikView etc
Y0
Y1
Y2
Y3
Y4

NOTES

Raw Data is 4X of Usable data (3 X Replication & 1X Overhead of


Above Usable Data Volume is 5X compression of Actual Data Vol
Please note that above data estimates are ballpark and likely to chan
Vodafone Idea Data Volumes

Usable Data(PB) Incremental Data Raw Data(PB)


1.47 5.88
1.95 0.48 7.8
2.64 0.69 10.56
3.61 0.97 14.44
4.68 1.07 18.72

Usable data (3 X Replication & 1X Overhead of OS & IO).


Volume is 5X compression of Actual Data Volume.
ve data estimates are ballpark and likely to change with time as new data sources and applications increase.
BIG DATA PROJECT PLAN

INFRASTRUCTURE SET UP
M1 M2 M3 M4 M5 M6 M7 M8 M9 M10
Task Responsibiity w1 w2 w3 w4 w1 w2 w3 w4 w1 w2 w3 w4 w1 w2 w3 w4 w1 w2 w3 w4 w1 w2 w3 w4 w1 w2 w3 w4 w1 w2 w3 w4 w1 w2 w3 w4 w1 w2 w3 w4
Raise Demand VF (Tech./Biz.) 1 WK
SOW & proposal VF Tech. & Local Vendor 2 WK
SCM Approval and PO process VF Tech. 4 WK
Infrastructure Procurement VF Tech. & Local Vendor 8 WK
Infrastructure installation & setup VF Tech. & Local Vendor 4 WK
Testing - (Security) VF (Tech Security) & Local Vendor 2 WK

Required H/W needs to be ready for S/W installation


S/W & TOOLS SET UP
M1 M2 M3 M4 M5 M6 M7 M8 M9 M10
Task Responsibiity w1 w2 w3 w4 w1 w2 w3 w4 w1 w2 w3 w4 w1 w2 w3 w4 w1 w2 w3 w4 w1 w2 w3 w4 w1 w2 w3 w4 w1 w2 w3 w4 w1 w2 w3 w4 w1 w2 w3 w4
Raise Demand VF (Tech./Biz.) 2 WK
SOW & proposal VF Tech. Local Vendor 3 WK
SCM Approval and PO process VF Tech. 4 WK
Software Procurement VF Tech. & Local Vendor 4 WK
Software Installation & Setup VF Tech. & Local Vendor 4 WK
Testing - (Security) VF (Tech Security) & Local Vendor 2 WK

Required S/W needs to be ready for data loading & Anonymiation

DATA INGESTION & ANONYMIZATION


M1 M2 M3 M4 M5 M6 M7 M8 M9 M10
Task Responsibiity w1 w2 w3 w4 w1 w2 w3 w4 w1 w2 w3 w4 w1 w2 w3 w4 w1 w2 w3 w4 w1 w2 w3 w4 w1 w2 w3 w4 w1 w2 w3 w4 w1 w2 w3 w4 w1 w2 w3 w4
BRS preparation VF (Business)
Raising a Demand VF Tech./Biz. 2 WK
FRS & Solution doc Preparation VF Tech. & Local Vendor 4 WK
SOW & Proposal VF Tech. & Local Vendor 4 WK
SCM approval & PO Process VF Tech. 3 WK
Design & Development VF Tech. & Local Vendor 8 WK
Testing - (Security/SIT/QA/UAT) VF(Tech. Security, Tech.) & Local Vendor 4 WK
Deployment - Go Live VF Tech. & Local Vendor 1 WK
History Load & Anonymization Local Vendor 3 WK

Data Needs to be available for Use case design & development


USE CASE DESIGN & DEVELOPMENT
M1 M2 M3 M4 M5 M6 M7 M8 M9 M10 M11 M12 M13 M14 M15 M16 M17 M18
Task Responsibiity w1 w2 w3 w4 w1 w2 w3 w4 w1 w2 w3 w4 w1 w2 w3 w4 w1 w2 w3 w4 w1 w2 w3 w4 w1 w2 w3 w4 w1 w2 w3 w4 w1 w2 w3 w4 w1 w2 w3 w4 w1 w2 w3 w4 w1 w2 w3 w4 w1 w2 w3 w4 w1 w2 w3 w4 w1 w2 w3 w4 w1 w2 w3 w4 w1 w2 w3 w4 w1 w2 w3 w4
BRS preparation VF (Business) 3 WK
Raising a Demand VF (Business) 2 WK
FRS & Solution doc Preparation VF Tech./Biz. & Local Vendor 6 WK
SOW & Proposal VF Tech./Biz. & Local Vendor 4 WK
SCM approval & PO Process VF Tech./Biz. 4 WK
Design & Development VF Tech./Biz. & Local Vendor 32 WK
Testing - (Security/SIT/QA/UAT) VF(Tech. Security, Tech./Biz.) & Local Vendor 6 WK
Deployment - Go Live VF Tech./Biz. & Local Vendor 4 WK

You might also like