Optigrise Technology Solutions LLC, New Jersey
Optigrise Technology Solutions LLC, New Jersey
Business Intelligence
(DW/BI)
Optigrise Technology
Digital SBU
Cloud Services Data, Analytics & Insights Service AI & Cognitive Services Digital Integration Services
• Cloud Consulting Data Strategy, Consulting & Architecture • AI Consulting • Digital Integration
• Cloud Architecture Data Warehouse & Business Intelligence • Data Science & Machine Architecture
• Cloud Migration Operational Databases, OLTP Learning • API Gateway
• Cloud Native Dev
Data Warehouse, OLAP & Data Mart • Conversational AI, NLP, • Micro service
Business Intelligence Chatbot/Virtual Agents • EAI and SOA
• Cloud Testing & Ops
ETL • Voice, Speech & Video • DevOps
MDM/Master Data Management
Big Data & Analytics
• Modern Data Warehouse,
DWaaS
• Big Data & Analytics, Big Data on
cloud, Data Migration
• Data Visualization
• Data Ops, Data Integration & ELT
Focus Areas
“The goal is to turn data into information, and information into insight.” –
Carly Fiorina, former executive, president, and chair of Hewlett-Packard Co.
Typical Approach
Learning
• Separate pipeline – Separate pipeline/data flow
Business Intelligence
data services b/w traditional data engineering, big data & ML
Data Warehouse
teams.
& solutions
Big Data *
• Focus on Data science only – While AI and
are different predictive analytics can solve many use cases, still
organizations have huge amount of data in
from others? relational & structured form. They should
continue to have a strong DW/BI strategy
Our Approach
• Unified approach – Unified tools & process. Data Strategy
• Unified pipeline – Unified pipeline from data
Data Warehouse
traditional DW/BI, Big data, AI & other analytics.
Learning
• Balanced Approach: Balanced approach b/w
traditional DW/BI, Big data & AI
• Data Ops – Bringing in DevOps & Agile principles to
Data projects.
• Cost Optimization - Cost saving on DW/BI, so that
additional savings could be spent on AI & Big Data. Strong Data Foundation
DW & BI
• BI & Analytics design • Visualization & graph • Traditional ETL tools Business Intelligence,
• Dimensional modeling, design/build (Talend, SSIS) Visualization & Dashboards:
OLAP Cube design • Reporting • Big data/data lake related • Talend, Power BI, Qlick
• Self service analytics design/build ELT tools
• BI test • Dashboards • Data integration tools Others:
design/build (Streamsets, Altryx) • GDPR, HIPPA and data
• Testing development privacy consulting
• Data archival
DW/BI architecture – very small organizations
No Staging Area
• Often time in very small organizations & POCs,
Data warehouse does not have separate ‘Staging
Area’
• Data from operational systems are moved directly
to data warehouse
No Data Mart
• Analytics/Visualization/reporting apps directly
query data warehouse.
• Uses Staging
Area
• Departmental
Data Marts based
Tier 1 Tier 2 Tier 3 on business /
subject area.
• OLAP Servers:
OLAP Cubes used
for dimensional
modeling.
• BI/Visualization
tools access data
mart data and
not raw data in
data warehouse.
• PostgreSQL
• DB4o • Microsoft SQL Server
Object
• AWS Quantum Leger Relational • Oracle
Database/QLDB • IBM Db2
(Blockchain database) • MySQL
Specialized Relational /
• Spatial Database • MariaDB
•
Databases RDBMS
GIS Database • Sybase
• Redis
• Neo4J • Memcached
• Tinkerpop/Gremlin • Amazon DynamoDB (Cloud)
• AWS NeptuneDB (Cloud) • Azure CosmosDB (Cloud)
• Azure CosmosDB w/ Graph/ RDF Key value • Aerospike
Gremlin API Database Store/Cache • Riak
• JanusGraph Data Continuum • Oracle Berkley DB
• RDF Stores Polyglot persistence
• MongoDB
• AWS DynamoDB (Cloud)
• ElasticSearch • CouchBase / CouchDB
• Solr Document • Azure CosmosDB (Cloud)
• Search
Marklogic Database • GCP Datastore (Cloud)
• Amazon CloudSearch (Cloud) • RavenDB
• Azure Search (Cloud) • IBM Cloudant (Cloud)
Wide
• InfluxDB Time series Column • Cassandra
• Prometheus Database Store • Hbase
• Amazon Timestream • Azure CossmosDB w/ Cassandra API (Cloud)
(Cloud) • Google Cloud BigTable (Cloud)
Paradigm shift in applications & database technology …
Swiss army knife / One size fit all Approach Micro service styled app. Each micro service uses the database that fits
the purpose. Polyglot persistence.
DBaaS/Cloud databases - Relational, NoSQL, Graph …
Relational / OLTP
Amazon Aurora Azure SQL Database Cloud Spanner Db2 on Cloud
Amazon RDS for Oracle Azure SQL MySQL Cloud SQL (MySQL) Compose for MySQL
Amazon RDS for SQL Server Azure SQL PostgreSQL Cloud SQL (PostgreSQL) Compose for PostgreSQL
Amazon RDS for MySQL Azure SQL MariaDB Cloud SQL (SQL Server)
Amazon RDS for PostgreSQL
Amazon RDS for MariaDB
NoSQL
Key Value Store Amazon DynamoDB Azure CosmosDB w/ etcd API Compose for etcd
Document Database Amazon DocumentDB (with MongoDB Azure CosmosDB w/ SQL API Azure Cloud Firestore Cloudant
compatibility) CosmosDB w/ MongoDB API Compose for MongoDB
Amazon DynamoDB
Column Store Database Azure CosmosDB w/ Cassandra API Cloud Bigtable Compose for ScyllaDB
Timeseries Amazon Timestream
Graph Database Amazon Neptune Azure CosmosDB w/ Gremlin API Compose for JanusGraph
Caching/In memory Store Amazon ElastiCache for Redis Azure Cache for Redis Cloud Memorystore Compose for Redis
Amazon ElastiCache for Memcached
Traditional Data
Data Lake Modern Data Warehouse Next Gen Data Warehouse
Warehouse
Challenge: Traditional data warehouses could not store unstructured Challenge: Traditional data warehouses could not analyze
and semi structured data because they follow strict schema. This unstructured and semi structured data. This restricts their usage
restricts their usage for storing an analyzing data from NoSQL, logs, for storing an analyzing data from NoSQL, logs, IoT data,
IoT data, audio/video files etc, which currently constitutes more than audio/video files etc, which currently constitutes more than 50% of
50% of enterprise data. enterprise data.
Solution: Using data lakes and cloud storage platforms which can Solution: Using big data solutions like Hadoop and Spark which can
store unstructured, semi structured and structured data. analyze unstructured/semi structured data. Also with ML and
graph processing capabilities could be used.
Challenge: Traditional data warehouses face challenges in scaling Challenge: Traditional data warehouses typically inputs data only
which causes performance issues in queries. using batch based traditional ETL /Extract Transform Load method.
This means data could not be analyzed real time.
Solution: Modern data warehouses uses Massively parallel
processing and hybrid shared disk/shared nothing architecture Solution: Big data solutions use streaming to consume data from
for scaling. This ensures their query responses are fast. sources like clickstream, event log, IoT data and real time location
data from mobile devices. They also perform stream analytics on
incoming data to ensure they can provide real time analytics.
Db2 DW – Spark & R Analytics running within core database engine
ETL – Extract Transform Load
Our technology expertise & focus in ETL &
Data Integration
• Informatica - PowerCenter, PowerExchange, Data
Replication
• IBM - IBM InfoSphere Information Server, IBM
InfoSphere Data Replication,
• Microsoft – SQL Server Integration Service / SSIS (On
premise), Azure Data Factory (Cloud)
• Talend - Talend Open Studio, Talend Data Fabric, Talend
Data Management Platform
• Oracle - Oracle Data Integration Platform Cloud, Oracle
GoldenGate (OGG), Oracle GoldenGate Cloud, Oracle
Data Integrator (ODI).
• Apache Nifi (open source)
Cloud Only
• AWS Glue
• Alooma - now part of Google Cloud
• Panopfly – both data integration & light weight data
warehouse. Cloud SaaS solution
• Stitch – Light weight solution
• Azure Data Factory
Talend ETL and Data Integration Platform
Challenge: More often than not, within large enterprises there are thousands on point to point ETL pipelines, which performs data integration
from source system, app databases, COTS/SaaS to data warehouses and other systems. This causes what is called ETL hell or Integration
spaghetti, which is difficult to manage & operate and becomes a huge bottle neck for “digital transformation”. Traditional ETL is also not real time
and can not scale to cope up with the growing data volume.
Solution: Streaming and Messaging based systems like Kafka or Kinesis or Message Bus based architecture could solve these problems. Using
a pub sub based architecture removes the point to point Integration spaghetti. Also modern platforms like Kafka scales extremely well and
can handle real time streaming data from various sources.
Business Intelligence & Visualization
Our technology expertise & focus in
Business Intelligence & Analytics
• Tableau – Tableu on prem and cloud products
• Microsoft – Power BI, SQL Server Reporting
Service (SSRS)
• Qlik - Qlikview
• SAS – SAS platform
• Looker - now part of Google Cloud
• MicroStrategy
• IBM – Cognos
• TIBCO - Spotfire
Power BI
• Business analytics service that
delivers insights to enable fast,
informed decisions
• Could connect to all industry
standard data warehouses.
• Transform data into stunning
visuals and share them with
colleagues on any device.
• Visually explore and analyze
data—on-premises and in the
cloud—all in one view.
• Collaborate on and share
customized dashboards and
interactive reports.
• Scale across your organization
with built-in governance and
security.
• Supports cloud and desktop
versions.
Master Data Management (MDM)
Master Data Management (MDM)
Our technology expertise & focus in
MDM
• Informatica: Informatica MDM, Informatica
MDM Cloud
• IBM: IBM InfoSphere Master Data
Management, IBM Master Data Management
on Cloud
Data Security, Privacy & Compliance
Right to be secured All PII data be secured by pseudonymization or encryption, whether at rest or in transit.
Customers have the right to export their PII data in an encrypted format, such that it can easily be imported into a
Right to portability different IT environment. This could have huge implications in big data ecosystems. For example, a customer could
request to have their telematics data transferred from one insurance carrier to another.
In the post-GDPR world, customers will have the right to request and be shown how and why they were targeted for a
Right to be informed specific marketing campaign.
• Mobile BI
• Cloud based BI solutions
• Self Service BI & Analytics
Sample Profiles
About
Educational Qualification B.E from PQR B.S, M.A
Profession Career - 1.5 years with XYZ Ltd - 1 year with PQR Corp
- 3 years with ASD LLC
Other Tech Stack Scripting, .NET basics AWS AWS, Mongo, Java
Certification - AWS Certified (Associate) Hortonworks Hadoop Certified
Project Experience
Domain Knowledge Retail, CPG Manufacturing, Telecom BFSI, Retail