0% found this document useful (0 votes)
157 views

Rapidminer: Real Data Science, Fast and Simple

RapidMiner is a data science platform that provides unified capabilities for data preparation, modeling, validation, and operationalization. It aims to make data science fast and simple. RapidMiner is the #1 open-source data mining and analytics software, and has over 200,000 engaged users. It offers a lightning fast platform with over 1,500 functions for data preparation and machine learning. RapidMiner provides expertise through its marketplace of consultants, algorithms, and extensions to help users across many industries and domains. Pricing starts at $10,000 per user annually for the Studio Large subscription and $60,000 per instance annually for the Server Large subscription.

Uploaded by

she day
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
157 views

Rapidminer: Real Data Science, Fast and Simple

RapidMiner is a data science platform that provides unified capabilities for data preparation, modeling, validation, and operationalization. It aims to make data science fast and simple. RapidMiner is the #1 open-source data mining and analytics software, and has over 200,000 engaged users. It offers a lightning fast platform with over 1,500 functions for data preparation and machine learning. RapidMiner provides expertise through its marketplace of consultants, algorithms, and extensions to help users across many industries and domains. Pricing starts at $10,000 per user annually for the Studio Large subscription and $60,000 per instance annually for the Server Large subscription.

Uploaded by

she day
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 26

RapidMiner

Overview

Real data science, fast and simple.


RapidMiner Highlights

#1 200,000+ 250+ 50+


By the Engaged
numbers Data Science Global Channel
Community Partners
Platform Clients
Members

Leader Leader #1 Open-Source Innovation Winner


Analysts 2014, 2015, 2016 & 2017 2017 Platform 2015
Gartner Magic Quadrant Predictive Analytics Last five years in a row Wisdom of Crowds for Advanced
for Data Science Platforms & Machine Learning Data Mining & & Predictive Analytics, Big Data
Analytics Software Poll Analytics & End-User Data Prep

CB Insights VENTANA
The AI 100, 2017 RESEARCH
Accolades “100 Startups Using Artificial Intelligence 2016 Technology Innovation
Awards Winner
to Transform Industries” Predictive Analytics

2
Insight Without Action Has No Value

Analytics 3.0*
Predictive & Step Five
Prescriptive

Analytics 2.0
Diagnostic
Proactive
Analytics 1.0
Descriptive Reactive

Passive

Business Intelligence Data Visualization Data Science

Database Analytic Data Marts Big Data

Sums & Counts Drilldown Machine Learning

Historical Information Current Insight Human / Automated Actions


*First referenced by Thomas H Davenport, HBR December 2013 3
High Value Use Cases Need Real Data Science
Retail &
Automotive Life Sciences Government
Consumer Goods

Banking Manufacturing Telco e-Health

Insurance Travel, Transport


Oil & Gas Utilities
& Logistics

Customer Analytics Operational Analytics Risk Analytics


• Customer Acquisition • Channel / Mix • Supply Chain Optimization • Call Center Operations • Credit Scoring • Anti-Money Laundering
• Cross-sell/Upsell Optimization • Manufacturing Operations • Retail Store Operations • Insurance Underwriting • Rogue Trading
• Offer Optimization • Web Analytics • Asset Performance • Predictive Maintenance • Capital Planning • Cyber Security
• Retention & Loyalty • Pricing Optimization Process Engineering • IT Operations • Stress Testing • Compliance
• Win back • … • Capacity Planning • … • Fraud Detection • …

Drive Revenue Reduce Costs Avoid Risks

+50% -34% +46%


New revenue Realized cost Increased
opportunities* savings* profitability*

*Ventana Research Next Generation Predictive Analytics Benchmark Research, 2015


4
Lightning-Fast Unified Platform

Data Prep Model & Validate Operationalize


Speed & optimize ALL data Apply machine learning to Easily deploy & maintain
exploration, blending & rapidly prototype & confidently models and embed
cleansing tasks validate predictive models
Incorporate all analytic results Embed results in all
types of data types of business
• Data selection • Modeling • Model deployment apps & data
• Data Cleaning • Cross validation • Scoring as web service visualization tools
• Data integration • Model Optimization • Model monitoring
• Data formatting • Model Management • Reporting and visualization
• Data exploration • Model Export • Maintenance

5
The RapidMiner Competitive Advantage

Unified
Platform
Prototype – Substantiate – Operationalize –
seamless, high performance orchestration

Lightning Fast #1 Marketplace for


Data Science Data Science Expertise
Powerful, visual & guided use of 1,500 On-demand consultants, algorithms &
data prep and machine learning extensions; global presence & domain
functions & third party libraries expertise in every industry

Real data science, fast and simple.


6
RapidMiner Platform & Pricing
1 year subscription shown

Studio Large Free product versions receive


community support.
Server Large
$10,000 per user $60,000 per instance
Unlimited 10x+ Row limits in Studio apply when using Server Unlimited
performance or Radoop so limiting the data a user can
use. Server Medium
$30,000 per instance
8 8

Studio Medium Radoop Enterprise Server Small


First User $15,000
$5,000 per user $15,000 per instance
Cores 4 4x Each additional User $5,000 Cores 4
performance

Studio Small Executes all 1500+ RapidMiner


functions plus 70+ native Hadoop
$2,500 per user operators
2 2x 2
Studio performance
Server
Free Free
Radoop Free
1 70+ native Hadoop operators only 1
10,000 100,000 1,000,000 Unlimited 10,000 100,000 1,000,000 Unlimited

Data Rows Data Rows

RapidMiner Studio RapidMiner Radoop RapidMiner Server


• Visual Workflow Designer • Execute Data Science Workflows • Collaborate & Share
Seamlessly on Hadoop
• Guided Analytics & Reusable Processes • Compute
• Analysis upon the full breadth &
• Wealth of Predictive Algorithms & Functions • Integrate
variety of stored big data
• Operationalize 7
Get Successful with RapidMiner
Get Started
Jumpstart your enablement and get
started fast with free self-service
1 4
tutorials, videos and the daily demo Get Successful
Utilize the experience and expertise of the
RapidMiner Customer Success Team
Get Guidance • Customer orientation Get Connected &
Attend product workshops and
ask questions of product
2 • Installation support & guidance
• Implementation planning
Contribute
experts as you build your first • Use case, architecture, best practices
• Training, Certification & Services needs
5 Connect to the RapidMiner
community: learn, share, contribute:
machine learning workflows Live Online • 200,000+ member, 34,000+
• Quarterly reviews
Get Educated & Virtual instructor-led
posts
Self-Paced Online • Innumerable external blogs,
Certified Learn when convenient
articles, scientific papers &
3 Develop the essential skills
to be successful with the
Classroom
Face-to-face at our or
books
RapidMiner product suite your office
Community & Blogs

Books
Videos & In-Product Tutorials Webinars Demos & Documentation

8
Systems

RapidMiner Partner Network


Technology Integrators
Value Added
OEM
Resellers

Global
Partners 9
Real data science, fast and simple.

RapidMiner Inc.
10 Milk Street
11th Floor
Boston, MA 02108
rapidminer.com
Boston Budapest Dortmund London @rapidminer
Additional Content

11
RapidMiner Data Science Impact

Bridge the Data Science Skills Gap Operationalize Competitive Advantage

50% Created new revenue


Chief Analytics Officer Chief Executive Officer opportunities*
39%
Empower operational workers to Leverage prescriptive analytics in
consume data science in their all your decisions to achieve
routine decision making better outcomes Increased
Improved 46% profitability*
customer service*

Coding Data Scientist Applied Data Scientist


Accelerate the creation of high- Confidently extract the hidden
95% value data science while value from your data using
faster streamlining low-value tasks intuitive predictive analytics 5-10x data science
capability

Build Better Predictive Models Faster Easily Use Predictive Analytics


*Ventana Research Next-Generation Predictive
Analytics Benchmark Research, 2015
12
The RapidMiner Data Science Platform
RapidMiner Marketplaces
On-demand Innovation & Execution
∞ Extensive Domain Expertise
Expert marketplace of certified RapidMiner skills
Plug-ins, Algorithms, Extensions
Product Marketplace to extend and innovate

RapidMiner Studio RapidMiner Server


Lightning Fast Real Data Science, Code Optional Seamless Deployment, Management &
Collaboration

Data Access Data Exploration Data Prep Modeling Validation Collaboration Computation Scheduling Integration Management
Connect to any data Quickly discover patterns Speed & optimize ALL data Efficiently build and Confidently & accurately Connect to any data Quickly discover Speed & Efficiently build and Confidently & accurately
source, any format, at any or data quality issues exploration, blending & deliver better models estimate model source, any format, at patterns or data quality optimize ALL data deliver better models estimate model
scale cleansing tasks faster performance any scale issues exploration, blending & faster performance
cleansing tasks

RapidMiner Radoop
Simplified, Intelligent Big Data Science & Machine Learning

Simplified Analytics Lightning Fast Broad Data Access Integrated Security Optimized for Hadoop Scalable Processing Spark Execution
Reduces Hadoop complexity Covers complete analytics Eliminate connectivity Ensure security compliance Leverage Hadoop distributed Process in-Hadoop and in- Execute RapidMiner sub -
lifecycle struggles power memory processes in parallel

13
The RapidMiner Platform

RapidMiner Market Place


Industry, Application & ML Extensions

RapidMiner Web Applications


RapidMiner Studio RapidMiner Server
Visual Workflow Designer Collaborate + Compute + Deploy + Maintain

Workflow Builder Data and Process


Web App Portal
Repository

Web Services
Process Execution
User/Group Access Process
Engine
Rights management Scheduler

Process
Integrate using Web Services, JSON, SQL, …
Execution Engine Server Java SE/EE
RapidMiner Radoop Application Application
Compile + Execute in Hadoop
RapidMiner Radoop Databases / Application (BI, ERP,
Compile + Execute in Hadoop Data CRM…) / Portal
warehouses

Incorporate all R / Python / SQL Scripting


Run in multiple
types of data Compute Engines In-Memory H2O / Weka
In-Hadoop & Spark

14
RapidMiner Studio
All-In-One Data Science Workflow Designer

Lightning Fast
Visual interface for rapidly building complete analytic
workflows

Powerful
Rich library of algorithms and functions to build the strongest
possible model for any use case

Open & Extensible


• Open source innovation keeps pace with changing
business needs

15
RapidMiner Server
Operationalization & Collaboration Management

Team Collaboration Frictionless Dynamic & Continuous


Operationalization Model Management
Central repository facilitates sharing Flexible execution options Individual and customizable
of data sources, analytic processes & streamline deployment, processes to check for accuracy
best practices maintenance & embedding of drifts or shifts
analysis

16
RapidMiner Radoop
Extends the RapidMiner’s visual workflow to Hadoop

Hadoop made easy


Translates data science workflows into Hadoop so data scientists
concentrate on analytics not Hadoop programming

In Hadoop Execution
Pushes analytic instructions into Hadoop
for computation

Secure
Complies with Hadoop security standards

17
Sample Use Cases
Payments – Worldwide Telco - Austria
Telco - Austria Telco – Germany
Customer feedback & voice of the Optimize customer support by
customer, churn prevention, text Automated Online Market Research, automatically categorizing
Automated Customer Feedback Text
mining, automated text categorization, Text Analytics, Sentiment Analysis, unstructured data by content and to
Analysis for Automated E-Mail
and sentiment analysis to customer Customer Insight prioritize and reduce response time
Categorization & Routing
support and sat to prevent customer and cost so increasing customer
churn satisfaction

Telco – Europe
Telco - Switzerland Telco – Hungary
CRM applications including Marketing – Germany
optimization of direct marketing Customer Relationship Analytics, Churn
Server & Equipment Load
campaigns, automated generation of Prediction & Prevention, Direct
Forecasting, Predictive product recommendations for cross- Automated Online Market Research,
Marketing Campaign Optimization,
Maintenance, Predicting & selling and up-selling, customer churn Scheduling & Automated Execution of Text & Sentiment Analysis, Customer
Preventing Server & Component prevention, and fraud detection ETL Tasks Insight, Competitive Intelligence
Failures

Market Research -
Worldwide Payments – Worldwide OEM – Europe
Telco – Germany
Sentiment Analysis of online text
Prediction of sales volumes; Fraud Detection & Prevention sources, including social media and
Fraud Detection & Prevention Solutions
CRM optimization; social media for Telecoms
other user generated content for
monitoring and sentiment customer care triage
analysis

18
Sample Customer Use Cases

Multiple Customers, Payments – Worldwide


Partner - Europe
Industries
Sentiment Analysis of online text Smart meter installation optimization
Automated Customer Feedback Text sources, including social media and as a service – maximize first time visit
Analysis for Automated E-Mail / Social other user generated content for success
Media, Categorization, Triage & Routing customer care triage

Market Research – Worldwide


Payments - Russia Org Telco – Europe

Fraud detection in retail network Prediction of sales volumes; CRM CRM applications including optimization
historical data on service usage, optimization; social media monitoring and of direct marketing campaigns,
transaction history, customer profiles, sentiment analysis automated generation of product
usage logs, and known cases of recommendations for cross-selling and
Automated Customer Feedback Text Analysis
fraudulent behavior up-selling, customer churn prevention,
for Automated E-Mail Categorization &
Routing and fraud detection

19
Sample Customer Use Cases

Manufacturing – Production Manufacturing – Predictive


Voice of the Customer
Optimization Maintenance
Automated Customer Feedback Text
Analysis for Automated E-Mail / Social Optimization Of Production Logistics & High Value Assets - Silicon, Cars,
Media, Categorization, Triage & Routing Flows, Quality, Yield, Product Mix, Process Trucks, Aircraft, Turbines, IT
Mining Infrastructure,…

Maximizing Customer
Fraud Detection
Lifetime Value
Fraud detection in retail network
historical data on service usage, CRM applications including optimization
transaction history, customer profiles, of direct marketing campaigns,
usage logs, and known cases of automated generation of product
fraudulent behavior recommendations for cross-selling and
up-selling, customer churn prevention,
and fraud detection

20
Safeguarding Electronic Payments
Anticipating the risk of fraud

The Challenge
• Protecting against fraud and anticipation of risk 7x24
Russia’s • Large and diverse set of partners (merchants) – over 70,0000
Largest electronic • How to classify and check merchant ecommerce sites for payment system compliance?
payment service
RapidMiner Solution
• Analyze, classify and check merchants’ ecommerce sites for compliance
• Utilize text mining with NLP to auto-categorize with high sentiment accuracy
• Mashup the widest data sets - historical data on service usage, transaction history, customer profiles,
usage logs, and known cases of fraudulent behavior
• Detect anomalies, misuse and fraud through operationalized classification model

Outcome
• Only 8-10% of merchant sites now screened manually at 80% confidence threshold
• Accurate automated analysis of high risk sites- 92% correctly classified
• Elimination of false positives - no normal sites classified as high risk
• Time and cost to resolve fraud case radically reduced

21
Repeat Business through Marketing Efficacy
Identify upsell offers through deep customer analytics

The Challenge
Large • Industry with tight margins & intense competition
North American • Broad array of online & mobile channels for customers to place orders
• Goal to improve marketing offers and create more repeat business
restaurant delivery
chain RapidMiner Solution
• Capture a vast array of customer ordering data from multiple online & mobile phone channels
• Use RapidMiner to join & enriched data with 3rd-party demographics & competitive data
• Use data science to assess performance and growth drivers at individual stores & franchise groups
• Results used to tailor coupons & upsell offers to customers

Outcome
• Greater flow of repeat customers, driving growth at individual stores and franchise groups
• Far outpaced the industry: Posted best Q2 & Q3 domestic same-store sales growth of the 25 largest
restaurant chains in the U.S.
• Next steps: RapidMiner Radoop

22
Customer Satisfaction through Quality of Service
Customer experience begins with network quality

The Challenge
• Backend infrastructure footprint & costs increasing yearly
Leading European • Customer satisfaction driven by service quality in areas such as video streaming latency
• Network operation teams must accelerate root cause analysis, reduce time to repair
Telecoms Provider • Data visualization with big data alone cannot provide operationalized insight needed

RapidMiner Solution
• Secure large scale Hortonworks Hadoop Big Data Hub architecture to leverage data lakes
• Correlation of log events with historical log data to preempt service quality degradation
• Through machine learning rapidly predict demand as consumer usage patterns change
• Utilize text mining to optimize help desk ticket triage and processing

Outcome
• Reduce infrastructure requirements (-10%)
• Improved customer retention (2%+)
• IT Operations costs reduced (-30%)

23
Drive Data Science Agility & Cut Costs
Faster development & deployment of customer analytics models

The Challenge
Leading • Existing data science teams looking to replace SAS
North American – Strong dislike of unwieldy SAS platform with the coding & complexity of it’s multiple
applications & user interfaces
Financial Services – Cost of SAS too high
Institution RapidMiner Solution
• Pull together customer data from across a number of internal databases & third-party sources
• Easily incorporate a large library of legacy predictive models written in R & Python
• Small team of 4 data scientists using collaboration features in RapidMiner Server to share data
prep and machine learning processes

Outcome
• Improved upsell opportunities and customer retention
• Speeds the process of data prep, rapid prototyping & validation of models over SAS methods
and coding-only methods
• Expansion into Risk department where data science team doesn’t code in SAS, R or Python

24
Gartner & Forrester – RapidMiner a Clear Leader
2017
Magic Quadrant for PAML Wave
Data Science Platforms

“…a Leader, owing to its market presence, the volume of client inquiries that Gartner “RapidMiner wraps breadth and depth in a beautiful package.
receives about it, its user community, and its well-rounded product that addresses
most data science use cases well.”
RapidMiner invested heavily to revamp visual interface to make it the most
‘Reference customers praised many facets of the platform — its large selection of concise and fluid that we have seen during this evaluation. Add to that,
algorithms, flexible modeling capabilities, data source integration and RapidMiner’s comprehensive set of operators that encapsulate a wide
consequent data preparation. The platform's strength lies not just in particular range of data prep, analytical, and modeling functionality to increase
areas, but also in its all-around consistency.” productivity of data scientists.”
25
Peer Insights – True Expert Validation

Business Software and Services Reviews


Verified software ratings and reviews from your enterprise IT peers

Top Predictive Analytics Products by Enterprise reviewers Reviews for Advanced Analytics Platforms

26

You might also like