0% found this document useful (0 votes)

221 views40 pages

Vladimir Cherkassky IJCNN05

Uploaded by

arijit_ghosh_18

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

221 views40 pages

Vladimir Cherkassky IJCNN05

Uploaded by

arijit_ghosh_18

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 40

New Formulations for

Predictive Learning
Vladimir Cherkassky
University of Minnesota
cherk001@umn.edu
Tutorial at IJCNN-05
July 31, 2005
Copyright © Vladimir Cherkassky
1
Outline
Motivation and Background
Standard Inductive Learning Formulation
Alternative Formulations
- non-inductive types of inference
- non-standard inductive formulations
Predictive models for interpretation
Conclusions

2
Motivation:
Importance of Problem Formulation
Traditional (Simplistic) View

‘Useful’ =‘Predictive’
May lead to misconceptions:
Inductive models are completely data-driven
The goal is to design better algorithms

3
Motivation: philosophical
Karl Popper: Science starts from
problems, and not from observations

Confucius: Learning without thought is

useless, thought without learning is
dangerous

What to do vs how to do

4
Motivation
Another view of Predictive Learning

Importance of problem formulation (vs algorithm)

Just a few known formulations
Thousands of algorithms

5
Background: historical
The problem of predictive learning
Given past data + reasonable assumptions
Estimate unknown dependency for future
predictions

Driven by applications (not theory)

6
Historical Development
Statistics (mathematical science)
Goal: model identification, density estimation
Neural Networks (empirical science)
Goal of learning: generalization, risk minimization
Statistical Learning (VC theory)
(natural science)
Goal of learning: generalization for distinct learning
problem formulations

7
Standard Inductive Learning
The learning machine observes samples (x ,y), and
returns an estimated response yˆ = f ( x, w)
Two modes of inference: identification vs imitation
Risk ∫ Loss(y, f(x,w)) dP(x,y)→ min Λ
y
x
Generator Learning
of samples Machine

y
System
8
Two Learning Problems

Learning ~ estimating mapping x → y

(in the sense of risk minimization)
Binary Classification: estimating an
indicator function (with 0/1 loss)
Regression: estimating a real-valued
function (with squared loss)
Assumptions: iid, training/test, loss fct
9
Contributions of VC-theory
The Goal of Learning
system imitation vs system identification
Two factors responsible for generalization
Keep-It-Direct Principle (Vapnik, 1995)
Do not solve a problem of interest by solving a more
general (harder) problem as an intermediate step
Clear Distinction between
- problem setting
- solution approach (inductive principle)
- learning algorithm

10
Alternative Formulations
Re-examine assumptions behind
standard inductive learning
1 Finite training + large unknown test set
Æ non-inductive inference (transduction, …)
2 Particular loss functions
Æ new inductive formulations (application-
driven)
3 Single model
Æ multiple model estimation

11
1.Transduction
How to incorporate unlabeled test data
into the learning process
Estimating function at given points
Given: training data (Xi, yi) , i = 1,…n
and unlabeled test points Xn+j , j = 1,…k
Estimate: class labels at these test points
Note: need to predict only at given test points
Xn+j, not for every possible input X

12
Transduction vs Induction
a priori knowledge
assumptions

estimated
function
induction deduction

training predicted
data output
transduction 13
Transduction based on size of margin
The problem: Find class label of test input X

14
Many potential applications
Prediction of molecular bioactivity for drug
discovery
Training data~1,909; test~634 samples

Input space ~ 139,351-dimensional

Prediction accuracy:

SVM induction~74.5%; transduction ~ 82.3%

Ref: J. Weston et al, KDD cup 2001 data analysis: prediction
of molecular bioactivity for drug design – binding to
thrombin, Bioinformatics 2003
15
Beyond Transduction: Selection
Selection Problem
Given: training data (Xi, yi) , i = 1,…n
and unlabeled test points Xn+j , j = 1,…k
Select: a subset of m test points with the
highest probability of belonging to one class
Note: selective inference needs only to select
a subset of m test points, rather than assign
class labels to all test points.

16
Hierarchy of Types of Inference

Identification
Imitation
Transduction
Selection
.....
Implications: philosophical, human learning

17
2. Application-driven formulations
APPLICATION NEEDS

Loss Input, output, Training/

Function other variables test data

Admissible
Models

FORMAL PROBLEM STATEMENT

LEARNING THEORY
18
Inductive Learning System (revised)
The learning machine observes samples
(x ,y), and returns an estimated response ŷ
to minimize application-specific Loss [f(x,w), y]
Λ
y
x
Generator Learning Loss[f(x,w),y]
of samples Machine

y
System

19
Application: financial engineering
Asset management via daily trading:
non-standard learning formulation

PREDICTIVE
input x prediction TRADING Buy/sell/hold
MODEL
indicators y DECISION
y=f(x)

GAIN/
MARKET
LOSS

20
Example: timing of mutual funds
Background: buy-and-hold vs trading
Recent scandals in mutual fund industry
Daily trading scenario

Index or Fund Money Market

Sell Buy
or or
buy sell

Proprietary Exchange Strategy

21
Example of Actual Trading
Improved return + Reduced risk/ volatility:

22
Learning formulation for fund trading
Given
- (
Daily % price changes of a fund qi = pi − pi −1 pi)
- Time series of daily values of input variables X
i
- Indicator decision function (1/0 ~ Buy/Sell) yi = f ( xi , w)

Objective: maximize total return over n-day period

n
Q ( w ) = ∑ f ( xi , w ) q i
i =1

23
Non-standard inductive formulation
n
Maximize total account value Q ( w) = ∑ f ( xi , w) qi
i =1
Neither classification, nor regression

PREDICTIVE
input x MODEL prediction TRADING Buy/sell/hold
indicators y DECISION
y=f(x)

GAIN/
MARKET
LOSS

24
3. Multiple Model Estimation
Single-model formulation
Estimate unknown
dependency
x→y

Multiple-model approach:
Available data can be
‘explained’ using
several models

25
Example data sets: Regression
Two regression models Single complex model

26
Multiple Model Formulation
Available (training) data are generated by several
(unknown) regression models,
y = t m (x) + ξ m x∈ Xm

Goals of learning:
Partition available data (clustering, segmentation)

Estimate a model for each subset of data

(supervised learning)
Assumption:
Majority of the data samples can be explained
(described) by a single model.

27
Experimental Results: Linear
3.5 3.5
3 3
2.5 2.5
2 2
1.5 1.5
1 1
0.5 0.5 M1 estimate
M2 estimate
0 0
0 0.5 1 0 0.5 1
(a) (b)

3.5 3.5
3 3
2.5 2.5
2 2
1.5 1.5
1 1
0.5 0.5 M1 estimate
M2 estimate
0 0
0 0.5 1 0 0.5 1
(c) (d) 28
Experimental Results: Non-Linear
2 2

1 1

0 0

-1 -1
M1 estimate
M2 estimate
-2 -2
0 0.5 1 0 0.5 1
(a) (b)

2 2

1 1

0 0

-1 -1
M1 estimate
M2 estimate
-2 -2
0 0.5 1 0 0.5 1
(c) (d)

29
Multiple Model Classification
Single-model approach Multiple-model approach
Æcomplex model Æ two simple models

30
Procedure for MMC
Initialization: Available data = all training samples.
Step 1: Estimate major model, i.e. apply robust
classification to available data
Here, ‘Robustness’ wrt variations of data generated by minor
model (s)
Step 2: Partition available data (from one class)
into two subsets

Step 3: Remove subset of data (from one class)

classified by the major model from available data.
Iterate

31
Example of MMC: XOR data set
Training phase

32
Comparison for toy data set
MMC hyperplanes RBF-SVM
1.2 1.2
(2)
H
1 1

0.8 0.8

0.6 0.6

0.4 0.4

0.2 0.2

0 0
(1)
H
-0.2 -0.2
-1 -0.5 0 0.5 -1 -0.5 0 0.5
(a) (b)

33
Comparison continued
SVM polynomial kernel Prediction Accuracy

1.2
Error (%SV)
1
RBF 0.058 (25.5%)
0.8
Poly 0.067 (26.4%)
0.6
MMC 0.055 (14.5%)
0.4

0.2

-0.2
-1 -0.5 0 0.5
(c)

34
Summary for Multiple Model Estimation

Improvements due to novel problem

formulation, not sophisticated algorithms
Practical learning algorithm using based on
(linear) SVM
Resulting model has hierarchical structure
Advantages:
Interpretation
No Kernel Selection

35
Prediction and interpretation
Many, many applications intrinsically
difficult to formalize
Two practical goals of learning:
- prediction (objective loss function)
- interpretation, understanding (subjective)
Most algorithms developed for predictive
settings, but used for interpretation and
human decision making
Rationale: good predictive model ~ true

36
Example:functional neuroimaging
Understanding fMRI image data:
- estimate ‘good’ Brain Activation Maps showing brain activity
(colored patches) in response to specific tasks
Measure of goodness: predictability, reproducibility

37
Predictive models for understanding
Always assume inductive formulation
What if transduction yields much better
prediction?
Fundamental problem (classical view):
- human reasoning ~ logic + induction
- transduction does not fit this paradigm
Goal of science: understanding
Goal of science: perform/act well
38
Conclusions
Methodological shift:
think first about the problem formulation,
rather than learning algorithms
Importance of problem formulation
- for empirical comparisons
- the limits of predictive models
Philosophical impact of Vapnik’s new
types of (non-inductive) inference
39
References
VC-Theory: V. Vapnik, Statistical Learning Theory, Wiley, NY

Transduction: V. Vapnik (1998), Statistical Learning Theory,

Wiley, + many recent papers

Timing of Mutual Funds: E. Zitzewitz (2002), Who cares about

shareholders: arbitrage-proofing mutual funds. Journal of Law,
Economics and Organization, 19, 2, pp. 245-280

Multiple Model Estimation:

Y. Ma and V. Cherkassky (2003), Multiple model classification
using SVM-based approach, in Proc. IJCNN
V. Cherkassky and Y. Ma (2005), Multiple model regression
estimation, IEEE TNN, 14, 4, pp. 785-798

Tennis Trading Lay The Leader Strategy Sports Trading Life
No ratings yet
Tennis Trading Lay The Leader Strategy Sports Trading Life
13 pages
Notes On ML
No ratings yet
Notes On ML
42 pages
Predict Total Goals
No ratings yet
Predict Total Goals
40 pages
Stocks Shortlist
100% (1)
Stocks Shortlist
24 pages
STOCK
No ratings yet
STOCK
19 pages
The Application of Machine Learning For Sport Result Prediction A Review
No ratings yet
The Application of Machine Learning For Sport Result Prediction A Review
49 pages
The Bivariate Poisson Distribution
No ratings yet
The Bivariate Poisson Distribution
45 pages
ML 1 2 3
No ratings yet
ML 1 2 3
54 pages
Pivot CE
No ratings yet
Pivot CE
25 pages
Colen Soccer
No ratings yet
Colen Soccer
93 pages
استخدام التحليل التمييزي للتنبؤ بالفشل المالي لعينة من المؤسسات الصغيرة والمتوسطة بولاية أم البواقي خلال الفترة 2014-2016 PDF
No ratings yet
استخدام التحليل التمييزي للتنبؤ بالفشل المالي لعينة من المؤسسات الصغيرة والمتوسطة بولاية أم البواقي خلال الفترة 2014-2016 PDF
26 pages
LSTM Stock Prediction
100% (1)
LSTM Stock Prediction
38 pages
Football Betting Secrets PDF
100% (1)
Football Betting Secrets PDF
35 pages
J (SW) Plot For All The Wells
No ratings yet
J (SW) Plot For All The Wells
18 pages
Month Deposit (USD) Profit WD USD Realisasi MAX
No ratings yet
Month Deposit (USD) Profit WD USD Realisasi MAX
23 pages
Betting Tracker v2 01 Euro
No ratings yet
Betting Tracker v2 01 Euro
104 pages
Financial Analysis (HAL) Final
No ratings yet
Financial Analysis (HAL) Final
22 pages
The Engineer Who Said The Ark of The Covenant Was A Giant Capacitor
No ratings yet
The Engineer Who Said The Ark of The Covenant Was A Giant Capacitor
5 pages
B4T07STOCKS
No ratings yet
B4T07STOCKS
4 pages
Stock Future
No ratings yet
Stock Future
7 pages
Stock Analysis
No ratings yet
Stock Analysis
16 pages
60 Days Rich Tracker-Law of Compounding (3month) : Start Invest
No ratings yet
60 Days Rich Tracker-Law of Compounding (3month) : Start Invest
23 pages
Testplan
No ratings yet
Testplan
18 pages
Trading Above and Below Resistance
No ratings yet
Trading Above and Below Resistance
7 pages
Forex Money Management Risk Management NOT Date Time Capital Profit Loss Balance Baki Risk Per Trade
No ratings yet
Forex Money Management Risk Management NOT Date Time Capital Profit Loss Balance Baki Risk Per Trade
6 pages
Trading of Stock Exchanges
No ratings yet
Trading of Stock Exchanges
16 pages
TBST Cluster Money Management: ENTRY HANYA DI R50/60 (T) DAN R50/60 (H)
0% (1)
TBST Cluster Money Management: ENTRY HANYA DI R50/60 (T) DAN R50/60 (H)
4 pages
Football Scores The Poisson Distribution and 30 Ye
No ratings yet
Football Scores The Poisson Distribution and 30 Ye
7 pages
Best Tipster Guide Sample
No ratings yet
Best Tipster Guide Sample
14 pages
Sexy Stochastics: The Setup
No ratings yet
Sexy Stochastics: The Setup
9 pages
Details: Art Technology Group Inc
No ratings yet
Details: Art Technology Group Inc
14 pages
MM Forex
No ratings yet
MM Forex
5 pages
Money Management Forex
0% (1)
Money Management Forex
2 pages
New Age Wealth Algo
No ratings yet
New Age Wealth Algo
12 pages
The Monte Carlo Simulation: Rustom D. Sutaria - Avia Intelligence 2016, Dubai
No ratings yet
The Monte Carlo Simulation: Rustom D. Sutaria - Avia Intelligence 2016, Dubai
4 pages
Effect of Return and Volatility Calculation On Option Pricing: Using Banknifty
No ratings yet
Effect of Return and Volatility Calculation On Option Pricing: Using Banknifty
8 pages
FinTech Offerings by QcFinance - in
No ratings yet
FinTech Offerings by QcFinance - in
6 pages
Indepe Ndenc e Day
No ratings yet
Indepe Ndenc e Day
13 pages
7 Lessons To Learn From A Market Downturn
No ratings yet
7 Lessons To Learn From A Market Downturn
6 pages
Amba Canslim1
No ratings yet
Amba Canslim1
9 pages
Ben Ulmer, Matt Fernandez, Predicting Soccer Results in The English Premier League PDF
100% (1)
Ben Ulmer, Matt Fernandez, Predicting Soccer Results in The English Premier League PDF
5 pages
Predicting Final Result of Football Match Using Poisson Regression Model
No ratings yet
Predicting Final Result of Football Match Using Poisson Regression Model
6 pages
Siddharth Deora: Global Indices
No ratings yet
Siddharth Deora: Global Indices
5 pages
GS DS&Algo Problems & Test Cases
No ratings yet
GS DS&Algo Problems & Test Cases
28 pages
GoogleFinance - Historical Market Data
No ratings yet
GoogleFinance - Historical Market Data
1 page
AI Associate Merged
No ratings yet
AI Associate Merged
100 pages
F A Q (Faq) : Requently Sked Uestions S
No ratings yet
F A Q (Faq) : Requently Sked Uestions S
4 pages
Pivot Point Calculator
No ratings yet
Pivot Point Calculator
2 pages
Azmat S&P CNX NIFTY Inc Shariah LIVE Exponential Moving Averages
No ratings yet
Azmat S&P CNX NIFTY Inc Shariah LIVE Exponential Moving Averages
2 pages
TransferLearningwithAdaptiveFine Tuning
No ratings yet
TransferLearningwithAdaptiveFine Tuning
16 pages
GoogleFinance API
No ratings yet
GoogleFinance API
1 page
Win at the Bingo
From Everand
Win at the Bingo
Patrick Penillon
No ratings yet
DBMS Notes 3 - TutorialsDuniya
No ratings yet
DBMS Notes 3 - TutorialsDuniya
203 pages
CH 1
No ratings yet
CH 1
16 pages
Cse - Ai & ML
No ratings yet
Cse - Ai & ML
44 pages
Particle Swarm Optimization
No ratings yet
Particle Swarm Optimization
54 pages
Ibf List of Providers
No ratings yet
Ibf List of Providers
103 pages
Risk Free Virtual Bets
No ratings yet
Risk Free Virtual Bets
3 pages
How To Calculate The Expected Value: P X P P P
No ratings yet
How To Calculate The Expected Value: P X P P P
2 pages
Data Science and Machine Learning
No ratings yet
Data Science and Machine Learning
30 pages
Profit Prowess: Mastering for Football Trading Success
From Everand
Profit Prowess: Mastering for Football Trading Success
Michael Smith
No ratings yet
Hung-Yi Lee Word2vec (v3)
No ratings yet
Hung-Yi Lee Word2vec (v3)
23 pages
A Survey On Optical Character Recognition For Bangla and Devanagari Scripts
No ratings yet
A Survey On Optical Character Recognition For Bangla and Devanagari Scripts
36 pages
PHD Thesis Topics in Image Processing
100% (3)
PHD Thesis Topics in Image Processing
6 pages
How Transformers Work - A Detailed Exploration of Transformer Architecture - DataCamp
No ratings yet
How Transformers Work - A Detailed Exploration of Transformer Architecture - DataCamp
19 pages
A Statistical Development of Fixed Odds Betting Rules in Soccer
No ratings yet
A Statistical Development of Fixed Odds Betting Rules in Soccer
20 pages
TOTO Site Sports Betting Strategies - Key Take Into Account TOTO Site Sports Betting
No ratings yet
TOTO Site Sports Betting Strategies - Key Take Into Account TOTO Site Sports Betting
1 page
Fibonaccicalculater 1
No ratings yet
Fibonaccicalculater 1
1 page
AI Project Cycle One Shot Lecture Notes
No ratings yet
AI Project Cycle One Shot Lecture Notes
41 pages
Geffcm Gefem: Approximate Clustering in Very Large Object Data
No ratings yet
Geffcm Gefem: Approximate Clustering in Very Large Object Data
32 pages
Kumar 2021
No ratings yet
Kumar 2021
11 pages
Coral Reef Packages - Lakshadweep
No ratings yet
Coral Reef Packages - Lakshadweep
31 pages
T4 Rule Based Fuzzy Models (Koczy) PDF
No ratings yet
T4 Rule Based Fuzzy Models (Koczy) PDF
54 pages
Share Khan Option Calculator
No ratings yet
Share Khan Option Calculator
2 pages
All Machine Learning Models Explained in 6 Minutes - by Terence Shin - Towards Data Science
No ratings yet
All Machine Learning Models Explained in 6 Minutes - by Terence Shin - Towards Data Science
10 pages
Fuzz Ieee
No ratings yet
Fuzz Ieee
47 pages
Nifty 50 Stocks
No ratings yet
Nifty 50 Stocks
3 pages
Football Predictions
No ratings yet
Football Predictions
4 pages
Learning by Asking Questions: Decision Trees: Piyush Rai Machine Learning (CS771A)
No ratings yet
Learning by Asking Questions: Decision Trees: Piyush Rai Machine Learning (CS771A)
22 pages
Artificial Intelligence Based Language Translation
No ratings yet
Artificial Intelligence Based Language Translation
9 pages
1 To 4 Losers
100% (1)
1 To 4 Losers
9 pages
AI Project Cycle
No ratings yet
AI Project Cycle
10 pages
Rohm ML8511 00FCZ05B Datasheet PDF
No ratings yet
Rohm ML8511 00FCZ05B Datasheet PDF
8 pages
AEIE
No ratings yet
AEIE
6 pages
PHD GUIDE LIST 28thdec2019
No ratings yet
PHD GUIDE LIST 28thdec2019
14 pages
Machine Learning
100% (1)
Machine Learning
15 pages
Adafruit TCS34725 Library Documentation: Release 1.0
No ratings yet
Adafruit TCS34725 Library Documentation: Release 1.0
23 pages
Supplier Selection Based On Hierarchical Potential Support Vector Machine
No ratings yet
Supplier Selection Based On Hierarchical Potential Support Vector Machine
8 pages
MAKAL
No ratings yet
MAKAL
2 pages
Poster Template
100% (1)
Poster Template
1 page
TCS34725 Color Sensor User Manual
No ratings yet
TCS34725 Color Sensor User Manual
16 pages
IntroductionToPLC DCS ApplicationIIIBrochure
No ratings yet
IntroductionToPLC DCS ApplicationIIIBrochure
2 pages
IntroductionToPLC DCS ApplicationIIIBrochure
No ratings yet
IntroductionToPLC DCS ApplicationIIIBrochure
2 pages
ML Unit-3
No ratings yet
ML Unit-3
23 pages
Application For Package Tour To Lakshadweep: Please Submit in Word Format
No ratings yet
Application For Package Tour To Lakshadweep: Please Submit in Word Format
2 pages
AI Subsets
No ratings yet
AI Subsets
5 pages
Tutorial 1 Solutions
No ratings yet
Tutorial 1 Solutions
3 pages
Chaitanya Kulkarni Resume
No ratings yet
Chaitanya Kulkarni Resume
1 page
Clustering Via K-Means and Meanshift
No ratings yet
Clustering Via K-Means and Meanshift
11 pages
Anti Ragging Committee - Duty Roster - 2024
No ratings yet
Anti Ragging Committee - Duty Roster - 2024
1 page
2024 Forrester XDR Wave - Leader
100% (1)
2024 Forrester XDR Wave - Leader
3 pages
Resume Rohan Shah
No ratings yet
Resume Rohan Shah
1 page
Wireless Sensor Networks:: Network Architectures, Protocols and Applications
No ratings yet
Wireless Sensor Networks:: Network Architectures, Protocols and Applications
2 pages
The Best Books For Aspiring Data Scientists
No ratings yet
The Best Books For Aspiring Data Scientists
1 page
MOODLE Workshop Assignment
No ratings yet
MOODLE Workshop Assignment
2 pages
2
No ratings yet
2
2 pages
Student Placement Prediction
No ratings yet
Student Placement Prediction
4 pages

Vladimir Cherkassky IJCNN05

Uploaded by

Vladimir Cherkassky IJCNN05

Uploaded by

New Formulations for

 Confucius: Learning without thought is

 Importance of problem formulation (vs algorithm)

 Driven by applications (not theory)

 Learning ~ estimating mapping x → y

 Input space ~ 139,351-dimensional

SVM induction~74.5%; transduction ~ 82.3%

Loss Input, output, Training/

FORMAL PROBLEM STATEMENT

Index or Fund Money Market

Proprietary Exchange Strategy

Objective: maximize total return over n-day period

 Estimate a model for each subset of data

 Step 3: Remove subset of data (from one class)

 Improvements due to novel problem

 Transduction: V. Vapnik (1998), Statistical Learning Theory,

 Timing of Mutual Funds: E. Zitzewitz (2002), Who cares about

 Multiple Model Estimation:

You might also like

Confucius: Learning without thought is

Importance of problem formulation (vs algorithm)

Driven by applications (not theory)

Learning ~ estimating mapping x → y

Input space ~ 139,351-dimensional

Estimate a model for each subset of data

Step 3: Remove subset of data (from one class)

Improvements due to novel problem

Transduction: V. Vapnik (1998), Statistical Learning Theory,

Timing of Mutual Funds: E. Zitzewitz (2002), Who cares about

Multiple Model Estimation: