Professional Documents
Culture Documents
Ján Dolinský
2BridgZ Solutions
jan.dolinsky@2bridgz.com
Statistical Modeling
Introduction
Methodology
Case Studies
2BridgZ Solutions
Introduction
2BridgZ Solutions
Introduction
2BridgZ Solutions
Introduction
2BridgZ Solutions
Methodology
• variable selection
- irrelevant variables may worsen prediction quality of a model,
- collinearity & multicollinearity,
- unstable models
2BridgZ Solutions
Methodology – Variable Selection
10
N N
...
0
0
… 1
0.57 1
2BridgZ Solutions
Methodology – Variable Selection
Best-Subset selection
2BridgZ Solutions
Methodology – Variable Selection
2BridgZ Solutions
Methodology – Variable Selection
● Speed
2BridgZ Solutions
Methodology – Variable Selection
2BridgZ Solutions
Methodology – Model Structure Building
2BridgZ Solutions
Methodology – variable expansion
y=b*x+c
Methodology – variable expansion
y=b*x+c
Methodology – variable expansion
2BridgZ Solutions
Methodology – Model Structure Building
Polynomial terms
Basis Functions (RBF, Thin-Plate-Spline, …)
Fourier functions
Spatio-Temporal mixing (ESN, ...)
Example: RBF terms for non-linear classification
Example: RBF terms for non-linear classification
…
rbf 250
Example: RBF terms for non-linear classification
Example: RBF terms for non-linear classification
SVM MB using IC
methodology MS & CV Automatic MB
generated models 10-20 1
hyper-parameters 2-3 1-0
final dimensionality 120 7
missclas. rate 9.5% 9%
Future research
Information Geometry
Information Criteria