Migrate WeightVector to use tempita #13358

massich · 2019-03-01T15:15:03Z

WeightVector is used in #13346 and has attributes that cannot be fused.

scikit-learn/sklearn/utils/weight_vector.pxd

Lines 12 to 20 in 984871b

    
           cdef np.ndarray w 
        
           cdef np.ndarray aw 
        
           cdef double *w_data_ptr 
        
           cdef double *aw_data_ptr 
        
           cdef double wscale 
        
           cdef double average_a 
        
           cdef double average_b 
        
           cdef int n_features 
        
           cdef double sq_norm

This PR uses Tempita to allow float32 float64.

cross ref: #11000

massich · 2019-03-01T16:07:27Z

~~num_iter is double, I think it should be int. For now I let it as a template of double.~~

scikit-learn/sklearn/utils/weight_vector.pxd

Lines 24 to 25 in 984871b

    
           cdef void add_average(self,  double *x_data_ptr, int *x_ind_ptr, 
        
                                 int xnnz, double c, double num_iter) nogil

EDIT: number_iter comes from (t - average + 1), so its not integer but something else

massich · 2019-03-01T16:46:24Z

sklearn/linear_model/sgd_fast.pyx

@@ -22,7 +22,7 @@ from numpy.math cimport INFINITY
 cdef extern from "sgd_fast_helpers.h":
    bint skl_isfinite(double) nogil

-from sklearn.utils.weight_vector cimport WeightVector
+from sklearn.utils.weight_vector cimport WeightVector64 as WeightVector


This should not be here but in the template. Maybe we have some sort of clash. I need to get back to that

unless you also template sgd_fast, you'll have to deal with both, and switch with

if floating is float: do something with WeightVector64 else: do something with WeightVector32

Also this diff don't belong to this PR

massich · 2019-03-01T17:12:04Z

ping: @NicolasHug can you review aswell?? Thx

jeremiedbb

LGTM

jeremiedbb · 2019-03-04T10:17:45Z

sklearn/linear_model/sgd_fast.pyx

@@ -22,7 +22,7 @@ from numpy.math cimport INFINITY
 cdef extern from "sgd_fast_helpers.h":
    bint skl_isfinite(double) nogil

-from sklearn.utils.weight_vector cimport WeightVector
+from sklearn.utils.weight_vector cimport WeightVector64 as WeightVector


unless you also template sgd_fast, you'll have to deal with both, and switch with

if floating is float: do something with WeightVector64 else: do something with WeightVector32

jeremiedbb · 2019-03-04T10:18:03Z

sklearn/linear_model/sgd_fast.pyx

@@ -22,7 +22,7 @@ from numpy.math cimport INFINITY
 cdef extern from "sgd_fast_helpers.h":
    bint skl_isfinite(double) nogil

-from sklearn.utils.weight_vector cimport WeightVector
+from sklearn.utils.weight_vector cimport WeightVector64 as WeightVector


Also this diff don't belong to this PR

jnothman

Not sure about the import but this lgtm

NicolasHug

Nitpicks + questions.

Also:

is WeightVector32 (going to be) used somewhere else?
it'd be nice to have tests ensuring that the .pyx and .pxd files that are generated by tempita are git-ignored. Or at least that we can't modify them accidentally. As @ogrisel noted these tests should also pass / be ignored on pre-compiled wheels where cython sources aren't available.

NicolasHug · 2019-03-12T18:31:53Z

sklearn/utils/weight_vector.pyx.tp

        """The L2 norm of the weight vector. """
        return sqrt(self.sq_norm)
+
+{{endfor}}


Missing newline

NicolasHug · 2019-03-12T18:32:04Z

sklearn/utils/weight_vector.pxd.tp

+    cdef void reset_wscale(self) nogil
+    cdef {{c_type}} norm(self) nogil
+
+{{endfor}}


missing newline

NicolasHug · 2019-03-12T18:43:57Z

sklearn/utils/weight_vector.pyx.tp

+dtypes = [('64', 'double'),
+          ('32', 'float')]
+
+def get_dispatch(dtypes):


Sorry not familiar with tempita (and the tempita doc links are broken): why do you need get_dispatch? You can't simply iterate over the dtypes list?

glemaitre · 2019-04-24T14:11:14Z

@massich Could you solve the resolve the conflict and the address the comments

mbatoul · 2021-07-07T08:32:20Z

Hi @massich,

I saw that your PR has been stalled.

Are you still interested in continuing this work?

mbatoul · 2021-07-07T12:58:34Z

Hi @jeremiedbb @NicolasHug,

I started a PR #20481 to continue @massich's work.

I will address your comments there.

[skip ci] Trigger PR

930e27d

massich mentioned this pull request Mar 1, 2019

tagging skip ci does not work for azure #13360

Closed

Joan Massich added 3 commits March 1, 2019 16:52

Move weith_vector to a template

f40dcca

transform weight_vector.pxd.tp into a template

c158a79

move pyx into pyx.tp

d1139a1

Joan Massich added 3 commits March 1, 2019 17:39

first pass in changing doubles

ef6ca46

change all double that were missing

04ce5c9

fix compilation

50de266

massich commented Mar 1, 2019

View reviewed changes

massich marked this pull request as ready for review March 1, 2019 16:59

jeremiedbb approved these changes Mar 4, 2019

View reviewed changes

jnothman approved these changes Mar 6, 2019

View reviewed changes

NicolasHug reviewed Mar 12, 2019

View reviewed changes

amueller added Needs work help wanted labels Aug 6, 2019

github-actions bot added module:linear_model module:utils labels Mar 2, 2020

Base automatically changed from master to main January 22, 2021 10:50

thomasjpfan added the cython label Apr 13, 2021

mbatoul mentioned this pull request Jul 7, 2021

MAINT Improve source generation using Tempita #20481

Merged

cmarmo added Superseded PR has been replace by a newer PR and removed help wanted labels Jul 13, 2021

glemaitre closed this in #20481 Aug 9, 2021

	cdef np.ndarray w
	cdef np.ndarray aw
	cdef double *w_data_ptr
	cdef double *aw_data_ptr
	cdef double wscale
	cdef double average_a
	cdef double average_b
	cdef int n_features
	cdef double sq_norm

Uh oh!

Migrate WeightVector to use tempita #13358

Migrate WeightVector to use tempita #13358

Uh oh!

Conversation

massich commented Mar 1, 2019

Uh oh!

massich commented Mar 1, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

massich Mar 1, 2019

Choose a reason for hiding this comment

Uh oh!

jeremiedbb Mar 4, 2019

Choose a reason for hiding this comment

Uh oh!

jeremiedbb Mar 4, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

massich commented Mar 1, 2019

Uh oh!

jeremiedbb left a comment

Choose a reason for hiding this comment

Uh oh!

jeremiedbb Mar 4, 2019

Choose a reason for hiding this comment

Uh oh!

jeremiedbb Mar 4, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jnothman left a comment

Choose a reason for hiding this comment

Uh oh!

NicolasHug left a comment

Choose a reason for hiding this comment

Uh oh!

NicolasHug Mar 12, 2019

Choose a reason for hiding this comment

Uh oh!

NicolasHug Mar 12, 2019

Choose a reason for hiding this comment

Uh oh!

NicolasHug Mar 12, 2019

Choose a reason for hiding this comment

Uh oh!

glemaitre commented Apr 24, 2019

Uh oh!

mbatoul commented Jul 7, 2021

Uh oh!

mbatoul commented Jul 7, 2021

Uh oh!

Uh oh!

massich commented Mar 1, 2019 •

edited

Loading

jeremiedbb Mar 4, 2019 •

edited

Loading

jeremiedbb Mar 4, 2019 •

edited

Loading