FIX instability of `_RidgeGCV` #33020

antoinebaker · 2026-01-07T10:07:20Z

Reference Issues/PRs

Fixes #32459 and supersedes #32506

What does this implement/fix? Explain your changes.

This fixes numerical errors in the small alpha regime by refactoring the solve/decompose methods:

solve now returns the leave-one-out errors looe and the primal coeffcient coef instead of the c and G_inverse_diag
the helper _compute_gram and _compute_cov reuse the precomputed X_mean
estimation of looe and coef is adapted to each method for best numerical stability
new option cov to force use eigendecomposition of the covariance matrix
new option gram to force use eigendecomposition of the gram matrix

The gcv_mode options have been modified to always pick the cheaper option

option eigen now switches to the cheaper option: gram if n < p and cov if p < n
option svd, not implemented for sparse X, fallbacks to eigen when X is sparse.
in main, eigen always picks gramand svd always fallbacks to cov for sparse X (whatever the shape of X), which can explain the poor benchmark observed in FIX instability of gcv_mode="svd" in RidgeCV #32506.

TODO

make it array api compliant
benchmark memory/time compared to main
update _RidgeGCV docstring, especially math explanations

ogrisel

I haven't checked the code and the math yet, but the fact that the tests pass and the examples still look good is very promising.

I also ran some quick interactive timeit experiments and the performance seems good (including with torch/MPS) compared to main.

However, I found a performance regression for the linear regression panel of the examples/ensemble/plot_stack_predictors.py example:

in main, it runs in less than 0.01s: https://scikit-learn.org/dev/auto_examples/ensemble/plot_stack_predictors.html

Maybe this is because of a bad "auto" policy for that case?

Please also find a first pass of feedback below.

ogrisel · 2026-01-09T16:01:17Z

doc/whats_new/upcoming_changes/sklearn.linear_model/33020.fix.rst

@@ -0,0 +1,6 @@
+- The leave-one out errors :class:`linear_model.RidgeCV` are now numerically


It's more than a numerical stability fix for the estimation of the LOO error values. It's also, and more importantly, an improvement in the estimation of the model parameters themselves.

But what is it then? If you suggest an enhancement, what is enhanced apart from the fix? (I ask for my own understanding)

I should have said "a numerical stability fix in the estimation of the model parameters themselves" instead.

ogrisel · 2026-01-09T16:02:54Z

sklearn/linear_model/_ridge.py

+
+    # svd not implemented for sparse X, fallback to eigen
+    if gcv_mode == "svd" and sparse.issparse(X):
+        gcv_mode = "eigen"


I would rather raise a ValueError here. Most users should stick to the auto mode anyway.

Ah actually, passing gcv_mode="svd" explicitly in main works with sparse inputs but would no longer work in this branch.

So maybe we could instead raise a FutureWarning when gcv_mode="svd" is set explicitly by the user and fitting on sparse data. The warning message could now state that this now uses the "eigen" solver instead and that in the future this will raise ValueError.

sklearn/linear_model/_ridge.py

ogrisel

Could you please extend the test test_ridge_cv_results_predictions to check that it works both for n_samples > n_features and n_features < n_samples with the auto/eigen solver?

EDIT: and similarly for test_ridge_cv_multioutput_sample_weight and test_ridge_cv_custom_multioutput_scorer.

ogrisel · 2026-01-13T10:39:02Z

sklearn/linear_model/_ridge.py

-    return "eigen"
+
+    # All other cases ("auto", "eigen", "svd" with sparse X)
+    # fallbacks to gram (n <= p) or cov (p < n)


I think we should raise a warning if the user explicitly asks for svd and fits on sparse data (see the previous discussion on the outdated code here: #33020 (comment)).

ogrisel · 2026-01-13T10:41:25Z

sklearn/linear_model/tests/test_ridge.py

+@pytest.mark.parametrize("X_shape", [(100, 50), (50, 50), (50, 100)])
+@pytest.mark.parametrize("X_container", [np.asarray] + CSR_CONTAINERS)
+def test_ridge_gcv_noiseless(alpha, gcv_mode, fit_intercept, X_shape, X_container):
+    # Ridge should recover LinearRegression in the noiseless case


Suggested change

# Ridge should recover LinearRegression in the noiseless case

# Ridge should recover LinearRegression in the noiseless case and

# near-zero alpha.

ogrisel · 2026-01-13T10:44:36Z

sklearn/linear_model/tests/test_ridge.py

+    gcv_ridge = RidgeCV(alphas=alphas, gcv_mode=gcv_mode, fit_intercept=fit_intercept)
+    gcv_ridge.fit(X, y)
+    assert_allclose(gcv_ridge.coef_, lin_reg.coef_, atol=1e-10)
+    assert_allclose(gcv_ridge.intercept_, lin_reg.intercept_, atol=1e-10)


Let's update this test to expect a FutureWarning (and later a ValueError) for the gcv_mode="svd" / sparse X combinations.

antoinebaker added 6 commits December 17, 2025 15:48

add noiseless test

ff3bf08

return looe and coef

b6f292a

svd, gram, cov

3cad7eb

docstring gcv_mode

44a923d

separate wide/long cases

8f2c58c

Merge branch 'main' into refactor_ridge_gcv

5113978

github-actions bot added the module:linear_model label Jan 7, 2026

antoinebaker mentioned this pull request Jan 7, 2026

FIX instability of gcv_mode="svd" in RidgeCV #32506

Closed

antoinebaker added 5 commits January 9, 2026 10:41

avoid dot for array api

2b11b98

changelog

462d1db

avoid len

81c7bf9

fix shape

bac578a

use ndim

fde00ed

ogrisel added the CUDA CI label Jan 9, 2026

github-actions bot removed the CUDA CI label Jan 9, 2026

ogrisel added Numerical Stability CUDA CI labels Jan 9, 2026

github-actions bot removed the CUDA CI label Jan 9, 2026

ogrisel reviewed Jan 9, 2026

View reviewed changes

antoinebaker added 9 commits January 9, 2026 19:14

simplify gcv_mode

64ef077

update docstring

db6ede7

use eigen for auto

6a9585a

use array api eigh

ab555df

array api sum

938402c

array api outer

cc2491b

rewrite gram

5fe0fb7

increase atol

87b0e8a

edge case n=p

aec1939

ogrisel reviewed Jan 13, 2026

View reviewed changes

ogrisel mentioned this pull request Jan 13, 2026

FEAT Polynomial Chaos Expansions #27842

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

FIX instability of `_RidgeGCV` #33020

FIX instability of `_RidgeGCV` #33020

antoinebaker commented Jan 7, 2026 •

edited by ogrisel

Loading

Uh oh!

ogrisel left a comment

Uh oh!

ogrisel Jan 9, 2026

Uh oh!

lorentzenchr Jan 9, 2026

Uh oh!

ogrisel Jan 12, 2026

Uh oh!

ogrisel Jan 9, 2026

Uh oh!

ogrisel Jan 9, 2026 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

ogrisel left a comment •

edited

Loading

Uh oh!

ogrisel Jan 13, 2026

Uh oh!

ogrisel Jan 13, 2026

Uh oh!

ogrisel Jan 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		@@ -0,0 +1,6 @@
		- The leave-one out errors :class:`linear_model.RidgeCV` are now numerically

	# Ridge should recover LinearRegression in the noiseless case
	# Ridge should recover LinearRegression in the noiseless case and
	# near-zero alpha.

Uh oh!

FIX instability of _RidgeGCV #33020

Are you sure you want to change the base?

FIX instability of _RidgeGCV #33020

Conversation

antoinebaker commented Jan 7, 2026 • edited by ogrisel Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Uh oh!

ogrisel left a comment

Choose a reason for hiding this comment

Uh oh!

ogrisel Jan 9, 2026

Choose a reason for hiding this comment

Uh oh!

lorentzenchr Jan 9, 2026

Choose a reason for hiding this comment

Uh oh!

ogrisel Jan 12, 2026

Choose a reason for hiding this comment

Uh oh!

ogrisel Jan 9, 2026

Choose a reason for hiding this comment

Uh oh!

ogrisel Jan 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

ogrisel left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ogrisel Jan 13, 2026

Choose a reason for hiding this comment

Uh oh!

ogrisel Jan 13, 2026

Choose a reason for hiding this comment

Uh oh!

ogrisel Jan 13, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

FIX instability of `_RidgeGCV` #33020

FIX instability of `_RidgeGCV` #33020

antoinebaker commented Jan 7, 2026 •

edited by ogrisel

Loading

ogrisel Jan 9, 2026 •

edited

Loading

ogrisel left a comment •

edited

Loading