Skip to content

deps: adds limits to numpy installs under 3.10 for dataproc #370

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 3 commits into from
Jul 8, 2025

Conversation

chalmerlowe
Copy link
Collaborator

@chalmerlowe chalmerlowe commented Jul 7, 2025

The Google Dataproc team requested this change.

CONTEXT:

When installing python-bigquery[pandas] (which depends on python-db-dtypes-pandas, which depends on numpy) an issue arises because db-dtypes attempts to install a version of numpy greater than 2.2.6. Due to requirements to reproduce the Dataproc environment, they cannot allow installs of numpy greater than 2.2.6 so they requested that we add an upper limit.

@product-auto-label product-auto-label bot added size: xs Pull request size is extra small. api: bigquery Issues related to the googleapis/python-db-dtypes-pandas API. labels Jul 7, 2025
@product-auto-label product-auto-label bot added size: s Pull request size is small. and removed size: xs Pull request size is extra small. labels Jul 7, 2025
@product-auto-label product-auto-label bot added size: xs Pull request size is extra small. and removed size: s Pull request size is small. labels Jul 7, 2025
Copy link
Collaborator

@tswast tswast left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

👎 on this change. I suspect dataproc's problem isn't really because of db-dtypes incompatibility but because of older pandas + newer numpy not playing nicely.

I wouldn't want to prevent all 3.10 users from using the latest numpy.

Edit: Discussed offline. Numpy already doesn't support >= 2.3.0 on Python 3.10. While this change feels redundant in that case, it shouldn't do any harm.

@chalmerlowe
Copy link
Collaborator Author

As noted in my internal chat comment. I am talking to the Dataproc team to see if there are alternatives that meet their needs but don't affect other users of this library.

More to come.

@chalmerlowe chalmerlowe marked this pull request as ready for review July 8, 2025 17:19
@chalmerlowe chalmerlowe requested review from a team as code owners July 8, 2025 17:19
@chalmerlowe chalmerlowe requested a review from shollyman July 8, 2025 17:19
@chalmerlowe chalmerlowe merged commit 6d79280 into main Jul 8, 2025
24 checks passed
@chalmerlowe chalmerlowe deleted the exp-pb-v-3-3-6-db-1-42-2 branch July 8, 2025 17:20
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
api: bigquery Issues related to the googleapis/python-db-dtypes-pandas API. size: xs Pull request size is extra small.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants