pyarrow dependency should be optional #743
Labels
api: bigquery
Issues related to the googleapis/python-bigquery-sqlalchemy API.
priority: p2
Moderately-important priority. Fix may not be included in next release.
type: bug
Error or flaw in code with unintended results or allowing sub-optimal usage patterns.
pyarrow is an optional dependency of google-cloud-bigquery, but it's made mandatory by python-bigquery-sqlalchemy.
As pyarrow is quite large on disk — 100 MB on x86_64 Linux — I don't want to install it when it's unused in my application (AFAICT I don't need
google-cloud-bigquery-storage
either but that's not huge).I suggest:
pyarrow
andgoogle-cloud-bigquery-storage
bqstorage
extra that depends ongoogle-cloud-bigquery[bqstorage]
. That'll respect upstream's version bounds without introducing local bounds that could cause conflicts for users.bigquery-sqlalchemy[bqstorage]
.There's an existing PR at #470 but it looks like it has stalled out, so I'm filing this issue to provide a blueprint for someone who's able to do this work.
The text was updated successfully, but these errors were encountered: