-
Notifications
You must be signed in to change notification settings - Fork 1.2k
[Feature Store] Feature group creation: provide a DataCatalogConfig while enabling glue table creation #2916
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
so is this supported or not? |
@clausagerskov Apologies for the delay.
Your conclusions are correct. Currently, we do not allow customers to provide In the mean time, we'll be updating both API documentation and notebook examples to make service expectations clear. Thank you for bringing this to our attention. |
Thanks @simonvdk for the explanation of details. I'm running into the same problem. Basically I want to specify my database and table when the Feature Group is created AND I want the Glue Catalog table created for me. So currently what's the workaround?
If I manually create the glue table.. I'm guessing I can use the Feature Definitions within the Feature Group to 'help me' create the types for the glue table. Here's pseudo code of how I might approach this.. I'm happy for someone to suggest an easier/better alternative. :)
Yes? .. this seems like a lot of work just so that we can place Feature Groups where we want them... |
We are facing something similar, but now by using Iceberg table types. If we set:
we get:
And then, if we indeed turn if off (ie, changing
Which is surprising, because according to the doc, this parameter does not exist. In our case, we don't want to create the table and database in defaults, but in specific ones. |
Hello! I'm facing the same problem, is there a solution for this? |
I encountered the same issue. Due to data governance protocols, I cannot place the features in the default database (sagemaker_featurestore), so I would like to select a different database. However, I encountered the same issues described by the users posted on this thread. A prompt solution would be greatly appreciated! |
Partner team confirmed that they don't see a path to improving the behavior of documentation at this time. |
Uh oh!
There was an error while loading. Please reload this page.
Use case
Create a feature group with automatic glue table creation for the offline store metadata, while configuring the glue data catalog database and table names
Issue encountered
It seems that providing a DataCatalogConfig and setting disable_glue_table_creation to false are mutually exclusive:
But I cannot provide a DataCatalogConfig and enable the glue table creation. Error encountered:
Why this seems to be an issue:
DataCatalogConfig
should not be mutually exclusive with the automatic table creationWays to reproduce issue
Reproduced with AWS SDK (2.50.0) and AWS CLI.
Providing an
OfflineStoreConfig
with bothDisableGlueTableCreation=False
and aDataCatalogConfig
with configured glue database (already created) and a glue table (that does not yet exist) raises the above error. Providing theDataCatalogConfig
withDisableGlueTableCreation=True
does not raise, but the glue table is not created either.Example with AWS CLI:
aws sagemaker create-feature-group --cli-input-json '{"EventTimeFeatureName": "timestamp", "Description": "", "RecordIdentifierFeatureName": "record_id", "FeatureDefinitions": [{"FeatureName": "record_id", "FeatureType": "Integral"}, {"FeatureName": "timestamp", "FeatureType": "String"}], "OfflineStoreConfig": {"S3StorageConfig": {"S3Uri": "s3://my_bucket/my_prefix", "KmsKeyId": "arn:aws:kms:region:account_id:key/key_id"}, "DataCatalogConfig": {"TableName": "my_table", "Catalog": "account_id", "Database": "my_db"}, "DisableGlueTableCreation": false}, "FeatureGroupName": "my-feature-group"}'
Expected output
A clearer documentation about how to configure the offline store data catalog (e.g. with an example in a notebook), and possibly the possibility to configure the data catalog while benefiting from the glue table creation
NB: A similar issue has been opened on the aws-cli repository
The text was updated successfully, but these errors were encountered: