You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Provide the remote resource access to your training data. For automated machine learning experiments running on remote compute, the data needs to be fetched using a `get_data()` function.
66
+
## Access data using TabularDataset function
70
67
71
-
To provide access, you must:
72
-
+ Create a get_data.py file containing a `get_data()` function
73
-
+ Place that file in a directory accessible as an absolute path
68
+
Defined X and y as `TabularDataset`s, which are passed to Automated ML in the AutoMLConfig. `from_delimited_files` by default sets the `infer_column_types` to true, which will infer the columns type automatically.
74
69
75
-
You can encapsulate code to read data from a blob storage or local disk in the get_data.py file. In the following code sample, the data comes from the sklearn package.
70
+
If you do wish to manually set the column types, you can set the `set_column_types` argument to manually set the type of each columns. In the following code sample, the data comes from the sklearn package.
See this [sample notebook](https://github.com/Azure/MachineLearningNotebooks/blob/master/how-to-use-azureml/automated-machine-learning/remote-amlcompute/auto-ml-remote-amlcompute.ipynb) for an additional example of this design pattern.
117
111
118
112
## Configure experiment
119
-
120
113
Specify the settings for`AutoMLConfig`. (See a [full list of parameters](how-to-configure-auto-train.md#configure-experiment) and their possible values.)
0 commit comments