Implement autoload device extension mechanism (from RFC #122468) #127228

bkowalskiINTEL · 2024-05-27T14:39:07Z

These changes refer to RFC from #122468 issue. Early testing has been done against habana_frameworks extension, with the entry point exported according to the RFC discussion.
Things to do:

Documentation
Unit tests (we are open for suggestions)
More in-depth testing (in progress)

Looking forward for your review.

pytorch-bot · 2024-05-27T14:39:10Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/127228

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 2f938df with merge base 1110edb ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

linux-foundation-easycla · 2024-05-27T14:39:12Z

✅ login: bkowalskiINTEL (2f938df)
❌ The email address for the commit (d977015, d06411e, 516fd3f) is not linked to the GitHub account, preventing the EasyCLA check. Consult this Help Article and GitHub Help to resolve. (To view the commit's email address, add .patch at the end of this PR page's URL.) For further assistance with EasyCLA, please submit a support request ticket.

bsochack · 2024-05-27T14:43:21Z

torch/__init__.py

@@ -2065,6 +2065,28 @@ def _constrain_as_size(symbol, min: Optional[builtins.int] = None, max: Optional
    """
    torch.sym_constrain_range_for_size(symbol, min=min, max=max)

+def import_device_backends():


Please move the change to the end of file

bsochack · 2024-05-27T14:43:49Z

torch/__init__.py

+    for backend in entry_points(group='torch.backends'):
+        try:
+            backend_hook = backend.load() 
+            if not hasattr(backend_hook, 'init_custom_backend'):


Please change a name of function to init_device_backend for consistency

Why don't we use the official load() function of EntryPoint to get the init function to call?

Sure, we can simplify to initialize at load

jgong5 · 2024-05-28T00:42:48Z

torch/__init__.py

+    for backend in entry_points(group='torch.backends'):
+        try:
+            backend_hook = backend.load() 
+            if not hasattr(backend_hook, 'init_custom_backend'):


Why don't we use the official load() function of EntryPoint to get the init function to call?

jgong5 · 2024-05-28T00:43:36Z

torch/__init__.py

+
+def is_device_backend_autoload_enabled() -> bool:
+    var = os.getenv("TORCH_DISABLE_DEVICE_BACKEND_AUTOLOAD")
+    return not var in (1, 'True', 'true', 'Yes', 'yes')


nit: suggest to do var.upper() to simplify the code.

Since os.getenv() returns None, when the variable isn't set, that's how I would see it then:

def is_device_backend_autoload_enabled() -> bool: var = os.getenv("TORCH_DISABLE_DEVICE_BACKEND_AUTOLOAD") return not str(var).upper() in ('1', 'TRUE', 'YES')

What do You think?

def is_device_backend_autoload_enabled() -> bool: var = os.getenv("TORCH_DISABLE_DEVICE_BACKEND_AUTOLOAD") return var is None or not str(var).upper() in ('1', 'TRUE', 'YES')

shink · 2024-05-28T01:50:55Z

torch/__init__.py

+    for backend in entry_points(group='torch.backends'):
+        try:
+            backend_hook = backend.load() 
+            if not hasattr(backend_hook, 'init_custom_backend'):
+                print(f"No explicit backend init function for \'{backend_hook.__name__}\' has been found, custom backend can't be loaded.")
+                continue
+            backend_hook.init_custom_backend()
+        except IndexError:
+            pass


I think we don't need to call the init function, only load it.
Since the init function is implemented in your extension, your extension will be imported automatically after backend.load() is called. And calling it directly might be risky.

Here is a simple example and any suggestions are welcome.

in torch_npu implementation:

# setup.py setup( entry_points={ 'torch.backends': [ 'torch_npu = torch_npu:_init_device_backend', ], } ) # torch_npu/__init__.py def _init_device_backend(): pass

in pytorch:

# torch/__init__.py for plugin in discovered_plugins: try: # just loads the plugin without calling plugin.load() except Exception: # keep quiet pass

jczaja · 2024-05-29T08:42:09Z

@bsochack , @jgong5 , @shink Hi, We accidentally deleted branch used for this PR and so PR was invalidated. We recasted PR here: #127386 . It does contain fixes to all comments of this review. Please continue to review this functionality at new PR: #127386 . We apologize for inconvenience.

Bartosz Kowalski and others added 4 commits May 27, 2024 15:58

Implement device extension autoload mechanism

d06411e

Implement device extension autoload mechanism

516fd3f

Remove unnecessary comments

d977015

Refactor autoload in __init__.py

2f938df

bkowalskiINTEL marked this pull request as draft May 27, 2024 14:39

bsochack reviewed May 27, 2024

View reviewed changes

bsochack mentioned this pull request May 27, 2024

[RFC] Add support for device extension autoloading #127074

Closed

pytorchbot added the open source label May 27, 2024

jgong5 reviewed May 28, 2024

View reviewed changes

shink reviewed May 28, 2024

View reviewed changes

bkowalskiINTEL closed this May 29, 2024

bkowalskiINTEL deleted the device_extension_autoloader branch May 29, 2024 07:33

bkowalskiINTEL mentioned this pull request May 29, 2024

[from RFC #122468] Implement autoload device extension mechanism #127386

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Implement autoload device extension mechanism (from RFC #122468) #127228

Implement autoload device extension mechanism (from RFC #122468) #127228

bkowalskiINTEL commented May 27, 2024

Uh oh!

pytorch-bot bot commented May 27, 2024 •

edited

Loading

Uh oh!

linux-foundation-easycla bot commented May 27, 2024 •

edited

Loading

Uh oh!

bsochack May 27, 2024

Uh oh!

bsochack May 27, 2024

Uh oh!

jgong5 May 28, 2024

Uh oh!

bsochack May 28, 2024

Uh oh!

jgong5 May 28, 2024

Uh oh!

jgong5 May 28, 2024

Uh oh!

bkowalskiINTEL May 28, 2024

Uh oh!

jgong5 May 29, 2024

Uh oh!

shink May 28, 2024 •

edited

Loading

Uh oh!

jczaja commented May 29, 2024

Uh oh!

Uh oh!

Implement autoload device extension mechanism (from RFC #122468) #127228

Implement autoload device extension mechanism (from RFC #122468) #127228

Conversation

bkowalskiINTEL commented May 27, 2024

Uh oh!

pytorch-bot bot commented May 27, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/127228

✅ No Failures

Uh oh!

linux-foundation-easycla bot commented May 27, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

shink May 28, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jczaja commented May 29, 2024

Uh oh!

Uh oh!

pytorch-bot bot commented May 27, 2024 •

edited

Loading

linux-foundation-easycla bot commented May 27, 2024 •

edited

Loading

shink May 28, 2024 •

edited

Loading