Metal backend: Add operator implementations #15023

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Open

manuelcandales wants to merge 9 commits into gh/manuelcandales/142/head from gh/manuelcandales/143/head

Contributor

manuelcandales commented Oct 10, 2025

Adds bfloat16/float32 working implementations of the following AOTI shim ops:

aoti_torch_mps_mm_out
aoti_torch_mps_convolution
aoti_torch_mps__scaled_dot_product_attention_math_for_mps

Adds a stub implementation of aoti_torch_mps_addmm_out


          Update

3bea537

[ghstack-poisoned]

Contributor Author

manuelcandales commented Oct 10, 2025 •

edited

Loading

Stack from ghstack (oldest at bottom):

manuelcandales requested review from cccclai and shoumikhin as code owners

October 10, 2025 21:01

pytorch-bot bot commented Oct 10, 2025 •

edited

Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/15023

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 Cancelled Job

As of commit 61ead64 with merge base 6e0c9f6 ():

CANCELLED JOB - The following job was cancelled. Please retry:

pull (gh)

This comment was automatically generated by Dr. CI and updates every 15 minutes.

This was referenced Oct 10, 2025

[ET][Metal] Update aoti_common with additional AOTI functions needed by Metal backend #15003

Merged

Add Metal backend Python preprocessing, partitioning, and tests #15015

Merged

Add Metal backend type definitions and utilities #15019

Merged

Add Metal backend core ETMetal runtime. #15020

Open

Metal backend: Add AOTI shims for memory management #15021

Open

Metal backend: Implement the AOTI MPS shim #15022

Open

Add Metal backend build system and runtime integration #15024

Open

meta-cla bot added the CLA Signed label

manuelcandales requested review from larryliu0820 and mergennachin and removed request for cccclai and shoumikhin

October 10, 2025 21:03


          Update

de83a9f

[ghstack-poisoned]

mergennachin reviewed

View reviewed changes

backends/apple/metal/runtime/shims/et_metal_ops.mm Outdated Show resolved Hide resolved

backends/apple/metal/runtime/shims/et_metal_ops.mm Outdated Show resolved Hide resolved

backends/apple/metal/runtime/shims/et_metal_ops.mm Outdated Show resolved Hide resolved

backends/apple/metal/runtime/shims/et_metal_ops.mm Outdated Show resolved Hide resolved

backends/apple/metal/runtime/shims/et_metal_ops.mm

Comment on lines +899 to +902

+                        size_t query_size = query_tensor->numel() * element_size;
+                        query_buffer = [device newBufferWithBytes:query_data_ptr
+                                                           length:query_size
+                                                          options:MTLResourceStorageModeShared];

Contributor

mergennachin Oct 12, 2025

how and where do you clean this up?

backends/apple/metal/runtime/shims/et_metal_ops.mm Outdated Show resolved Hide resolved

backends/apple/metal/runtime/shims/et_metal_ops.mm Show resolved Hide resolved

backends/apple/metal/runtime/shims/et_metal_ops.h Show resolved Hide resolved

backends/apple/metal/runtime/shims/et_metal_ops.mm

Comment on lines +1182 to +1183

		// For attention weights, zero-fill the GPU buffer (shared memory allows CPU memset)
		std::memset(attn_contents_ptr, 0, attn_size_bytes);

Contributor

mergennachin Oct 12, 2025

do you need zero filling here

backends/apple/metal/runtime/shims/et_metal_ops.mm

+                      // Set output tensor handles
+                      *ret0 = out_tensor_handle;
+                      *ret1 = attn_tensor_handle;

Contributor

mergennachin Oct 12, 2025

Is ret1 actually populated or just zerod

manuelcandales added 2 commits

October 13, 2025 12:46


          Update

2f092af

[ghstack-poisoned]


          Update

e9b3372

[ghstack-poisoned]

manuelcandales added the release notes: none label

manuelcandales added 2 commits

October 13, 2025 18:47


          Update

aec8796

[ghstack-poisoned]


          Update

3229b92

[ghstack-poisoned]

manuelcandales added a commit to manuelcandales/executorch-1 that referenced this pull request


          Metal backend: Add operator implementations

95fb414

Adds bfloat16/float32 working implementations of the following AOTI shim ops:
 - aoti_torch_mps_mm_out
 - aoti_torch_mps_convolution
 - aoti_torch_mps__scaled_dot_product_attention_math_for_mps

 Adds a stub implementation of aoti_torch_mps_addmm_out


ghstack-source-id: 61b8cc4
ghstack-comment-id: 3392300522
Pull-Request: pytorch#15023

manuelcandales added 3 commits

October 14, 2025 20:57


          Update

7f178d3

[ghstack-poisoned]


          Update

780d883

[ghstack-poisoned]


          Update

61ead64

[ghstack-poisoned]

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CLA Signed release notes: none