When taking a slice of a tensor and using scatter_add_ on MPS operations do nothing when taking slices at non-zero offsets. [pip3] numpy==1.26.4 [pip3] optree==0.13.1 [pip3] torch==2.8.0 ...