profile
viewpoint

czarlos/OpenBitCL 0

BitCoin App

issue commentpytorch/xla

Optimizing the implementation of Longformer

Actually when I looked deeper, I think the einsums were fine, it was only the in-place updates of views that seemed to be very expensive

ibeltagy

comment created time in a month

issue commentpytorch/xla

Optimizing the implementation of Longformer

Looking at the profile the most expensive parts are the atten::unselect operations followed by the reshapes and transposes internal to atten::einsum.

ibeltagy

comment created time in a month

more