Run on the correct CUDA stream #175

xrq-phys · 2025-05-19T05:38:05Z

This patch tries to ensure when one runs sageattn under:

with torch.cuda.stream(stream):
    sageattn(q, k, v)

all kernels would be enqueued onto the correct CUDA stream.

walker-ai · 2025-07-23T11:35:11Z

Hi, I'm currently working on support CUDA graph of SA, but I've encountered some output errors. I want to know if this PR is related to me. Maybe this stream issue could cause some correctness errors?

Run on the correct CUDA stream

2a3f5b6

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Run on the correct CUDA stream #175

Run on the correct CUDA stream #175

Uh oh!

xrq-phys commented May 19, 2025

Uh oh!

walker-ai commented Jul 23, 2025

Uh oh!

Uh oh!

Run on the correct CUDA stream #175

Are you sure you want to change the base?

Run on the correct CUDA stream #175

Uh oh!

Conversation

xrq-phys commented May 19, 2025

Uh oh!

walker-ai commented Jul 23, 2025

Uh oh!

Uh oh!