AdaptiveStepSize currently throws on _GPUStyle() paths (step_rule
dispatch in src/solver.jl). per-problem backtracking does scalar
comparisons that aren't GPU-friendly without rework.
cross-vendor: should work uniformly on Metal / CUDA / AMDGPU once the
broadcast pattern is right.
AdaptiveStepSizecurrently throws on_GPUStyle()paths (step_ruledispatch in
src/solver.jl). per-problem backtracking does scalarcomparisons that aren't GPU-friendly without rework.
Vector{Bool}mask reused across iterationsMARGUERITE_TEST_GROUP=gpucross-vendor: should work uniformly on Metal / CUDA / AMDGPU once the
broadcast pattern is right.