Skip to content

[Feature Request] Qwen3-VL architecture performance optimization #5928

@Dineshkumar-Anandan-ZS0367

Description

What is your request?

I am working on serving the qwen3-vl-8b-it model on max to compare with vLLM serving performance.

Does max has optimised for this?

Metadata

Metadata

Assignees

No one assigned

    Labels

    Needs TriageIssue needs to be routed/triaged to a particular team stillenhancementNew feature or requestmax

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions