Skip to content

Eval bug: llama_model_load: error loading model: error loading model architecture: unknown model architecture: 'dflash' #4

@esmail-mkh

Description

@esmail-mkh

Name and Version

version: 9405 (3764c5a53)
built with MSVC 19.44.35226.0 for Windows AMD64

Operating systems

Windows

GGML backends

CUDA

Hardware

RTX 3070 8GB + 32GB Ram

Models

Qwen 3.6 35B A3B IQ4NL

Problem description & steps to reproduce

when i want to use dflash model there is an error:

llama_model_load: error loading model: error loading model architecture: unknown model architecture: 'dflash'

how can i fix it?

First Bad Commit

No response

Relevant log output

print_info: file format = GGUF V3 (latest)
print_info: file type = Q8_0
print_info: file size = 480.40 MiB (8.50 BPW)
llama_model_load: error loading model: error loading model architecture: unknown model architecture: 'dflash'
llama_model_load_from_file_impl: failed to load model
srv load_model: failed to load draft model, 'M:\Qwen\Qwen3.6 35B A3B IQ4NL\Qwen3.6-35B-A3B-DFlash-Q8_0.gguf'
srv operator (): operator (): cleaning up before exit...
main: exiting due to model loading error

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions