Name and Version
version: 9405 (3764c5a53)
built with MSVC 19.44.35226.0 for Windows AMD64
Operating systems
Windows
GGML backends
CUDA
Hardware
RTX 3070 8GB + 32GB Ram
Models
Qwen 3.6 35B A3B IQ4NL
Problem description & steps to reproduce
when i want to use dflash model there is an error:
llama_model_load: error loading model: error loading model architecture: unknown model architecture: 'dflash'
how can i fix it?
First Bad Commit
No response
Relevant log output
print_info: file format = GGUF V3 (latest)
print_info: file type = Q8_0
print_info: file size = 480.40 MiB (8.50 BPW)
llama_model_load: error loading model: error loading model architecture: unknown model architecture: 'dflash'
llama_model_load_from_file_impl: failed to load model
srv load_model: failed to load draft model, 'M:\Qwen\Qwen3.6 35B A3B IQ4NL\Qwen3.6-35B-A3B-DFlash-Q8_0.gguf'
srv operator (): operator (): cleaning up before exit...
main: exiting due to model loading error
Name and Version
version: 9405 (3764c5a53)
built with MSVC 19.44.35226.0 for Windows AMD64
Operating systems
Windows
GGML backends
CUDA
Hardware
RTX 3070 8GB + 32GB Ram
Models
Qwen 3.6 35B A3B IQ4NL
Problem description & steps to reproduce
when i want to use dflash model there is an error:
llama_model_load: error loading model: error loading model architecture: unknown model architecture: 'dflash'
how can i fix it?
First Bad Commit
No response
Relevant log output
print_info: file format = GGUF V3 (latest)
print_info: file type = Q8_0
print_info: file size = 480.40 MiB (8.50 BPW)
llama_model_load: error loading model: error loading model architecture: unknown model architecture: 'dflash'
llama_model_load_from_file_impl: failed to load model
srv load_model: failed to load draft model, 'M:\Qwen\Qwen3.6 35B A3B IQ4NL\Qwen3.6-35B-A3B-DFlash-Q8_0.gguf'
srv operator (): operator (): cleaning up before exit...
main: exiting due to model loading error