Towards accurate bounding boxes - Image grounding #34

Open
opened 2026-04-20 22:34:06 +01:00 by apb · 1 comment
Owner
- https://huggingface.co/Qwen/Qwen3-VL-8B-Instruct - hf.co/mradermacher/Youtu-VL-4B-Instruct-i1-GGUF:IQ4_XS -
apb changed title from Image grounding to Towards accurate bounding boxes - Image grounding 2026-05-16 22:20:31 +01:00
Author
Owner
0.00.089.049 I srv    load_model: loading model '/home/andy/.lmstudio/models/lmstudio-community/Qwen3.6-27B-GGUF/Qwen3.6-27B-Q8_0.gguf'
2026-05-25 16:06:20 [DEBUG]
 0.02.779.124 W llama_context: n_ctx_seq (8192) < n_ctx_train (262144) -- the full capacity of the model will not be utilized
2026-05-25 16:06:20 [DEBUG]
 0.02.886.556 W sched_reserve: layer 0 is assigned to device CPU but the fused Gated Delta Net tensor is assigned to device Vulkan0 (usually due to missing support)
0.02.886.560 W sched_reserve: fused Gated Delta Net (chunked) not supported, set to disabled
2026-05-25 16:06:20 [DEBUG]
 0.02.914.627 W common_init_from_params: KV cache shifting is not supported for this context, disabling KV cache shifting
0.02.914.639 I common_init_from_params: warming up the model with an empty run - please wait ... (--no-warmup to disable)
2026-05-25 16:06:21 [DEBUG]
 0.03.485.598 W load_hparams: Qwen-VL models require at minimum 1024 image tokens to function correctly on grounding tasks
0.03.485.601 W load_hparams: if you encounter problems with accuracy, try adding --image-min-tokens 1024
0.03.485.601 W load_hparams: more info: https://github.com/ggml-org/llama.cpp/issues/16842
2026-05-25 16:06:21 [DEBUG]
 0.03.819.804 I srv    load_model: loaded multimodal model, '/home/andy/.lmstudio/models/lmstudio-community/Qwen3.6-27B-GGUF/mmproj-Qwen3.6-27B-BF16.gguf'
0.03.819.816 I srv    load_model: initializing slots, n_slots = 4
``` 0.00.089.049 I srv load_model: loading model '/home/andy/.lmstudio/models/lmstudio-community/Qwen3.6-27B-GGUF/Qwen3.6-27B-Q8_0.gguf' 2026-05-25 16:06:20 [DEBUG] 0.02.779.124 W llama_context: n_ctx_seq (8192) < n_ctx_train (262144) -- the full capacity of the model will not be utilized 2026-05-25 16:06:20 [DEBUG] 0.02.886.556 W sched_reserve: layer 0 is assigned to device CPU but the fused Gated Delta Net tensor is assigned to device Vulkan0 (usually due to missing support) 0.02.886.560 W sched_reserve: fused Gated Delta Net (chunked) not supported, set to disabled 2026-05-25 16:06:20 [DEBUG] 0.02.914.627 W common_init_from_params: KV cache shifting is not supported for this context, disabling KV cache shifting 0.02.914.639 I common_init_from_params: warming up the model with an empty run - please wait ... (--no-warmup to disable) 2026-05-25 16:06:21 [DEBUG] 0.03.485.598 W load_hparams: Qwen-VL models require at minimum 1024 image tokens to function correctly on grounding tasks 0.03.485.601 W load_hparams: if you encounter problems with accuracy, try adding --image-min-tokens 1024 0.03.485.601 W load_hparams: more info: https://github.com/ggml-org/llama.cpp/issues/16842 2026-05-25 16:06:21 [DEBUG] 0.03.819.804 I srv load_model: loaded multimodal model, '/home/andy/.lmstudio/models/lmstudio-community/Qwen3.6-27B-GGUF/mmproj-Qwen3.6-27B-BF16.gguf' 0.03.819.816 I srv load_model: initializing slots, n_slots = 4 ```
Sign in to join this conversation.
No milestone
No project
No assignees
1 participant
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference
apb/infrastructure#34
No description provided.