Skip to content

Draft fallback review#200

Draft
cavusmustafa wants to merge 88 commits into
ravi9:dev_backend_openvinofrom
cavusmustafa:draft_fallback_review
Draft

Draft fallback review#200
cavusmustafa wants to merge 88 commits into
ravi9:dev_backend_openvinofrom
cavusmustafa:draft_fallback_review

Conversation

@cavusmustafa
Copy link
Copy Markdown
Collaborator

Draft PR for reviewing changes only (original PR: https://github.com/zhaixuejun1993/llama.cpp/tree/xuejun/openvino-fallback-cpu-v2). This PR will be removed after reviewing.

DO NOT MERGE.

zhaixuejun1993 and others added 30 commits May 20, 2026 16:08
* added translate_1to1_match_1_input function and updated gelu and tanh translations

* Remove unused translation function calls

---------

Co-authored-by: Mustafa Cavus <mustafacavus@intel.com>
* OpenVINO backend: refactor VIEW related operation

* Enable VIEW handling in following ops

* OpenVINO backend does not support GGML_OP_NORM & GGML_OP_L2_NORM with VIEW input accuracy issue from OpenVINO
wine99 and others added 29 commits May 21, 2026 15:40
…vl-cohere2

Enable arch tests for Qwen3VL and Cohere2 in OpenVINO backend
Enable T5 model for architecture testing in OpenVINO backend
Enable jamba and kimi-linear for architecture tests
…-oss

Fix accuracy issue and enable Arctic and Grok for arch tests
* Initiall gemma4 npu support

* temp. fix for gemma4 accuracy bug on npu

* Remove hardcoded names for npu-fold handling

* revert static n tokens for cont translation as it is not needed

* removed unused variable
…der cache. Add environment variable GGML_OPENVINO_ENABLE_CACHE (default: YES). When set to NO, the decoder_cache is bypassed and models are rebuilt from the cgraph on every inference call in both dynamic and static compute paths. This is useful for debugging and verifying correctness without caching interference.
…model_env

Add GGML_OPENVINO_ENABLE_CACHE env var for decoder cache control
…_log

Disable debug log printing in OpenVINO backend
…g_src to recorde the src ggml tensor for OpenVINO dynamic shape infer
@ravi9 ravi9 force-pushed the dev_backend_openvino branch from d4aa38a to 5cdd4f0 Compare June 2, 2026 19:17
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants