MINIFICPP-2719 - Add multimodal capability to llama.cpp processor by adamdebreceni · Pull Request #2107 · apache/nifi-minifi-cpp

adamdebreceni · 2026-02-17T14:08:52Z

Thank you for submitting a contribution to Apache NiFi - MiNiFi C++.

In order to streamline the review of the contribution we ask you
to ensure the following steps have been taken:

For all changes:

Is there a JIRA ticket associated with this PR? Is it referenced
in the commit message?
Does your PR title start with MINIFICPP-XXXX where XXXX is the JIRA number you are trying to resolve? Pay particular attention to the hyphen "-" character.
Has your PR been rebased against the latest commit within the target branch (typically main)?
Is your initial contribution a single, squashed commit?

For code changes:

If adding new dependencies to the code, are these dependencies licensed in a way that is compatible for inclusion under ASF 2.0?
If applicable, have you updated the LICENSE file?
If applicable, have you updated the NOTICE file?

For documentation related changes:

Have you ensured that format looks appropriate for the output in which it is rendered?

Note:

Please ensure that once the PR is submitted, you check GitHub Actions CI results for build issues and submit an update to your PR as soon as possible.

Copilot

Pull request overview

This PR updates the MiNiFi C++ llama.cpp extension to support multimodal (mtmd) inference, including wiring FlowFile content as “files” into the llama.cpp mtmd pipeline and optionally writing model output to a FlowFile attribute instead of overwriting content.

Changes:

Bump vendored llama.cpp to b8944 and apply a new patch to build mtmd support and fix missing includes.
Extend RunLlamaCppInference with multimodal model configuration + optional “output to attribute” behavior.
Update the LlamaContext interface and DefaultLlamaContext implementation to accept file buffers and perform mtmd tokenization/eval.

Reviewed changes

Copilot reviewed 11 out of 11 changed files in this pull request and generated 8 comments.

Show a summary per file

File	Description
thirdparty/llamacpp/mtmd-fix.patch	Adds mtmd subdirectory to llama.cpp build, fixes an include, and removes some mtmd tool executables.
thirdparty/llamacpp/lu8_macro_fix.patch	Removes an older llama.cpp patch no longer applied after the version bump.
thirdparty/llamacpp/cpp-23-fixes.patch	Removes an older llama.cpp patch no longer applied after the version bump.
cmake/LlamaCpp.cmake	Bumps llama.cpp tag, enables `LLAMA_BUILD_COMMON`, applies mtmd patch, and extends include dirs for common/tools/vendor headers.
extensions/llamacpp/CMakeLists.txt	Links the extension against `mtmd` and `llama-common` in addition to `llama`.
extensions/llamacpp/processors/LlamaContext.h	Extends `generate()` to accept a list of binary “files” (e.g., images/audio).
extensions/llamacpp/processors/DefaultLlamaContext.h	Adds mtmd/chat-template state and updates constructor/generate signature for multimodal support.
extensions/llamacpp/processors/DefaultLlamaContext.cpp	Implements mtmd initialization, multimodal tokenization/eval, and updated decode loop.
extensions/llamacpp/processors/RunLlamaCppInference.h	Adds `MultiModal Model Path` and `Output Attribute Name` properties and stores them in member state.
extensions/llamacpp/processors/RunLlamaCppInference.cpp	Passes FlowFile bytes as multimodal files, inserts mtmd marker, and optionally writes output to an attribute.
extensions/llamacpp/tests/RunLlamaCppInferenceTests.cpp	Updates the mock context to match the new `generate()` signature.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

+    if (multimodal_model_path_) {
+      input_data_and_prompt.append(mtmd_default_marker());
+      files.push_back(std::move(read_result));


+  if (output_attribute_) {
+    session.setAttribute(flow_file, output_attribute_.value(), text);
+  } else {


+  unique_llama_batch batch;
+  int32_t decode_status = 0;
+  if (multimodal_ctx_) {
+    gsl_Assert(!files.empty());


+    auto status = mtmd_helper_eval_chunks(multimodal_ctx_, llama_ctx_, chunks.get(), 0, 0, 1, true, &n_past);
+    if (status != 0) {
+      throw Exception(PROCESSOR_EXCEPTION, fmt::format("Failed to eval multimodal chunks, error: {}", status));
+    }


+    batch.reset(llama_batch_init(1, 0, 1));
+    batch->n_tokens = 1;
+    batch->token[0] = new_token_id;
+    batch->pos[0] = n_past;
+    batch->n_seq_id[0] = 1;


    input_data_and_prompt.append("Input data (or flow file content):\n");
-    input_data_and_prompt.append({reinterpret_cast<const char*>(read_result.data()), read_result.size()});
+    if (multimodal_model_path_) {
+      input_data_and_prompt.append(mtmd_default_marker());
+      files.push_back(std::move(read_result));
+    } else {


+    if (multimodal_model_path_) {
+      input_data_and_prompt.append(mtmd_default_marker());
+      files.push_back(std::move(read_result));
+    } else {


+  if (output_attribute_) {
+    session.setAttribute(flow_file, output_attribute_.value(), text);
+  } else {
+    session.writeBuffer(flow_file, text);
+  }


adamdebreceni force-pushed the multimodal_llama branch from 8cb0923 to b207ec4 Compare February 18, 2026 12:39

adamdebreceni added 5 commits April 30, 2026 10:03

MINIFICPP-2719 - Add multimodal capability to llama.cpp processor

31ddfff

MINIFICPP-2719 - Do not build executable tools

6e5d52c

MINIFICPP-2719 - Fix build

d2ce276

MINIFICPP-2719 - Fix rebase

9dc1f90

MINIFICPP-2719 - Fix template use

09c3416

adamdebreceni force-pushed the multimodal_llama branch from b207ec4 to 09c3416 Compare April 30, 2026 08:03

adamdebreceni added 4 commits April 30, 2026 11:20

MINIFICPP-2719 - Fix build

efb65a9

MINIFICPP-2719 - Clang tidy fix, win fix

f96ef0b

MINIFICPP-2719 - Linter fix

e111c62

MINIFICPP-2719 - gcc 13 fix

841edc6

adamdebreceni marked this pull request as ready for review May 4, 2026 12:18

lordgamez self-requested a review May 5, 2026 11:32

martinzink requested review from Copilot and martinzink May 5, 2026 12:27

Copilot started reviewing on behalf of martinzink May 5, 2026 12:28 View session

Copilot AI reviewed May 5, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MINIFICPP-2719 - Add multimodal capability to llama.cpp processor#2107

MINIFICPP-2719 - Add multimodal capability to llama.cpp processor#2107
adamdebreceni wants to merge 9 commits intoapache:mainfrom
adamdebreceni:multimodal_llama

adamdebreceni commented Feb 17, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

adamdebreceni commented Feb 17, 2026

For all changes:

For code changes:

For documentation related changes:

Note:

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants