Skip to content

Change executorch binaries #802

Merged
benITo47 merged 5 commits intomainfrom
@bo/SwapBinaries
Feb 11, 2026
Merged

Change executorch binaries #802
benITo47 merged 5 commits intomainfrom
@bo/SwapBinaries

Conversation

@benITo47
Copy link
Contributor

Description

This PR changes binaries to include new tokenizer functionalities.

Added:

  • Wordpiece model and decoder
  • Bert and Roberta tokenization is supported
  • Padding and truncation from tokenizer.json is now respected

Introduces a breaking change?

  • Yes
  • No

Type of change

  • Bug fix (change which fixes an issue)
  • New feature (change which adds functionality)
  • Documentation update (improves or adds clarity to existing documentation)
  • Other (chores, tests, code style improvements etc.)

Tested on

  • iOS
  • Android

Testing instructions

Run the test suites.
Run all apps that use tokenizers and verify they load and produce proper output (LLM, S2T, T2I, Embeddings etc.)

Checklist

  • I have performed a self-review of my code

Additional notes

Runnin the tests can yield some issues. Couldn't get to why they happen. Calling failing functions in example apps yields proper results. Probably some issue with test environment. We decided to not hold this PR due to those failing TC's and investigate them later on.

@benITo47 benITo47 changed the title @bo/swap binaries Change executorch binaries Feb 11, 2026
@msluszniak msluszniak added the bug fix PRs that are fixing bugs label Feb 11, 2026
@benITo47 benITo47 requested a review from mkopcins February 11, 2026 15:01
@msluszniak
Copy link
Member

I had an error in Clip embedding example:
image
it would be cool if someone else also test this specific app.

Copy link
Member

@msluszniak msluszniak left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I ran tests and tested all demo apps using tokenizers. All worked as expected :))

@benITo47 benITo47 merged commit 18d7e33 into main Feb 11, 2026
4 checks passed
@benITo47 benITo47 deleted the @bo/SwapBinaries branch February 11, 2026 20:55
benITo47 added a commit that referenced this pull request Feb 12, 2026
## Description

This PR changes binaries to include new tokenizer functionalities.

Added: 
- Wordpiece model and decoder
- Bert and Roberta tokenization is supported 
- Padding and truncation from tokenizer.json is now respected 



### Introduces a breaking change?

- [ ] Yes
- [x] No

### Type of change

- [x] Bug fix (change which fixes an issue)
- [ ] New feature (change which adds functionality)
- [ ] Documentation update (improves or adds clarity to existing
documentation)
- [ ] Other (chores, tests, code style improvements etc.)

### Tested on

- [x] iOS
- [x] Android

### Testing instructions
Run the test suites. 
Run all apps that use tokenizers and verify they load and produce proper
output (LLM, S2T, T2I, Embeddings etc.)

### Checklist

- [x] I have performed a self-review of my code

### Additional notes
Running the tests can yield some issues. Couldn't get to why they happen.
Calling failing functions in example apps yields proper results.
Probably some issue with test environment. We decided to not hold this
PR due to failing TC's and investigate them later on.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

bug fix PRs that are fixing bugs

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants