Skip to content
#

ui-localization

Here are 3 public repositories matching this topic...

Language: All
Filter by language

Demo: Herculis-CUA-GUI-Actioner-4B is a Computer Use Agent (CUA) multimodal model designed for GUI understanding, UI localization, and action execution across web, desktop, and mobile environments

  • Updated Dec 14, 2025
  • Python

A Gradio-based demonstration for the prithivMLmods/Gliese-CUA-Tool-Call-8B model, specialized in GUI element localization. Users upload UI screenshots, provide task instructions (e.g., "Click on the search bar"), and receive predicted click coordinates in Click(x, y) format.

  • Updated Dec 15, 2025
  • Python

Improve this page

Add a description, image, and links to the ui-localization topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the ui-localization topic, visit your repo's landing page and select "manage topics."

Learn more