You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: tutorials/Fine_Tuning_Vision/01-intro-fine-tuning.md
+6-6Lines changed: 6 additions & 6 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,4 +1,4 @@
1
-
## Fine-Tuning Vision Models Using Ultralytics and Tapis
1
+
## Section 8: Fine-Tuning Vision Models Using Ultralytics and Tapis
2
2
3
3
This Tapis application allows users to fine-tune Vision models such as YOLO using Ultralytics running in a Singularity container on Vista. It is designed to run on High-Performance Computing (HPC) systems via Tapis, leveraging GPU acceleration for training tasks.
4
4
@@ -93,12 +93,12 @@ These parameters define the hardware footprint requested from the Slurm schedule
93
93
94
94
Follow these steps to submit the job using the Tapis UI:
95
95
96
-
### Step 1: Initiate Submission
96
+
### Step 8.1: Initiate Submission
97
97
Navigate to the **Apps** list and select the `ultralytics-fine-tune` app. Click the button to initiate a JSON-based submission.
98
98
99
99

100
100
101
-
### Step 2: Edit the JSON Payload
101
+
### Step 8.2: Edit the JSON Payload
102
102
Paste the job JSON provided below into the editor and click `Submit`.
@@ -148,21 +148,21 @@ These are specific flags passed to the Slurm scheduler on the Vista system:
148
148
149
149

150
150
151
-
### Step 3: Monitor Job Progress
151
+
### Step 8.3: Monitor Job Progress
152
152
After clicking **Submit**, navigate to the **Jobs** tab. You can monitor the status as it moves from `PENDING` to `RUNNING` and finally `FINISHED`.
153
153
154
154

155
155
Since we are running 100 Epochs to fine-tune, this will take around 10-12 minutes to finish.
156
156
157
157
158
-
### Step 4: Job Output and Results location
158
+
### Step 8.4: Job Output and Results location
159
159
Once the job finishes, you should see output similar to the image below.
160
160

161
161
162
162
You should see a train directory with weights. Inside the `weights` directory, you can access the `best.pt model`. This file will be accessible in Jupyter Notebook.
163
163
164
164
165
-
### Step 5: Finding best.pt file from Jupyter
165
+
### Step 8.5: Finding best.pt file from Jupyter
166
166
167
167
Inside your work directory, you should see a `vista` folder. In there you can find the directory named with your `jobUUID`. In the above image the job uuid is highlighted. In the job directory you will find the outputs of the job archived with the same train directory containing the best.pt file.
Once logged in, you can inspect your JSON Web Token (JWT). The JWT is the authentication token that Tapis uses to verify your identity across all API calls. It contains encoded claims about your user session — including your username, tenant, and token expiration. You can view your token in TapisUI to understand how Tapis manages identity and access.
31
31
@@ -37,7 +37,7 @@ A Tapis **system** represents a storage or execution resource — such as a VM o
37
37
38
38
In this tutorial, instead of creating systems from scratch, we will use a **public system** that has been pre-registered for you on TACC's Vista cluster.
39
39
40
-
### Navigate the Systems List
40
+
### Step 2.3: Navigate the Systems List
41
41
42
42
When you log in to TapisUI and click on **Systems** from the left-hand menu, you should see one public system available to you. This system has been pre-registered for the tutorial.
43
43
@@ -46,7 +46,7 @@ When you log in to TapisUI and click on **Systems** from the left-hand menu, you
46
46
{: .note}
47
47
> ⚠️ Note: The system is visible but you are **not yet authenticated** to access files or run jobs on it. You must first add TMS credentials.
48
48
49
-
### Add TMS Credentials for the NAIRR Vista Public System
49
+
### Step 2.4: Add TMS Credentials for the NAIRR Vista Public System
50
50
51
51
To access the public system running on Vista, you need to add **TMS (Trust Management System)** credentials. TMS credentials are temporary credentials generated by the TMS service and stored in the Tapis Security Kernel (SK). They allow Tapis services and applications to securely access external resources on your behalf. Instead of storing permanent usernames or passwords, Tapis retrieves the required credentials from TMS at runtime — improving security by keeping sensitive information encrypted and centrally managed.
52
52
@@ -56,7 +56,7 @@ Click on **Authenticate with TMS Keys** on the Vista system.
56
56
57
57
Once your TMS credentials are added, your system is ready to use.
58
58
59
-
### Use "Go to Home" to Get to Files
59
+
### Step 2.5: Use "Go to Home" to Get to Files
60
60
61
61
After authenticating, you can verify your system access by clicking the **View Files** button, which will take you to the file listing on the Vista system.
62
62
@@ -68,13 +68,13 @@ After authenticating, you can verify your system access by clicking the **View F
68
68
69
69
The Tapis Files service provides a unified interface for managing files across any registered Tapis system. Once you have authenticated to a system, you can browse, upload, download, and manage files directly from TapisUI.
70
70
71
-
### Navigate the Files in Vista System
71
+
### Step 2.6: Navigate the Files in Vista System
72
72
73
73
From the left-hand menu, click on the **Files** tab and select the Vista system. You should see a listing of files on the system. You can navigate directories, view file details, and open files directly.
Try uploading a file to the Vista system using the upload functionality in the Files tab. This confirms that your credentials are working and that you have write access to the system.
80
80
@@ -86,11 +86,11 @@ A Tapis **application** represents all the information required to run a Tapis j
86
86
87
87
In this tutorial, instead of creating an application from scratch, we will use the **FlexServ** public application that has already been registered for you.
88
88
89
-
### Navigate to the Tapis Apps List
89
+
### Step 2.8: Navigate to the Tapis Apps List
90
90
91
91
Click on **Apps** from the left-hand menu in TapisUI. You will see a list of available applications. Toggle between "My Apps" and "Public Apps" to see the full list.
92
92
93
-
### Find the FlexServ Public App
93
+
### Step 2.9: Find the FlexServ Public App
94
94
95
95
Look for the application **FlexServ-vista-nairr version 1.4.0** in the public apps list. FlexServ is a TACC-owned inference server for running AI models on HPC systems. It has been pre-registered as a public app for all users to submit jobs.
## Section 6: Launching the Jupyter Notebook Environment
4
2
5
3
Jupyter is an open source project that provides a webapp interface for writing code and documents. Throughout this tutorial, we will be using a Jupyter Notebook environment for making Tapis User Requests.
6
4
7
-
### Download Notebook
8
-
9
-
Dowload the Jupyter Notebook by clicking the link below:
10
-
[Download Notebook](sentiment_analysis.ipynb)
11
-
12
-
### Starting up your Jupyter Notebook Environment
5
+
### Step 6.1: Starting up your Jupyter Notebook Environment
13
6
14
7
For this tutorial, we will use [TACC's Public JupyterHub](https://public.jupyter.tacc.cloud)
15
8
You may login with your TACC accounts.
16
9
17
-
18
-
### Navigating to the $WORK File System
10
+
### Step 6.2: Navigating to the $WORK File System
19
11
20
12
On successful login, ensure that you have access to a folder, `work`, within the Jupyter file system.
Copy file name to clipboardExpand all lines: tutorials/Tapis_FlexServ/01b-running-flexserv.md
+8-7Lines changed: 8 additions & 7 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,4 +1,6 @@
1
-
## Adding TMS Credentials on the Vista system.
1
+
## Section 4: Executing Large Models on Vista with Tapis and Flexserv
2
+
3
+
### Step 4.1: Adding TMS Credentials on the Vista system.
2
4
3
5
To access the public system running on Vista, first you will need to add TMS credentials on the system. TMS (Trust Management System) credentials on Tapis systems are temporary credentials generated by the TMS System and stored in the Tapis Security Kernel (SK) that allow services or applications to securely access external resources on behalf of a user. Instead of storing permanent usernames or passwords, Tapis retrieves the required credentials from the TMS service at runtime. This approach improves security by keeping sensitive information encrypted and centrally managed while enabling automated job execution on Tapis systems.
4
6
@@ -14,7 +16,7 @@ You can now view files on Vista, by clicking on the `View Files ` button.
14
16

15
17
16
18
17
-
## Running FlexServ Application on Vista
19
+
### Step 4.2: Running FlexServ Application on Vista
18
20
19
21
The following app runs the FlexServ on TACC's Vista System. For the purposes of this tutorial, the application has already been registered with Tapis and is available as a public app for all users to submit jobs.
20
22
@@ -242,7 +244,7 @@ The following app runs the FlexServ on TACC's Vista System. For the purposes of
242
244
You should see the Flex Server application already registered in your Tapis UI: **FlexServ-vista-nairr version 1.4.0**
### FlexServ API Prompt: YOLO Evaluation Script Generator
7
6
8
7
#### Task Summary:
9
8
To test the capabilities of the FlexServ inference server, we can provide a complex prompt to the Responses API. This prompt asks the AI to generate a complete Python evaluation script that performs Animal detection on the images from the LILA BC Small Animal dataset. This is a large camera-trap image dataset used for wildlife monitoring and ecological research. It contains millions of images captured by automated cameras, including small mammals and many blank triggers, along with annotations describing the detected species. For training object detection models such as YOLO, the dataset can be downloaded in YOLO format, where each image has a corresponding .txt label file containing bounding-box coordinates in the form <class_id> <x_center> <y_center> <width> <height>.
10
9
11
10
---
12
11
13
-
#### On FlexServ UI
12
+
### Exploring the FlexServ UI
13
+
14
+
### Step 7.1: Refresh Model Pool
14
15
15
16
- Refresh the Model pool so you can see public and private models available for you to run.
16
17

17
18
19
+
### Step 7.2: Update the Responses API and Parameters
20
+
18
21
- Copy and paste the following prompt into the FlexServ UI in the `Responses API`, `Input(Markdown)` section, shown in the image below.
19
22
20
23
@@ -72,21 +75,24 @@ After the code, briefly explain how the program works in plain English.
72
75
-`Qwen/Qwen2.5-Coder32B-Instruct-61.0 GB - Text Generation`
73
76
- Make sure the `Streams` is checked.
74
77
- Uncheck `Multi-turn conversation`
78
+
79
+
### Step 7.3: Run the Responses API
80
+
75
81
- Click `Run`. Within a few minutes, you should see the code generation start in the blue box in the Responses API. Wait for it to complete. After completion, you should see output similar to the image below.
76
82
77
83

78
84
79
85
80
-
### On Jupyter:
86
+
### Step 7.4: Running Code Detection On Jupyter
81
87
82
-
Go to the notebook Code-Detection on your Jupyter path.
88
+
Go to the Jupyter notebook Code-Detection on your Jupyter path.
0 commit comments