-
Notifications
You must be signed in to change notification settings - Fork 1.3k
Open
Description
This issue is open to discuss different options for adding GPU build and test to Github workflows.
To enable this feature, SINGA must provide a real or virtual machine with GPU as host machine for running the workflow. Then use the self-hosted runner feature of Github Actions. See also this MLOps video tutorial.
The team need to take some decisions:
- Which machine(s) should we use? (e.g. virtual machine(s) on AWS, dedicated server(s) at NUS, ..)
- Which operating systems we test on? (Only Linux? or also Mac).
- When we run the GPU build and test workflow? (with every pull request? once per day at night? once per week? only on master branch?, ...)
- Should we keep the machines always on? or use them only when the scheduled test is running and shut down them when there is no workflow runs? Assuming we run the GPU build and test only at scheduled time.
- Should we add workflows to run examples and test the Jupyter notebooks? note that some examples may take hours or days to complete the training. But automating the test of examples can be very useful to speed up the development.
What do you think?
Metadata
Metadata
Assignees
Labels
No labels