Skip to content

Releases: OP-DSL/OPS

OPS v2.0.1

09 Mar 18:11
e7801de

Choose a tag to compare

Merge pull request #280 from OP-DSL/develop

Update version and improve documentation content

OPS v2.0.0

06 Mar 23:33

Choose a tag to compare

  • New ops_translator (Primary / most important)

    • A new OPS code-generation engine (ops_translator) replaces the legacy translator.
    • Based on Python/Clang/Fparser/Jinja2 — produces improved C/C++/Fortran backends.
    • Enables broader target support and improved codegen quality (SYCL, HIP, CUDA, OpenMP offload, etc.).
    • All apps (including CloverLeaf, CloverLeaf_3D, wave_test, halfprecision) now use the new translator.
    • Migration notes: translator runtime requires updated Python dependencies (see ops_translator/requirements.txt), and generated output paths/filenames may differ from the legacy translator — review build scripts when upgrading.
  • New back-end & language support:

    • SYCL targets and improved SYCL integration (CPU/GPU).
    • HIP support and HIP/CUDA codegen fixes.
    • Added/expanded OpenMP offload MPI backend.
    • Fortran offload support and improved Fortran templates.
    • OpenCL and OpenACC: no longer supported in this release. Legacy codegen
      and example targets referencing OpenCL/OpenACC remain in the tree for
      historical/reference purposes but are unmaintained. Users should migrate
      to SYCL, HIP, or OpenMP offload paths where possible.
  • Precision & computation features:

    • Half-precision (float16) and mixed-precision support on GPUs and in code generation.
    • Multiblock mixed-precision halo-exchange fixes and enhancements.
  • I/O, tooling & utilities:

    • HDF5 improvements including SoA output mode and robustness fixes for MPI HDF5 writes.
    • Energy and power measurement hooks (RAPL/powercap and GPU power capture).
  • Bug fixes:

    • Fixed SYCL device selection (OPS_SYCL_DEVICE=gpu) crashing on empty device list.
    • Fixed Fortran I/O deadlock when using print/WRITE inside OpenMP parallel regions with Intel Fortran runtime.
  • Build, CI and testing:

    • Numerous CMake and Makefile improvements (F90/CMake additions, device selection, build flags).
    • Added EXTRA_CLEAN_FILES support in Makefile.c_app for app-specific clean targets.
    • Added clean_all_apps.sh utility script.
    • CI/test additions and config changes (expanded tests, GPU_NUMBER config update).
    • Added test.sh for multiple C and Fortran apps.
  • Documentation and developer experience:

    • Major documentation refreshes across the docs, developer guide updates and a new doc/quickstart.md.
    • Added example/test scripts for apps and updated tutorial timings.
    • Updated AUTHORS.

Version 1.00

08 Dec 11:34
e3ba07d

Choose a tag to compare

Merge pull request #160 from OP-DSL/develop

Prepare for the release V1.0.0