Add fltflt rounding and fmod functions by tbensonatl · Pull Request #1129 · NVIDIA/MatX

tbensonatl · 2026-02-27T22:15:20Z

Add support for the following float-float (fltflt) functions:

Round toward nearest, with ties toward even
Truncate toward zero
Truncate toward negative infinity
fmod (floating point remainder)

Also includes are new unit tests and benchmarks for the newly introduced functions.

Also add fltflt_add_same_sign(), with is more efficient than fltflt_add() for the case where we know both inputs have the same sign. Signed-off-by: Thomas Benson <tbenson@nvidia.com>

Signed-off-by: Thomas Benson <tbenson@nvidia.com>

copy-pr-bot · 2026-02-27T22:15:23Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

greptile-apps · 2026-02-27T22:19:06Z

Greptile Summary

This PR adds four new float-float arithmetic functions: round-to-nearest (with ties to even), truncate toward zero, floor (truncate toward negative infinity), and fmod (floating-point remainder). The implementation includes proper edge case handling with zero-division guards returning NaN, consistent use of fabsf for float operations, and comprehensive test coverage.

Key changes:

Added fltflt_round_to_nearest(), fltflt_round_toward_zero(), fltflt_floor(), and fltflt_fmod() functions
Added optimized fltflt_add_same_sign() for same-sign operands
Made fltflt constructors constexpr for compile-time evaluation
Comprehensive unit tests covering positive/negative values, zero divisors, and high-precision cases
Performance benchmarks for all new functions

Previous feedback addressed:
All issues from previous review threads have been resolved - zero-division guards are in place, fabsf is used consistently, and comprehensive tests have been added for fmod.

Confidence Score: 5/5

This PR is safe to merge with no identified issues
All previous feedback has been addressed. The implementation includes proper edge case handling, comprehensive test coverage (12+ test cases for fmod alone), performance benchmarks, and follows consistent coding patterns. The code uses appropriate precision functions (fabsf), guards against division by zero, and includes detailed documentation.
No files require special attention

Important Files Changed

Filename	Overview
include/matx/kernels/fltflt.h	Adds four new rounding/fmod functions with proper zero-division guards and consistent use of `fabsf`. All previous feedback addressed.
test/00_misc/FloatFloatTests.cu	Comprehensive test coverage for all new functions including edge cases (negative values, zero divisor, high-precision cases).
bench/00_misc/fltflt_arithmetic.cu	Adds performance benchmarks for new rounding and fmod functions with appropriate test cases.
bench/scripts/run_fltflt_benchmarks.py	Updates benchmark list to include newly added functions (round, trunc, floor, fmod, cast operations).

_{Last reviewed commit: 7a37881}

greptile-apps

_{4 files reviewed, 4 comments}

_{Edit Code Review Agent Settings | Greptile}

cliffburdick · 2026-02-27T23:01:55Z

/build

tbensonatl · 2026-03-02T13:53:56Z

/build

- Add fltflt_fmod unit tests - Updated fltflt_fmod to return {NaN, NaN} in the case of a zero divisor Signed-off-by: Thomas Benson <tbenson@nvidia.com>

tbensonatl · 2026-03-02T19:15:07Z

/build

tbensonatl added 4 commits February 7, 2026 17:39

Add fltflt round, trunc, and fmod functions

0d15692

Also add fltflt_add_same_sign(), with is more efficient than fltflt_add() for the case where we know both inputs have the same sign. Signed-off-by: Thomas Benson <tbenson@nvidia.com>

Add fltflt_floor() function, unit tests, and floor benchmark

ce50521

Signed-off-by: Thomas Benson <tbenson@nvidia.com>

Fix fltflt ctor for gcc 8.5

3ade505

Signed-off-by: Thomas Benson <tbenson@nvidia.com>

Documentation updates for fltflt header

0651eb6

Signed-off-by: Thomas Benson <tbenson@nvidia.com>

tbensonatl requested a review from cliffburdick February 27, 2026 22:15

tbensonatl self-assigned this Feb 27, 2026

greptile-apps Bot reviewed Feb 27, 2026

View reviewed changes

cliffburdick reviewed Feb 27, 2026

View reviewed changes

Comment thread bench/00_misc/fltflt_arithmetic.cu Outdated

cliffburdick reviewed Feb 27, 2026

View reviewed changes

Comment thread include/matx/kernels/fltflt.h

cliffburdick approved these changes Feb 27, 2026

View reviewed changes

Make const variable constexpr

86bc768

greptile-apps Bot reviewed Mar 2, 2026

View reviewed changes

Comment thread include/matx/kernels/fltflt.h

Comment thread include/matx/kernels/fltflt.h

Comment thread include/matx/kernels/fltflt.h Outdated

Comment thread test/00_misc/FloatFloatTests.cu

Address greptile feedback

7a37881

- Add fltflt_fmod unit tests - Updated fltflt_fmod to return {NaN, NaN} in the case of a zero divisor Signed-off-by: Thomas Benson <tbenson@nvidia.com>

tbensonatl merged commit dd01c29 into main Mar 2, 2026
1 check passed

tbensonatl deleted the feature/add-fltflt-round-fmod branch March 6, 2026 23:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add fltflt rounding and fmod functions#1129

Add fltflt rounding and fmod functions#1129
tbensonatl merged 6 commits intomainfrom
feature/add-fltflt-round-fmod

tbensonatl commented Feb 27, 2026

Uh oh!

copy-pr-bot Bot commented Feb 27, 2026

Uh oh!

greptile-apps Bot commented Feb 27, 2026 •

edited

Loading

Uh oh!

greptile-apps Bot left a comment •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

cliffburdick commented Feb 27, 2026

Uh oh!

tbensonatl commented Mar 2, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

tbensonatl commented Mar 2, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

tbensonatl commented Feb 27, 2026

Uh oh!

copy-pr-bot Bot commented Feb 27, 2026

Uh oh!

greptile-apps Bot commented Feb 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Greptile Summary

Confidence Score: 5/5

Important Files Changed

Uh oh!

greptile-apps Bot left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

cliffburdick commented Feb 27, 2026

Uh oh!

tbensonatl commented Mar 2, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

tbensonatl commented Mar 2, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

greptile-apps Bot commented Feb 27, 2026 •

edited

Loading

greptile-apps Bot left a comment •

edited

Loading