Optimize Eq and Ord for LazyByteString using pointer equality by sjakobi · Pull Request #404 · haskell/bytestring

sjakobi · 2021-06-30T12:30:55Z

This is inspired by a discussion in Haskell-Cafe:
https://mail.haskell.org/pipermail/haskell-cafe/2021-June/134073.html

This is inspired by a discussion in Haskell-Cafe: https://mail.haskell.org/pipermail/haskell-cafe/2021-June/134073.html

sjakobi · 2021-07-01T21:19:49Z

TODO for myself:

Benchmarking

sjakobi · 2021-07-13T04:45:17Z

Data/ByteString/Lazy/Internal.hs

+  | otherwise = case compare al bl of
+      LT -> a == S.BS bp al && eq as (Chunk (S.BS (S.plusForeignPtr bp al) (bl - al)) bs)
+      EQ -> a == b && eq as bs
+      GT -> S.BS ap bl == b && eq (Chunk (S.BS (S.plusForeignPtr ap bl) (al - bl)) as) bs


In the LT and GT cases, we'd recursively run the pointer equality checks on freshly allocated Chunks – which is totally wasteful

It might be better to call an "inner" function here which doesn't perform the pointer equality check.

cmp has the same issue.

clyring · 2021-09-22T00:38:17Z

I suspect that sharing-based equality-checks on lazy data structures are inherently at odds with referential transparency due to infinite and partially-defined values. So, it's my opinion that these comparison functions should only be offered from an Unsafe module, if at all.

I will also point out that since the Eq instance for strict ByteString already performs a sharing-based equality check, the existing Eq instance for lazy ByteString should already be pretty fast in most cases where there is a long shared tail. The same does not appear to be true for the Ord instance.

sjakobi · 2021-09-23T10:29:40Z

Thanks for your comments, @clyring!

I suspect that sharing-based equality-checks on lazy data structures are inherently at odds with referential transparency due to infinite and partially-defined values. So, it's my opinion that these comparison functions should only be offered from an Unsafe module, if at all.

Could you clarify where exactly you see the problem? My intention was that the changed instances would behave just like the old ones. But maybe this won't work out?!

I will also point out that since the Eq instance for strict ByteString already performs a sharing-based equality check, the existing Eq instance for lazy ByteString should already be pretty fast in most cases where there is a long shared tail. The same does not appear to be true for the Ord instance.

I'm also not convinced yet that this patch will pay off performance-wise.

clyring · 2021-09-23T11:41:37Z

These comparators give the same result as those in the existing instances, as long as at least one argument is a finite, total ByteString. But for infinite and partial ByteStrings, they will (sometimes) produce EQ where the existing instance would produce bottom. For example, let x = LBS8.repeat 'c' in x == x. I'm not sure off the top of my head if any "reasonable" code can actually be broken by this, but I am wary nevertheless.

Optimize Eq and Ord for LazyByteString using pointer equality

5e38b9f

This is inspired by a discussion in Haskell-Cafe: https://mail.haskell.org/pipermail/haskell-cafe/2021-June/134073.html

Bodigrim approved these changes Jun 30, 2021

View reviewed changes

sjakobi commented Jul 13, 2021

View reviewed changes

sjakobi mentioned this pull request Nov 1, 2025

Use pointer equality when comparing keys haskell-unordered-containers/unordered-containers#77

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimize Eq and Ord for LazyByteString using pointer equality#404

Optimize Eq and Ord for LazyByteString using pointer equality#404
sjakobi wants to merge 1 commit intomasterfrom
sjakobi/lazy-ptr-eq

sjakobi commented Jun 30, 2021

Uh oh!

sjakobi commented Jul 1, 2021

Uh oh!

sjakobi Jul 13, 2021

Uh oh!

clyring commented Sep 22, 2021

Uh oh!

sjakobi commented Sep 23, 2021

Uh oh!

clyring commented Sep 23, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

sjakobi commented Jun 30, 2021

Uh oh!

sjakobi commented Jul 1, 2021

Uh oh!

sjakobi Jul 13, 2021

Choose a reason for hiding this comment

Uh oh!

clyring commented Sep 22, 2021

Uh oh!

sjakobi commented Sep 23, 2021

Uh oh!

clyring commented Sep 23, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants