Fix TS_2DIFF float/double page compatibility by hongzhi-gao · Pull Request #796 · apache/tsfile

hongzhi-gao · 2026-04-30T07:15:01Z

Summary

Fix C++ TS_2DIFF float/double page layout to match Java overflow-page encoding.
Fix Java table-read time decoder selection to use actual time chunk encoding (instead of global default), preventing mis-decoding on PLAIN time chunks.
Harden googletest zip handling in CMake: reject/remove empty or invalid zip files and only accept valid ZIP magic before enabling tests.

Encoding Layout

Legacy raw block (old C++ path):
[ TS_2DIFF inner block ]

Java-compatible overflow page (fixed C++ path):
[ outer_magic(varint=2147483646) ]
[ count(varint) ]
[ bitmap_under ]
[ bitmap_over ]
[ inner_max_point(varint=2) ]
[ TS_2DIFF inner block payload ]

Test Input

Float values: 3.123456768E20f, NaN
Expected encoded hex (Java baseline):

FE FF FF FF 07 02 00 03 02 00 00 00 01 00 00 00 00 1E 38 8A AA 61 87 75 56

Verification

C++ encoded output matched the Java baseline byte-for-byte.
Java table query can read both C++-generated and Java-generated files

…idation. Align C++ float/double TS_2DIFF flush/read behavior with Java overflow-page layout, and prevent 0-byte/corrupt googletest archives from being treated as successful downloads during test configuration.

codecov-commenter · 2026-04-30T07:39:29Z

Codecov Report

❌ Patch coverage is 78.53659% with 44 lines in your changes missing coverage. Please review.
✅ Project coverage is 61.58%. Comparing base (86ec4b9) to head (9ecdd9a).
⚠️ Report is 1 commits behind head on develop.

Files with missing lines	Patch %	Lines
cpp/src/encoding/ts2diff_decoder.h	55.55%	36 Missing ⚠️
cpp/src/encoding/ts2diff_encoder.h	92.72%	8 Missing ⚠️

Additional details and impacted files

@@             Coverage Diff             @@
##           develop     #796      +/-   ##
===========================================
+ Coverage    61.49%   61.58%   +0.08%     
===========================================
  Files          731      731              
  Lines        45581    45874     +293     
  Branches      6787     6880      +93     
===========================================
+ Hits         28029    28250     +221     
- Misses       16560    16613      +53     
- Partials       992     1011      +19

☔ View full report in Codecov by Harness.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

Add optional TSFILE_OPTIMIZATION_FLAGS overrides in project/examples CMake while removing hardcoded optimization settings so library builds follow common CMake integration behavior.

Use Java-compatible default maxPoint handling for float/double TS_2DIFF encode/decode while preserving legacy raw-block compatibility, and add a byte-level regression test for the documented float/NaN hex sequence.

Use the actual time chunk encoding instead of global default when building time decoders in table read paths, preventing wrong TS_2DIFF decoding on PLAIN time chunks.

Drop nonessential explanatory comments in TS_2DIFF codec changes and remove unused defaultTimeDecoder from AbstractChunkReader to keep the PR focused for review.

Merge upstream/develop into fix/cpp-ts2diff and resolve CMake conflicts. Use configured time encoding when reading non-time chunks, keep chunk header encoding for aligned/table time chunks, and align test writers with the encoders they use.

Mirror TestFloatJavaDefaultHexCompatibility with the same overflow-page inputs so C++ double encoder output stays byte-for-byte compatible with Java.

jt2594838 · 2026-06-03T09:50:37Z

+        constexpr uint32_t kJavaOverflowMagic =
+            2147483647u;  // Integer.MAX_VALUE
+        constexpr uint32_t kJavaValueOverflowMagic =
+            2147483646u;  // Integer.MAX_VALUE - 1


kJavaOverflowMagic -> FLAG_SCALED_VALUE_OVERFLOW
kJavaValueOverflowMagic -> FLAG_ORIGINAL_VALUE_OVERFLOW

Renamed as suggested

jt2594838 · 2026-06-03T09:52:34Z

+        std::vector<uint8_t> underflow_bitmap(
+            static_cast<size_t>(num_values / 8 + 1), 0);
+        std::vector<uint8_t> value_overflow_bitmap(
+            static_cast<size_t>(num_values / 8 + 1), 0);


May use overflow/underflow consistently.

Apply review feedback: FLAG_SCALED_VALUE_OVERFLOW / FLAG_ORIGINAL_VALUE_OVERFLOW, and consistent underflow/overflow bitmap names in encoder and decoder.

Fix TS_2DIFF float/double page compatibility and harden gtest zip val…

247b4ed

…idation. Align C++ float/double TS_2DIFF flush/read behavior with Java overflow-page layout, and prevent 0-byte/corrupt googletest archives from being treated as successful downloads during test configuration.

hongzhi-gao added 6 commits April 30, 2026 15:44

Make tsfile-cpp optimization flags inherit from caller by default.

b0a4f9e

Add optional TSFILE_OPTIMIZATION_FLAGS overrides in project/examples CMake while removing hardcoded optimization settings so library builds follow common CMake integration behavior.

Align C++ float/double TS_2DIFF default semantics with Java.

13bb80e

Use Java-compatible default maxPoint handling for float/double TS_2DIFF encode/decode while preserving legacy raw-block compatibility, and add a byte-level regression test for the documented float/NaN hex sequence.

Fix table time decoder selection for chunk readers.

fa39650

Use the actual time chunk encoding instead of global default when building time decoders in table read paths, preventing wrong TS_2DIFF decoding on PLAIN time chunks.

Trim redundant comments and remove dead time decoder field.

24f4367

Drop nonessential explanatory comments in TS_2DIFF codec changes and remove unused defaultTimeDecoder from AbstractChunkReader to keep the PR focused for review.

Add Java baseline hex test for double TS_2DIFF encoding.

787b50c

Mirror TestFloatJavaDefaultHexCompatibility with the same overflow-page inputs so C++ double encoder output stays byte-for-byte compatible with Java.

jt2594838 approved these changes Jun 4, 2026

View reviewed changes

Rename TS_2DIFF Java overflow flags and align bitmap naming.

9ecdd9a

Apply review feedback: FLAG_SCALED_VALUE_OVERFLOW / FLAG_ORIGINAL_VALUE_OVERFLOW, and consistent underflow/overflow bitmap names in encoder and decoder.

jt2594838 approved these changes Jun 4, 2026

View reviewed changes

jt2594838 merged commit 2a864c5 into develop Jun 4, 2026
38 checks passed

jt2594838 deleted the fix/cpp-ts2diff branch June 4, 2026 08:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix TS_2DIFF float/double page compatibility#796

Fix TS_2DIFF float/double page compatibility#796
jt2594838 merged 8 commits into
developfrom
fix/cpp-ts2diff

hongzhi-gao commented Apr 30, 2026 •

edited

Loading

Uh oh!

codecov-commenter commented Apr 30, 2026 •

edited

Loading

Uh oh!

jt2594838 Jun 3, 2026

Uh oh!

hongzhi-gao Jun 4, 2026

Uh oh!

jt2594838 Jun 3, 2026

Uh oh!

hongzhi-gao Jun 4, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

hongzhi-gao commented Apr 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Encoding Layout

Test Input

Verification

Uh oh!

codecov-commenter commented Apr 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

jt2594838 Jun 3, 2026

Choose a reason for hiding this comment

Uh oh!

hongzhi-gao Jun 4, 2026

Choose a reason for hiding this comment

Uh oh!

jt2594838 Jun 3, 2026

Choose a reason for hiding this comment

Uh oh!

hongzhi-gao Jun 4, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

hongzhi-gao commented Apr 30, 2026 •

edited

Loading

codecov-commenter commented Apr 30, 2026 •

edited

Loading