TXT to Parquet decoder
The TXT to Parquet decoder stores the decoded output as Parquet files. For more information the Parquet format itself, see: https://parquet.apache.org/
Note
All major programming languages have Parquet support, see: https://arrow.apache.org/docs/
Tool support
Examples of some specific tools/languages supporting the Parquet format:
Tad (Parquet file viewer)
ClickHouse (Parquet as database)
Grafana (Requires ClickHouse server)
Output
The Parquet output data-schema always uses the following structure:
One timestamp value (
t) using datatype Int64 (MICROS) and snappy compressionOne or more signal values using datatype double / NULL and snappy compression
The row-group-size is set to 1 000 000 (1e6).
The signal names are constructed from the database used for decoding, as in the example below:
t Speed SpeedAccuracy SpeedValid
____________________ _____ _____________ __________
22-Apr-2022 14:14:43 0.01 2.006 1
22-Apr-2022 14:14:44 0.01 2.152 1
22-Apr-2022 14:14:45 0.01 2.290 1
If specific values exceed the MIN/MAX as defined in the database, they are included in the output as NULL.
Warning
Output records are skipped if all values are NULL.
Changelog
# Changelog
All notable changes to this project will be documented in this file.
## [25.12.15]
### Fixed
- Fixes issue where invalid signal values could cause later values in the same column to be shifted into the wrong rows.
## [25.08.20]
### Fixed
- Improved support for scientific notation in DBC files (now also supports e.g. 1.0E+10)
- Issue with J1939 TP repeated first-frames fixed
## [24.12.19]
### Changed
- Parquet writer updated to version 18.0.0
### Fixed
- Parquet writer performance restored (issue introduced in 24.10.17)
- Fix for packed error frames in MUX-TP
## [24.10.17]
### Added
- Support for transport protocols (ISO-TP, J1939-21, NMEA-TP, MUX-TP)
### Changed
- Versioning schema from SemVer to CalVer
- Default verbosity level changed to 2
- Type of out-of-range signal values changed from double *NaN* to *NULL* (change to parquet schema)
- Restriction on max 5 DBC-files per interface/channel removed
### Fixed
- Fix for order of messages with same timestamp
### Known-issues
- Reduced parquet write speed
## [2.3.2]
### Fixed
- Issue with merge of PGN source/destination addresses sometimes resulting in invalid output files
Download
Windows AMD64 / x86-64 (64-bit)
- 25.12.15.zip (MD5:
e0b1b5c23b5a8e89df4c36db7163f0c7) - 25.08.20.zip (MD5:
80addc9e5befce39bef0fec9c054a418) - 24.12.19.zip (MD5:
9bac68a1eb1b0741f3eafbe9005ba6a8) - 24.10.17.zip (MD5:
9358addfa78af557947f0374708fa072) - 2.4.0.zip (MD5:
c0eb390675562de6cfbf2deff7f74060)
Linux AMD64 / x86-64 (64-bit)
- 25.12.15.zip (MD5:
ef73f40a98216780bc3c523cea361827) - 25.08.20.zip (MD5:
57f8921ffad9912aa2412b345165d64c) - 24.12.19.zip (MD5:
e3db1cb14405bd5d3434665a278c577d) - 24.10.17.zip (MD5:
ba201e6dba4fb1a84260ff845f956f51)
Linux ARM64 (64-bit)
- 25.12.15.zip (MD5:
9b553ebce014517f132c3383e5b7a1dc) - 25.08.20.zip (MD5:
972d081ec47e6a29c549114e3157689d) - 24.12.19.zip (MD5:
6c0ac3d328f9bd8f95d9d9da5d41845c) - 24.10.17.zip (MD5:
af16d7a2dfd11bce4a0df832b6a6e92e)