Apache Avro Adds ZStandard Support
Written by Kay Ewbank   
Friday, 31 May 2019

Apache Avro 1.9 is available, with an updated JSON reader, smaller size, and support for ZStandard compression.

avro

Avro is a data serialization framework for high volume, high performance, high throughput, data processing systems. It uses JSON for defining data types and protocols, and serializes data in a compact binary format. Avro was developed by the same team that developed Hadoop, and it is used by data processing systems including Hadoop, Spark and Kafka.

Avro can be used as a toolkit via remote procedure calls (RPCs). The reference implementation of Avro is developed and released as a Java library. Avro has language binding support for C, C++, C#, Go, Haskell, Java, Perl, PHP, Python, Ruby, Scala, and TypeScript. It uses a fast and compact binary data format, has a language independent schema definition format. and uses a container file to store and persist data.

The improvements to the new release are designed to make it leaner and faster. The changes begin with the fact that the new version is built with Java 8 by default. This means the Joda time library is no longer needed for handling logical date and time values, as Java 8 handles dates and times natively.

The next improvement changes the way JSON reading is done. In previous versions this was carried out using Java's Jackson library. In this release the old Codehaus Jackson has been replaced by FasterXML's Jackson 2.9. In addition to being faster, this has security improvements. The developers have also removed Jackson classes from public API. 

Support for Facebook's ZStandard compression has also been added. ZStandard is a real-time compression algorithm, providing high compression ratios.

Elsewhere, in the drive to make Acro leaner, multiple dependencies were removed, including guava, paranamer, commons-codec, and commons-logging. Apache Avro is compiled and tested with Java 11.

avro

More Information

Avro Website

Related Articles

KSQL Adds Avro Support

Hadoop 3 Adds HDFS Erasure Coding

Serialization Will Go From Java - Sometime  

To be informed about new articles on I Programmer, sign up for our weekly newsletter, subscribe to the RSS feed and follow us on Twitter, Facebook or Linkedin.

Banner


Apache Fury Adds Optimized Serializers For Scala
31/10/2024

Apache Fury has been updated to add GraalVM native images and with optimized serializers for Scala collection. The update also reduces Scala collection serialization cost via the use of  encoding [ ... ]



Edera Releases Open Source Container Benchmark And Scanner
07/11/2024

Edera has released Am I Isolated, an open source container security benchmark that probes users runtime environments and tests for container isolation.


More News

espbook

 

Comments




or email your comment to: comments@i-programmer.info

Last Updated ( Friday, 31 May 2019 )